Samar Singh

India

@singh_samar4028

Dell- Principal Software Engineer

Badges

Problem Solving
Python
Sql

Certifications

Work Experience

  • Principal Data Engineer

    Dell•  July 2017 - Present

    Professional Summary: 14 years of industry experience in the field of Data Engineering, Analytics & Machine Learning. Hands-on expertise in Data Engineering, Machine Learning, IoT, DevOps, System Engineering and System Architecture of platforms, products and solutions for Industrial and Enterprise Applications. Worked & Led projects which involves real time data ingestion using Kafka into spark, transformation with Spark streaming, building ML Pipeline with model updating in batch mode, Cassandra used as intermediate data storage, Automation with Apache Airflow, Jenkin for CI/CD Pipeline, Containerization with Docker and Kubernetes distributed container management. Performance tuning on jobs running on Yarn Cluster. Query performance optimization, partitioning on cluster to achieve best parallel computing. Experience working on AWS S3 for distributed storage, PuTTY and AWS EMR for cluster computing. Packaging code in JAR Files and building portable Container with Docker to run application on cloud computing. Worked extensively on SQL databases, Query performance tuning, ETL pipeline building. Worked on Informatica, Teradata, SAP HANA to build ETL pipeline. Worked on Tableau for data visualization and Analytics. Build ML model for time series forecasting for sales data under “Dell Advance Analytics” Organization. Worked on supervised/unsupervised machine learning, NLP, re-enforcement Learning for various projects. Tools/Technologies- ETL, OOPs, Data Structures & Algo, SQL/NO SQL DBs, Pyspark, Tensor flow, MongoDB, Hive, HDFS, AWS EMR, S3, HBase/HDFS, Informatica, Tableau, Knime, Teradata, Kubernetes, Hadoop/MapReduce Languages – Python, Scala, R Libraries - NumPy, Pandas, Scikit-Learn, PyTorch Developer Tools : IntelliJ, VS code, Pycharm,Anaconda3, Git, Google Collab

  • Team Lead

    accenture•  March 2014 - June 2017

    Role - Team Lead Worked on multiple Data Engineering projects for multiple clients. Few are listed below. 1. Cargill 2. GE 3. Nestle 4. Reckitt Benckiser 5. McCormick Key Area of work –  Time Series Forecasting – Implemented ARIMA model  Building Data Pipeline, Deploying model in Production  Data gathering, Data analysis, Data Cleansing  Understanding Business requirement, Identifying data Sources, Data Modeling  Worked on projects typically involving data validation, preparation, modeling to building vehicle level saturation response curves.  Involved in organizing and directing quality work efforts, connecting statistical solutions with business problems

  • Software Engineer

    Trident Energy•  June 2007 - March 2014

    Role- SCM Analytics Manager Key Area of work-  Sales Analytics, Data Visualization with Tableau  Net Sales realization, Profitability Maximization analysis  Data extraction and modeling, workflows, Test Case Building  Data acquisition, Interaction with Source System, Data Mapping  Demand Forecasting, A/B testing, Hypothesis testing  Supply Chain Analytics  Lean Manufacturing, TOC, Six Sigma, Kaizen  Contribution per machine hour analysis

Education

  • NIT, Jalandhar (Dr. B.R. Ambedkar National Institute of Technology)

    Electronics & Communication Engineering, B.Tech•  June 2003 - May 2007

Skills

singh_samar4028 has not updated skills details yet.