Akshat Agrawal

United States

@95Akshat243

Senior Data Engineer

Badges

Problem Solving
Python
Sql

Certifications

Work Experience

  • Senior Data Engineer

    IBM•  December 2021 - Present•  Chicago, IL

    • Help clients transform their business and solve complex challenges using data and Agile methodology • Reduced process time by 90% to analyze employee skills by creating an NLP model to infer skills from multiple documents • Developed efficient Data Services APIs and Spark clusters using Python, FastAPI, MongoDB, PySpark, and IBM Cloud • Built AI Chatbot using IBM Watson Assistant, IBM Cloud, and SoulMachines to enhance customer service • Led data team to design a gamified solution by extracting job-related performance data to drive behavioral changes • Optimized the existing pipeline to reduce the JSON file size by 80% to incorporate the full set of ~20k technicians • Built data structures, performed data modeling, and built 8 ETL pipelines & 74 scripts in Palantir using PySpark and SQL • Provided ideas in client-facing design thinking and sprint planning workshops regarding scoping and data-related tasks • Built a Rolling Badge pipeline by implementing incremental update logic and built PowerBI reports • Reduced the process time by 95% by automating the process of adding new XP levels and notifications • Supporting senior leadership on POV and RFP to sign a contract for an integrated forecasting & planning project

  • Data Engineer

    Ryder Systems Inc.•  March 2021 - December 2021•  Remote

    • Developed advanced interactive reports and provided insights using Power BI in an Agile development environment • Built ETL pipelines in Azure Data Factory and Alteryx using SQL Server, Amazon Redshift, and Azure managed instances • Created Logic Apps to move data from FTP sites to blob storage for ETL ingestion • Helped client reduce carbon footprint by 50% by providing insights on the Power BI report created using DAX formulas • Optimized SQL DB and pipeline performance by 70% by implementing incremental refresh and indexing

  • Machine Learning Researcher

    Ontoadaptive LLC.•  June 2020 - February 2021•  Chicago, IL

    • Research and build new state-of-the-art NLP models in Agile environment to improve AI Health Monitoring and Alerting App • Implemented LSTM, CNN, Hierarchical and Hybrid models for Text Classification; Improved model accuracy from 87% to 94% • Built end to end web application using Python Flask API and PostgreSQL in Google Cloud Platform (GCP) • Built an ETL pipeline running on Spark distributed clusters using Apache Beam and Cloud DataFlow to deploy the ML model

  • Data Science Intern

    The Joint Commission•  January 2020 - May 2020•  Chicago, IL

    • Automated process of de-identifying Protected Health Information (PHI) from safety reports; Reduced process time by 91% • Deployed machine learning models using Python, Spacy and NLTK packages on Azure ML Studio; Built an AI web application

  • Systems Engineer

    Infosys Ltd•  August 2016 - May 2020•  Pune, India

    • Reduced the manual ticket resolution time by 40% by automating the alert monitoring system using Python and ServiceNow • Generated and analyzed server performance and log reports using SQL and Tableau for senior-level management • Developed migration and restore strategies of database servers during disaster drill with a 2-hour recovery time objective

Education

  • University of Illinois at Chicago, Chicago

    Computer Information Systems, MS•  August 2018 - May 2020

  • NIT, Surat (Sardar Vallabhbhai National Institute of Technology)

    Electrical Engineering & Computer Science, B.Tech•  July 2012 - May 2016

Skills

SQL
Python(Intermediate)
Python(Advanced)
Algorithm
Azure
AWS (Amazon Web Services)
MongoDB
Machine Learning
Flask
RESTful API
Git