Kartikeya Singh

India

@kartikeyasingh_4

Badges

Problem Solving
Python
Sql

Certifications

Work Experience

  • Associate Software Engineer II - Data

    Optum•  June 2022 - Present

    Created a high-performance ETL script with AWS Glue for the anonymization of 1 Billion records of PHI and PII, leading to enhanced data integrity and privacy compliance. Overhauled a complex Java JAR Spark job into a streamlined PySpark data pipeline, resulting in a 20% reduction in processing time while simplifying the creation of 50 tables and enhancing maintainability of the codebase. Optimized data processing by consolidating two Big Data pipelines into a single, efficient pipeline, utilizing dataframes to achieve a savings of 15TB of storage in Amazon S3. Spearheaded development of sample datasets, converting large datasets into efficient n% subsets, significantly reducing testing time for developers. Improved testing coverage and workflow efficiency reducing testing time by 50%. Optimized and reduced the runtime of 2 Spark applications using techniques such as shuffle partitions, checkpoint(), cache(), repartition(), broadcast(), and Window functions to enhance performance and efficiency.

Education

  • Kalinga Institute of Industrial Technology

    B.Tech, Electronics and computer science•  July 2018 - June 2022•  CGPA: 8.5

  • Jingle Bell Academy

    Higher Secondary•  April 2016 - April 2017•  Percentage: 84.6

  • Grammar Academy

    Secondary•  April 2014 - April 2015•  CGPA: 9.6

Skills

AWS Glue
AWS Step Functions
AWS Lambda
AWS EMR
Terraform
GIT
MySQL
AWS Redshift
Apache Spark
Apache Kafka
Python
C++
SQL
PL/pgSQL