FlowCV Logo
Akhilesh Pratap ShahiData Engineer
Email
[email protected]
Phone
+91-844-7020-911
akhileshshahi
GitHub
shahiakhilesh1304
Location
Bangalore, India
Visa Status
Open For Relocation (US, Europe, Asia)
Profile

Results-driven Data Engineer with over 5 years of experience in designing and optimizing scalable ETL/ELT pipelines and event-driven architectures. Proven expertise in Apache Spark, Kafka, and Azure Databricks, with a track record of cutting data processing time by 35% and reducing transfer costs by 90%. Strong background in building real-time analytics systems and high-throughput APIs for telecom, healthcare, and mobility sectors.

Skills
Programming & Scripting

Python, Java, Scala, Shell Scripting, J2EE, HTML5, JavaScript, Bootstrap

Performance & Optimization

Performance Tuning, Query Optimization, Caching Strategies

Databases & Warehousing

MongoDB, MySQL, Google BigQuery, Apache Druid, Data Warehousing, OLAP/OLTP, SQL, NoSQL, Data Governance, Data Quality

Big Data & Distributed Computing

Apache Spark, PySpark, Apache Kafka, Apache Hadoop, Hive

Cloud & Tools

Azure Databricks, Azure Delta Lake, Azure Data Factory, Azure Blob Storage

DevOps & Containers

Docker, Kubernetes (basics), Git, GitHub

Data Engineering & ETL

ETL/ELT Pipelines, Data Modeling (Dimensional, Relational), Snowflake Schema, ER Diagrams, Data Wrangling, Data Transformation, Data Aggregation, Data Annotation, Data Retention, Data Backup

Data Analysis & Visualization

Pandas, NumPy, SciPy, Matplotlib, Seaborn, Tableau

Orchestration & Workflow Tools

Apache Airflow, CI/CD basics

Professional Experience
03/2023 – present
Data Engineer, Reliance Jio Infocomm Pvt Ltd
  • Developed and maintained Spark-based ETL pipelines for user behavior data, processing 1.6B+ records/day.
  • Reduced audience retrieval time by 90% through efficient API design for cohort-based targeting.
  • Banglore, India
  • Increased campaign performance by 22% via re-targeting workflows based on real-time activity signals.
  • Built data ingestion APIs and automation frameworks, cutting prep time by 96% and cloud costs by 90%.
  • 10/2022 – 02/2023
    Software Engineer, MpHrx
  • Created end-to-end data ingestion pipelines using Azure Databricks and PySpark.
  • Improved claims data accuracy by 20% and boosted pipeline throughput by 45%.
  • Gurgaon, India
  • Optimized MongoDB and Hibernate queries, reducing DB latency by 50%.
  • 03/2022 – 07/2022
    Software Developer, SAR GROUP(Lectrix E-Vehicle)
  • Engineered Spring Boot APIs for real-time vehicle telemetry; reduced response latency by 40%.
  • Modeled high-volume MongoDB collections to support IoT sensor data storage and analytics.
  • Gurgaon, India
    01/2020 – 03/2022
    Software Developer (Co-Founder), Fintree Global Research
  • Built scalable backend infrastructure using Spring Boot and Java for a data research platform.
  • Led architecture and deployment of the MVP product, improving system performance by 28%.
  • Lucknow, India
    Akhilesh Pratap Shahi
    Key Projects
    03/2023 – present
    Cohort Intelligence, Data Engineer - JIO
  • Designed Spark jobs and APIs for segmenting 1.6B+ users into behavioral cohorts.
  • Increased pipeline throughput by 36.8% and cut manual overhead by 35%.
  • 11/2023 – present
    Retargeting, Data Engineer - JIO
  • Implemented segmentation logic using BPID & IFA identifiers for re-engagement campaigns.
  • Boosted campaign engagement rates by 22% via behavior-based user targeting.
  • 08/2024 – present
    AIMS API, Data Engineer - JIO
  • Built a Flask API managing 100M+ user IDs for behavioral classification.
  • Enabled real-time updates to marketing logic, improving targeting speed by 40%.
  • 05/2023 – 07/2025
    UIDUpload, Data Engineer - JIO
  • Developed a high-throughput Flask API for uploading user cohorts with >99.8% accuracy.
  • Improved activation consistency across SMS, RCS, and email channels.
  • 02/2024 – 04/2025
    Point Of Interest, Data Engineer - JIO
  • Created ETL pipelines aggregating diverse data sources for user interest mapping.
  • Enhanced mapping precision and reduced query latency by 30%.
  • 12/2024 – 03/2024
    JioTv Analytics, Data Engineer - JIO
  • Designed a Kafka-Spark-Druid pipeline for real-time viewer behavior analytics.
  • Decreased dashboard refresh time by 50%, enabling near real-time insight delivery.
  • 04/2024 – present
    Campaign Data API, Data Engineer - JIO
  • Created a Flask API to parse dynamic audience rules (include/exclude conditions).
  • Cut campaign data prep time by 96% and reduced transfer costs by 90%.
  • 07/2023 – present
    Custom Audience API, Data Engineer - JIO
  • Developed Spark-triggering APIs for dynamic cohort campaign orchestration.
  • Improved personalization and campaign delivery accuracy across channels.
  • 12/2022 – 02/2022
    Patient Data Visibility, Backend Developer - MpHrx
  • Built a high-efficiency Hibernate data layer for structured patient data.
  • Reduced average DB response time by 50%, improving analytics responsiveness.
  • 01/2022 – 02/2022
    Intercambio, Data Engineer - MpHRX
  • Unified claim ingestion from 5+ healthcare data providers for UNIMEDO using PySpark and Azure.
  • Enhanced accuracy by 20% and reduced data load latency by 40%.
  • Education
    08/2024 – present
    Woolf University, M.S. in Computer Science

    Specialized in ML/AI

    Valletta, Malta
    08/2015 – 04/2019
    Maharishi Markandeshwar (Deemed to be University), B.Tech

    Major in Computer Science

    Ambala, India
    Certificates
    Recognition and Achievements
  • Guest Lecturer – Python, BSA Engineering College
  • Former Member – Computer Society of India
  • Winner – Hackathons, Mono Acts, and Inter-college Theater Events
  • Vice President – Trojan Society | President – Pratibimb Theatre Club
  • Akhilesh Pratap Shahi