3 years building production ETL/ELT pipelines | 2 Microsoft Azure certifications | Processed 10,000+ hours of multimedia data | Mentoring 20+ aspiring data engineers
Data Engineer with 3+ years of experience building production ETL/ELT pipelines and AI-powered data systems across Azure and AWS. Proven track record designing data warehouses with dimensional modeling, automating data ingestion from hybrid sources, and optimizing large-scale multimedia processing pipelines. Specialized in healthcare data systems with hands-on expertise in vector databases, semantic search, and generative AI integration.
- •Microsoft Certified: Fabric Data Engineer Associate (DP-700) – Microsoft, October 2025
- •Microsoft Certified: Azure Data Engineer Associate (DP-203) – Microsoft, September 2024
Programming & Databases: Python, SQL, PySpark, SQL Server, PostgreSQL, MySQL
Cloud & Big Data: Azure Data Factory, Microsoft Fabric, Databricks, AWS (S3, EC2), Azure Storage
Data Architecture: Data Warehousing, Dimensional Modeling (Star/Snowflake), SCD Type 2, ETL/ELT Pipelines
Analytics & Tools: Power BI, Git, Data Quality Validation, Pipeline Monitoring