FlowCV Logo
Summary
  • 3+ Years of Experience in Data Engineering with Diversified Tools and Technologies.
  • Languages: Arabic, English (IELTS: 7 Bands), and Urdu.
  • Two Gold Medals.
  • Data Modeling Concepts such as Logical and Physical Data Modeling using Dimension Modeling.
  • Strong Experience in Communicating with Relevant Stakeholders in Requirement Analysis and Presenting them Multiple Data Solutions.
Skills

SQL, Python, AWS Cloud services, Azure Cloud Services, Data Warehousing, Data Governance, ETL/ELT, Snowflake, PowerDesigner, NGS Data Analysis, Matillion, Scripting, DBeaver, Normalization, Dimensional Modelling, Master Data Management, Confluence, Jira.

Professional Experience
Senior Data Engineer, ADLAB Solutions
Feb 2025 – present | Islamabad, Pakistan
  • Architected and developed end-to-end data solutions, integrating APIs, databases, and cloud platforms like AWS, Snowflake, and S3 to support data warehousing, analytics, and reporting needs.
  • Utilized Python and SQL scripting to automate data pipelines and integrate disparate systems, ensuring data accessibility and scalability.
  • .Spearheaded the development and optimization of data workflows across SFTP, HubSpot, Dropbox, Ticketmaster, and Salesforce systems, ensuring seamless data extraction, transformation, and loading (ETL) processes for various business applications.
  • Utilized Control-M for job scheduling and workload automation, ensuring smooth orchestration of complex data workflows across systems and environments.
  • Tools, Technologies, and Skills: AWS (Lambda, Glue, S3, RDS), Snowflake, Data Warehousing, ELT/ETL, SQL Scripting, Python, Data Analytics, API Integration, HubSpot, Salesforce, SFTP, Ticketmaster, Dropbox, CRM, ERB, Control-M, Data Provisioning, Data Governance, Data Integration, Data Security, Cloud Data Solutions.
  • Senior Data Engineer, MetaSol pk
    Jan 2024 – Jan 2025 | Islamabad, Pakistan

    · Architected end-to-end data warehousing solutions, including staging areas, data marts, data warehouses, and operational data stores, to support analytics and reporting needs. Used Azure Synapse Analytics and Azure Data Lake to build flexible data models that catered to both current and future business needs.

    · Spearheaded the design, development, and maintenance of scalable data pipelines using Azure Cloud Services, ensuring seamless data integration and processing across the organization. Leveraged Azure Data Factory (ADF) and Azure Databricks to deliver high-performance, reliable solutions that met evolving business requirements.

    · Optimized OLAP systems by tuning cube structures, aggregations, and indexing strategies for improved query performance.

    · Implemented process optimizations for ETL workflows, reducing execution times and increasing efficiency. Led troubleshooting initiatives to quickly identify and resolve issues, ensuring minimal downtime and data accuracy with Azure Data Factory and Azure Databricks.

    · Developed and optimized physical and logical data models in Azure Synapse Analytics and Azure SQL Database, enhancing system performance and reducing latency. Improved query response times for both batch and real-time data processing.

    · Tuned and optimized Azure-based database systems (Azure SQL Database, Azure Synapse Analytics) to improve storage efficiency, query performance, and processing speed.

    · Led data provisioning and access management, ensuring data security and governance. Provided strategic direction to safeguard critical data assets while ensuring they were accessible and compliant with relevant regulations.

    · Ensured high data quality standards by rigorously monitoring, reconciling, and validating ETL processes. Applied Azure Data Factory and Azure Monitor to maintain the integrity of data in the data warehouse.

    · Tools, Technologies, Skills, & Languages: Azure Data Factory, Azure Synapse Analytics, Azure Databricks, Azure SQL Database, Azure Data Lake, SQL, Indexing, Python, PySpark, Data Warehousing, ETL/ELT, Data Migration, Enterprise Data Catalog, Incremental Loading, Semi-Structured Data, SDLC, Data Pruning, Distributed Data Processing, Delta Lake, Medallion Architecture, Big Data.

    Data Engineer, DBiz.ai Client: T-Mobile US
    Mar 2022 – Dec 2023 | Sydney, Australia

    · Developed, tested, and maintained robust, scalable data pipelines using industry-best practices to ensure seamless data flow and processing across the organization. Consistently delivered optimized solutions that supported evolving business requirements, improving pipeline performance and reliability using AWS Cloud Services.

    · Played a key role in designing the architecture for data warehousing solutions, including staging areas, data warehouse, data marts, and operational data stores. Delivered data models and architectures that supported both current and future analytics and reporting needs.

    · Led troubleshooting efforts for ETL processes, quickly identifying and resolving issues to minimize downtime and ensure data accuracy. Implemented process improvements that reduced ETL execution time and increased efficiency.

    1 / 2

    · Tuned and optimized database systems to ensure high performance, recommending and implementing enhancements that improved storage, query performance, and processing speed.

    · Automated critical database processes, creating scripts and tools that improved operational efficiency and reduced manual workloads.

    · Owned data provisioning and access management, providing strategic direction on data security and governance to safeguard critical data assets while ensuring accessibility.

    · Ensured the integrity of data within the data warehouse through rigorous monitoring, reconciliation, and validation of ETL processes, consistently achieving high data quality standards.

    · Tools, Technologies, Skills, & Languages: AWS Lambda, S3, AWS Step Function, AWS SNS, AWS Glue, AWS RDS, AWS Aurora, Snowflake, DBeaver, SQL, Python, Boto3, Master Model, Schema Enforcement, Date-Warehousing, ETL/ELT, CDC, Incremental loading, Hashing Algorithm, SCD.

     

    Software Engineer, Hello World Tech
    Sep 2021 – Feb 2022 | RY Khan, Pakistan
  • Created a Property Agency Website & an Online Shopping Website.
  • Internee, Devicon Software House
    Jun 2021 – Aug 2021 | RY Khan, Pakistan
  • I Have Applied Programming Practically Here, Created Multiple DEMO Projects.
  • Projects
    Samba Project, Azure Data Engineer
    May 2023 – Dec 2023

    · Technologies Used: Azure Data Factory (ADF), Azure Synapse Analytics, Azure SQL Database, Azure Blob Storage.

    · Designed and implemented end-to-end ETL pipelines using Azure ADF.

    · Integrated Azure Synapse Analytics for large-scale data processing and real-time analytics, enabling efficient transformation and aggregation of large healthcare datasets.

    · Optimized data flows for enhanced performance, reducing data processing times by 15% through performance tuning and leveraging Synapse's parallel processing capabilities.

    · Established data governance and security protocols, ensuring compliance with internal security standards across all data pipelines using masking, tokenization and row access policy.

    Khaity Inc. Project, Image Analysis Software That Detect Diseases in Rice and Sugarcane
    Jan 2021 – Aug 2021
  • Integrated the Datasets using Online Public Databases and Pre-processed them.
  • Developed a Random Forest Model (ML) that Provides the Base of the Program to Detect Different Diseases & Abnormalities on the leaves.
  • Education
    COMSATS University Islamabad, Master of Science in Bioinformatics
    Aug 2022 – Jun 2024 | Islamabad, Pakistan.

    Related Courses: Biostatistics for Bioinformatics, Next Generation Sequencing Data Analysis, Bioinformatics, and Computer-Aided Drug Design.

    CGPA: 3.41 out of 4.

    Thesis Title & Work: AI's Radiogenomic Symphony Mapping Lung Cancer through Gene-image Correlation:

  • Built Multi-models and Deep Learning Model for Survival Analysis That Outperformed Previous Models ROC-AUC in Literature.
  • Built CNN and DNN Models to Predict EGFR Mutation Status in Lung Cancer Using CT Scans.
  • Khwaja Fareed University of Engineering & Information Technology, Bachelor of Science in Bioinformatics
    Aug 2018 – Jun 2022 | Rahimyar Khan, Pakistan.

    Related Courses: Data Mining, Bioinformatics Software Engineering, AI, Modeling & Simulation, Graphics & Visualization, DBMS, Probability & Statistic, OOP, Data Structures & Algorithms, and Programming Fundamentals.

    Gold Medalist with CGPA: 3.83 out of 4.

    Awards

    Two Gold Medals, Three Merit Scholarships, Three Years Volunteering Awards, NGIRI Award (Ignite).

    Languages
    English (IELTS: 7 Bands)|Arabic|Urdu
    Certifications

    Google Data Analytics Professional Certificate,

    Microsoft Office Specialist(MOS) Certification,

    Python Kaggle's Certificate,

    SQL Udemy's Certificate,

    Matillion Udemy's Certificate,

    Snowflake Udemy's Certificate,

    Data Warehouse Udemy's Certificate,

    ML Specialization Coursera's Certificate,

    Databricks Lakehouse Udemy’s Certificate.

    Publications
    A Comprehensive NSCLC Radiogenomics Review Covering the latest AI Approaches, Statistical Analysis Methods, and The Most Important Gene and Image-based Biomarkers, Clinical Lung Cancer
    Repurposing drug candidates for CSSV virus through virtual screening technique, Changhua Journal of Medicine.

    DOI: 10.6501/CJM.202306_21(2).0001.

    Interests
    Table Tennis, Football, Hiking, and Gaming.
    2 / 2