- •3+ Years of Experience in Data Engineering with Diversified Tools and Technologies.
- •Languages: Arabic, English (IELTS: 7 Bands), and Urdu.
- •Two Gold Medals.
- •Data Modeling Concepts such as Logical and Physical Data Modeling using Dimension Modeling.
- •Strong Experience in Communicating with Relevant Stakeholders in Requirement Analysis and Presenting them Multiple Data Solutions.
SQL, Python, AWS Cloud services, Azure Cloud Services, Data Warehousing, Data Governance, ETL/ELT, Snowflake, PowerDesigner, NGS Data Analysis, Matillion, Scripting, DBeaver, Normalization, Dimensional Modelling, Master Data Management, Confluence, Jira.
· Architected end-to-end data warehousing solutions, including staging areas, data marts, data warehouses, and operational data stores, to support analytics and reporting needs. Used Azure Synapse Analytics and Azure Data Lake to build flexible data models that catered to both current and future business needs.
· Spearheaded the design, development, and maintenance of scalable data pipelines using Azure Cloud Services, ensuring seamless data integration and processing across the organization. Leveraged Azure Data Factory (ADF) and Azure Databricks to deliver high-performance, reliable solutions that met evolving business requirements.
· Optimized OLAP systems by tuning cube structures, aggregations, and indexing strategies for improved query performance.
· Implemented process optimizations for ETL workflows, reducing execution times and increasing efficiency. Led troubleshooting initiatives to quickly identify and resolve issues, ensuring minimal downtime and data accuracy with Azure Data Factory and Azure Databricks.
· Developed and optimized physical and logical data models in Azure Synapse Analytics and Azure SQL Database, enhancing system performance and reducing latency. Improved query response times for both batch and real-time data processing.
· Tuned and optimized Azure-based database systems (Azure SQL Database, Azure Synapse Analytics) to improve storage efficiency, query performance, and processing speed.
· Led data provisioning and access management, ensuring data security and governance. Provided strategic direction to safeguard critical data assets while ensuring they were accessible and compliant with relevant regulations.
· Ensured high data quality standards by rigorously monitoring, reconciling, and validating ETL processes. Applied Azure Data Factory and Azure Monitor to maintain the integrity of data in the data warehouse.
· Tools, Technologies, Skills, & Languages: Azure Data Factory, Azure Synapse Analytics, Azure Databricks, Azure SQL Database, Azure Data Lake, SQL, Indexing, Python, PySpark, Data Warehousing, ETL/ELT, Data Migration, Enterprise Data Catalog, Incremental Loading, Semi-Structured Data, SDLC, Data Pruning, Distributed Data Processing, Delta Lake, Medallion Architecture, Big Data.
· Developed, tested, and maintained robust, scalable data pipelines using industry-best practices to ensure seamless data flow and processing across the organization. Consistently delivered optimized solutions that supported evolving business requirements, improving pipeline performance and reliability using AWS Cloud Services.
· Played a key role in designing the architecture for data warehousing solutions, including staging areas, data warehouse, data marts, and operational data stores. Delivered data models and architectures that supported both current and future analytics and reporting needs.
· Led troubleshooting efforts for ETL processes, quickly identifying and resolving issues to minimize downtime and ensure data accuracy. Implemented process improvements that reduced ETL execution time and increased efficiency.
· Tuned and optimized database systems to ensure high performance, recommending and implementing enhancements that improved storage, query performance, and processing speed.
· Automated critical database processes, creating scripts and tools that improved operational efficiency and reduced manual workloads.
· Owned data provisioning and access management, providing strategic direction on data security and governance to safeguard critical data assets while ensuring accessibility.
· Ensured the integrity of data within the data warehouse through rigorous monitoring, reconciliation, and validation of ETL processes, consistently achieving high data quality standards.
· Tools, Technologies, Skills, & Languages: AWS Lambda, S3, AWS Step Function, AWS SNS, AWS Glue, AWS RDS, AWS Aurora, Snowflake, DBeaver, SQL, Python, Boto3, Master Model, Schema Enforcement, Date-Warehousing, ETL/ELT, CDC, Incremental loading, Hashing Algorithm, SCD.
· Technologies Used: Azure Data Factory (ADF), Azure Synapse Analytics, Azure SQL Database, Azure Blob Storage.
· Designed and implemented end-to-end ETL pipelines using Azure ADF.
· Integrated Azure Synapse Analytics for large-scale data processing and real-time analytics, enabling efficient transformation and aggregation of large healthcare datasets.
· Optimized data flows for enhanced performance, reducing data processing times by 15% through performance tuning and leveraging Synapse's parallel processing capabilities.
· Established data governance and security protocols, ensuring compliance with internal security standards across all data pipelines using masking, tokenization and row access policy.
Related Courses: Biostatistics for Bioinformatics, Next Generation Sequencing Data Analysis, Bioinformatics, and Computer-Aided Drug Design.
CGPA: 3.41 out of 4.
Thesis Title & Work: AI's Radiogenomic Symphony Mapping Lung Cancer through Gene-image Correlation:
Related Courses: Data Mining, Bioinformatics Software Engineering, AI, Modeling & Simulation, Graphics & Visualization, DBMS, Probability & Statistic, OOP, Data Structures & Algorithms, and Programming Fundamentals.
Gold Medalist with CGPA: 3.83 out of 4.
Two Gold Medals, Three Merit Scholarships, Three Years Volunteering Awards, NGIRI Award (Ignite).
Google Data Analytics Professional Certificate,
Microsoft Office Specialist(MOS) Certification,
Python Kaggle's Certificate,
SQL Udemy's Certificate,
Matillion Udemy's Certificate,
Snowflake Udemy's Certificate,
Data Warehouse Udemy's Certificate,
ML Specialization Coursera's Certificate,
Databricks Lakehouse Udemy’s Certificate.
DOI: 10.6501/CJM.202306_21(2).0001.