Data Engineer with 3+ years of experience in designing and optimizing ETL pipelines and data workflows. Proficient in Python, SQL, and AWS technologies (Snowflake, Matillion, Airflow), and Tableau, with a proven ability to transform and integrate large datasets to drive actionable insights and support business growth.
Tech Stack: AWS (Glue, S3, Lambda, AppFlow, Step Functions, Athena), Kinesis, CloudFormation, Python
Tech Stack: Python 3. x, PySpark, SQL, AWS (Lambda, Glue, Redshift, S3), GitHub, JIRA
Python | SQL | Spark
S3 | RDS | Redshift | Glue | Cloud Formation Template | Lambda | AppFlow | Step Functions
MySQL | MongoDB | Cassandra | HBase
Git | Airflow
Spark | Hadoop | Hive | Kafka | Sqoop
Docker | Pyspark
Snowflake | Matillion
Designed and implemented a data pipeline using Matillion to load and transform data from AWS S3 into Snowflake, ensuring data integrity and enabling advanced analytics and visualization in Power BI.
Tech Stack: Matillion, AWS S3, Snowflake, Power BI
Designed and developed an ETL pipeline to export data from the MySQL transaction database to AWS Redshift for data analysis.
Tech Stack: Apache Airflow, PySpark, Amazon Redshift, S3, Apache Kafka