Big Data Engineer with 5+ years of experience in all phases of the software development life cycle. Passionate about Big Data and Machine Learning technologies and the delivery of effective solutions through creative problem-solving. Track record of building large scale systems using Big Data and Machine Learning technologies.
Python | SQL | Spark
S3 | EC2 | EMR | RDS | Redshift | Glue | CloudWatch | ECS
MySQL | MongoDB | Cassandra | HBase
Data Factory | Databricks | Functions | Blob | Synapse | Delta Lake
Spark | Hadoop | Hive | Kafka | Sqoop
Pandas | Numpy | Sklearn | PySpark | Pytorch |
Matplotlib | Seaborn | TFX
Docker | Docker Compose | GitHub Actions | MLflow | Git | DVC | Airflow
Cloud Storage | Compute Engine | Dataproc | BigQuery | Dataflow | GKE | AlloyDB
Categorization of financial product and service complaints registered by consumers.
Tech: Python, PySpark, Grafana, Prometheus, AWS, Azure
Tech Stack: Apache Airflow, PySpark, Apache Kafka, Amazon S3, AWS Glue, Amazon Redshift