Senior Data Engineer
Acentra HealthData Engineer
Parkland HealthJunior Data Engineer
LawnStarterData Analyst
LawnStarterBachelor's degree in Computer Science
The University of Texas at Austin- •Python
•SQL (Advanced query optimization, window functions, CTES)
•Scala
•PySpark
•Bash (scripting)
- •Snowflake
•Amazon Redshift
•BigQuery
•Dimensional Modeling (Star & Snowflake Schema)
•Data Lake Architecture
•Data Mart Design
- •Terraform (Infrastructure as Code)
• CI/CD for Data Pipelines
• Git/GitHub
• Azure DevOps
Amazon Web Services (AWS):
•S3
•Redshift
•Glue
•EMR
•Lambda
•EC2
•RDS
Microsoft Azure:
•Azure Data Factory
•Azure Synapse
•Azure Data Lake Storage (ADLS)
•Azure Functions
Google Cloud Platform (GCP):
•BigQuery
•Cloud Storage
•Dataflow
- •Apache Spark (batch & performance tuning)
• Databricks
• AWS EMR
• Azure Synapse Analytics
• GCP Dataflow
- •ETL/ELT Pipeline Design
•Apache Airflow
•Azure Data Factory (ADF)
•Workflow automation
•Data pipeline monitoring & alerting
- •Healthcare Claims Processing
• X12 Transactions
• HL7 and FHIR Standards
• Clinical Data Modeling
• CMS Regulatory Reporting
• HIPAA and PHI Governance
• ICD Coding Concepts
• Healthcare Data Interoperability
• Clinical and Financial Data Pipelines
- •Great Expectations
• Data validation frameworks
• HIPAA-compliant data handling
- •Healthcare data transformations
- •Apache Kafka
• AWS Kinesis
- •Tableau
• Looker
• KPI Development
• A/B Testing Analytics
• Business Metrics Reporting