Python
Scala
Java
SQL
JavaScript
Go
Rust
R
C#
AWS (EMR, Lambda, S3, SageMaker, EC2)
Azure (Data Factory, Synapse, OpenAI, Kubernetes, APIM)
GCP (BigQuery, Dataflow, Vertex AI)
FHIR
HL7
C-CDA
EDI
Azure AD
Azure API Management
OAuth2
PyTorch
Spark
Scikit-learn
Transformers
TensorFlow
Semantic Kernel
FastAPI
Databricks
Delta Lake
Hive
Redshift
Snowflake
Kafka
Airflow
dbt
Docker
Kubernetes
Terraform
Linux
CI/CD
Ding X., Yan C., Zhao Y., Yang Z. (2018). Efficient Processing of TopK Dominating Queries on Incomplete Data Using MapReduce. *ICCCS 2018*, Cloud Computing and Security, pp. 78–89.
https://credentials.databricks.com/aa12012c-d1ae-195-a99c-2b95d99ffa2#acc.apZlwUGe
https://www.credly.com/badges/1351a19d-0020-3f3-8fa0-16d8583bceb0/public_url
https://credentials.databricks.com/6abe7e2-163a-3ad-ab2f-bee8999a90f#acc.sgXrZzbq
https://learn.microsoft.com/en-us/users/jonroosevelt/transcript/dlozriqzx8g9wm
https://www.credly.com/badges/33bd7b0-5301-7b-b91d-68b5275e627/public_url
RAG, Llama Index, LangChain/LangGraph/LangSmith, LLM fine-tuning, evaluation/governance, agentic frameworks, MCP, prompt engineering, vector databases, agentic memory
Spark, Databricks, Delta Lake, Airflow, Kafka, Hive, Redshift, Snowflake, AWS EMR, Azure Data Factory, dbt, medallion architecture, structured streaming
AWS (EMR, Lambda, S3, SageMaker, EC2)
Azure (Data Factory, Synapse, OpenAI, Kubernetes, APIM)
GCP (BigQuery, Dataflow, Vertex AI)
FHIR, HL7 (v2/v3), C-CDA, EDI, HIPAA/GDPR/CCPA compliance, EHR/EMR integration, data normalization, clinical NLP
PyTorch, Scikit-learn, Transformers, TensorFlow, ML pipelines, CI/CD, model evaluation, experiment tracking, MLflow
Docker, Kubernetes, Terraform, Linux, CI/CD automation, Azure AD, API Management, secure API design, identity integration