Salma MohsenData Scientist
Work Experience
AI Engineer, Deloitte
Jan 2024 – present | Cairo, Egypt
Data Scientist, MF Strategy , Part-time
2023 – 2024 | Lausanne, Switzerland
Data Scientist, Al Tadamun Microfinance Foundation
Jan 2023 – Jan 2024 | Cairo, Egypt
Education
Postgraduate Diploma of Ai and machine learning, Institute of information Technology (ITI)
9-month program powered by École Pour l'Informatique et les Techniques Avancées
Project grade: excellent
1 / 2
Skills
Programming Language
Python, SQL, R, Linux shell scripting, C++
Architecture & Design:
Solution Architecture, Software Design Patterns (Factory, Pipeline, Strategy), Modular System Design, Scalable Cloud Architecture.
Mathematics and Statistical analysis
- •Hypothesis Testing , Confidence Intervals and A/B testing
- •Descriptive and Inferential Statistics
- •Linear Algebra (matrix operations, eigenvectors, SVD)
- •Calculus (gradients, optimization in ML models)
- •Cost Functions and Loss Metrics (MSE, Cross-Entropy, LogLoss)
Machine Learning and Deep Learning
- •Supervised Learning: Regression, Classification (Logistic Regression, Random Forest, XGBoost, SVM, etc.)
- •Unsupervised Learning: Dimensionality Reduction (PCA, TSNE), Clustering (KMeans, DBSCAN, etc.)
- •Computer Vision : Object Detection and Image Classification using CNN variants (e.g., VGG, ResNet, yolo), Image Preprocessing and Enhancement (OpenCV, PIL)
- •Time-Series Forecasting: Trend, Seasonality, and Noise Decomposition Forecast Modeling using ARIMA, RNN, and LSTM
- •NLP: Text preprocessing, Sentiment Analysis, NER, Text Classification, Summarization, Keyword Extraction
- •Model Optimization: Feature engineering, cross-validation, hyperparameter tuning
- •Machine Learning Frameworks: scikit-learn, TensorFlow, Pytorch, MLlib
Deployment
- •Deployment Frameworks: FastAPI, Flask
- •Containerization: Docker
Cloud
Azure, AWS, Databricks, Dataiku
Database and Data Engineering
- •Databases: PostgreSQL, Snowflake, MongoDB
- •ETL & Data Pipelines: Design and deployment of scalable ETL workflows using Airflow, Azure Data Factory (ADF) , and DBT
- •Data Modeling: Relational and document-based schema design, normalization, incremental loading strategies
- •Data Migration & Validation: End-to-end data migration using DBT; automated validation via SQL queries and conditional logic
- •Data Processing: PySpark, Pandas
- •Query Optimization: SQL tuning for performance, partitioning, indexing strategies
- •Data Governance: Handling of schema lifecycle (e.g., Alembic + SQLAlchemy), data quality checks, versioned deployments
Data Visualization and Dashboards
- •Business Intelligence: Power BI, Tableau
- •Python Visualization: Matplotlib, Seaborn, Plotly Dash
- •Skills: Interactive Dashboard Design, KPI Tracking, Visual Storytelling
LLM Agents:
- •LLM Theory: Transformer architecture, self-attention, multi-head attention, encoder-decoder models, positional encoding, fine-tuning methods
- •LLM Agent Frameworks: LangChain, LangGraph, Google ADK
- •Prompt Engineering: Structured and optimized prompts for summarization, classification, and extraction tasks
- •Retrieval-Augmented Generation (RAG): Integrated Milvus vector databases for context-aware information retrieval
- •Speech-to-Text: WhisperX, NeMo
- •LLM Agent Design: Multi-agent workflows, task orchestration,
Certificates
2 / 2