Experienced Data Scientist with a demonstrated history of working in IT industry. Collected data and drew insights to improve business operations and solve business problems.
Tech: Langchain, Vector databse, RAG, Openai, llm, github actions, ec2
Tech: OpanAI, RAG, Langchain, Pinecone, AWS S3, NextJs, Typescript, Tailwindcss
Tech: XGBoost, LSTM, Pytorch, TensorFlow, HuggingFace
Tech: CNN, Data Augmentation, TensorFlow, Adam, Relu, SoftMax
Tech: Oversampling (SMOTE), Recall, Precision, weighted f1, ExtraTree class., Streamlit
Tech: KNN Imputer, Label Encoder, Cross-Validation, Catboost and RF regressor, Optuna
LangGraph, Smolagents, Crewai, Pydantic AI
DNN, CNN, RNN, Transfer learning, LSTM, LLM, GenAI, RAG
Python, Java, HTML and CSS
Sklearn, Scipy, Statsmodel, Pandas, NumPy, Seaborn, Matplotlib, Selenium, BS4, NLTK, TensorFlow, Keras, pytorch, langchain, openai, hugginface, shap, Boto3, Flask, Fastapi, Streamlit
SQL
AWS (sagemaker, s3, Lex, Lamda, Bedrock, RDS, EC2, API gateway, secret manager), GCP (Bigquery,
Dataproc, Dataflow, Vertex AI, cloud SQL, etc.)
Classification, Regression, clustering, Decision Trees, K-Means Clustering, hierarchical clustering
Predictive analysis, Hypothesis Testing and Confidence Intervals, Principal Component Analysis, LDA and Dimensionality Reduction
Git
Unix
Excel, GCP Data Studio
N8N, Make.com, Zapier