Sagar TateData Scientist
Profile
Experienced Data Scientist with a demonstrated history of working in IT industry. Collected data and drew insights to improve business operations and solve business problems.
Personal Projects
WeldCraft, Hybrid Search, RAG
Apr 2024 – Apr 2024
Tech: Langchain, Vector databse, RAG, Openai, llm, github actions, ec2
Nov 2023 – present
Tech: OpanAI, RAG, Langchain, Pinecone, AWS S3, NextJs, Typescript, Tailwindcss
Feb 2024 – May 2024
Tech: XGBoost, LSTM, Pytorch, TensorFlow, HuggingFace
May 2022 – May 2022
Tech: CNN, Data Augmentation, TensorFlow, Adam, Relu, SoftMax
Mar 2022 – Mar 2022
Tech: Oversampling (SMOTE), Recall, Precision, weighted f1, ExtraTree class., Streamlit
Feb 2022 – Feb 2022
Tech: KNN Imputer, Label Encoder, Cross-Validation, Catboost and RF regressor, Optuna
Skills
Machine Learning — Classification, Regression, clustering, Decision Trees, K-Means Clustering, hierarchical clustering, Deep Learning — DNN, CNN, RNN, Transfer learning, LSTM, LLM, GenAI, RAG, Statistical Methods — Predictive analysis, Hypothesis Testing and Confidence Intervals, Principal Component Analysis, LDA and Dimensionality Reduction, Programming Languages and tools — Python, Java, HTML and CSS, Version Control Tools — Git, Python Libraries — Sklearn, Scipy, Statsmodel, Pandas, NumPy, Seaborn, Matplotlib, Selenium, BS4, NLTK, TensorFlow, Keras, pytorch, langchain, openai, hugginface, shap, Boto3, Flask, Fastapi, Streamlit, Scripting Language — Unix, Database Language — SQL, Data Reporting Tool — Excel, GCP Data Studio, Cloud Tools — AWS (sagemaker, s3, Lex, Lamda, Bedrock, RDS, EC2, API gateway, secret manager), GCP (Bigquery, | Dataproc, Dataflow, Vertex AI, cloud SQL, etc.)
Professional Experience
Dec 2022 – Aug 2024 | Remote, India
Oct 2021 – Dec 2022 | Pune, India
Dec 2018 – Oct 2021 | Pune, India
Certificates
Education
Apr 2021 – Apr 2022
Jul 2014 – Jul 2018