Name: Atharv Jangam

Job Role: Data Engineer

Experience: 3 Years

Address: Bloomington, USA

Skills

SQL LogoSQL: 85%
Python LogoPYTHON: 90%
Data Visualization LogoData Visualization: 90%
Stats LogoStatistical Analysis: 85%
ML LogoMachine Learning: 80%
Azure LogoCloud Platforms (Azure, AWS): 85%
big_data LogoBig Data Technologies: 80%
networkingCommunication: 90%

About

About Me

With a master's degree in Data Science and experience as a Data Engineer/Analyst, I have a solid foundation in data analysis, Azure, and AWS. Proficient in data engineering, ETL processes, data warehousing, SQL, Python, statistical analysis, hypothesis testing, customer behavior analysis, and machine learning. Demonstrated success in leading impactful projects, providing actionable insights, and mentoring team members effectively.

  • Profile: Data Analyst & Engineer
  • Domain: Retail, Ecommerce, Healthcare & Startup
  • Education: Master of Science in Data Science
  • Language: English, Hindi, Marathi
  • Skills: Python, SQL, NoSQL, R, MySQL, PostgreSQL, MongoDB
  • Frameworks: pySpark, Hadoop, Pandas, Scikit-learn, TensorFlow, MLlib, Seaborn, Altair, DAX.
  • BI Tools: Microsoft Power BI, Looker & Tableau
  • Interest: Traveling, Travel Photography, Teaching

0 +   Projects completed

LinkedIn

Resume

Resume

Data Science graduate from Indiana University Bloomington with a strong foundation in data engineering/Analysis, machine learning, and cloud computing. Recently certified in Azure Associate Data Engineer (DP-203), I am passionate about leveraging data to drive business decisions and improve processes. Currently seeking a data analyst role where I can apply my analytical skills and technical knowledge to contribute to impactful projects.

Experience


Feb 2024 - Present

Senior Consultant - ML

Indiana Univeristy Bloomington

Hoosier Community Networks, part of Indiana University, enhances the vitality and resilience of rural Indiana communities. By partnering with residents, businesses, and organizations.

  • Developed a Collaborative Filtering machine learning model to recommend local services and businesses, increasing personalized engagement and boosting visibility by 32%.
  • Collaborated with a cross-functional team to gather data via web scraping, cleaned and visualized the data with Python, and created a user-friendly map, leading to over 1,000 community members accessing local resources.
  • Implemented Agile methodologies to streamline community outreach programs, enhancing team collaboration and efficiency

Aug 2019 - Jul 2022

Data Engineer - Machine Learning

Stylopedia Technology Private Limited

Stylopedia is a dynamic startup company specializing in executing projects for third-party clients. With a focus on delivering innovative technological solutions.

  • Led the implementation of Azure Event Hub and Synapse for real-time data processing, reducing data latency to under 5 seconds and improving user experience by 73%.
  • Spearheaded the implementation of model training and deployment automation using Azure ML Studio, cutting retraining time by 50% and significantly boosting predictive accuracy.
  • Directed the integration of Gradient Boosting models into the data pipeline, doubling user engagement through 2x repeat bookings by enhancing personalization.
  • Managed the creation of ETL pipelines with Azure Data Factory, optimizing workflows to reduce processing time by 43% and cut costs by 18%.



Education


Aug 2022 - May 2024

Master of Science in Data Science

Indiana University Bloomington

GPA: 3.6/4

Aug 2017 - Jul 2021

BEng in Computer Engineering

University of Pune

CGPA: 8.27/10

Achievements

Achievements

Here are some of my notable certifications and achievements.

Azure Data Engineer Associate (DP-203)

Azure Data Engineer Associate (DP-203) certification demonstrates proficiency in integrating, transforming, and consolidating data using Azure services like Synapse Analytics, Data Factory, Stream Analytics, and Databricks. It covers skills in SQL, Python, Scala, and modern data architecture patterns.

Gen AI 360 Certificate by Activeloop

This course builds foundational skills in Retrieval Augmented Generation (RAG) for Production with LangChain & LlamaIndex. It emphasizes hands-on experience, ensuring proficiency in integrating RAG solutions into various applications to enhance data-driven decision-making and AI-powered functionalities.

Google Data Analytics Specialization

Google Data Analytics Professional Certificate includes eight courses covering hands-on, practice-based assessments. It prepares individuals for introductory roles in data analytics, with skills in spreadsheets, SQL, Tableau, and R.

Award-Winning Data Visualization Project

We won $1500 in the AEI Lab's data visualization competition at Indiana University. Our project used Python for data cleaning and advanced visualization techniques, including Pandas, Matplotlib, Seaborn, Plotly, Altair, Streamlit, Geopy, and Pydeck.

Data Analytics Consulting Virtual Internship

Completed KPMG's Data Analytics Consulting Virtual Internship, covering Data Quality Assessment, Data Insights, and Data Presentation. This program prepared me for roles in data analytics through hands-on, practice-based assessments.

Startup India Learning Program

Completed the free online entrepreneurship course by Startup India, developed by Invest India and UpGrad. This 4-week program includes lessons from 40+ top Indian founders, advancing entrepreneurial ideas through structured learning.

Projects

Projects

Below are the sample Data Science projects.

AdventureWorks Sales Analysis with Azure

Developed a PowerBI dashboard for Adventure Works, highlighting Product Performance, Customer Demographics, and Sales Overview. Utilized ETL processes with PySpark in Databricks for data transformation and a silver-to-gold data pipeline.

Climate Change Analysis

Identified a 2°C global land temperature shift and presented mitigation strategies using advanced data visualization techniques in Tableau, including heat maps, tree maps, and histograms, analyzing 50 years of time series data.

DocuChat AI with RAG and GenAi

DocChat AI is an advanced document chatting application utilizing Retrieval-Augmented Generation (RAG) to provide efficient and accurate responses to user queries. This project leverages OpenAI’s LLM, LangChain, and Pinecone for indexing and querying documents, enhancing query response times and overall system performance.


Healthcare Analytics - Using Machine Learning

Healthcare organizations face increasing pressure to enhance patient outcomes. This project uses healthcare analytics and machine learning to predict patient Length of Stay (LOS). By applying quantitative and qualitative data analysis.

Intelligent Personal Memory Assistant (IPMA)

The Intelligent Personal Memory Assistant (IPMA) helps users remember important information proactively. Using voice queries and a natural language user interface, IPMA can recall details about events, belongings, and schedule reminders.

ATS Resume Expert Using GenAI

This project helps job seekers improve their resumes by providing detailed analysis against job descriptions. The tool highlights the strengths and weaknesses of a resume, identifies missing keywords, and suggests areas for improvement to increase the chances of getting hired.

0 Achievements
0 Projects
0 Mentored Students
0 Years of Experience

More projects on Github

I enjoy addressing business issues and discovering the untold stories in data.


GitHub

Contact

Contact Me

Below are the details to reach out to me!

Address

Bloomington, Indiana

Contact Number

+ 1 812 7785 857

Email Address

atharvjangam30@gmail.com

Download Resume

Click Me



Let's Connect!

Feel free to reach out to me for collaborations, opportunities, or any inquiries. I'm always open to connecting with new people!