Hi, I'm Shivangi Gupta! I have a strong background in Mathematics and a passion for transforming data into actionable insights. Currently, I specialize in data analysis and visualization, leveraging tools like Python, MySQL, Excel, and Power BI to uncover trends and drive decision-making.
- Programming & Data Analysis: Python (Pandas, NumPy, Tkinter)
- Visualization: Power BI, Matplotlib, Seaborn, Plotly.express
- Database Management: MySQL
- Excel & Reporting: Pivot Tables, VLOOKUP, Conditional Formatting, Dashboards
I enjoy working with data to uncover patterns, improve decision-making, and drive business growth. Explore my projects to see how I apply analytics to real-world problems.
My Resume in pdf.
This is a repository to showcase skills, share projects and track my progress in Data Analytics related topics.
In this section I will list data analytics projects briefly describing the technology stack used to solve cases.
Code: Analyzing the Factors affecting Student Performance.ipynb
Goal: To determine what factors infuence student exam performance.
Description: The project focused on analyzing a dataset of students performance in exams. The dataset included study habits(Hours Studied, frequency of study sessions), attendance, parental involvement and socioeconomic factors. The project involved loading the data, cleaning and preprocessing it, performing exploratory data analysis (EDA), analyzing the correlation between factors and student exam performance.
Skills: Data cleaning, data analysis, correlation matrices, data visualization.
Technology: Python, Pandas, Numpy, Seaborn, Matplotlib.
Results:
- Using Python functions the analysis revealed that attendance and number of hours studied are the most influential factors affecting exam scores.
- Parental involvement, access to resources also have notable effects. Factors such as gender and school type do not significantly impact exam performance
Goal: To analyze sales performance, customer behavior, and key revenue drivers in an e-commerce business.
Code: E-Commerce Analysis.ipynb
Description: This project involved analyzing an e-commerce sales dataset, which included product details, categories, prices, review scores, and monthly sales figures. The analysis covered data cleaning, exploratory data analysis (EDA), sales trends, customer segmentation, and revenue distribution. Key aspects included identifying top-selling products, seasonal trends, and the impact of review scores on sales performance.
Skills: Data cleaning, data analysis, correlation matrices,customer segmentation, sales trend analysis, data visualization.
Technology: Python, Pandas, Matplotlib, Seaborn.
Results:
- Using Python functions revealed that Electronics has the highest total sales, while clothing has the lowest total sales.
- Toys have the highest average review count.
- Health products have the highest average price, which could suggest that this category deals with more premium or specialized items.
- Books, Home & Kitchen, and Sports show balanced metrics across total sales, price, and review count.
Code: Word Guessing Game.ipynb
Goal: To develop an interactive word guessing game with categorized difficulty levels and hints.
Description: This project involved building a Python-based word guessing game where players guess words from categories like celebrities, movies, fruits, and colors. The game includes easy, medium, and hard levels, along with hints to assist players. It was designed using Python functions, loops, and conditional statements to enhance user experience.
Skills: Python programming, user input handling, logic building, game development.
Technology: Python, json, os, Random module.
Results:
- A fully functional game that provides an engaging and interactive experience, with dynamic difficulty levels and hints to enhance playability.
Code: Covid-19 Data Analysis.sql
Description: The dataset contains records of covid-19 cases, deaths, and recovered by each country/region from January, 2020 to June, 2021. The analysis focused on data cleaning, mortality rate.
Skills: Data cleaning,data imputing, statistical analysis, exploratory data analysis(EDA)
Technology: MySQL Workbench, SQL
Code: Instagram User Analytics.sql
Goal: To analyze and understand the behavior, preferences, and interactions of specific users on the platform.
Description: The project focuses on individual user-level data and provides insights that can be valuable for personalized content, targeted marketing, and enhancing user engagement. We will efficiently extract relevant information such as user engagement metrics, demographic insights, and content performance.
Skills: Data cleaning, exploratory data analysis(EDA), trend analysis
Technology: MySQL Workbench, SQL
Code: Online Retail Data Analysis.pbix
Dashboard: Dashboard
Goal: To review the company data to provide key insights to help CEO and CMO make strategic decisions for coming year.
Description: An online retail store has hired you as a consultant to review their data and provide insights that would be valuable to the CEO and CMO of the business. The business has been performing well and the management wants to analyse what the major contributing factors are to the revenue so they can strategically plan for next year. Draft the relevant analytics and insights that would help evaluate the current business performance and suggest metrics that would enable them to make the decision on expansion.
Skills: data cleaning, data visualization, dashboards
Technology: Excel, Power BI
Code: Forage-PwC-Power-BI-Job-Simulation.pbix
Goal: To provide data-driven insights for a telecom client through Power BI dashboards, helping the company optimize customer service, retention strategies, and diversity efforts.
Description: This project is part of the PwC Power BI Job Simulation, where I developed interactive dashboards for a leading telecom company. The goal was to analyze key business challenges and present actionable insights through data storytelling.
The project included three key tasks:
- Customer Service Dashboard – Analyzed agent performance, customer satisfaction trends, and call handling metrics to optimize customer support.
- Customer Retention Dashboard – Identified churn risk factors, customer lifetime value (CLV), and proactive retention strategies to improve customer retention.
- Diversity & Inclusion Dashboard – Examined gender representation, promotion rates, and turnover trends to enhance diversity at the executive level. Each dashboard provided clear KPIs and data-driven narratives to support strategic decision-making for the company.
Skills: Data Cleaning & Transformation, Data Visualization & Storytelling, Customer Churn Analysis, Business Intelligence Reporting, Diversity & Inclusion Analytics
Technology: Power BI, Excel, Data Modeling, DAX, SQL
Results:
- The dashboards enabled the telecom client to optimize customer service operations, reduce churn through proactive engagement, and implement targeted strategies to improve executive diversity.
Code: MediBuddy Insurance.xls
Goal: Identify if the key factors like gender, BMI, age, smoking status, geographic location, and number of dependents influences insurance claims and policy costs.
Description: This project explores health insurance claims using Excel, focusing on key factors like age, BMI, smoking status, and dependents. It includes exploratory data analysis (EDA), statistical testing (regression), data visualization for estimating insurance costs.
Skills: Statistical Analysis, Reporting, Case study Analysis
Results:
- Smokers have significantly higher insurance costs.
- BMI and age play a crucial role in determining claim amounts.
- Gender has minimal impact.
- Geographic location does not significantly affect insurance charges.
Panjab University, Chandigarh
Master of Science - Mathematics
June 2022
Panjab University, Chandigarh
Bachelors of Science - Mathematics
June 2020
- Labmentix Internship (March 2025 - Present)
- Hype Intern (September 2024)
- Glorivita CraftTech Solutions (August 2024)
- Mentorness Community(June 2024 - July 2024)
- Innobytes Services (May 2024 - June 2024)
- Trainity (January 2024 - March 2024)
Here's a list of the ones I have:
- Basic Python Certificate | HackerRank
- Intermediate SQL Certificate | HackerRank
- Basic Problem-Solving | HackerRank
- Training Certificate for Prompt Engineering for Generative AI | Internshala
- Data Analytics Training Certificate | Trainity
- LinkedIn: @ShivangiGupta
- Gmail: Shivangigupta.2599@gmail.com