Skip to content

Shivi2599/Data_Analysis_Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

Shivangi Gupta - Data Analysis Portfolio

About

Hi, I'm Shivangi Gupta! I have a strong background in Mathematics and a passion for transforming data into actionable insights. Currently, I specialize in data analysis and visualization, leveraging tools like Python, MySQL, Excel, and Power BI to uncover trends and drive decision-making.

Skills and Tools

  • Programming & Data Analysis: Python (Pandas, NumPy, Tkinter)
  • Visualization: Power BI, Matplotlib, Seaborn, Plotly.express
  • Database Management: MySQL
  • Excel & Reporting: Pivot Tables, VLOOKUP, Conditional Formatting, Dashboards

I enjoy working with data to uncover patterns, improve decision-making, and drive business growth. Explore my projects to see how I apply analytics to real-world problems.

My Resume in pdf.

This is a repository to showcase skills, share projects and track my progress in Data Analytics related topics.

Table of Contents

Portfolio Projects

In this section I will list data analytics projects briefly describing the technology stack used to solve cases.

Analyzing the Factors affecting Student Performance

Code: Analyzing the Factors affecting Student Performance.ipynb

Goal: To determine what factors infuence student exam performance.

Description: The project focused on analyzing a dataset of students performance in exams. The dataset included study habits(Hours Studied, frequency of study sessions), attendance, parental involvement and socioeconomic factors. The project involved loading the data, cleaning and preprocessing it, performing exploratory data analysis (EDA), analyzing the correlation between factors and student exam performance.

Skills: Data cleaning, data analysis, correlation matrices, data visualization.

Technology: Python, Pandas, Numpy, Seaborn, Matplotlib.

Results:

  • Using Python functions the analysis revealed that attendance and number of hours studied are the most influential factors affecting exam scores.
  • Parental involvement, access to resources also have notable effects. Factors such as gender and school type do not significantly impact exam performance

E-Commerce Analysis

Goal: To analyze sales performance, customer behavior, and key revenue drivers in an e-commerce business.

Code: E-Commerce Analysis.ipynb

Description: This project involved analyzing an e-commerce sales dataset, which included product details, categories, prices, review scores, and monthly sales figures. The analysis covered data cleaning, exploratory data analysis (EDA), sales trends, customer segmentation, and revenue distribution. Key aspects included identifying top-selling products, seasonal trends, and the impact of review scores on sales performance.

Skills: Data cleaning, data analysis, correlation matrices,customer segmentation, sales trend analysis, data visualization.

Technology: Python, Pandas, Matplotlib, Seaborn.

Results:

  • Using Python functions revealed that Electronics has the highest total sales, while clothing has the lowest total sales.
  • Toys have the highest average review count.
  • Health products have the highest average price, which could suggest that this category deals with more premium or specialized items.
  • Books, Home & Kitchen, and Sports show balanced metrics across total sales, price, and review count.

Word Guessing Game

Code: Word Guessing Game.ipynb

Goal: To develop an interactive word guessing game with categorized difficulty levels and hints.

Description: This project involved building a Python-based word guessing game where players guess words from categories like celebrities, movies, fruits, and colors. The game includes easy, medium, and hard levels, along with hints to assist players. It was designed using Python functions, loops, and conditional statements to enhance user experience.

Skills: Python programming, user input handling, logic building, game development.

Technology: Python, json, os, Random module.

Results:

  • A fully functional game that provides an engaging and interactive experience, with dynamic difficulty levels and hints to enhance playability.

Covid-19 Data Analysis

Code: Covid-19 Data Analysis.sql

Description: The dataset contains records of covid-19 cases, deaths, and recovered by each country/region from January, 2020 to June, 2021. The analysis focused on data cleaning, mortality rate.

Skills: Data cleaning,data imputing, statistical analysis, exploratory data analysis(EDA)

Technology: MySQL Workbench, SQL

Instagram User Analytics

Code: Instagram User Analytics.sql

Goal: To analyze and understand the behavior, preferences, and interactions of specific users on the platform.

Description: The project focuses on individual user-level data and provides insights that can be valuable for personalized content, targeted marketing, and enhancing user engagement. We will efficiently extract relevant information such as user engagement metrics, demographic insights, and content performance.

Skills: Data cleaning, exploratory data analysis(EDA), trend analysis

Technology: MySQL Workbench, SQL

Online Retail Data Analysis

Code: Online Retail Data Analysis.pbix

Dashboard: Dashboard

Goal: To review the company data to provide key insights to help CEO and CMO make strategic decisions for coming year.

Description: An online retail store has hired you as a consultant to review their data and provide insights that would be valuable to the CEO and CMO of the business. The business has been performing well and the management wants to analyse what the major contributing factors are to the revenue so they can strategically plan for next year. Draft the relevant analytics and insights that would help evaluate the current business performance and suggest metrics that would enable them to make the decision on expansion.

Skills: data cleaning, data visualization, dashboards

Technology: Excel, Power BI

Forage PwC Power BI Job Simulation

Code: Forage-PwC-Power-BI-Job-Simulation.pbix

Goal: To provide data-driven insights for a telecom client through Power BI dashboards, helping the company optimize customer service, retention strategies, and diversity efforts.

Description: This project is part of the PwC Power BI Job Simulation, where I developed interactive dashboards for a leading telecom company. The goal was to analyze key business challenges and present actionable insights through data storytelling.

The project included three key tasks:

  • Customer Service Dashboard – Analyzed agent performance, customer satisfaction trends, and call handling metrics to optimize customer support.
  • Customer Retention Dashboard – Identified churn risk factors, customer lifetime value (CLV), and proactive retention strategies to improve customer retention.
  • Diversity & Inclusion Dashboard – Examined gender representation, promotion rates, and turnover trends to enhance diversity at the executive level. Each dashboard provided clear KPIs and data-driven narratives to support strategic decision-making for the company.

Skills: Data Cleaning & Transformation, Data Visualization & Storytelling, Customer Churn Analysis, Business Intelligence Reporting, Diversity & Inclusion Analytics

Technology: Power BI, Excel, Data Modeling, DAX, SQL

Results:

  • The dashboards enabled the telecom client to optimize customer service operations, reduce churn through proactive engagement, and implement targeted strategies to improve executive diversity.

MediBuddy Insurance Project

Code: MediBuddy Insurance.xls

Goal: Identify if the key factors like gender, BMI, age, smoking status, geographic location, and number of dependents influences insurance claims and policy costs.

Description: This project explores health insurance claims using Excel, focusing on key factors like age, BMI, smoking status, and dependents. It includes exploratory data analysis (EDA), statistical testing (regression), data visualization for estimating insurance costs.

Skills: Statistical Analysis, Reporting, Case study Analysis

Results:

  • Smokers have significantly higher insurance costs.
  • BMI and age play a crucial role in determining claim amounts.
  • Gender has minimal impact.
  • Geographic location does not significantly affect insurance charges.

Education

Panjab University, Chandigarh
Master of Science - Mathematics
June 2022

Panjab University, Chandigarh
Bachelors of Science - Mathematics
June 2020

Internships Undertaken

  • Labmentix Internship (March 2025 - Present)
  • Hype Intern (September 2024)
  • Glorivita CraftTech Solutions (August 2024)
  • Mentorness Community(June 2024 - July 2024)
  • Innobytes Services (May 2024 - June 2024)
  • Trainity (January 2024 - March 2024)

Certificates

Here's a list of the ones I have:

  • Basic Python Certificate | HackerRank
  • Intermediate SQL Certificate | HackerRank
  • Basic Problem-Solving | HackerRank
  • Training Certificate for Prompt Engineering for Generative AI | Internshala
  • Data Analytics Training Certificate | Trainity

Contacts

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors