Data Analyst
Technical Skills: Python, SQL, R, Tableau, Power BI
Education
-
Masters in Data Analytics |
Northeastern University (Aug 2025) |
-
Master of Science in Physics |
Pune University (May 2017) |
-
Bachelor of Science in Physics |
Pune University (May 2015) |
Work Experience
Data Analyst @ Madden Global Solutions (Jan 2025 - Present)
- Designed and implemented a Python-based ETL pipeline using pandas to automate Circana data processing into Google Looker Studio for a BI reporting system from scratch to track weekly KPIs. Enabled real-time reporting for 14 clients and reduced manual reporting by 80%.
- Built an integrated supply chain analytics platform using Excel Macros to achieve 98% in-stock rates and a 15% sales increase.
- Implemented a cost-benefit analysis framework using Excel to evaluate promotional campaigns, delivering a 235% ROI.
- Automated Excel-based financial budget watch reports by extracting and transforming SAP data to track $76 million in CPG shipments.
- Identified supply chain and compliance gaps at the store level, analyzing POS data using Excel, and recovered $3.9 million in lost sales for a client.
Graduate Teaching Assistant @ Northeastern University (Apr 2024 - Present)
- Provided academic support to 160 students in Machine Learning and Statistics, focusing on GLM, KNN models, regression, regularization, and hypothesis testing using Chi-Square, ANOVA, T-tests, etc., through personalized and group study sessions.
- Guided 60 students in understanding Hadoop Distributed File System (HDFS), and big data analytics using tools like Cloudera, Apache Spark, Apache Hadoop, Hue, and Impala, contributing to 70% of students achieving an A-grade.
Research Analyst @ Northeastern University (Oct 2024 - Apr 2025)
- Analyzed over 5 million U.S. job postings from Lightcast using R; engineered a cosine similarity–based deduplication pipeline on cleaned Document-Term Matrices, flagging over 10 thousand redundant postings with ≥95% match.
- Deployed an interactive Tableau dashboard linked to hosted datasets to visualize credentialing demand (certifications vs. degrees) across.
Data Analyst @ Madden Global Solutions (Jan 2025 - Present)
- Designed and implemented a Python-based ETL pipeline using pandas to automate Circana data processing into Google Looker Studio for a BI reporting system from scratch to track weekly KPIs. Enabled real-time reporting for 14 clients and reduced manual reporting by 80%.
- Built an integrated supply chain analytics platform using Excel Macros to achieve 98% in-stock rates and a 15% sales increase.
- Implemented a cost-benefit analysis framework using Excel to evaluate promotional campaigns, delivering a 235% ROI.
- Automated Excel-based financial budget watch reports by extracting and transforming SAP data to track $76 million in CPG shipments.
- Identified supply chain and compliance gaps at the store level, analyzing POS data using Excel, and recovered $3.9 million in lost sales for a client.
Subject Matter Expert @ Infinity Learn (Oct 2022 - Apr 2023)
- Collaborated with 10 vendors to develop over 6,000 assessments and more than 500 science articles, achieving a 93% acceptance rate during the content review process.
- Co-ordinated in-house content processes, including YouTube video reviews, assessments, and article creation, while leveraging Excel for
continuous performance analysis and feedback resulting in a performance rating of 4.0/4.0.
Subject Matter Expert @ LIDO Learning (Apr 2022 - Aug 2022)
- Led a 6-member product development team, utilizing JIRA, to build and design engaging EdTech solutions, resulting in error-free
instructional products with a 3.9/4 stakeholder satisfaction rating.
- • Utilized SQL to query student data from the LMS, driving a 25% improvement in outcomes through data-informed content creation.
Tutor Trainer @ LIDO Learning (May 2021 - Mar 2022)
- Mentored 200+ teachers and 400+ Business Development Associates, elevating instruction quality and resulting in improved customer retention and 18% new customer growth.
- Analyzed Tutor data using advanced Excel, created dashboards using Power BI to develop an Online Teaching Practices program using Articulate Rise, resulting in a 30% increase in tutor delivery ratings and driving a 15% revenue growth within a month.
Curriculum Designer @ LIDO Learning (Oct 2019 - Apr 2021)
- Designed and developed immersive EdTech products, collaborating with the product team on strategy and research, resulting in highly
engaging educational content that achieved a 3.95/4.00 stakeholder rating.
- Collaborated with the Data Analytics team to gather student data using LMS and perform analysis to drive student growth by 12%.
- Mentored and onboarded 4 new curriculum designers and helped them achieve a rating of 3.80/4.00 in their first month.
Program Associate @ LIDO Learning (Oct 2019 - Apr 2021)
- Managed stakeholder relationships and facilitated communication between the organization and 20 low-income public schools across the
city, ensuring alignment of services throughout the academic year.
- Implemented a comprehensive training program for 35 instructors, enhancing Math and Science delivery in Public Schools, resulting in improved academic performance and a 150% increase in client base.
- Led yearlong boot camp interventions in 30 low-income public schools to increase the basic math operation skills of middle school students by 65%.
- Organized the “Pune Science Festival 2018,” a two-day science exhibition, with the assistance of over 50 public school student volunteers, drawing in a crowd of over 2,500 visitors.
Projects
Engaging Worlds, Gold Medal - MIT Education Hackathon ‘24 – Python, Convai, Anthropic
Link
- Developed an AI-powered immersive learning platform that integrates interactive experiences with automated assessment across
multiple devices, including VR.
- Integrated NLP for real-time student interaction analysis, generating actionable insights for educators.
FIFA Players Analytics – R, Excel
Link
- Conducted extensive Exploratory Data Analysis (EDA) on 18,483 FIFA player records, identifying key correlations and patterns in player attributes, market values, and wages.
- Developed a Logistic regression model to classify players with high or low wages with a 0.93 AUC score and a regularization model to predict
players’ potential ratings with 98.92% accuracy using R.
- Performed hypothesis testing to compare overall ratings between European and South American players, revealing statistically significant differences (p-value <0.05) and providing insights for regional scouting strategies.
Kidney Failure Production – Python, SQL, Excel
Link
- Led a team of 4 members to build a Kidney Failure Prediction model using Logistic Regression and GBT models with 95.6% accuracy.
- Combined 6 datasets related to demographics, medical history, lab test, diet of the patients with over 143 variables using SQL to create
train and test datasets to build a model.
NYPD Crime Analysis – Excel, Tableau
Link
- Discovered that 20% of male perpetrators involved in molestation-related crimes are young adults after performing data cleaning and
Exploratory Data Analysis (EDA) using Excel.
- Developed interactive Tableau dashboards connecting various data sources and defining calculations to provide crime insights.
Drug Misuse Database Management System – SQL, R, Excel
Link
- Developed SQL database design to study drug consumption among different age groups over the period and deployed it on Azure.
- Utilized Excel for data transformation and R for data visualization of drug consumption patterns.
Analysis of Magneto Hydrodynamic Waves and Energy Transport in the Solar Corona - IDL
Link
- Investigated the coronal heating problem using Magneto-Hydrodynamics (MHD) to study energy transport and dissipation via waves in the solar atmosphere.
- Analyzed properties of propagating waves in the solar corona using data from the Solar Dynamics Observatory (SDO). Performed spatial and temporal image analysis on multi-wavelength observations using Interactive Data Language (IDL) and SolarSoft.
- Conducted Fourier analysis to study frequency distributions and power spectra at various coronal heights.