About me

I am Gauri, a Data Analytics graduate, from Northeastern University and AWS Certified Professional with an interest in performing all essential functions of data analysis right from data collection, data preparation and transformations, effective storage and finally data mining leading to reporting views and visualizations serving business requirements. I am open to roles in Data Analytics and Data Science where I can help organizations effectively use business data and cloud services to pursue data-driven decision-making.

Academic Projects

  • All about data jobs

    This project, presents a detailed report for analyzing jobs in the data field concerning location, employment type, company size, and years of experience, using data pivoting and massaging techniques in python pandas. All the charts and plots are created in the Datawrapper tool and hosted using Google Sites with the link below!

  • Sales data analysis

    This project, shows the report for classifying inventory products to retain selling or to discard them using machine learning techniques with rich python libraries like sklearn. Interactive visualizations are created using the flourish tool, Plotly Chart Studio, and hosted using Google Sites with link below!

Database Project

  • SQL server Data ingesion
  • ERD Database design
  • SQL queries Data analysis
  • SQL views Power BI dashboard

This is a Food delivery database project created using MS SQL server and MS Power BI reporting tool. This GitHub project is up-to-date with the restaurant database design document, final ERD document, SQL DDL statements for implementing the database, and the SQL DML statements used to create views, and visualizations in MS Power BI.

Web scraping Project

This repository contains the Python code of a web scrapping application hosted on the cloud using AWS Command Line Interface. This application scrapes data from web pages and collects data about various Disney Movies. The reports are created on top of collected data for Analytics.

AWS Services used : AWS CLI, AWS S3, AWS Quicksight, AWS Glue
Python Libraries used : Beautifulsoup, requests, boto3, json, pandas dataframes