Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.
-
Updated
Feb 23, 2024 - Python
Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.
Exploratory Data Analysis to uncover factors data lead to employee attrition.
Wrangling the WeRateDogs datasets to showcase data gathering, assessing, cleaning, and documentation skills.
A total package of what data science is all about. from dashboard building to data wrangling, sql, data collection, vizualization, webscrapping to presentaion.
Movies data analysis to produce visuals and insights about the data-set of 10,000 movies.
Capstone project of Udacity Data Analyst Nanodegree. Focus on advanced visualizations to explore data and to communicate insights and patterns. Final slide deck is made with Jupyter notebook with interactive HTML slides (based on reveal.js).
Ford GoBike 2019 Dataset is a dataset for the bikeshare system, in this study I have presented the data on the slides file as a part of the visualization Learning process of the Data Analysis Nanodegree of Udacity.
A SQL project that extracts insights from a baseball database using advanced queries, window functions and aggregations to analyze baseball player performance, team statistics, and salary data to support strategic decision-making in player recruitment, budget allocation, and performance evaluation.
Pipeline for curating intracranial EEG data (2022-2025)
project in Udacity Data Analyst Nanodegree. This project focused on advanced data gathering (several sources incl twitter API), wrangling and cleaning of data. Plus 2 reports.
This project, carried out in Jupyter Notebook, aims to explore the main Data Analysis techniques with Python tools. Pandas, Numpy, Seaborn, Matplotlib, Plotly and sklearn are used. Divided into three notebooks, I separate the data cleaning, data analysis and machine learning part. For more details and goals, see README
Machine learning, signal processing pipeline used to identify song name from user input (hum/whistle to song).
Factors that affect manufacturing GDP SA perspective
🌌 Data analysis to answer questions about one of the most successful movie franchises of all time: Star Wars.
Data Wrangling Project from the Udacity Data Analytics Nano Degree
I study how reported economic growth was undergoing change in 2004 to 2019 testing for the number of criminal records and other characteristics.
Predictive Model for BRENT price movements
Coursera Data Science Specialization Capstone Project
Add a description, image, and links to the wrangling-cleaning topic page so that developers can more easily learn about it.
To associate your repository with the wrangling-cleaning topic, visit your repo's landing page and select "manage topics."