Covid-19 Analysis
Covid-19 Analysis is a comprehensive project that examines various aspects of COVID-19, including cases, deaths, vaccinations, policies, mortality, and mobility. The goal is to analyze the impact of COVID-19 globally and on a country-wise level, assessing how different regions handled the pandemic.
- To provide a detailed analysis of COVID-19 trends worldwide.
- To assess the effectiveness of policies and healthcare responses.
- To offer forecasting for the next 30 days using machine learning techniques.
- To facilitate clustering and regression analysis for deeper insights.
The primary motivation behind this project was to build an end-to-end data analysis solution using real-world data. This project not only enhanced my technical, analytical, and machine learning skills but also provided a deeper understanding of COVID-19 trends. Inspired by "Our World in Data," I decided to develop an interactive dashboard with advanced analytical capabilities.
- Case fatality rate (CFR) analysis
- Weekly/Biweekly growth trends
- Cases and deaths per million
- Policy impact on cases, deaths, mobility, and vaccination
- Reproduction rate trends
- Testing efficiency and healthcare capacity
- Mobility trends across different countries
- Vaccination trends, attitudes, and manufacturer statistics
- Excess mortality trends and projections
- Forecasting: 30-day predictions for COVID-19 trends
- Clustering: Grouping similar countries based on COVID-19 response
- Regression Analysis: Finding correlations between policies, cases, deaths, and vaccinations
- Language: Python (3.9.x)
- Frameworks & Libraries: Dash, Plotly, Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn
- Development Environment: PyCharm
- Python Version: 3.9.x
- Minimum RAM: 8GB (16GB recommended for optimal performance)
- Required Libraries: Install via
pip install -r requirements.txt
- Clone or download the project repository.
- Install the required dependencies:
pip install -r requirements.txt
- Download the necessary dataset from Google Drive: COVID-19 Data
- Extract the two folders from the downloaded data and place them inside the
Covid_19_Projectfolder. - Run the dashboard:
python dashboard.py
- The dashboard will launch in your browser, ready for analysis.
- Left Panel: Contains filters (country selection, analysis type).
- Right Panel: Displays visualizations (charts, maps, tables).
- Interactivity: Users can switch between different types of analyses.
- Data Availability: If no visualization appears, the dataset may not contain data for that country.
Primary Data Source: Our World in Data
- Vaccination Data:
- Mathieu, E., Ritchie, H., Ortiz-Ospina, E. et al. A global database of COVID-19 vaccinations. Nat Hum Behav (2021). https://doi.org/10.1038/s41562-021-01122-8
- Testing Data:
- Hasell, J., Mathieu, E., Beltekian, D. et al. A cross-country database of COVID-19 testing. Sci Data 7, 345 (2020). https://doi.org/10.1038/s41597-020-00688-8
Covid_19_Project/
├── Analysis_Scripts/ # All analysis scripts
├── Cleaned_data/ # Processed datasets
├── data_cleaning_scripts/ # Data cleaning scripts
├── ML_Models/ # Machine learning models and scripts
├── OWID_Covid_Data/ # Original datasets (Download separately)
├── Images/ # Dashboard snapshots
├── dashboard.py # Main entry point
CasesDeathAnalysisVaccinationAnalysisExcessMortalityAnalysisMobilityAnalysisPolicyAnalysisTestingHealthcareAnalysis
- Class Structure & Library Selection: Choosing the best structure for classes and deciding between Matplotlib/Seaborn vs. Plotly for visualization.
- Dash Integration: Creating a dynamic and interactive layout while maintaining performance.
- Data Processing: Cleaning and merging large datasets efficiently.
- Machine Learning Implementation: Integrating forecasting and clustering models into the analysis.
Below are some snapshots of the dashboard visualizations:
- Cases and Deaths per Million by Country
- Case Fatality Rate by Country
- Case Fatality Rate - Map View
- Case Fatality Rate - Table View
- Health Capacity Over Time in Australia
- Mobility vs. Case Growth in Argentina
- Policy Effectiveness by Country
- Testing Rates by Country
- Testing Rates and Healthcare Capacity by Country
- Vaccination Rates Over Time in Argentina
- This project is free to use.
- Users are encouraged to modify and extend it as needed.
- I would appreciate credit if someone uses or builds upon this project.