A course project (MA4240: Applied Statistics) on US demographics and cardiovascular diseases using the BRFSS (Behavorial Risk Factor Surveillance System) 2021 survey data by CDC (Centers for Disease Control and Prevention).
The project involves the following:
- Analysis of US demographic data: confidence interval estimation for heights and weights of the US population.
- Analysis of proportion of population involved in various health and lifestyle factors that may lead to heart diseases, like diabetes, age, smoking, alcohol consumption, and so on.
- Hypothesis testing on normally distributed numerical features.
- Hypothesis testing on categorical features: association of factors like diabetes, arthritis, and age with heart diseases.