Tools: Python · Pandas · Matplotlib · Seaborn · Power BI
Dataset: IBM HR Analytics Dataset (Kaggle) — 1,470 employees
Status: ✅ Complete
Why are employees leaving the company? This analysis identifies the key drivers of attrition and provides data-backed retention recommendations for the HR team.
- Data Cleaning — removed irrelevant columns, handled missing values
- EDA — Python (Pandas, Matplotlib, Seaborn)
- Dashboard — Power BI interactive dashboard
- Overall attrition rate is 16.1% — 237 out of 1,470 employees left
- Sales department has the highest attrition rate
- Employees working OverTime are 3x more likely to leave
- Age group 18-25 has the highest attrition rate
- Employees who left earned $2,000 less per month than those who stayed
- Reduce overtime — strongest predictor of attrition
- Focus retention on Sales dept — highest risk department
- Review salary bands for junior employees — income gap is significant
| File | Description |
|---|---|
notebooks/analysis.ipynb |
Full Python analysis |
outputs/chart1_department.png |
Attrition by department |
outputs/chart2_overtime.png |
Overtime impact |
outputs/chart3_age_group.png |
Attrition by age group |
outputs/chart4_job_satisfaction.png |
Job satisfaction analysis |
outputs/chart5_income.png |
Income distribution |
outputs/dashboard_screenshot.png |
Power BI dashboard |
