Skip to content

Latest commit

 

History

History
16 lines (8 loc) · 1006 Bytes

File metadata and controls

16 lines (8 loc) · 1006 Bytes

MVP

The goal of the project is to predict the success of a Kickstarter campaign.

The baseline classification model was built with small subset of features (goal in USD, in US, campaign duration) which has F1 score of 0.75 and ROC score of 0.65.

After experimenting with different subsets of features, the model performed best with all the features, which has F1 score of 0.77 and ROC score of 0.77. Confusion matrix shown as below.

Comparing different models, Random Forest outperforms the other models with an AUC score of 0.79 and Logistic Regression has the second highest AUC score.

My next step is to try tuning the hyperparameters for Logistic Regression and Random Forest models to see if I can get a better result. I will also try out XGBoost for any improvement on model performance.