Skip to content
#

selection-bias

Here are 12 public repositories matching this topic...

Longform data analysis article arguing every “dataset” is actually three: Observed (captured rows), Missing (what should exist but doesn’t), and Excluded (what filters/joins/dropna removed). Includes dataset accounting, join-loss and missingness audits, segmentation checks, and practical templates to prevent biased KPIs and wrong conclusions.

  • Updated Feb 16, 2026

Bank marketing ML model (92% ROC-AUC) with XGBoost + Platt scaling. EDA-driven binning, handles 93% class imbalance, addresses data leakage & selection bias. 16 docs covering nuances & business impact.

  • Updated Dec 29, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the selection-bias topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the selection-bias topic, visit your repo's landing page and select "manage topics."

Learn more