Skip to content
View Karant15's full-sized avatar

Block or report Karant15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Karant15/README.md

Hi, I'm Karan Trivedi

MS Data Analytics | Webster University (Dec 2024) Piscataway, NJ | Open to US opportunities (Remote & Relocate) | STEM OPT Active


What I Do

I build data-driven solutions that solve real business problems - not just notebooks that sit on a laptop.

7+ years of combined experience across healthcare, recruitment, and business analytics - including managing data relationships with 30+ NHS hospitals in the UK. Now applying that domain knowledge to data science.

Lean Six Sigma Black Belt - I don't just find problems in data. I frame them as business solutions.


Open To

Track 1 - Analytics: Data Analyst · Business Analyst · Healthcare Analyst · BI Analyst Track 2 - Healthcare Recruitment: Account Manager · Client Success Manager · Workforce Analytics Manager

7+ years managing NHS hospital accounts + MS Data Analytics + LSS Black Belt = rare combination for both tracks


12 Projects | May-September 2026

Building in public. One new deployment every 2 weeks.

# Project Type Status Live
1 Healthcare Workforce Analytics Dashboard Python · Streamlit Live Open
2 Supply Chain KPI Dashboard + DMAIC + SQL Python · SQL · Streamlit Live Open
3 Healthcare Readmission ML Pipeline XGBoost · SHAP · Streamlit Live Open
4 SQL Business Analytics Dashboard SQL · SQLite · Python · Streamlit Live Open
5 Supply Chain Power BI Dashboard Power BI · DAX Building Releasing May 2026
6 HR Attrition ML Pipeline + SHAP XGBoost · SHAP · SQL · Streamlit Planned Releasing June 2026
7 Demand Forecasting ML Prophet · ARIMA · XGBoost · SQL Planned Releasing June 2026
8 LLM Chat With Data Tool LangChain · OpenAI · SQL · Streamlit Planned Releasing June 2026
9 Finance Fraud Detection ML XGBoost · SHAP · SQL · Streamlit Planned Releasing July 2026
10 Resume Analyzer AI Tool LangChain · Hugging Face · SQL Planned Releasing July 2026
11 Healthcare RAG Document Q&A LangChain · ChromaDB · RAG Planned Releasing August 2026
12 Cricket Analytics Dashboard Python · Plotly · SQL · Streamlit Planned Releasing August 2026

Featured Projects

Healthcare Workforce Analytics Dashboard - LIVE

Analyzed 9.6M real US Medicare records to identify physician staffing gaps across all 50 states

  • Processed 1.1M unique providers across 104 medical specialties
  • Built interactive 5-tab Streamlit dashboard with US choropleth maps
  • Applied Lean Six Sigma DMAIC framework to structure recruitment gap analysis
  • Identified Wyoming (97.7%), Vermont and Alaska as most critically underserved states
  • Full analysis run locally on 9.6M records - dashboard shows 50k representative sample
  • Live: https://karan-healthcare-analytics.streamlit.app
  • Stack: Python · Pandas · Plotly · Streamlit · CMS Medicare Data

SQL Business Analytics Dashboard - LIVE

25 advanced SQL queries across healthcare and supply chain real datasets - window functions, CTEs, subqueries

  • 6 basic, 9 intermediate, 9 advanced SQL queries across two real datasets
  • Window functions: RANK DENSE_RANK ROW_NUMBER NTILE LAG LEAD running totals moving averages
  • CTEs for ABC inventory classification and cross-domain state comparison
  • Live SQL Query Explorer - write and run any SQL against real database in browser
  • Cross-domain analysis combining CMS Medicare + DataCo supply chain in one SQLite DB
  • Live: https://karan-sql-analytics.streamlit.app/
  • Stack: SQL · SQLite · Python · Plotly · Streamlit

Healthcare Readmission ML Pipeline - LIVE

End-to-end ML pipeline predicting 30-day hospital readmission risk - 101,745 real patient records

  • Trained and compared 4 models: Logistic Regression, Random Forest, Gradient Boosting, XGBoost
  • XGBoost selected with ROC-AUC 0.598 - best balance for imbalanced medical data
  • SMOTE oversampling to handle 11.2% minority class imbalance
  • SHAP explainability showing clinicians exactly why a patient is flagged high risk
  • Live: https://karan-healthcare-ml.streamlit.app
  • Stack: Python · XGBoost · SHAP · SMOTE · Streamlit · UCI Diabetes Dataset

Supply Chain KPI Dashboard + DMAIC + SQL - LIVE

Analyzed 180,519 real orders - found that 57% of deliveries are late across 23 global regions

  • Only 42.7% on-time delivery rate - Central Africa worst at 60.7% late rate
  • 15 SQL queries via SQLite covering late rates, revenue, customer segments
  • ABC inventory segmentation identifying Class A products driving 80% of revenue
  • Full DMAIC Six Sigma structured analysis - Define through Control
  • Live: https://karan-supply-chain.streamlit.app
  • Stack: Python · SQL · SQLite · Plotly · Streamlit · DMAIC

Human Capital Analysis

Predicting employee turnover to reduce hiring costs

  • Analyzed 15,000+ employee records using Logistic Regression and Decision Trees
  • Achieved 90% prediction accuracy - job satisfaction identified as top turnover driver
  • Stack: R · Logistic Regression · Decision Trees · k-NN · SVM
  • Repo: Human-Capital-Analysis

Bank Loan Risk Model

Loan default prediction reducing misclassification cost by $3M

  • Built Logistic Regression and Decision Tree models on 5,960 loan applicants
  • Improved sensitivity to 80.65%, reducing false negatives
  • Stack: R · Logistic Regression · Decision Trees
  • Repo: Bank-Loan-Decision-Making-Analysis

Consumer Segmentation Analysis

Customer segmentation and brand loyalty prediction

  • Segmented 600 consumer profiles using K-Means clustering
  • Built for AXANTEUS market research agency
  • Stack: R · K-Means · Random Forest · Logistic Regression
  • Repo: Consumer-Segmentation-Analysis

Currently Learning

Course Platform Status
Data Analysis: SQL · Power BI · Tableau · Excel Udemy SQL complete - Power BI in progress
Google Data Analytics Professional Certificate Coursera In progress
Microsoft PL-300 Power BI Associate Microsoft Learn In progress
Unilever Supply Chain Analytics Coursera In progress

Tech Stack

Languages:        Python · R · SQL (basic through advanced window functions and CTEs)
Visualization:    Plotly · Streamlit · Power BI · Tableau · Seaborn
ML/Analytics:     Scikit-learn · XGBoost · SHAP · Logistic Regression · Decision Trees
                  Random Forest · Clustering · Time Series · Predictive Modeling
Imbalance:        SMOTE (imbalanced-learn)
Database:         SQL · SQLite · PostgreSQL · MySQL · Excel (Advanced) · DAX
AI/LLM:           LangChain · OpenAI API · Hugging Face · ChromaDB RAG (coming soon)
Process:          Lean Six Sigma Black Belt · DMAIC · SIPOC · RCA · FMEA
Domain:           Healthcare · Supply Chain · HR Analytics · Finance · Recruitment

Certifications

  • Lean Six Sigma Black Belt - Benchmark Six Sigma (2021)
  • Lean Six Sigma Green Belt - Benchmark Six Sigma (2021)
  • MS Data Analytics - Webster University (Dec 2024) | GPA 3.31
  • Google Data Analytics - Coursera (in progress)
  • Microsoft PL-300 Power BI - Microsoft Learn (in progress)

Experience Highlights

Senior Accounts Manager - ID Medical LLP (Healthcare Staffing, UK) Managed data relationships with 30+ NHS hospitals · Improved forecasting accuracy 25% · 15% YoY revenue growth

Senior Recruitment Consultant - QX KPO Services 453 shifts booked in one month · £25,000 revenue · Led 4-member analytics team

International Peer Mentor & Writing Coach - Webster University CRLA Level 2 Certified · Improved student outcomes 94%


Let's Connect

krntrivedi@gmail.com LinkedIn Healthcare Dashboard Supply Chain Dashboard ML Pipeline GitHub


12 projects in progress. One new deployment every 2 weeks. Check back soon.

Popular repositories Loading

  1. Human-Capital-Analysis Human-Capital-Analysis Public

    Predicting employee turnover using Logistic Regression, Decision Tree, k-NN & SVM on 14,999 employees. Decision Tree achieved 97% accuracy & 0.97 AUC. Built in R. Dataset: 10 attributes.

    R

  2. Bank-Loan-Decision-Making-Analysis Bank-Loan-Decision-Making-Analysis Public

    Predicting home improvement loan defaults using Logistic Regression & Decision Tree in R. 77.47% accuracy, 80.65% sensitivity, $1.165M cost reduction. Dataset: 5,960 applicants | 13 variables.

    R

  3. Consumer-Segmentation-Analysis Consumer-Segmentation-Analysis Public

    Consumer segmentation & brand loyalty prediction for 600 profiles using K-Means clustering, Logistic Regression & Random Forest. Built for AXANTEUS market research agency. Built in R.

    R

  4. Karant15 Karant15 Public

  5. Healthcare-Workforce-Analytics Healthcare-Workforce-Analytics Public

    Analyzing 9.6M US Medicare records to identify physician staffing gaps and recruitment priorities across 50 states

  6. Supply-Chain-Analytics Supply-Chain-Analytics Public

    Analyzed 180,519 supply chain orders to identify delivery failures and inventory gaps using Lean Six Sigma DMAIC framework