Skip to content
View AjayTiwari94's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report AjayTiwari94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AjayTiwari94/README.md
Typing SVG

B.Tech, Computer Science & Engineering (Artificial Intelligence)
Noida Institute of Engineering & Technology  |  Batch 2022-26  |  CGPA: 8.63

   

About Me

Fresher Data Analyst with hands-on experience in SQL, Python, Power BI, and Excel. Built three end-to-end analytics projects covering payment fraud detection, customer churn segmentation, and workforce trend analysis; each grounded in real datasets with verified findings. Two months of industry experience at Ethara AI (Gurugram) evaluating LLM outputs across multiple models at scale.

Currently open to Data Analyst and Business Analyst roles across Noida, Gurugram, Bengaluru, Mumbai, and Chennai.


Tech Stack

Languages & Querying

Python MySQL PostgreSQL

Visualization & BI

Power BI Tableau Matplotlib Seaborn

Libraries & Tools

Pandas NumPy Excel Jupyter Git

Analytics Capabilities

Customer Segmentation Churn Modeling Fraud Detection KPI Reporting EDA Trend Analysis


Experience

LLM Post-Training Intern, Data Analysis & Model Evaluation

Ethara AI  |  Gurugram  |  Feb 2026 to Apr 2026

  • Rated 300+ LLM outputs on instruction following, truthfulness, prompt adherence, and verbosity across 5 project types including STEM reasoning at JEE Advanced level; daily output scaled from 20 to 50 tasks
  • Completed 40 multi-model comparative evaluations across Gemini, GPT, Claude, Kimi, and GLM to identify failure points used for model improvement; team scaled from 15 to 50 members based on submission priority

Projects

MySQL Python Power BI Excel

  • Sampled 55,554 transactions from a 6.3M-record PaySim dataset; confirmed 0 duplicates and 0 nulls in SQL and Python
  • Found 18% of customers drove 62% of $8.97bn total transaction value
  • Fraud concentrated in TRANSFER (0.94%, 49 cases) and CASH_OUT (0.33%, 51 cases); zero fraud in PAYMENT, CASH_IN, or DEBIT
  • High-value transactions had 18% greater fraud probability (0.20% vs 0.17%); built a Power BI dashboard with 4 KPIs and an interactive slicer

MySQL Excel

  • Built a 3-tier churn segmentation model on 8,000 users (25.89% overall churn) where single-metric models failed to separate groups
  • Segmented into High Risk (492 users, 28.46%), Medium Risk (2,106, 26.54%), and Low Risk (5,402, 25.40%), each mapped to a retention action
  • Found Student + Mobile the highest-churn combination at 29.92% through subscription x device cross-analysis; followed by Family + Mobile (27.82%) and Family + Desktop (27.52%)
  • Skip rate was the strongest behavioral indicator: 0.3049 (churned) vs 0.2985 (retained)

MySQL Python Excel

  • Cleaned 3,642 raw records down to 1,995 by removing 1,647 rows (45%) with missing layoff count and percentage
  • US led with 256,559 layoffs; India second at 51,234; Post-IPO stage had the highest exposure at 204,132 layoffs
  • 2022 was the peak year at 160,661 layoffs; Q1 2023 alone added 125,000+
  • Consumer industry most impacted at ~45,182 layoffs; single largest event: 12,000 employees from one company

🏆 Achievements

Award Details Year
1st Place, Start-Up Arena, NIT Trichy Ranked 1st among 80-100 teams (300+ participants); qualified for national finale 2026
INSPIRE Scholarship, Government of India Awarded to top 1% in Class 12 board exams nationwide 2021

GitHub Stats

 




"Code with purpose, analyze with curiosity, build with impact."

Pinned Loading

  1. Banking-Customer-Risk-Transaction-Analysis Banking-Customer-Risk-Transaction-Analysis Public

    Analyzed over 55K banking transactions using MySQL to uncover fraud patterns, highlighting high-risk transaction types and value-based fraud insights using Power BI.

    Jupyter Notebook 1

  2. Customer_Churn_Analysis-Spotify Customer_Churn_Analysis-Spotify Public

    SQL-based analysis of Spotify user behavior to identify churn patterns, build a rule-based risk segmentation model, and design targeted retention strategies.

  3. Layoff_Trend_And_Workforce_Analysis Layoff_Trend_And_Workforce_Analysis Public

    This repository contains a complete end-to-end data cleaning workflow performed using MySQL on a real-world dataset. It demonstrates how to identify inconsistencies, remove duplicate records, handl…

    1

  4. Power-BI_Dashboards Power-BI_Dashboards Public

    Collection of interactive Power BI dashboards analyzing real-world datasets to uncover trends, insights, and data-driven decision patterns.

  5. Data_Analysis-Excel Data_Analysis-Excel Public

    This repository showcases a complete data analysis project performed entirely in Microsoft Excel, covering everything from raw data cleaning to actionable insights. It demonstrates how powerful Exc…

    1

  6. Data_Analysis-Tableau Data_Analysis-Tableau Public

    Tableau-based data analysis project using 30M+ Airbnb records joined from Listings, Calendar, and Reviews files to build interactive dashboards and visual insights.