Skip to content
View zehuanyu's full-sized avatar
  • 20:36 (UTC -07:00)

Block or report zehuanyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zehuanyu/README.md

Hi, I'm Zehuan Yu

I have a background in data science, mathematics, robotics, and applied engineering, with hands-on experience in Python, SQL, C++, machine learning, data visualization, ROS 2, and simulation.

My GitHub focuses on practical data and engineering projects, including used-car price modeling, autonomous racing simulation, computer vision detection, API-based analytics, biostatistical modeling, and C++ object-oriented programming.


Featured Projects

Used Car Depreciation

A machine learning project for analyzing used-car depreciation patterns and estimating resale prices from listing-level vehicle attributes.

This project builds a practical valuation workflow, including data cleaning, exploratory analysis, feature engineering, LightGBM modeling, benchmark comparison, segment-level error analysis, and a Streamlit app for interactive vehicle price estimation.

Highlights:

  • Cleaned and processed large-scale used-car listing data
  • Engineered depreciation-related features such as vehicle age, mileage transforms, and engine displacement
  • Trained a LightGBM regression model for resale price prediction
  • Benchmarked model performance against simpler baseline models
  • Built a Streamlit app to demonstrate price estimation and depreciation curves

Tech Stack: Python, Pandas, LightGBM, Scikit-learn, Streamlit, Matplotlib, Regression Modeling


F1TENTH Race Planner

A ROS 2 autonomous racing simulation project using LiDAR-based local planning in the F1TENTH simulator.

This project combines wall following, follow-the-gap, local trajectory planning, predictive speed control, racing-line biasing, and ML-assisted tuning. It demonstrates how robotics simulation and data-driven parameter tuning can be used to improve autonomous vehicle behavior.

Highlights:

  • Built a LiDAR-based local racing planner for the F1TENTH simulator
  • Combined wall following, follow-the-gap, and local planning logic
  • Added predictive speed control based on curvature, clearance, braking distance, and steering smoothness
  • Used parameter tuning and ML-assisted adjustment to improve simulated driving performance
  • Organized configs, scripts, models, and test results for reproducible evaluation

Tech Stack: ROS 2, Python, F1TENTH, LiDAR, NumPy, YAML, Simulation


YOLOv8 Traffic Sign Detection

A computer vision project for traffic sign detection using YOLOv8 and custom lightweight architecture variants.

This project compares a standard YOLOv8 baseline with attention and feature-fusion extensions such as CBAM, LCFE, IMCMD, and YOLO-TS-style fusion. The goal is to improve small-object traffic sign detection and evaluate model robustness across day and night driving conditions.

Highlights:

  • Trained and evaluated YOLOv8-based traffic sign detection models
  • Compared baseline, attention-based, and feature-fusion model variants
  • Evaluated mAP, parameter efficiency, and day/night robustness
  • Organized experiment configs, scripts, reports, and result summaries for reproducible analysis

Tech Stack: Python, PyTorch, Ultralytics YOLOv8, OpenCV, Object Detection, Computer Vision


Spotify Year-End Music Trend Analyzer

A Python data analysis project comparing 2020 and 2021 year-end music trends using Spotify API data, SQLite, SQL queries, and Matplotlib visualizations.

Highlights:

  • Collected playlist data from the Spotify Web API
  • Stored artist and track records in SQLite
  • Used SQL joins to compare artists across years
  • Identified artists appearing in both years and artists unique to each year
  • Generated visualizations for artist overlap and trend comparison

Tech Stack: Python, Spotify API, SQLite, SQL, Matplotlib, JSON


HIV Dynamics Biostatistics Simulation

A Python-based mathematical biology project that simulates HIV infection dynamics and immune response using ordinary differential equations.

Highlights:

  • Modeled interactions between healthy CD4+ T-cells, infected cells, virions, and immune response cells
  • Used SciPy ODE solvers for numerical simulation
  • Compared immune response behavior under different parameter settings
  • Visualized CTLp and CTLe trends over time
  • Compiled results into a research report and presentation

Tech Stack: Python, NumPy, SciPy, Matplotlib, ODE Modeling, Biostatistics


Intro-to-Programming-Cpp-Projects

A semester-long C++ programming portfolio covering console applications, games, ciphers, object-oriented design, and simulation.

Highlights:

  • Built projects including a focaccia calculator, Rock-Paper-Scissors game, cipher program, Battleship game, and elevator simulation
  • Practiced C++ fundamentals, functions, loops, strings, file input, and class design
  • Developed larger multi-file programs using object-oriented programming

Tech Stack: C++, Object-Oriented Programming, File I/O, Console Applications


Technical Skills

Programming: Python, SQL, C++
Data Science & Analytics: Pandas, Scikit-learn, LightGBM, SQLite, Matplotlib, Streamlit, API Data Collection, Data Cleaning
Machine Learning: Regression Modeling, Model Evaluation, Feature Engineering, Benchmarking, Error Analysis
Computer Vision: PyTorch, Ultralytics YOLOv8, OpenCV, Object Detection
Robotics & Simulation: ROS 2, F1TENTH, LiDAR, Odometry, Ackermann Drive, Simulation
Tools: Git, GitHub, Ubuntu, WSL2, VS Code


Current Focus

I am currently building projects at the intersection of data science, business analytics, machine learning, automotive systems, and robotics simulation. My strongest interests are data analytics, business intelligence, applied machine learning, data-driven engineering, and practical software tools for solving real-world problems.

Pinned Loading

  1. Used-Car-Depreciation Used-Car-Depreciation Public

    Machine learning project for used-car depreciation analysis and resale price prediction using Python, LightGBM, EDA, error analysis, and Streamlit.

    Python

  2. F1tenth-Race-Planner F1tenth-Race-Planner Public

    ROS 2 LiDAR-based local racing planner for the F1TENTH simulator with wall following, follow-the-gap, speed control, and ML-assisted tuning.

    Python

  3. YOLOv8-Traffic-Sign-Detection YOLOv8-Traffic-Sign-Detection Public

    YOLOv8-based traffic sign detection project with custom attention and feature-fusion variants, day/night robustness analysis, and mAP benchmarking.

    Python

  4. Spotify-Year-End-Music-Trend-Analyzer Spotify-Year-End-Music-Trend-Analyzer Public

    A Python music data analysis project comparing 2020 and 2021 year-end trends using Spotify API data, SQLite, SQL, and Matplotlib.

    Python

  5. HIV-Dynamics-Biostatistics-Simulation HIV-Dynamics-Biostatistics-Simulation Public

    Python biostatistics project modeling HIV infection dynamics and immune response using ODE simulation, SciPy, and Matplotlib.

    Python

  6. Intro-to-Programming-Cpp-Projects Intro-to-Programming-Cpp-Projects Public

    Semester-long C++ programming portfolio covering console applications, games, ciphers, OOP, and simulation.

    C++