Skip to content
View ShubhamSaini01's full-sized avatar

Block or report ShubhamSaini01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
ShubhamSaini01/README.md

Typing SVG

Nice to meet you! ๐Ÿ‘‹

Hi, I'm Shubham Saini, a seasoned Computer Vision Engineer with a robust academic background from Birla Institute of Technology and Science, Pilani. My expertise spans Image Processing, Computer Vision, performance optimization, and inference engineering, with specialized proficiency in high-performance computing (HPC), GPU programming, and AI accelerators.

I'm skilled in C++, Object-Oriented Programming (OOP), advanced Memory Management techniques, OpenCV, CUDA, TensorRT, ONNX, and parallel computing frameworks. My projects frequently involve optimizing deep learning models for accelerated inference, developing efficient and scalable software solutions, and leveraging my extensive experience with GPU architectures.

I'm passionate about solving challenging problems at the intersection of software engineering and AI infrastructure, continually aiming for peak performance, reliability, and innovation.

About Me

  • ๐ŸŒ Experienced Computer Vision Engineer
  • ๐Ÿ’ป Skilled in C++(14,17), high Performance Computing, Multi-Threading, PyTorch, CUDA, TensorRT, OpenCV.
  • ๐ŸŽ“ Birla Institute of Technology and Science, Pilani graduate

๐Ÿ’ฌ Ask me about ...

Feel free to explore my repositories to learn more about my projects and contributions.

Pinned Loading

  1. Chess-Coach-Agent Chess-Coach-Agent Public

    AI agent that analyzes PGN games using Stockfish + Gemini to give personalized chess coaching.

    C++ 3 1

  2. Attention-the-Flash-Way Attention-the-Flash-Way Public

    Cuda 1

  3. trt-infer-bench trt-infer-bench Public

    C++ 1

  4. OnnxInference OnnxInference Public

    CMake

  5. ternary-cnn ternary-cnn Public

    Ternary CNN implementation for CIFAR-10: PyTorch model with {-1, 0, +1} constrained weights + AVX-VNNI optimized C++ inference engine. Includes training pipeline, bitpacked weight export, and full โ€ฆ

    C++