Skip to content
View ChelsaMJ's full-sized avatar
:octocat:
meow~
:octocat:
meow~

Block or report ChelsaMJ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ChelsaMJ/ReadMe.md

Typing SVG

Final Year B.Tech CSE Student | Data Science Enthusiast

Cat GIF

About Me

  • Passionate about Data Structures & Algorithms, Data Science, and Full-Stack Development
  • Experienced in Python, Java, SQL, and Web Technologies
  • Interested in Machine Learning, Topic Modeling (LDA), and Big Data Tools like Hadoop & Hive
  • Currently exploring Advanced DSA and Scalable Data Processing
  • Internship: Data Science Intern @ Celebal Technologies

Tech Stack

Languages

Frameworks & Tools

Pandas NumPy spaCy NLTK

Cat GIF

Featured Projects

A Natural Language Processing project that detects and classifies hate speech from text data.
Highlights:

  • Used TF-IDF vectorization and Logistic Regression for classification
  • Preprocessed datasets to handle imbalanced classes using SMOTE
  • Achieved high F1-score through fine-tuned model hyperparameters

Unsupervised learning project applying Latent Dirichlet Allocation (LDA) to the 20 Newsgroups dataset for topic discovery.
Highlights:

  • Implemented text preprocessing: tokenization, stopword removal, and lemmatization
  • Applied Gensim’s LDA model to extract coherent topics
  • Visualized results using pyLDAvis for better interpretability

app link

A Retrieval-Augmented Generation (RAG) chatbot that answers intelligent queries based on real loan applicant data.
Highlights:

  • Ingests and chunks structured loan datasets for semantic search
  • Builds vector search index using TF-IDF + FAISS
  • Retrieves relevant data chunks and uses Gemini 1.5 Pro for human-like answers
  • Deployed via Streamlit for interactive user experience
    Tech Stack: Python, Streamlit, FAISS, TF-IDF, Google GenerativeAI (Gemini), Pandas

An improved word-level autocorrector that suggests accurate spelling corrections using advanced NLP techniques.
Highlights:

  • Integrated spaCy for lemmatization and NLTK corpora for real vocabulary coverage
  • Ranked suggestions by Brown corpus frequency for realistic recommendations
  • Edit-distance-based candidate generation with robust error handling
  • Vastly improved over v1.0, which relied on a limited custom vocabulary
    Tech Stack: Python, spaCy, NLTK, Brown Corpus, Edit-Distance Algorithm

An Android application designed to connect entrepreneurs with mentors and foster collaboration.
Highlights:

  • User profiles for entrepreneurs and mentors
  • Real-time chat and messaging system
  • Article posting and discussion threads for knowledge sharing
  • Clean and user-friendly Material Design UI
    Tech Stack: Java (Android), XML, Firebase (Authentication & Realtime Database)

GitHub Stats

My GitHub Stats
GitHub Streak
Top Languages
GitHub Activity Graph

LeetCode Stats

LeetCode Profile LeetCode Badge

Connect with Me

LinkedIn

"Code. Debug. Learn. Repeat."

Cat GIF

Pinned Loading

  1. Exploratory-Data-Analysis-of-Crime-Rates-in-U.S.-Cities-and-States Exploratory-Data-Analysis-of-Crime-Rates-in-U.S.-Cities-and-States Public

    This project explores crime data to uncover patterns and relationships between different crime types and socio-economic factors. Using data-driven techniques, we aim to identify key trends and prov…

    Jupyter Notebook

  2. Music-Recommendation-System Music-Recommendation-System Public

    Python Capstone project - This NLP-powered system recommends songs based on lyrics and textual descriptions, aligning seamlessly with your preferences.

    Jupyter Notebook

  3. Hate-Speech-Detection Hate-Speech-Detection Public

    A deep learning project for detecting hate speech in tweets using NLP techniques and a Keras-based neural network classifier.

    Jupyter Notebook

  4. Practo-upGrad-Internship Practo-upGrad-Internship Public

    Business Analytics Internship: Summer internship done under upGrad and practo

    Jupyter Notebook

  5. Entrenet-Android-App Entrenet-Android-App Public

    About Android project: Developed an app for entrepreneurs to connect with mentors, chat with them, post articles, and start discussions.

    Kotlin

  6. Network-Traffic-Anomaly-Detection Network-Traffic-Anomaly-Detection Public

    Anomaly detection system for network traffic using machine learning models, built as a part of the Bachelor of Technology final year project.

    Jupyter Notebook