SPARC: Smart Perception & Assistive Reality Companion

(Indian Sign Language Recognition System)

Team: NAMO NIRVANA (Team ID: 94943)
Problem Statement ID: SIH25247
Theme: Miscellaneous | Category: Hardware
Smart India Hackathon 2025

👥 Team Members

Harsh Yadav (Team Lead)
Avishkar Jaiswal
Samriddhi Ganguly
Samyak Jain
Harshit Singh
Thakur Akshayakumar Raj

🚀 Project Overview

SPARC is a specialized Indian Sign Language (ISL) Recognition System. It is designed to interpret the complex, dynamic, and bimanual gestures unique to ISL, translating them into text/speech in real-time.

Unlike generic sign language models, SPARC focuses specifically on the temporal dynamics of ISL—where the movement is just as important as the pose.

❓ Why This Matters

5 Million+ deaf individuals in India.
< 250 certified ISL interpreters.
The Gap: Most existing solutions focus on static alphabets (A-Z). ISL is a full language with grammar and continuous motion. SPARC solves for words and sentences.

🧠 Advanced ISL Model Architecture

This repository implements a multi-stage deep learning pipeline, evolving from standard LSTMs to State-of-the-Art (SOTA) architectures.

1. Data Processing Pipeline (`helper_functions.py`)

Input normalization: 45 Frames per video (Fixed Size).
Feature Extraction: MediaPipe Holistic extracts 258 Keypoints per frame:
- Pose (132): Body orientation & arm movement.
- Left Hand (63) + Right Hand (63): Fine-grained finger articulation.
Augmentation Strategy: To ensure robustness, we implement:
- Gaussian Noise Injection (Simulating sensor noise).
- Spatial Scaling (Handling different body sizes).
- Temporal Warping (Handling different signing speeds).

2. Model Evolution

We researched and implemented three distinct tiers of models:

🟢 Tier 1: Baseline LSTM (`deploy-code.py`)

Structure: 3 stacked LSTM layers (64-128-256 units) + Dense classification head.
Use Case: Fast, lightweight recognition for basic vocabulary.
Current Deployment: Optimized for low-latency CPU inference.

🔵 Tier 2: Regularized Deep LSTM (`train-improved-model.py`)

Improvements: Added Batch Normalization, Dropout (0.3), and L2 Regularization.
Activation: Switched to tanh for stable gradient flow.
Result: Higher accuracy on unseen test subjects.

🔴 Tier 3: Ultra-Advanced Two-Stream Network (`train-ultra-advanced-model.py`)

SOTA Architecture: A hybrid Spatial-Temporal design.
Stream 1 (Spatial): LSTM with Self-Attention mechanisms to focus on hand-face interaction.
Stream 2 (Temporal): Temporal Convolutional Networks (TCN) to capture fast motion dynamics.
Fusion: Attention-based fusion layer combines both streams for the final prediction.

⚡ Real-Time Geometric Detector

For instant feedback on static cultural signs.

Alongside the AI model, we engineered a Rule-Based Heuristic Engine (realtime-detection.py) for specific geometric ISL gestures:

Namaste: Calculates wrist-to-wrist distance and palm symmetry.
I am Indian: Triangulates Hand-Eyebrow-Shoulder positions.
Water/Doctor/Home: Custom geometric signatures.

📊 Dataset & Performance

Dataset: INCLUDE 50 + Custom NAMO NIRVANA Dataset.
Vocabulary: 16 Classes (Hello, Thank you, Please, Good Morning, etc.).
Training Scale: 1000+ Videos with 5x Augmentation.
Accuracy:
- Validation: 74.6%
- Real-Time Test: 84.0%

📥 Installation

Clone the repository
Install Dependencies:
```
pip install -r requirements.txt
```

🖥️ Usage Guide

1. Run the Main ISL Model (Recommended)

This uses the LSTM network to recognize dynamic words ("Hello", "How are you").

python deploy-code.py

2. Run Geometric Detection Demo

For checking specific static signs (Namaste, Indian, etc.).

python realtime-detection.py

(Or use RUN-REALTIME-DEMO.bat on Windows)

3. Train Your Own Model

If you want to add new words to the ISL dictionary:

# Prepare data in 'training-data/' folder
python train-improved-model.py --epochs 100 --augment 5

Developed by Team NAMO NIRVANA for Smart India Hackathon 2025.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
crnn-model-v1-initial-attempt-files		crnn-model-v1-initial-attempt-files
lstm-model-improved		lstm-model-improved
lstm-model-one		lstm-model-one
lstm-model-trained		lstm-model-trained
training-data		training-data
DATASET_STRUCTURE.md		DATASET_STRUCTURE.md
IMPROVEMENTS-SUMMARY.md		IMPROVEMENTS-SUMMARY.md
README.md		README.md
RUN-OPENCV-DEMO.bat		RUN-OPENCV-DEMO.bat
RUN-REALTIME-DEMO.bat		RUN-REALTIME-DEMO.bat
START-ADVANCED-TRAINING-VISIBLE.bat		START-ADVANCED-TRAINING-VISIBLE.bat
START-ADVANCED-TRAINING.bat		START-ADVANCED-TRAINING.bat
START-TRAINING-VISIBLE.bat		START-TRAINING-VISIBLE.bat
START-TRAINING.bat		START-TRAINING.bat
START-ULTRA-ADVANCED-TRAINING.bat		START-ULTRA-ADVANCED-TRAINING.bat
TRAINING-200-EPOCHS.bat		TRAINING-200-EPOCHS.bat
app.py		app.py
check-training-status.py		check-training-status.py
deploy-code.py		deploy-code.py
helper_functions.py		helper_functions.py
monitor-training.py		monitor-training.py
organize-dataset.py		organize-dataset.py
prepare-dataset.py		prepare-dataset.py
realtime-detection.py		realtime-detection.py
requirements.txt		requirements.txt
run-16-words-crnn.py		run-16-words-crnn.py
run-16-words-working.py		run-16-words-working.py
run-16words.py		run-16words.py
run-all-words-demo.py		run-all-words-demo.py
run-through-cmd-line.py		run-through-cmd-line.py
show-all-16-words.py		show-all-16-words.py
train-16words-model.py		train-16words-model.py
train-advanced-model.py		train-advanced-model.py
train-improved-model.py		train-improved-model.py
train-ultra-advanced-model.py		train-ultra-advanced-model.py
train-with-available-data.py		train-with-available-data.py
use-trained-model.py		use-trained-model.py
watch-training.py		watch-training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPARC: Smart Perception & Assistive Reality Companion

👥 Team Members

🚀 Project Overview

❓ Why This Matters

🧠 Advanced ISL Model Architecture

1. Data Processing Pipeline (`helper_functions.py`)

2. Model Evolution

🟢 Tier 1: Baseline LSTM (`deploy-code.py`)

🔵 Tier 2: Regularized Deep LSTM (`train-improved-model.py`)

🔴 Tier 3: Ultra-Advanced Two-Stream Network (`train-ultra-advanced-model.py`)

⚡ Real-Time Geometric Detector

📊 Dataset & Performance

📥 Installation

🖥️ Usage Guide

1. Run the Main ISL Model (Recommended)

2. Run Geometric Detection Demo

3. Train Your Own Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SPARC: Smart Perception & Assistive Reality Companion

👥 Team Members

🚀 Project Overview

❓ Why This Matters

🧠 Advanced ISL Model Architecture

1. Data Processing Pipeline (helper_functions.py)

2. Model Evolution

🟢 Tier 1: Baseline LSTM (deploy-code.py)

🔵 Tier 2: Regularized Deep LSTM (train-improved-model.py)

🔴 Tier 3: Ultra-Advanced Two-Stream Network (train-ultra-advanced-model.py)

⚡ Real-Time Geometric Detector

📊 Dataset & Performance

📥 Installation

🖥️ Usage Guide

1. Run the Main ISL Model (Recommended)

2. Run Geometric Detection Demo

3. Train Your Own Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Data Processing Pipeline (`helper_functions.py`)

🟢 Tier 1: Baseline LSTM (`deploy-code.py`)

🔵 Tier 2: Regularized Deep LSTM (`train-improved-model.py`)

🔴 Tier 3: Ultra-Advanced Two-Stream Network (`train-ultra-advanced-model.py`)

Packages