🎙️ Voce.AI: The Neural Canvas for Spoken Intelligence

Conceived, Architected, and Engineered by Shrey Bansal.

Live Production Environment: voce-ai.vercel.app

Voce.AI represents the frontier of human-computer interaction. It is not a chat app; it is a Real-Time Cognitive Coaching Platform built to test the extreme limits of low-latency artificial intelligence orchestration. By fusing the Socratic Method with instantaneous neural audio processing, Voce.AI provides a premium workspace for interview mastery, linguistic refinement, and dialectical growth.

This repository is built to outshine standard architectures, achieving enterprise-grade resilience, high-fidelity design, and near-zero latency vocal feedback loops.

⚡ The Novelty: Why Voce.AI Stands Alone

Most AI applications rely on simple text-in, text-out API wrappers. Voce.AI shatters that paradigm.

The "Socratic Synchronization" Loop: The AI does not just answer questions; it is engineered to interrogate. Through carefully crafted system prompts, the models act as expert examiners (e.g., IELTS, Technical Lead), using active dialectical questioning to force the user to articulate deeper thoughts.
Deterministic Report Generation: Beyond conversation, the system captures telemetry from the semantic exchange. It parses the dialogue and generates a structured, multi-dimensional Performance Report, grading the user on fluency, vocabulary, and cognitive pacing—stored instantly via Convex.
Untouchable "Lavender Glass" Aesthetic: The UI isn't a pre-built template. It is a completely bespoke design system utilizing the oklch perceptual color space, 32px volumetric backdrop-filters, and 300ms cubic-bezier kinetic transitions to make the digital space feel like a luxurious, responsive physical room.

🏛️ System Architecture & Data Flow

Voce.AI orchestrates a complex symphony of cutting-edge technologies to maintain a seamless, sub-second conversation loop.

graph TD
    %% User Interaction Layer
    User((User Voice)) -->|Microphone Input| A[Neural Canvas UI<br/>React 19 / Next.js 15]
    
    %% Audio & Identity Pipeline
    A -->|Authentication Token| Stack[Stack Auth<br/>Identity & Security]
    A -->|Raw Audio Stream| Groq[Groq Engine<br/>Whisper Speech-to-Text]
    
    %% Core Intelligence Engine
    Groq -->|Transcribed Text| Router[Intelligence Router<br/>Next.js Server Actions]
    Router -->|Contextual Socratic Prompt| OR[OpenRouter Hub<br/>Gemini 2.0 / Llama 3]
    
    %% Synthesis & Analytics Pipeline
    OR -->|AI Cognitive Response| Feedback[Report Generator<br/>Performance Telemetry]
    OR -->|AI Cognitive Response| TTS[ElevenLabs<br/>Neural Vocal Synthesis]
    
    %% Persistence & Output
    Feedback -->|Atomic Mutation| Convex[(Convex DB<br/>Real-Time Sync)]
    TTS -->|Low-Latency Audio Stream| User
    
    %% Telemetry
    Router -.->|Metrics Sync| Obs[Scope-A Observability<br/>Prometheus & Grafana]

    style A fill:#2D1B4E,stroke:#B28DFF,stroke-width:2px,color:#fff
    style Groq fill:#111,stroke:#00E676,stroke-width:2px,color:#fff
    style OR fill:#111,stroke:#2962FF,stroke-width:2px,color:#fff
    style TTS fill:#111,stroke:#FF3D00,stroke-width:2px,color:#fff
    style Convex fill:#FFBD51,stroke:#E65100,stroke-width:2px,color:#111

🧪 Deep Dive: Elite Technical Stack

1. The Core Framework

Next.js 15 (App Router): Edge-ready server-side rendering for unparalleled SEO and initial load speeds.
React 19: Utilizing the bleeding-edge concurrent features for perfectly smooth UI rendering.
Tailwind CSS v4: A highly optimized, utility-first styling engine configured with our custom oklch Lavender-Glass tokens.

2. The Intelligence Pipeline

Groq (Whisper): Chosen over standard STT providers for its LPU (Language Processing Unit) architecture, delivering instantaneous text transcription.
OpenRouter: Abstracting the LLM layer allows dynamic switching between Gemini 2.0 Flash, Claude, and Llama to find the absolute best reasoning engine for the specific Socratic persona.
ElevenLabs: Uncanny, human-identical text-to-speech synthesis that provides the platform with its empathetic, authoritative voice.

3. Real-Time Infrastructure & Identity

Convex: The nervous system of Voce.AI. It replaces complex Redux states and traditional REST APIs with real-time, reactive database subscriptions.
Stack Auth: Enterprise-grade identity management seamlessly integrated with Next.js middleware to protect user sessions and reports.

4. Full-Scale Observability

Prometheus & Grafana: The application doesn't just run; it is heavily monitored. We track AI inference latency, API token consumption, and system jitter in real-time, ensuring extreme economic and technical efficiency.

🚀 Deployment & Local Execution

Voce.AI is currently deployed globally via Vercel with edge-caching enabled. To run the neural canvas locally:

# 1. Clone the Masterpiece
git clone https://github.com/shreybansal365/Voce.AI.git
cd Voce.AI

# 2. Install Dependencies
npm install

# 3. Environmental Configuration
# You must provide keys for Convex, Stack Auth, Groq, OpenRouter, and ElevenLabs in `.env.local`.

# 4. Initialize Local Server
npm run dev

👨‍💻 Author & Visionary

Shrey Bansal
Full-Stack AI Architect & UI/UX Visionary
Conceiver of the Voce.AI platform. Responsible for the end-to-end engineering, from the low-latency WebRTC audio pipelines to the bespoke Lavender Glass UI architecture.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.handoff		.handoff
app		app
components/ui		components/ui
convex		convex
k8s		k8s
lib		lib
monitoring		monitoring
public		public
scripts		scripts
services		services
stack		stack
.dockerignore		.dockerignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
PORTFOLIO_GUIDE.md		PORTFOLIO_GUIDE.md
README.md		README.md
components.json		components.json
docker-compose.yml		docker-compose.yml
jsconfig.json		jsconfig.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Voce.AI: The Neural Canvas for Spoken Intelligence

⚡ The Novelty: Why Voce.AI Stands Alone

🏛️ System Architecture & Data Flow

🧪 Deep Dive: Elite Technical Stack

1. The Core Framework

2. The Intelligence Pipeline

3. Real-Time Infrastructure & Identity

4. Full-Scale Observability

🚀 Deployment & Local Execution

👨‍💻 Author & Visionary

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ Voce.AI: The Neural Canvas for Spoken Intelligence

⚡ The Novelty: Why Voce.AI Stands Alone

🏛️ System Architecture & Data Flow

🧪 Deep Dive: Elite Technical Stack

1. The Core Framework

2. The Intelligence Pipeline

3. Real-Time Infrastructure & Identity

4. Full-Scale Observability

🚀 Deployment & Local Execution

👨‍💻 Author & Visionary

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages