🚀 Sagent

Real-Time AI Voice Call Agent Platform

Talk. Listen. Think. Respond — in real time.

🧠 What is Sagent?

Sagent is a real-time AI voice call agent that can:

📞 Make outbound calls
📲 Handle inbound calls
🧠 Understand speech using LLMs
🗣️ Respond with natural voice
📡 Stream live transcripts to a dashboard

⚡ Core Capabilities

🔁 Streaming pipeline: STT → LLM → TTS
📡 Live transcript via WebSocket
🧑‍💼 Multi-tenant architecture
⚙️ Configurable AI agent (prompt-driven)
📞 Twilio call integration (inbound + outbound)

🏗️ Architecture Overview

flowchart RL
    Tenant --> Frontend[Dashboard]
    Frontend --> Backend
    Backend --> |WebSocket| Frontend

    Backend --> Twilio
    Twilio --> Backend
    
    subgraph Server-side
        Backend[FastAPI] --> AI[AI: STT + LLM + TTS]
        AI --> Backend
        direction TB
        Backend --> DB[(PostgreSQL)]
    end

    Twilio <--> Lead

✨ Key Features

🔁 Real-time voice interaction (STT → LLM → TTS)
📡 Live transcript streaming (WebSocket)
🧑‍💼 Multi-tenant architecture
⚙️ Configurable AI agent (prompt-based behavior)
📞 Outbound & inbound call support
🗂️ Call history with transcripts & recordings
📱 Phone-like UI dashboard

🎯 Why This Project Stands Out

Real-time AI system (not batch or async)
Full-stack architecture (FastAPI + React)
Voice + LLM + Telephony integration
Production-ready design (multi-tenant, scalable)

🎥 Demo Preview (Coming Soon)

Live call + real-time transcript streaming UI

🧩 Tech Stack

Backend

FastAPI (Python)
PostgreSQL (Render)
WebSocket (real-time streaming)

Frontend

React (TypeScript)
Tailwind CSS

AI & Voice

STT: ElevenLabs Scribe (Realtime)
LLM: OpenAI API
TTS: ElevenLabs Flash

Telephony

Twilio (calls + recordings)

Hosting

Render

📁 Project Structure

Sagent/
├── backend/      # FastAPI backend
├── frontend/     # React dashboard
├── docs/         # system design documents
├── infra/        # deployment configs
└── README.md

🔄 Core Flow

Outbound Call

sequenceDiagram
    participant Twilio
    participant Backend
    participant AI
    participant UI

    UI->>Backend: Start Call
    Backend->>Twilio: Initiate Call

    loop Conversation
        Twilio->>Backend: Audio
        Backend->>AI: Process
        AI->>Backend: Response
        Backend->>Twilio: Voice
        Backend->>UI: Transcript
    end

📞 Use Cases

AI sales agent (cold calls)
customer support automation
appointment booking
AI receptionist
voice-based SaaS demos

🎯 Design Principles

Real-time first (low-latency streaming)
Modular architecture (clean separation)
Scalable by design (multi-tenant ready)
AI-centric (prompt-driven behavior)

🚀 Getting Started

1. Clone the repo

git clone https://github.com/oceanstar88/sagent.git
cd sagent

2. Setup backend

cd backend
pip install -r requirements.txt
uvicorn app.main:app --reload

3. Setup frontend

cd frontend
npm install
npm run dev

4. Configure environment

Create .env file:

DATABASE_URL=
JWT_SECRET=

TWILIO_ACCOUNT_SID=
TWILIO_AUTH_TOKEN=
TWILIO_PHONE_NUMBER=

ELEVENLABS_API_KEY=
OPENAI_API_KEY=

📡 Demo Capabilities

Start a call from dashboard
Receive inbound call
Watch live transcript
Review call history

📚 Documentation

Detailed system design available in docs

Includes:

system architecture
AI engine design
backend & frontend design
API spec
sequence diagrams

🔮 Future Improvements

call analytics dashboard
CRM integration
multi-agent orchestration
voice cloning
multilingual support

👨‍💻 Author

Built as a high-performance AI voice agent system demo for showcasing real-time AI + telephony integration.

⭐️ Summary

Sagent demonstrates:

real-time AI systems
voice + LLM integration
full-stack engineering capability
production-level architecture

This is not just a demo — it's a foundation for real AI voice products.

⭐ If you find this interesting, consider starring the repo!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Sagent

Real-Time AI Voice Call Agent Platform

🧠 What is Sagent?

⚡ Core Capabilities

🏗️ Architecture Overview

✨ Key Features

🎯 Why This Project Stands Out

🎥 Demo Preview (Coming Soon)

🧩 Tech Stack

Backend

Frontend

AI & Voice

Telephony

Hosting

📁 Project Structure

🔄 Core Flow

Outbound Call

📞 Use Cases

🎯 Design Principles

🚀 Getting Started

1. Clone the repo

2. Setup backend

3. Setup frontend

4. Configure environment

📡 Demo Capabilities

📚 Documentation

🔮 Future Improvements

👨‍💻 Author

⭐️ Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
docs		docs
frontend		frontend
infra		infra
scripts		scripts
.env.example		.env.example
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🚀 Sagent

Real-Time AI Voice Call Agent Platform

🧠 What is Sagent?

⚡ Core Capabilities

🏗️ Architecture Overview

✨ Key Features

🎯 Why This Project Stands Out

🎥 Demo Preview (Coming Soon)

🧩 Tech Stack

Backend

Frontend

AI & Voice

Telephony

Hosting

📁 Project Structure

🔄 Core Flow

Outbound Call

📞 Use Cases

🎯 Design Principles

🚀 Getting Started

1. Clone the repo

2. Setup backend

3. Setup frontend

4. Configure environment

📡 Demo Capabilities

📚 Documentation

🔮 Future Improvements

👨‍💻 Author

⭐️ Summary

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages