AlphaAvatar is a self-hostable Omni Personal Assistant framework designed to evolve into an intelligent personal butler β a continuous, personalized, and proactive assistant that can remember, understand, plan, and act on behalf of the user.
It is built around a plugin-based real-time Agent architecture, combining:
- π§ Memory for long-term user, assistant, and tool interaction history
- 𧬠Persona for user understanding, identity continuity, and personalization
- π‘ Reflection for self-improvement and long-term behavioral adaptation
- π Planning for task decomposition, reminders, and future-oriented actions
- βοΈ Behavior for response style, workflow policy, and proactive assistance
- π§° Tools through MCP, RAG, DeepResearch, and external integrations
- π Virtual Character for real-time voice/avatar interaction
β¨ Fully self-hostable and privacy-first β AlphaAvatar can run locally or on your own infrastructure, giving you control over your data, memory, tools, and behavior.
|
|
|
|
|
|
π‘ AlphaAvatar is not just a chatbot. It is a foundation for building stateful, proactive, multimodal, and self-evolving personal AI assistants.
|
Self-improving memory module for Omni-Avatar. |
Automatic extraction and real-time matching of multimodal user persona. |
|
A self-improvement module that reflects on memory, behavior, and interaction history. |
Long-horizon planning module for tasks, reminders, goals, and multi-step workflows. |
|
Controls response style, workflow policy, tool-use behavior, and proactive assistance rules. |
The real-time generated virtual character that visually represents the Avatar during interactions. |
|
Allow AlphaAvatar to access the network and perform single-step/multi-step inference through a separate Agent service to search for more accurate content. |
Allow AlphaAvatar to access Documents/Skills (user-uploaded/generated by the Reflection module/URL access) to obtain document-related information. |
|
Allows AlphaAvatar to discover and call real-world external tools such as databases, email, calendars, APIs, and productivity apps. |
Provide AlphaAvatar with a sandbox environment to interact with the external world or with other agents, thereby enabling multi-agent interaction and exploration. |
-
[2026/05] Released AlphaAvatar version 0.5.4:
- Added LanceDB-backed MCP tool retrieval, enabling AlphaAvatar to semantically search relevant MCP tools from Agent queries.
- Refactored system prompt and runtime prompt composition, improved Persona runtime state tracking, added temporary-user to real-user identity merging, and improved RAG runtime behavior.
- Released AlphaAvatar version 0.5.5: Fixed the inference runner registration lifecycle for production
startmode, ensuring plugins runners are registered after config parsing and before LiveKit creates the inference executor.
-
[2026/04] Released AlphaAvatar version 0.5.3:
- Added localized Markdown backup for the Memory plugin.
- Added LanceDB as the default local VDB option when Qdrant credentials are not provided.
-
[2026/03] Released AlphaAvatar version 0.5.0:
-
[2026/02] Released AlphaAvatar version 0.4.0:
- Added RAG support through RAG-Anything.
- Optimized the Memory and DeepResearch modules.
- Released AlphaAvatar version 0.4.1: Fixed Persona plugin bugs and added a new MCP plugin.
-
[2026/01] Released AlphaAvatar version 0.3.0:
- Added DeepResearch support through the Tavily API.
- Released AlphaAvatar version 0.3.1: Added tool-call memory extraction during userβassistant interactions.
2025 Release History
-
[2025/12] Released AlphaAvatar version 0.2.0:
- Added AIRI Live2D-based virtual character display.
-
[2025/11] Released AlphaAvatar version 0.1.0:
- Added automatic memory extraction.
- Added automatic user persona extraction and matching.
Install stable AlphaAvatar version from PyPI:
uv venv .my-env --python 3.11
source .my-env/bin/activate
pip install alpha-avatar-agentsInstall latest AlphaAvatar version from GitHub:
git clone --recurse-submodules https://github.com/AlphaAvatar/AlphaAvatar.git
cd AlphaAvatar
uv venv .venv --python 3.11
source .venv/bin/activate
uv sync --all-packagesStart your agent in dev mode to connect it to LiveKit and make it available from anywhere on the internet.
π§© Step 1. Configure Environment Variables
cd AlphaAvatar
# Copy template
cp .env.template .env.devEdit .env.dev and set required environment variables.
π¦ Step 2. Download Required Files
alphaavatar download-filesβ Step 3. Run the Agent
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/voice/pipeline_openai_tools.yaml
# or
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/mm/pipeline_openai_tools.yamlTo see more supported modes, please refer to the LiveKit doc.
To see more examples, please refer to the Examples README
AlphaAvatar supports multiple Access Channels, allowing different types of users β from end users to developers β to interact with the system.
AlphaAvatar is built around a real-time multimodal Agent runtime. It separates streaming interaction, model reasoning, runtime context, plugin state, tool execution, and channel adapters.
AlphaAvatar Runtime
βββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β LiveKit Room β
β audio / text / video / data streams β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AgentSession β
β real-time turn detection / streaming IO β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AvatarEngine β
β Realtime Model Pipeline β
β STT / VAD / LLM / TTS / video-ready input β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β Prompt & Runtime Context Layer β
β system prompt / runtime prompt / time / mode β
β memory / persona / plan / reflection / rules β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββΌββββββββββββββββββββββββ
β β β
βΌ βΌ βΌ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Memory Plugin β β Persona Plugin β β Tools Plugins β
β conversation β β user profile β β MCP / RAG β
β tool memory β β speaker / face β β DeepResearch β
β avatar memory β β runtime state β β external APIs β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Reflection Pluginβ β Planning Plugin β β Behavior Plugin β
β self-analysis β β tasks / plans β β response policy β
β adaptation β β reminders β β workflow rules β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β UserPath Workspace β
β local data / cache / logs / memory / RAG β
β persona state / artifacts / user files β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββ΄ββββββββββββββββββββββββ
β β
βΌ βΌ
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
β Native Web / App β β Channel Adapters β
β voice / text / video β β WhatsApp / future β
β WebRTC / UI β β messaging platforms β
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
AlphaAvatar now provides a browser-based realtime demo interface built on LiveKit.
π Try the Web Demo: https://www.alphaavatar.ai/demo
The Web Demo supports:
- ποΈ Real-time voice interaction
- π¬ Text chat with the Avatar
- π· Camera preview and video-ready interaction
- π Agent audio playback
- π Virtual character / avatar stage
- π§ Full plugin support, including Memory, Persona, RAG, MCP, and DeepResearch
- π Browser timezone metadata, enabling AlphaAvatar to understand local login time
The Web Demo is the recommended way to try AlphaAvatar with a full realtime multimodal experience.
Interact with AlphaAvatar directly inside messaging platforms.
Capabilities:
- π¬ Text-based conversation
- π€ Voice message interaction
- π§° Tool invocation via chat interface
π¦ Channel introduction: README
Make sure AlphaAvatar Agent is already running (see Quick Start above).
ENV_FILE=.env.dev sh examples/channels/start_whatsapp.shπ‘ The WhatsApp channel runs as an independent bridge process and connects to the Agent runtime.
A dedicated AlphaAvatar mobile application providing:
- ποΈ Real-time voice communication
- π Live2D / Virtual character visualization
- π§ Persistent memory & persona
Developers can immediately access AlphaAvatar via the LiveKit Playground.
π https://agents-playground.livekit.io/
After starting your AlphaAvatar server:
- Connect to your LiveKit instance
- Configure the Agent name in the Playground (must match
avatar_name, default:Assistant) to enable Explicit Dispatch. - Connect to the agent room
- Start testing real-time interaction
Supported capabilities:
- ποΈ Voice interaction
- π§ Memory extraction
- π RAG retrieval
- π§° MCP tool invocation
- π Virtual character display
π‘ AlphaAvatar is currently developer-first, with a Web Demo available for realtime interaction.
More user-facing web and mobile experiences are under active development.

