Licheng Wang AlphaAvatar

Learnable, configurable, and pluggable Omni Personal Assistant for everyone

AlphaAvatar Introduction

AlphaAvatar is an Omni-Avatar personal assistant framework designed to evolve into your intelligent personal butler.

✨ Fully self-hostable and privacy-first — you can deploy AlphaAvatar locally or on your own infrastructure, with full control over your data, memory, and behavior.

It is built around a plugin-based Agent architecture, combining:

🧠 Full-modality Memory
🧬 Dynamic Persona understanding
💡 Self-improving Reflection
📅 Long-term Planning & Execution
🧰 External Tool integrations (MCP / RAG / DeepResearch)
😊 Real-time Virtual Character (Avatar)

Together, these components enable AlphaAvatar to move beyond a traditional chatbot into a continuous, personalized, and proactive assistant system.

🎯 Vision

Our goal is to build an AI that can continuously learn, remember, and act on behalf of the user, acting as a true personal life manager rather than a passive responder.

🧩 What Can AlphaAvatar Do?

With the combination of internal Agent plugins and external tools, AlphaAvatar can:

1️⃣ Personal Data & Life Metrics Management

📊 Track and analyze personal metrics (health, fitness, sleep, study progress)
📈 Provide long-term insights and trend analysis
🎯 Suggest improvements based on historical patterns

2️⃣ Knowledge & Notes Management

📖 Organize personal notes, documents, and knowledge
🔍 Retrieve relevant information via RAG
🧠 Build a personal knowledge base over time

3️⃣ Task & Event Management

📅 Schedule tasks and reminders
⏰ Proactively notify based on context and priority
🔄 Break down long-term goals into actionable steps

4️⃣ Autonomous Planning & Execution

🧠 Plan multi-step workflows (learning plans, projects, research)
🔧 Call tools automatically to complete tasks
📌 Maintain consistency across long time horizons

5️⃣ Personalized Companion & Context Awareness

🧬 Understand user preferences, habits, and personality
💬 Provide highly personalized responses
🤝 Maintain continuity across conversations and modalities

6️⃣ External World Interaction

🌐 Search, research, and summarize real-world information
🧰 Integrate with tools (email, database, APIs, messaging apps)
🔗 Act as a bridge between user intent and external systems

🤖 Why AlphaAvatar?

Unlike traditional AI assistants, AlphaAvatar is:

Stateful → remembers and evolves with you
Proactive → acts, not just responds
Composable → fully plugin-driven architecture
Omni-modal → supports text, voice, and visual interaction
Extensible → easily integrates new tools and capabilities

💡 AlphaAvatar is not just an assistant — it is your long-term AI companion and personal operating system.

AlphaAvatar Plugins

🧠 Memory

Self-improving memory module for Omni-Avatar.

README↗

🧬 Persona

Automatic extraction and real-time matching of user full modality persona.

README↗

💡 Reflection

An Optimizer for Omni-Avatar that can automatically build an internal knowledge base for avatars.

README↗

📅 Planning

Agents need to plan over a longer time frame to ensure that their actions are sequential and reliable.

README↗

🤖 Behavior

Controls AlphaAvatar’s behavior logic and process flow.

README↗

😊 Virtual Character

The real-time generated virtual character that visually represents the Avatar during interactions.

README↗

Tools Plugins

🔍 DeepResearch

Allow AlphaAvatar to access the network and perform single-step/multi-step inference through a separate Agent service to search for more accurate content.

README↗

📖 RAG

Allow AlphaAvatar to access Documents/Skills (user-uploaded/generated by the Reflection module/URL access) to obtain document-related information.

README↗

🧰 MCP

Allows AlphaAvatar to access real-world external tools, such as databases, email, social media, etc.

README↗

🌍 SANDBOX

Provide AlphaAvatar with a sandbox environment to interact with the external world or with other agents, thereby enabling multi-agent interaction and exploration.

README↗

Docs and guides

Latest News 🔥

[2026/03] We have released AlphaAvatar version 0.5.0 to support the MCP plugin, which enables retrieval and concurrent invocation of the MCP tools.
- Released AlphaAvatar version 0.5.1: Added WhatsApp channel support via Baileys driver, enabling connection to AlphaAvatar Agent for WhatsApp integration.
- Released AlphaAvatar version 0.5.2: Add the AlphaAvatar Voice plugin to support the Voice.ai TTS API, providing a wider variety of speaker options.
[2026/02] We have released AlphaAvatar version 0.4.0 to support RAG by RAG-Anything library and optimized the Memory and DeepResearch modules.
- Released AlphaAvatar version 0.4.1: Fix the Persona plugin bugs and Add new MCP plugin.
[2026/01] We have released AlphaAvatar version 0.3.0 to support DeepResearch by tavily API.
- Released AlphaAvatar version 0.3.1: ADD tool calls during user-Assistant interactions to the Memory module.
[2025/12] We have released AlphaAvatar version 0.2.0 to support AIRI live2d-based virtual character display.
[2025/11] We have released AlphaAvatar version 0.1.0 to support automatic memory extraction, automatic user persona extraction and matching.

Installation ⚙️

Install stable AlphaAvatar version from PyPI:

uv venv .my-env --python 3.11
source .my-env/bin/activate
pip install alpha-avatar-agents

Install latest AlphaAvatar version from GitHub:

git clone --recurse-submodules https://github.com/AlphaAvatar/AlphaAvatar.git
cd AlphaAvatar

uv venv .venv --python 3.11
source .venv/bin/activate
uv sync --all-packages

Quick Start ⚡️

Start your agent in dev mode to connect it to LiveKit and make it available from anywhere on the internet.

🧩 Step 1. Configure Environment Variables

cd AlphaAvatar

# Copy template
cp .env.template .env.dev

Edit .env.dev and set required environment variables.

📦 Step 2. Download Required Files

alphaavatar download-files

✅ Step 3. Run the Agent

ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/pipeline_openai_airi.yaml
# or
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/pipeline_openai_tools.yaml

To see more supported modes, please refer to the LiveKit doc.

To see more examples, please refer to the Examples README

Usage 🚀

AlphaAvatar supports multiple Access Channels, allowing different types of users — from end users to developers — to interact with the system.

🧠 Runtime Architecture

            AlphaAvatar Runtime
            ───────────────────

    ┌──────────────────────────────┐
    │        AgentSession          │
    │        AvatarEngine          │
    │   (LLM / Memory / RAG / MCP) │
    └──────────────┬───────────────┘
                   │
            InputDispatcher
                   │
            InputEnvelope
                   │
    ┌──────────────┴───────────────┐
    │                              │
Channel Adapters                Native Inputs
(Ingress Layer)                (Web / App)
    │                              │
    ▼                              ▼
WhatsApp / WeChat / Slack      audio / text / video
    │                              │
    └──────────────┬───────────────┘
                   ▼
           OutputDispatcher
                   │
    ┌──────────────┴───────────────┐
    │                              │
Channel Egress                  Native Output
(Messaging APIs)                (WebRTC / UI)

💡 AlphaAvatar uses a Channel Adapter architecture to decouple runtime logic from communication channels.

🌐 Web Access

🖥️ Browser-based interface for real-time interaction. This will become the official AlphaAvatar user interface.

🎙️ Real-time voice & multimodal communication
🧠 Full plugin support (Memory / RAG / MCP / etc.)
😊 Virtual character display

💬 Social & Messaging Platforms

Interact with AlphaAvatar directly inside messaging platforms.

Capabilities:

💬 Text-based conversation
🎤 Voice message interaction
🧰 Tool invocation via chat interface

WhatsApp

📦 Channel introduction: README

▶️ Start WhatsApp Channel

Make sure AlphaAvatar Agent is already running (see Quick Start above).

ENV_FILE=.env.dev sh examples/channels/start_whatsapp.sh

💡 The WhatsApp channel runs as an independent bridge process and connects to the Agent runtime.

WeChat

Slack

📲 Native Mobile App

A dedicated AlphaAvatar mobile application providing:

🎙️ Real-time voice communication
😊 Live2D / Virtual character visualization
🧠 Persistent memory & persona

🧪 Developer Playground (Available Now)

This is the primary access channel for AlphaAvatar today.

Developers can immediately access AlphaAvatar via the LiveKit Playground.

👉 https://agents-playground.livekit.io/

After starting your AlphaAvatar server:

Connect to your LiveKit instance
Configure the Agent name in the Playground (must match avatar_name, default: Assistant) to enable Explicit Dispatch.
Connect to the agent room
Start testing real-time interaction

Supported capabilities:

🎙️ Voice interaction
🧠 Memory extraction
🔍 RAG retrieval
🧰 MCP tool invocation
😊 Virtual character display

💡 AlphaAvatar is currently developer-first. Web and mobile experiences are actively under development.

Licheng Wang AlphaAvatar

Achievements

Achievements

Learnable, configurable, and pluggable Omni Personal Assistant for everyone

AlphaAvatar Introduction

🎯 Vision

🧩 What Can AlphaAvatar Do?

1️⃣ Personal Data & Life Metrics Management

2️⃣ Knowledge & Notes Management

3️⃣ Task & Event Management

4️⃣ Autonomous Planning & Execution

5️⃣ Personalized Companion & Context Awareness

6️⃣ External World Interaction

🤖 Why AlphaAvatar?

AlphaAvatar Plugins

🧠 Memory

🧬 Persona

💡 Reflection

📅 Planning

🤖 Behavior

😊 Virtual Character

Tools Plugins

🔍 DeepResearch

📖 RAG

🧰 MCP

🌍 SANDBOX

Docs and guides

Latest News 🔥

Installation ⚙️

Quick Start ⚡️

Usage 🚀

🧠 Runtime Architecture

🌐 Web Access

💬 Social & Messaging Platforms

WhatsApp

WeChat

Slack

📲 Native Mobile App

🧪 Developer Playground (Available Now)

Pinned Loading

Uh oh!