TEN-framework/ten-framework

GitHub: TEN-framework/ten-framework

一个开源的实时多模态对话 AI 框架,提供完整的语音交互开发工具链和丰富的 Agent 示例。

Stars: 10185 | Forks: 1219

![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/831a60992d002728.jpg) [![TEN Releases](https://img.shields.io/github/v/release/ten-framework/ten-framework?color=369eff&labelColor=gray&logo=github&style=flat-square)][ten-releases] [![Coverage Status](https://coveralls.io/repos/github/TEN-framework/ten-framework/badge.svg?branch=main)][coverage] [![Release Date](https://img.shields.io/github/release-date/ten-framework/ten-framework?labelColor=gray&style=flat-square)][ten-releases] [![Commits](https://img.shields.io/github/commit-activity/m/TEN-framework/ten-framework?labelColor=gray&color=pink)][commit-activity] [![Issues closed](https://img.shields.io/github/issues-search?query=repo%3ATEN-framework%2Ften-framework%20is%3Aclosed&label=issues%20closed&labelColor=gray&color=green)][issues-closed] [![Contributors](https://img.shields.io/github/contributors/ten-framework/ten-framework?color=c4f042&labelColor=gray&style=flat-square)][contributors] [![GitHub license](https://img.shields.io/badge/License-Apache_2.0_with_certain_conditions-blue.svg?labelColor=%20%23155EEF&color=%20%23528bff)][license] [![Ask DeepWiki](https://deepwiki.com/badge.svg)][deepwiki] [![ReadmeX](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/eaaad06c8a002729.svg)][readmex] [![README in English](https://img.shields.io/badge/English-lightgrey)][lang-en-readme] [![简体中文操作指南](https://img.shields.io/badge/简体中文-lightgrey)][lang-zh-readme] [![日本語のREADME](https://img.shields.io/badge/日本語-lightgrey)][lang-jp-readme] [![README in 한국어](https://img.shields.io/badge/한국어-lightgrey)][lang-kr-readme] [![README en Español](https://img.shields.io/badge/Español-lightgrey)][lang-es-readme] [![README en Français](https://img.shields.io/badge/Français-lightgrey)][lang-fr-readme] [![README in Italiano](https://img.shields.io/badge/Italiano-lightgrey)][lang-it-readme] [![TEN-framework%2Ften_framework | Trendshift](https://trendshift.io/api/badge/repositories/11978)][trendshift] [Official Site][official-site] • [Documentation][documentation] • [Blog][blog]

Table of Contents
- [Welcome to TEN][welcome-to-ten] - [Agent Examples][agent-examples-section] - [Quick Start with Agent Examples][quick-start] - [Localhost][localhost-section] - [Codespaces][codespaces-section] - [Agent Examples Self-Hosting][agent-examples-self-hosting] - [Deploying with Docker][deploying-with-docker] - [Deploying with other cloud services][deploying-with-other-cloud-services] - [Stay Tuned][stay-tuned] - [TEN Ecosystem][ten-ecosystem-anchor] - [Questions][questions] - [Contributing][contributing] - [Code Contributors][code-contributors] - [Contribution Guidelines][contribution-guidelines] - [License][license-section]
## 欢迎使用 TEN TEN is an open-source framework for real-time multimodal conversational AI. [TEN Ecosystem][ten-ecosystem-anchor] includes [TEN Framework][ten-framework], [Agent Examples][agent-examples-repo], [VAD][ten-vad], [Turn Detection][ten-turn-detection] and [Portal][ten-portal].
| Community Channel | Purpose | | ---------------- | ------- | | [![Follow on X](https://img.shields.io/twitter/follow/TenFramework?logo=X&color=%20%23f5f5f5)][follow-on-x] | Follow TEN Framework on X for updates and announcements | | [![Discord TEN Community](https://img.shields.io/badge/Discord-Join%20TEN%20Community-5865F2?style=flat&logo=discord&logoColor=white)][discord-invite] | Join our Discord community to connect with developers | | [![Follow on LinkedIn](https://custom-icon-badges.demolab.com/badge/LinkedIn-TEN_Framework-0A66C2?logo=linkedin-white&logoColor=fff)][linkedin] | Follow TEN Framework on LinkedIn for updates and announcements | | [![Hugging Face Space](https://img.shields.io/badge/Hugging%20Face-TEN%20Framework-yellow?style=flat&logo=huggingface)][hugging-face] | Join our Hugging Face community to explore our spaces and models | | [![WeChat](https://img.shields.io/badge/TEN_Framework-WeChat_Group-%2307C160?logo=wechat&labelColor=darkgreen&color=gray)][wechat-discussion] | Join our WeChat group for Chinese community discussions |
## Agent 示例
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a9c0fb474c002732.jpg) Multi-Purpose Voice Assistant — This low-latency, high-quality real-time assistant supports both RTC and [WebSocket][websocket-example] connections, and you can extend it with [Memory][memory-example], [VAD][voice-assistant-vad-example], [Turn Detection][voice-assistant-turn-detection-example], and other extensions. See the [Example code][voice-assistant-example] for more details.
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/3729334b10002740.jpg) Doodler — A doodle board that turns spoken or typed prompts into simple hand-drawn sketches, complete with a crayon palette and real-time drawing. [Example code][doodler-example]
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/4c10089063002742.jpg) Speaker Diarization — Real-time diarization that detects and labels speakers, the Who Likes What game shows an interactive use case. [Example code][speechmatics-diarization-example]
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/b1a58d632a002745.jpg) Lip Sync Avatars — Works with multiple avatar vendors, the main character features Kei, an anime character with MotionSync-powered lip sync, and also supports realistic avatars from Trulience, HeyGen, and Tavus. See the [Example code][voice-assistant-live2d-example] for different Live2D characters.
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/5c889ac9ff002752.jpg) SIP Call — SIP extension that enables phone calls powered by TEN. [Example code][voice-assistant-sip-example]
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/8c76f9b35c002755.jpg) Transcription — A transcription tool that transcribes audio to text. [Example code][transcription-example]
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
![Image](https://github.com/user-attachments/assets/3d60f1ff-0f82-4fe7-b5c2-ac03d284f60c) ESP32-S3 Korvo V3 — Runs TEN agent example on the Espressif ESP32-S3 Korvo V3 development board to integrate LLM-powered communication with hardware. See the [integration guide][esp32-guide] for more details.
[![](https://img.shields.io/badge/-Back_to_top-gray?style=flat-square)][readme-top]
## Agent 示例快速入门 ### 本地环境 #### 步骤 ⓵ - 先决条件 | Category | Requirements | | --- | --- | | **Keys** | • Agora [App ID][agora-app-id] and [App Certificate][agora-app-certificate]
• [OpenAI][openai-api] API key
• [Deepgram][deepgram] ASR
• [ElevenLabs][elevenlabs] TTS | | **Installation** | • [Docker][docker] / [Docker Compose][docker-compose]
• [Node.js (LTS) v18][nodejs] | | **Minimum System Requirements** | • CPU >= 2 cores
• RAM >= 4 GB |
![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/121749c5a2002734.jpg) ![divider](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/a871724965002736.jpg)
标签:Agent 示例, ASR, DLL 劫持, DNS解析, Docker, LLM, MITM代理, NLP, TEN Framework, TTS, Unmanaged PE, 云服务, 人工智能, 人机交互, 大语言模型, 安全防御评估, 实时多模态, 实时通信, 对话式AI, 开发框架, 开源项目, 日志审计, 深度学习, 用户模式Hook绕过, 语音AI, 语音交互, 语音识别, 请求拦截, 逆向工具