TEN-framework/ten-framework
GitHub: TEN-framework/ten-framework
一个开源的实时多模态对话 AI 框架,提供完整的语音交互开发工具链和丰富的 Agent 示例。
Stars: 10185 | Forks: 1219

[][ten-releases]
[][coverage]
[][ten-releases]
[][commit-activity]
[][issues-closed]
[][contributors]
[][license]
[][deepwiki]
[][readmex]
[][lang-en-readme]
[][lang-zh-readme]
[][lang-jp-readme]
[][lang-kr-readme]
[][lang-es-readme]
[][lang-fr-readme]
[][lang-it-readme]
[][trendshift]
[Official Site][official-site] •
[Documentation][documentation] •
[Blog][blog]
Table of Contents
- [Welcome to TEN][welcome-to-ten] - [Agent Examples][agent-examples-section] - [Quick Start with Agent Examples][quick-start] - [Localhost][localhost-section] - [Codespaces][codespaces-section] - [Agent Examples Self-Hosting][agent-examples-self-hosting] - [Deploying with Docker][deploying-with-docker] - [Deploying with other cloud services][deploying-with-other-cloud-services] - [Stay Tuned][stay-tuned] - [TEN Ecosystem][ten-ecosystem-anchor] - [Questions][questions] - [Contributing][contributing] - [Code Contributors][code-contributors] - [Contribution Guidelines][contribution-guidelines] - [License][license-section]
| Community Channel | Purpose | | ---------------- | ------- | | [][follow-on-x] | Follow TEN Framework on X for updates and announcements | | [][discord-invite] | Join our Discord community to connect with developers | | [][linkedin] | Follow TEN Framework on LinkedIn for updates and announcements | | [][hugging-face] | Join our Hugging Face community to explore our spaces and models | | [][wechat-discussion] | Join our WeChat group for Chinese community discussions |
## Agent 示例
 Multi-Purpose Voice Assistant — This low-latency, high-quality real-time assistant supports both RTC and [WebSocket][websocket-example] connections, and you can extend it with [Memory][memory-example], [VAD][voice-assistant-vad-example], [Turn Detection][voice-assistant-turn-detection-example], and other extensions. See the [Example code][voice-assistant-example] for more details.
 
 Doodler — A doodle board that turns spoken or typed prompts into simple hand-drawn sketches, complete with a crayon palette and real-time drawing. [Example code][doodler-example]
 
 Speaker Diarization — Real-time diarization that detects and labels speakers, the Who Likes What game shows an interactive use case. [Example code][speechmatics-diarization-example]
 
 Lip Sync Avatars — Works with multiple avatar vendors, the main character features Kei, an anime character with MotionSync-powered lip sync, and also supports realistic avatars from Trulience, HeyGen, and Tavus. See the [Example code][voice-assistant-live2d-example] for different Live2D characters.
 
 SIP Call — SIP extension that enables phone calls powered by TEN. [Example code][voice-assistant-sip-example]
 
 Transcription — A transcription tool that transcribes audio to text. [Example code][transcription-example]
 
 ESP32-S3 Korvo V3 — Runs TEN agent example on the Espressif ESP32-S3 Korvo V3 development board to integrate LLM-powered communication with hardware. See the [integration guide][esp32-guide] for more details.
[][readme-top]
## Agent 示例快速入门
### 本地环境
#### 步骤 ⓵ - 先决条件
| Category | Requirements |
| --- | --- |
| **Keys** | • Agora [App ID][agora-app-id] and [App Certificate][agora-app-certificate]• [OpenAI][openai-api] API key
• [Deepgram][deepgram] ASR
• [ElevenLabs][elevenlabs] TTS | | **Installation** | • [Docker][docker] / [Docker Compose][docker-compose]
• [Node.js (LTS) v18][nodejs] | | **Minimum System Requirements** | • CPU >= 2 cores
• RAM >= 4 GB |
 
标签:Agent 示例, ASR, DLL 劫持, DNS解析, Docker, LLM, MITM代理, NLP, TEN Framework, TTS, Unmanaged PE, 云服务, 人工智能, 人机交互, 大语言模型, 安全防御评估, 实时多模态, 实时通信, 对话式AI, 开发框架, 开源项目, 日志审计, 深度学习, 用户模式Hook绕过, 语音AI, 语音交互, 语音识别, 请求拦截, 逆向工具