2025-07-18 13:14:28 +08:00

475 lines
12 KiB
Markdown
Executable File
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: xiaozhi-esp32-server
description: 基于ESP32的小智开源服务端轻量级且高效的语音交互服务
---
# xiaozhi-esp32-server
<div class="project-header">
<div class="project-logo">
<img src="./images/logo.png" alt="xiaozhi-esp32-server Logo" onerror="this.src='/py-xiaozhi/images/logo.png'; this.onerror=null;">
</div>
<div class="project-badges">
<span class="badge platform">ESP32</span>
<span class="badge language">Python</span>
<span class="badge status">活跃开发中</span>
</div>
</div>
<div class="project-intro">
<p>xiaozhi-esp32-server是为开源智能硬件项目<a href="https://github.com/78/xiaozhi-esp32" target="_blank">xiaozhi-esp32</a>提供的后端服务,根据<a href="https://ccnphfhqs21z.feishu.cn/wiki/M0XiwldO9iJwHikpXD5cEx71nKh" target="_blank">小智通信协议</a>使用Python实现帮助您快速搭建小智服务器。</p>
</div>
## 适用人群
本项目需要配合ESP32硬件设备使用。如果您已经购买了ESP32相关硬件且成功对接过虾哥部署的后端服务并希望独立搭建自己的`xiaozhi-esp32`后端服务,那么本项目非常适合您。
<div class="warning-box">
<h3>⚠️ 重要提示</h3>
<ol>
<li>本项目为开源软件与对接的任何第三方API服务商包括但不限于语音识别、大模型、语音合成等平台均不存在商业合作关系不为其服务质量及资金安全提供任何形式的担保。建议使用者优先选择持有相关业务牌照的服务商并仔细阅读其服务协议及隐私政策。本软件不托管任何账户密钥、不参与资金流转、不承担充值资金损失风险。</li>
<li>本项目成立时间较短,还未通过网络安全测评,请勿在生产环境中使用。如果您在公网环境中部署学习本项目,请务必在配置文件<code>config.yaml</code>中开启防护。</li>
</ol>
</div>
## 核心特性
<div class="features-container">
<div class="feature-item">
<div class="feature-icon">🔄</div>
<h3>通信协议</h3>
<p>基于<code>xiaozhi-esp32</code>协议通过WebSocket实现数据交互</p>
</div>
<div class="feature-item">
<div class="feature-icon">💬</div>
<h3>对话交互</h3>
<p>支持唤醒对话、手动对话及实时打断,长时间无对话时自动休眠</p>
</div>
<div class="feature-item">
<div class="feature-icon">🧠</div>
<h3>意图识别</h3>
<p>支持使用LLM意图识别、function call函数调用减少硬编码意图判断</p>
</div>
<div class="feature-item">
<div class="feature-icon">🌐</div>
<h3>多语言识别</h3>
<p>支持国语、粤语、英语、日语、韩语默认使用FunASR</p>
</div>
<div class="feature-item">
<div class="feature-icon">🤖</div>
<h3>LLM模块</h3>
<p>支持灵活切换LLM模块默认使用ChatGLMLLM也可选用阿里百炼、DeepSeek、Ollama等</p>
</div>
<div class="feature-item">
<div class="feature-icon">🔊</div>
<h3>TTS模块</h3>
<p>支持EdgeTTS默认、火山引擎豆包TTS等多种TTS接口满足语音合成需求</p>
</div>
<div class="feature-item">
<div class="feature-icon">📝</div>
<h3>记忆功能</h3>
<p>支持超长记忆、本地总结记忆、无记忆三种模式,满足不同场景需求</p>
</div>
<div class="feature-item">
<div class="feature-icon">🏠</div>
<h3>IOT功能</h3>
<p>支持管理注册设备IOT功能支持基于对话上下文语境下的智能物联网控制</p>
</div>
<div class="feature-item">
<div class="feature-icon">🖥️</div>
<h3>智控台</h3>
<p>提供Web管理界面支持智能体管理、用户管理、系统配置等功能</p>
</div>
</div>
## 部署方式
本项目提供两种部署方式,请根据您的具体需求选择:
<div class="deployment-table">
<table>
<thead>
<tr>
<th>部署方式</th>
<th>特点</th>
<th>适用场景</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>最简化安装</strong></td>
<td>智能对话、IOT功能数据存储在配置文件</td>
<td>低配置环境,无需数据库</td>
</tr>
<tr>
<td><strong>全模块安装</strong></td>
<td>智能对话、IOT、OTA、智控台数据存储在数据库</td>
<td>完整功能体验</td>
</tr>
</tbody>
</table>
</div>
详细部署文档请参考:
- [Docker部署文档](https://github.com/xinnan-tech/xiaozhi-esp32-server/blob/main/docs/Deployment.md)
- [源码部署文档](https://github.com/xinnan-tech/xiaozhi-esp32-server/blob/main/docs/Deployment_all.md)
## 支持平台列表
xiaozhi-esp32-server支持丰富的第三方平台和组件
### LLM 语言模型
<div class="platform-item">
<h4>接口调用</h4>
<p><strong>支持平台:</strong>阿里百炼、火山引擎豆包、深度求索、智谱ChatGLM、Gemini、Ollama、Dify、Fastgpt、Coze</p>
<p><strong>免费平台:</strong>智谱ChatGLM、Gemini</p>
<p><em>实际上任何支持openai接口调用的LLM均可接入使用</em></p>
</div>
### TTS 语音合成
<div class="platform-item">
<h4>接口调用</h4>
<p><strong>支持平台:</strong>EdgeTTS、火山引擎豆包TTS、腾讯云、阿里云TTS、CosyVoiceSiliconflow、TTS302AI、CozeCnTTS、GizwitsTTS、ACGNTTS、OpenAITTS</p>
<p><strong>免费平台:</strong>EdgeTTS、CosyVoiceSiliconflow(部分)</p>
<h4>本地服务</h4>
<p><strong>支持平台:</strong>FishSpeech、GPT_SOVITS_V2、GPT_SOVITS_V3、MinimaxTTS</p>
<p><strong>免费平台:</strong>FishSpeech、GPT_SOVITS_V2、GPT_SOVITS_V3、MinimaxTTS</p>
</div>
### ASR 语音识别
<div class="platform-item">
<h4>接口调用</h4>
<p><strong>支持平台:</strong>DoubaoASR</p>
<h4>本地服务</h4>
<p><strong>支持平台:</strong>FunASR、SherpaASR</p>
<p><strong>免费平台:</strong>FunASR、SherpaASR</p>
</div>
### 更多组件
- **VAD语音活动检测**支持SileroVAD本地免费使用
- **记忆存储**支持mem0ai1000次/月额度、mem_local_short本地总结免费
- **意图识别**支持intent_llm通过大模型识别意图、function_call通过大模型函数调用完成意图
## 参与贡献
xiaozhi-esp32-server是一个活跃的开源项目欢迎贡献代码或提交问题反馈
- [GitHub仓库](https://github.com/xinnan-tech/xiaozhi-esp32-server)
- [问题反馈](https://github.com/xinnan-tech/xiaozhi-esp32-server/issues)
- [致开发者的公开信](https://github.com/xinnan-tech/xiaozhi-esp32-server/blob/main/docs/contributor_open_letter.md)
<style>
.project-header {
display: flex;
align-items: center;
margin-bottom: 2rem;
}
.project-logo {
width: 100px;
height: 100px;
margin-right: 1.5rem;
}
.project-logo img {
width: 100%;
height: 100%;
object-fit: contain;
}
.project-badges {
display: flex;
flex-wrap: wrap;
gap: 0.5rem;
}
.badge {
display: inline-block;
padding: 0.25rem 0.75rem;
border-radius: 1rem;
font-size: 0.85rem;
font-weight: 500;
}
.badge.platform {
background-color: var(--vp-c-brand-soft);
color: var(--vp-c-brand-dark);
}
.badge.language {
background-color: rgba(59, 130, 246, 0.2);
color: rgb(59, 130, 246);
}
.badge.status {
background-color: rgba(16, 185, 129, 0.2);
color: rgb(16, 185, 129);
}
.project-intro {
font-size: 1.1rem;
line-height: 1.6;
margin-bottom: 2rem;
padding: 1.5rem;
background-color: var(--vp-c-bg-soft);
border-radius: 8px;
border-left: 4px solid var(--vp-c-brand);
}
.warning-box {
margin: 2rem 0;
padding: 1.5rem;
background-color: rgba(234, 179, 8, 0.1);
border-left: 4px solid rgba(234, 179, 8, 0.8);
border-radius: 8px;
}
.warning-box h3 {
color: rgb(234, 179, 8);
margin-top: 0;
}
.features-container {
display: grid;
grid-template-columns: repeat(auto-fill, minmax(300px, 1fr));
gap: 1.5rem;
margin: 2rem 0;
}
.feature-item {
background-color: var(--vp-c-bg-soft);
border-radius: 8px;
padding: 1.5rem;
transition: transform 0.3s ease, box-shadow 0.3s ease;
border: 1px solid var(--vp-c-divider);
}
.feature-item:hover {
transform: translateY(-5px);
box-shadow: 0 5px 15px rgba(0, 0, 0, 0.1);
}
.feature-icon {
font-size: 2rem;
margin-bottom: 1rem;
}
.feature-item h3 {
color: var(--vp-c-brand);
margin-top: 0;
margin-bottom: 0.5rem;
}
.deployment-table {
margin: 2rem 0;
overflow-x: auto;
}
.deployment-table table {
width: 100%;
border-collapse: collapse;
}
.deployment-table th,
.deployment-table td {
padding: 1rem;
text-align: left;
border-bottom: 1px solid var(--vp-c-divider);
}
.deployment-table th {
background-color: var(--vp-c-bg-soft);
font-weight: 500;
}
.platform-item {
margin: 1.5rem 0;
padding: 1.5rem;
background-color: var(--vp-c-bg-soft);
border-radius: 8px;
}
.platform-item h4 {
color: var(--vp-c-brand);
margin-top: 0;
margin-bottom: 1rem;
}
.platform-item p {
margin: 0.5rem 0;
}
.demo-videos {
margin: 2rem 0;
}
.video-grid {
display: grid;
grid-template-columns: repeat(auto-fill, minmax(280px, 1fr));
gap: 1.5rem;
}
.video-item {
display: block;
text-decoration: none;
color: inherit;
border-radius: 8px;
overflow: hidden;
transition: transform 0.3s ease;
background-color: var(--vp-c-bg-soft);
}
.video-item:hover {
transform: translateY(-5px);
}
.video-thumbnail {
width: 100%;
aspect-ratio: 16 / 9;
overflow: hidden;
}
.video-thumbnail img {
width: 100%;
height: 100%;
object-fit: cover;
transition: transform 0.3s ease;
}
.video-item:hover .video-thumbnail img {
transform: scale(1.05);
}
.video-title {
padding: 1rem;
font-weight: 500;
}
.demo-more {
text-align: center;
margin-top: 1.5rem;
}
.demo-more a {
display: inline-block;
padding: 0.5rem 1.5rem;
background-color: var(--vp-c-brand);
color: white;
border-radius: 4px;
text-decoration: none;
transition: background-color 0.3s ease;
}
.demo-more a:hover {
background-color: var(--vp-c-brand-dark);
}
.related-projects {
display: grid;
grid-template-columns: repeat(auto-fill, minmax(300px, 1fr));
gap: 1.5rem;
margin: 2rem 0;
}
.project-card {
background-color: var(--vp-c-bg-soft);
border-radius: 8px;
padding: 1.5rem;
display: flex;
flex-direction: column;
border: 1px solid var(--vp-c-divider);
transition: transform 0.3s ease, box-shadow 0.3s ease;
}
.project-card:hover {
transform: translateY(-5px);
box-shadow: 0 5px 15px rgba(0, 0, 0, 0.1);
}
.project-card h3 {
color: var(--vp-c-brand);
margin-top: 0;
margin-bottom: 1rem;
}
.project-link {
margin-top: auto;
display: inline-block;
padding: 0.5rem 1rem;
background-color: var(--vp-c-brand);
color: white;
text-decoration: none;
border-radius: 4px;
text-align: center;
transition: background-color 0.3s ease;
}
.project-link:hover {
background-color: var(--vp-c-brand-dark);
}
.contributors {
display: grid;
grid-template-columns: repeat(auto-fill, minmax(300px, 1fr));
gap: 2rem;
margin: 2rem 0;
}
.contributor {
background-color: var(--vp-c-bg-soft);
border-radius: 8px;
padding: 1.5rem;
text-align: center;
border: 1px solid var(--vp-c-divider);
}
.contributor img {
width: 120px;
height: 60px;
object-fit: contain;
margin-bottom: 1rem;
}
.contributor h4 {
color: var(--vp-c-brand);
margin-top: 0;
margin-bottom: 0.5rem;
}
@media (max-width: 768px) {
.project-header {
flex-direction: column;
align-items: flex-start;
}
.project-logo {
margin-bottom: 1rem;
}
.contributors {
grid-template-columns: 1fr;
}
.related-projects {
grid-template-columns: 1fr;
}
.features-container {
grid-template-columns: 1fr;
}
}
</style>