Go to file

Dong Ning d6b1414967 camera 优化：在原有的RGB565处理下，容易超时改为JPEG格式 (#1029 )

* camera 优化

feat(camera): 修改摄像头配置为JPEG格式并优化图像处理逻辑

将摄像头输出格式从RGB565改为JPEG以提高传输效率，同时调整JPEG质量为10
重构预览图像处理逻辑，支持直接处理JPEG格式并自动转换为RGB565
优化Explain方法中的JPEG队列处理，减少内存分配和拷贝操作

* 修复代码缩进

调整代码缩进格式以提升可读性

* fix(esp32_camera): 修复RGB565格式预览图像的字节序问题

添加字节交换处理，将大端序转换为小端序，确保预览图像显示正确

* 使用旧的处理方式

* refactor(esp32_camera): 移除preview_buffer_直接使用preview_image_.data

2025-08-14 22:11:15 +08:00

.github

fix: ci idf.py not found (#1030 )

2025-08-05 22:18:28 +08:00

docs

添加了一个声波配网的HTML界面 (#996 )

2025-08-02 14:13:35 +08:00

main

camera 优化：在原有的RGB565处理下，容易超时改为JPEG格式 (#1029 )

2025-08-14 22:11:15 +08:00

partitions/v1

feat: add emote_gfx UI for EchoEar (#1022 )

2025-08-01 18:07:13 +08:00

scripts

feat: Add locales with OGG sounds (#1057 )

2025-08-12 18:41:00 +08:00

.clangd

add labplus-Ledong-v2 and labplus-mpython-v3 board: (#897 )

2025-07-17 00:12:53 +08:00

.gitignore

feat: add emote_gfx UI for EchoEar (#1022 )

2025-08-01 18:07:13 +08:00

CMakeLists.txt

Bump to 1.8.7

2025-08-13 11:50:27 +08:00

LICENSE

Update LICENSE

2025-07-22 10:06:11 +08:00

README_en.md

v1.8.0: Audio 代码重构与低功耗优化 (#943 )

2025-07-19 22:45:22 +08:00

README_ja.md

v1.8.0: Audio 代码重构与低功耗优化 (#943 )

2025-07-19 22:45:22 +08:00

README.md

v1.8.0: Audio 代码重构与低功耗优化 (#943 )

2025-07-19 22:45:22 +08:00

sdkconfig.defaults

Fix custom wakeword for dual mic (#1018 )

2025-08-01 13:30:17 +08:00

sdkconfig.defaults.esp32

添加esp32支持唤醒词 (#782 )

2025-06-09 04:43:46 +08:00

sdkconfig.defaults.esp32c3

v1.7.3: 参考ESP-HI，为所有C3板子增加10多KB可用SRAM

2025-06-10 01:33:20 +08:00

sdkconfig.defaults.esp32c6

添加微雪电子esp32-c6-lcd-1.69、esp32-c6-Touch-lcd-1.69的支持 (#816 )

2025-06-13 18:20:22 +08:00

sdkconfig.defaults.esp32p4

fix p4 compiling errors

2025-06-08 02:39:53 +08:00

sdkconfig.defaults.esp32s3

在S3芯片上使用更多的PSRAM，解决立创开发板拍照时可能出现内存不足的问题

2025-06-24 04:59:00 +08:00

README_en.md

An MCP-based Chatbot

(English | 中文 | 日本語)

Video

👉 Human: Give AI a camera vs AI: Instantly finds out the owner hasn't washed hair for three days【bilibili】

👉 Handcraft your AI girlfriend, beginner's guide【bilibili】

Introduction

This is an open-source ESP32 project, released under the MIT license, allowing anyone to use it for free, including for commercial purposes.

We hope this project helps everyone understand AI hardware development and apply rapidly evolving large language models to real hardware devices.

If you have any ideas or suggestions, please feel free to raise Issues or join the QQ group: 1011329060

Control Everything with MCP

As a voice interaction entry, the XiaoZhi AI chatbot leverages the AI capabilities of large models like Qwen / DeepSeek, and achieves multi-terminal control via the MCP protocol.

Features Implemented

Wi-Fi / ML307 Cat.1 4G
Offline voice wake-up ESP-SR
Supports two communication protocols (Websocket or MQTT+UDP)
Uses OPUS audio codec
Voice interaction based on streaming ASR + LLM + TTS architecture
Speaker recognition, identifies the current speaker 3D Speaker
OLED / LCD display, supports emoji display
Battery display and power management
Multi-language support (Chinese, English, Japanese)
Supports ESP32-C3, ESP32-S3, ESP32-P4 chip platforms
Device-side MCP for device control (Speaker, LED, Servo, GPIO, etc.)
Cloud-side MCP to extend large model capabilities (smart home control, PC desktop operation, knowledge search, email, etc.)

Hardware

Breadboard DIY Practice

See the Feishu document tutorial:

👉 "XiaoZhi AI Chatbot Encyclopedia"

Breadboard demo:

Supports 70+ Open Source Hardware (Partial List)

Software

Firmware Flashing

For beginners, it is recommended to use the firmware that can be flashed without setting up a development environment.

The firmware connects to the official xiaozhi.me server by default. Personal users can register an account to use the Qwen real-time model for free.

👉 Beginner's Firmware Flashing Guide

Development Environment

Cursor or VSCode
Install ESP-IDF plugin, select SDK version 5.4 or above
Linux is better than Windows for faster compilation and fewer driver issues
This project uses Google C++ code style, please ensure compliance when submitting code

Developer Documentation

Custom Board Guide - Learn how to create custom boards for XiaoZhi AI
MCP Protocol IoT Control Usage - Learn how to control IoT devices via MCP protocol
MCP Protocol Interaction Flow - Device-side MCP protocol implementation
A detailed WebSocket communication protocol document

Large Model Configuration

If you already have a XiaoZhi AI chatbot device and have connected to the official server, you can log in to the xiaozhi.me console for configuration.

👉 Backend Operation Video Tutorial (Old Interface)

For server deployment on personal computers, refer to the following open-source projects:

xinnan-tech/xiaozhi-esp32-server Python server
joey-zhou/xiaozhi-esp32-server-java Java server
AnimeAIChat/xiaozhi-server-go Golang server

Other client projects using the XiaoZhi communication protocol:

huangjunsen0406/py-xiaozhi Python client
TOM88812/xiaozhi-android-client Android client

Star History

Languages

C++ 74.8%

C 15.9%

Python 7.8%

CMake 1.3%

HTML 0.2%

README_en.md

An MCP-based Chatbot

Video

Introduction

Control Everything with MCP

Features Implemented

Hardware

Breadboard DIY Practice

Supports 70+ Open Source Hardware (Partial List)

Software

Firmware Flashing

Development Environment

Developer Documentation

Large Model Configuration

Related Open Source Projects

Star History