mirror of https://github.com/78/xiaozhi-esp32.git synced 2026-02-27 22:36:35 +00:00

Go to file

Xiaoxia 71c86ab62b Fix setupui (#1777 )

* Enhance GitHub Actions artifact download script

- Updated the output directory structure to save downloaded files in a version-specific subdirectory (releases/<version>).
- Added a new function to determine the default releases directory path relative to the script's location.
- Improved artifact renaming logic to handle known extensions more robustly and ensure compatibility with filenames containing dots.

* Refactor UI setup in ElectronEmojiDisplay and OttoEmojiDisplay classes

- Moved SetupChatLabel call in ElectronEmojiDisplay to ensure it is executed after the parent UI is initialized, preventing potential issues with container validity.
- Updated SetupUI in OttoEmojiDisplay to release the display lock before calling SetEmotion, avoiding deadlock scenarios during UI setup.

* Add multiline chat message support in display configuration

- Introduced a new Kconfig option to enable multiline chat message display in the default mode.
- Updated the LCD display setup to accommodate a dynamic height bottom bar for multiline messages.
- Modified the configuration files for the waveshare-esp32-s3-epaper-1.54 board to include the new chat message setting.

* Update font and emoji settings for Magiclick boards; enhance bottom bar visibility logic in LCD display

- Changed the default text and emoji fonts for Magiclick S3 2P4 and S3 2P5 boards to Noto fonts.
- Improved bottom bar visibility logic in LcdDisplay to hide when there is no content, ensuring a cleaner UI experience.

2026-02-19 20:10:27 +08:00

.github

Bump project version to 2.2.3 and fix release.py (#1771 )

2026-02-17 15:54:22 +08:00

docs

Change the Bluetooth device name to "Xiaozhi-Blufi" in blufi provisioning. (#1701 )

2026-01-28 01:51:42 +08:00

main

Fix setupui (#1777 )

2026-02-19 20:10:27 +08:00

partitions

Add support for both hardware versions of waveshare-s3-epaper-1.54 (#1583 )

2026-02-19 16:52:47 +08:00

scripts

Fix setupui (#1777 )

2026-02-19 20:10:27 +08:00

.clang-format

Add clang-format (#1694 )

2026-01-26 20:42:31 +08:00

.gitignore

Update esp-ml307 component version to 3.6.2 to support UART DMA (#1724 )

2026-02-02 09:53:06 +08:00

CMakeLists.txt

Bump project version to 2.2.3 and fix release.py (#1771 )

2026-02-17 15:54:22 +08:00

LICENSE

Update LICENSE

2025-07-22 10:06:11 +08:00

README_ja.md

Fix RNDIS board and enhance camera initialization (#1702 )

2026-01-28 16:11:26 +08:00

README_zh.md

Fix RNDIS board and enhance camera initialization (#1702 )

2026-01-28 16:11:26 +08:00

README.md

Fix RNDIS board and enhance camera initialization (#1702 )

2026-01-28 16:11:26 +08:00

sdkconfig.defaults

Fix lichuang-dev camera (#1290 )

2025-10-14 20:44:44 +08:00

sdkconfig.defaults.esp32

Detect wake word model from index.json (#1211 )

2025-09-17 08:31:51 +08:00

sdkconfig.defaults.esp32c3

Switch to 2.0 branch (#1152 )

2025-09-04 15:41:28 +08:00

sdkconfig.defaults.esp32c5

feat: add esp-spot c5 (#1462 )

2025-11-20 15:52:49 +08:00

sdkconfig.defaults.esp32c6

Switch to 2.0 branch (#1152 )

2025-09-04 15:41:28 +08:00

sdkconfig.defaults.esp32p4

feat: support JPEG input (#1455 )

2025-11-18 20:34:22 +08:00

sdkconfig.defaults.esp32s3

Enhance memory management in asset download and OTA processes by repl… (#1716 )

2026-01-31 22:58:08 +08:00

README.md

An MCP-based Chatbot

(English | 中文 | 日本語)

Introduction

👉 Human: Give AI a camera vs AI: Instantly finds out the owner hasn't washed hair for three days【bilibili】

👉 Handcraft your AI girlfriend, beginner's guide【bilibili】

As a voice interaction entry, the XiaoZhi AI chatbot leverages the AI capabilities of large models like Qwen / DeepSeek, and achieves multi-terminal control via the MCP protocol.

Version Notes

The current v2 version is incompatible with the v1 partition table, so it is not possible to upgrade from v1 to v2 via OTA. For partition table details, see partitions/v2/README.md.

All hardware running v1 can be upgraded to v2 by manually flashing the firmware.

The stable version of v1 is 1.9.2. You can switch to v1 by running git checkout v1. The v1 branch will be maintained until February 2026.

Features Implemented

Wi-Fi / ML307 Cat.1 4G
Offline voice wake-up ESP-SR
Supports two communication protocols (Websocket or MQTT+UDP)
Uses OPUS audio codec
Voice interaction based on streaming ASR + LLM + TTS architecture
Speaker recognition, identifies the current speaker 3D Speaker
OLED / LCD display, supports emoji display
Battery display and power management
Multi-language support (Chinese, English, Japanese)
Supports ESP32-C3, ESP32-S3, ESP32-P4 chip platforms
Device-side MCP for device control (Speaker, LED, Servo, GPIO, etc.)
Cloud-side MCP to extend large model capabilities (smart home control, PC desktop operation, knowledge search, email, etc.)
Customizable wake words, fonts, emojis, and chat backgrounds with online web-based editing (Custom Assets Generator)

Hardware

Breadboard DIY Practice

See the Feishu document tutorial:

👉 "XiaoZhi AI Chatbot Encyclopedia"

Breadboard demo:

Supports 70+ Open Source Hardware (Partial List)

Software

Firmware Flashing

For beginners, it is recommended to use the firmware that can be flashed without setting up a development environment.

The firmware connects to the official xiaozhi.me server by default. Personal users can register an account to use the Qwen real-time model for free.

👉 Beginner's Firmware Flashing Guide

Development Environment

Cursor or VSCode
Install ESP-IDF plugin, select SDK version 5.4 or above
Linux is better than Windows for faster compilation and fewer driver issues
This project uses Google C++ code style, please ensure compliance when submitting code

Developer Documentation

Custom Board Guide - Learn how to create custom boards for XiaoZhi AI
MCP Protocol IoT Control Usage - Learn how to control IoT devices via MCP protocol
MCP Protocol Interaction Flow - Device-side MCP protocol implementation
MQTT + UDP Hybrid Communication Protocol Document
A detailed WebSocket communication protocol document

Large Model Configuration

If you already have a XiaoZhi AI chatbot device and have connected to the official server, you can log in to the xiaozhi.me console for configuration.

👉 Backend Operation Video Tutorial (Old Interface)

For server deployment on personal computers, refer to the following open-source projects:

xinnan-tech/xiaozhi-esp32-server Python server
joey-zhou/xiaozhi-esp32-server-java Java server
AnimeAIChat/xiaozhi-server-go Golang server
hackers365/xiaozhi-esp32-server-golang Golang server

Other client projects using the XiaoZhi communication protocol:

huangjunsen0406/py-xiaozhi Python client
TOM88812/xiaozhi-android-client Android client
100askTeam/xiaozhi-linux Linux client by 100ask
78/xiaozhi-sf32 Bluetooth chip firmware by Sichuan
QuecPython/solution-xiaozhiAI QuecPython firmware by Quectel

Custom Assets Tools:

78/xiaozhi-assets-generator Custom Assets Generator (Wake words, fonts, emojis, backgrounds)

About the Project

This is an open-source ESP32 project, released under the MIT license, allowing anyone to use it for free, including for commercial purposes.

We hope this project helps everyone understand AI hardware development and apply rapidly evolving large language models to real hardware devices.

If you have any ideas or suggestions, please feel free to raise Issues or join our Discord or QQ group: 994694848

Star History

Languages

C++ 73.3%

C 15.2%

Python 6.9%

Roff 3%

CMake 1.4%

Other 0.2%

README.md

An MCP-based Chatbot

Introduction

Version Notes

Features Implemented

Hardware

Breadboard DIY Practice

Supports 70+ Open Source Hardware (Partial List)

Software

Firmware Flashing

Development Environment

Developer Documentation

Large Model Configuration

Related Open Source Projects

About the Project

Star History