Abstract:
This project presents the development of an interactive robot using ESP32 as
the main controller, integrated with Al-based services through HTTP and MQTT protocols. The robot is capable of detecting wake words, recording voice input, sending it to an AI server for processing, and playing back responses via audio streaming or local files. Features such as text-to-speech (TTS), SD card storage, and servo-controlled physical expressions were implemented to enhance user interaction. The system aims to provide a smart, responsive, and engaging experience in human-robot communication, with potential applications in education, customer
service, and entertainment.