WSEAS Transactions on Systems
Print ISSN: 1109-2777, E-ISSN: 2224-2678
Volume 23, 2024
Generative Language Model Technology Integrated into an IoT Device for the Development of a Voice Assistant
Authors: ,
Abstract: The integration of artificial intelligence technologies into IoT devices has opened new possibilities for interaction with the environment through voice assistants, such as ChatGPT, improving interaction with smart devices in sectors such as home, health, and education. However, the adoption of these technologies faces challenges due to device heterogeneity, the need for interoperability, and concerns about data privacy and security. The objective of this research is to develop an IoT device that integrates artificial intelligence technologies and generative language models for a voice assistant, covering the design of a voice recognition system, the implementation of efficient communication with the model, the coordination between ESP32 microcontrollers and the integration of a voice synthesis system. The results show that the system can send queries to ChatGPT and receive responses in real time, validating its ability to handle natural language processing. Furthermore, speech synthesis, using Audio.h library and the MAX98357 module, have demonstrated effective text-to-audio conversion, while the integration of the INMP441 microphone and the Google Cloud Speech-to-Text platform ensures voice capture and processing. In conclusion, the operation of the IoT device and its real-time interaction with the ChatGPT API were validated to obtain an efficient text-to-speech conversion, being scalable for future improvements.
Search Articles
Keywords: Generative language model, artificial intelligence, ChatGPT, Internet of Things, voice assistant, LLM, ESP32
Pages: 521-530
DOI: 10.37394/23202.2024.23.54