Author:
Valerie Michelle,Salamah Irma,Lindawati
Abstract
This paper presents the development and evaluation of a personal assistant robot prototype with advanced speech recognition and natural language processing (NLP) capabilities. Powered by a Raspberry Pi microprocessor, it is the core component of the robot's hardware. It is designed to receive commands and promptly respond by performing the requested actions, utilizing integrated speech recognition and NLP technologies. The prototype aims to enhance meeting efficiency and productivity through audio-to-text conversion and high-quality image capture. Results show excellent performance, with accuracy rates of 100% in Indonesian and 99% in English. The efficient processing speed, averaging 9.07 seconds per minute in Indonesian and 15.3 seconds per minute in English, further enhances the robot's functionality. Additionally, integrating a high-resolution webcam enables high-quality image capture at 1280 x 720 pixels. Real-time integration with Google Drive ensures secure storage and seamless data management. The findings highlight the prototype's effectiveness in facilitating smooth interactions and effective communication, leveraging NLP for intelligent language understanding. Integrating NLP-based speech recognition, visual documentation, and data transfer provides a comprehensive platform for managing audio, text, and image data. The personal assistant robot prototype presented in this research represents a significant advancement in human-robot interaction, particularly in meeting and collaborative work settings. Further refinements in NLP can enhance efficiency and foster seamless human-robot interaction experiences.
Subject
Marketing,Economics and Econometrics,General Materials Science,General Chemical Engineering
Reference32 articles.
1. Navigation and Formation of Swarm Robotics with Local Positioning System
2. A. de Barcelos Silva et al., “Intelligent personal assistants: A systematic literature review,” Expert Syst. Appl., vol. 147, p. 113193, 2020, doi: 10.1016/j.eswa.2020.113193.
3. A. Tur and D. Traum, “Comparing Approaches to Language Understanding for Human-Robot Dialogue: An Error Taxonomy and Analysis,” in Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022, pp. 5813–5820.
4. M. Omar, S. Choi, D. Nyang, and D. Mohaisen, “Robust natural language processing: Recent advances, challenges, and future directions,” IEEE Access, 2022.
5. L. Riccioli, “Artificial Intelligence: Innovation for Society 5.0,” Available SSRN 4457016, 2023.