%0 Journal Article %T 智能交互系统的设计与实现:基于人脸识别与语音识别技术
Design and Implementation of Intelligent Interaction System: Based on Face Recognition and Speech Recognition Technology %A 李玲 %J Computer Science and Application %P 81-93 %@ 2161-881X %D 2024 %I Hans Publishing %R 10.12677/csa.2024.146144 %X 本文介绍了一种基于人脸识别与语音识别技术相融合的多模态智能交互系统。该系统由人脸识别模块及语音识别模块两大部分组成。通过集成openMV摄像头、麦克风阵列以及openMV IDE软件环境,开发一种多模态系统,该系统能够实现特征点提取与检测,并结合这些功能进行语音增强、语音识别和人脸识别。openMV摄像头进行图像采集,并在openMV IDE软件端执行特征点检测算法,捕捉用户的面部特征,实现身份验证和用户信息的获取。同时,麦克风阵列将负责捕获声音信号。语音增强模块通过运用基于时频卷积网络(TFCN)的轻量级语音增强算法,抑制背景噪声,保持目标语音的失真尽可能低,实现对目标语音的增强。语音识别模块实现了从语音到文本的转换,提升系统的智能化水平。该系统可广泛应用于智能家居领域,具体来说,可以应用于智能门锁,该系统可以自动识别家庭成员的面孔,实现无钥匙进入。此外,语音识别模块可以识别出特定的语音命令,如“开门”或“关门”,从而进一步增加智能门锁的便捷性和安全性。实验结果表明,本智能交互系统通过融合人脸识别与语音识别技术,成功开发了一种多模态智能交互系统。这一集成化的设计不仅体现了系统的高效性和稳定性,更预示了该系统在未来广泛应用中的巨大潜力和实用价值。
This paper introduces a multi-modal intelligent interaction system based on the integration of face recognition and speech recognition technology. The system consists of two parts: face recognition module and voice recognition module. By integrating the openMV camera, microphone array and openMV IDE software environment, a multi-modal system is developed, which can realize feature point extraction and detection, and combine these functions for voice enhancement, speech recognition and face recognition. The openMV camera collects images and executes a feature point detection algorithm on the openMV IDE software to capture the user’s facial features and realize authentication and user information acquisition. At the same time, the microphone array will be responsible for capturing the sound signal. The speech enhancement module uses a lightweight speech enhancement algorithm based on time-frequency convolution network (TFCN) to suppress background noise, keep the distortion of the target voice as low as possible, and realize the enhancement of the target voice. The speech recognition module realizes the conversion from voice to text and improves the intelligent level of the system. The system can be widely used in the field of smart home. Specifically, it can be applied to smart door locks. The system can automatically identify the faces of family members and achieve keyless entry. In addition, the voice recognition module can recognize specific voice commands, such as “open the door” or “close the door”, thus further increasing the convenience and security of the smart door lock. The experimental results show that this intelligent interaction system has successfully developed a multi-modal intelligent interaction system by integrating face recognition and speech recognition technology. This integrated design not only reflects the efficiency and stability of the system, but also indicates the great potential and practical value of the system in its wide application in the future. %K 人脸识别,openMV,单通道语音增强,语音识别,智能交互系统
Face Recognition %K openMV %K Single-Channel Voice Enhancement %K Voice Recognition %K Intelligent Interaction System %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=89909