摘要
为进一步分析多模态人机交互的研究进展,探讨该领域未来的研究方向,从起源、模态分类及应用前景方面对多模态人机交互进行梳理和总结,并指出发展趋势和挑战.多模态人机交互在视觉、听觉、触觉和手势方面的研究和实际应用已相对成熟;眼动追踪、面部表情和肢体语言在学术研究中得到了广泛的关注和探讨,但在转化为实际应用时,其普及程度相对较低;而嗅觉和味觉的研究受其复杂性和多样性的影响,仍处于发展阶段.多模态人机交互呈现开放数据库及强智能的跨模态交互及跨设备交互发展趋势,但在模态融合、数据收集和隐私安全方面仍面临诸多挑战.通过梳理和归纳多模态人机交互的相关研究成果,为后续相关研究提供参考与借鉴.
Abstract
To analyze the research progress of multimodal human-computer interaction technology,discuss the future research direction,the multi-modal human-computer interaction was summarized from the aspects of origin,mode classification and application prospect to identify the development trend and challenge.Research and practical applications in visual,auditory,tactile,and gesture modalities had reached a relatively mature stage.Eye-tracking,facial expression,and body language technologies were widely focused and discussed in academic research.However,its popularity was relatively low when translated into practical applications.Research on olfaction and gustation still remains in the developmental stage due to their complexity and diversity.MMHCI presented a trend of open databases,highly intelligent cross-modal interactions,and cross-device interactions.However,many challenges remain in modality integration,data collection,and privacy security.This study provided references for subsequent relevant research through sorting out and summarizing the relevant research results of multi-modal human-computer interaction.