首页|一体机式虚拟数字人的设计与实现

一体机式虚拟数字人的设计与实现

扫码查看
虚拟数字人是人工智能与元宇宙应用的交叉点,也是当今线上与线下人机交互的新兴渠道之一.虚拟数字人涉及控制引擎、自然语言处理、3D图形渲染、语音识别与合成等技术领域,需要软硬件栈多层次的协同设计.为此,基于一体机部署模式的OMHuman虚拟数字人解决方案提出一套松耦合式控制引擎,采用独立显卡实现图形渲染,并通过自研算法在Intel OpenVINO计算引擎上实现人工智能模型推理,解决了传统方案在语音—动作协同控制等诸多方面的不足,同时兼顾了最终用户体验、开发成本与部署成本.比较测试表明,OMHuman虚拟数字人模型推理性能为传统引擎的2~3倍,图形渲染效率为核芯显卡的2倍,能够以自然的方式满足人机交互需求,目前已在虚拟主持人、智能数据分析师等场景得到成功应用.
Design and Implementation of All-in-One-Box Virtual Digital Human
Virtual digital humans are the intersection of artificial intelligence and metaverse applications,involving technology fields such as control engines,natural language processing,3D graphics rendering,speech recognition and synthesis,and require multi-level collaborative design of software and hardware stacks.To this end,a loosely coupled control engine is proposed for the OMHuman virtual digital human solu-tion based on the all-in-one machine deployment mode.It uses an independent graphics card to achieve graphics rendering and implements ar-tificial intelligence model inference on the Intel OpenVINO computing engine through self-developed algorithms.This solves the shortcomings of traditional solutions in voice action collaborative control and other aspects,while also taking into account the end user experience,develop-ment costs,and deployment costs.Comparative tests have shown that the reasoning performance of the OMHuman virtual digital human model is 2-3 times that of traditional engines,and the graphics rendering efficiency is twice that of core graphics cards.It can meet human-computer interaction needs in a natural way and has been successfully applied in scenarios such as virtual hosts and intelligent data analysts.

virtual digital humanartificial intelligenceall-in-one-boxcontrol enginenatural language processinggraphics rendering

黄林、林健、徐驰、罗明宇、王武双、鲁晓丹

展开 >

东云睿连(武汉)计算技术有限公司,湖北 武汉 430200

虚拟数字人 人工智能 一体机 控制引擎 自然语言处理 图形渲染

武汉东湖高新区第十三批3551光谷人才计划Intel Marketing Exchange项目

M165581394

2024

软件导刊
湖北省信息学会

软件导刊

影响因子:0.524
ISSN:1672-7800
年,卷(期):2024.23(7)
  • 4