一体机式虚拟数字人的设计与实现

Design and Implementation of All-in-One-Box Virtual Digital Human

黄林 ¹林健 ¹徐驰 ¹罗明宇 ¹王武双 ¹鲁晓丹¹

扫码查看

作者信息

1. 东云睿连(武汉)计算技术有限公司,湖北武汉 430200
折叠

摘要

虚拟数字人是人工智能与元宇宙应用的交叉点,也是当今线上与线下人机交互的新兴渠道之一.虚拟数字人涉及控制引擎、自然语言处理、3D图形渲染、语音识别与合成等技术领域,需要软硬件栈多层次的协同设计.为此,基于一体机部署模式的OMHuman虚拟数字人解决方案提出一套松耦合式控制引擎,采用独立显卡实现图形渲染,并通过自研算法在Intel OpenVINO计算引擎上实现人工智能模型推理,解决了传统方案在语音—动作协同控制等诸多方面的不足,同时兼顾了最终用户体验、开发成本与部署成本.比较测试表明,OMHuman虚拟数字人模型推理性能为传统引擎的2～3倍,图形渲染效率为核芯显卡的2倍,能够以自然的方式满足人机交互需求,目前已在虚拟主持人、智能数据分析师等场景得到成功应用.

Abstract

Virtual digital humans are the intersection of artificial intelligence and metaverse applications,involving technology fields such as control engines,natural language processing,3D graphics rendering,speech recognition and synthesis,and require multi-level collaborative design of software and hardware stacks.To this end,a loosely coupled control engine is proposed for the OMHuman virtual digital human solu-tion based on the all-in-one machine deployment mode.It uses an independent graphics card to achieve graphics rendering and implements ar-tificial intelligence model inference on the Intel OpenVINO computing engine through self-developed algorithms.This solves the shortcomings of traditional solutions in voice action collaborative control and other aspects,while also taking into account the end user experience,develop-ment costs,and deployment costs.Comparative tests have shown that the reasoning performance of the OMHuman virtual digital human model is 2-3 times that of traditional engines,and the graphics rendering efficiency is twice that of core graphics cards.It can meet human-computer interaction needs in a natural way and has been successfully applied in scenarios such as virtual hosts and intelligent data analysts.

关键词

虚拟数字人/人工智能/一体机/控制引擎/自然语言处理/图形渲染

Key words

virtual digital human/artificial intelligence/all-in-one-box/control engine/natural language processing/graphics rendering

引用本文复制引用

基金项目

武汉东湖高新区第十三批3551光谷人才计划(M165)

Intel Marketing Exchange项目(581394)

出版年

2024

软件导刊

湖北省信息学会

软件导刊

影响因子：0.524

ISSN：1672-7800

参考文献量4

段落导航