YOLOX-S声光信息融合目标识别算法

扫码查看

原文链接

万方数据
维普

中文摘要：针对现代战场单一探测手段的局限性和单模态目标识别存在信息不全面、易受噪声干扰等缺点,提出一种融合声光两种模态的目标识别方法.该方法利用深度卷积残差网络对声纹信息的对数梅尔频谱系数特征进行提取,使用YOLOX-S网络对目标进行光学特征提取,并计算目标的像空间位置与类别信息,然后在YOLOX-S模型预测部分的解耦头中引入用于处理声音特征的支路,将目标的光学特性与声学特性在YOLOX-S检测头分类支路上进行空间归一化,使视觉数据与声纹数据在同一可拼接域上进行映射与融合,对目标的声光融合特征进行识别推理.在自建数据集上进行验证,实验结果表明声纹信息和图像信息融合可以提供更全面的感知能力,使得目标的检测和识别更加准确和可靠.

外文标题：YOLOX-S Based Acousto-optical Information Fusion Target Recognition Algorithm

外文摘要：In view of the limitations of single detection methods in modern battlefield and the shortcomings of single mode target recognition such as incomplete information and easy to be disturbed by noise,a new target recognition method combining two modes of sound and light was proposed.In this method,the log-mel spectral coefficient features of voiceprint information were extracted by deep convolutional residual network,the optical features of the target were extracted by YOLOX-S network,and the image space position and category informa-tion of the target were calculated.Then,a branch for processing sound features was introduced into the decou-pling head of the prediction part of the YOLOX-S model.The optical and acoustic characteristics of the target were spatially normalized on the classification branch of the YOLOX-S detection head,so that the visual data and voicing data could be mapped and fused in the same concatenable domain,and the acousto-optical fusion fea-tures of the target could be identified and reasoned.The experimental results showed that the fusion of voice-print information and image information could provide a more comprehensive perception capability and make the detection and recognition of objects more accurate and reliable.

外文关键词：

target recognitionfeature fusionYOLOX-Svoiceprint features

作者：

杨茸宇、刘凤丽、郝永平

展开 >

作者单位：

沈阳理工大学机械工程学院,辽宁沈阳 110159

沈阳理工大学辽宁省先进制造技术与装备重点实验室,辽宁沈阳 110159

关键词：

目标识别特征融合 YOLOX-S 声纹特征

基金：

装备预研重点实验室基金项目

项目编号：

2021JCJQLB055009

出版年：

2024

探测与控制学报

中国兵工学会西安机电信息研究所机电工程与控制国家级重点实验室

探测与控制学报

CSTPCD北大核心

影响因子：0.267

ISSN：1008-1194

年,卷(期)：2024.46(5)