一种分步池化的声纹特征聚合方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：在基于神经网络的说话人特征提取模型中,不同的池化方法会影响声纹特征的聚合效果.与传统池化方法相比,一些结合注意力机制的池化方法表现出更强的特征聚合能力.基于此,提出一种分步池化的声纹特征聚合方法,并在公开数据集上进行实验.结果表明,所提方法能有效改善声纹特征的聚合效果,提高声纹识别的准确率.

外文标题：An Acoustic Feature Aggregation Method with Stepwise Pooling

外文摘要：In a speaker feature extraction model based on neural networks,different pooling methods can affect the aggregation effect of voiceprint features.Compared with traditional pooling methods,some pooling methods that combine attention mechanisms exhibit stronger feature aggregation capabilities.Based on this,a step-by-step pooling method for voiceprint feature aggregation is proposed,and experiments are conducted on publicly available datasets.The results show that the proposed method can effectively improve the aggregation effect of voiceprint features and enhance the accuracy of voiceprint recognition.

外文关键词：

voiceprint recognitionneural networkspoolingdeep learning

作者：

冯坤、和椿皓

展开 >

作者单位：

保定市排水服务中心,河北保定 071051

河北大学电子信息工程学院,河北保定 071000

关键词：

声纹识别神经网络池化深度学习

出版年：

2024

DOI：

10.16311/j.audioe.2024.03.007

电声技术

电视电声研究所(中国电子科技集团公司第三研究所)

电声技术

影响因子：0.259

ISSN：1002-8684

年,卷(期)：2024.48(3)

参考文献量11