一种基于预训练模型的语音深度伪造算法识别方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：为提高语音深度伪造算法识别模型的准确性和对未知伪造算法识别的泛化性,文章提出一种基于预训练模型的识别方法.基于真伪语音数据集,微调训练HuBERT预训练模型,并基于模型输出的深层嵌入特征构建流形空间,通过度量不同伪造算法下语音数据流形空间的测地线距离进行伪造算法的判定.实验表明,所提方法可以较为有效地实现对已知和未知伪造算法的识别.

外文标题：A Pre-trained Model Based Recognition Method for Speech Deepfake Algorithms

外文摘要：To improve the accuracy of the recognition model of speech deepfake algorithms and the generalization of the recognition of unknown deepfake algorithms,a recognition method based on the pre-trained model is proposed.Based on the real and fake speech dataset,the HuBERT pre-trained model is fine-tuned and the manifold space is constructed based on the deep embedded features output from the model,and the determination of deepfake algorithms is carried out by measuring the geodesic distances between the manifold spaces of different deepfake algorithms.Experiments show that the proposed method can realize the recognition of known and unknown deepfake algorithms more effectively.

外文关键词：

deepfake algorithm recognitionpre-trained modelsmanifold measurement

作者：

田野、罗曦、许斌、葛珊、张向阳

展开 >

作者单位：

中国电子科技集团公司第三研究所,北京 100015

关键词：

深度伪造算法识别预训练模型流形测度

出版年：

2024

DOI：

10.16311/j.audioe.2024.02.009

电声技术

电视电声研究所(中国电子科技集团公司第三研究所)

电声技术

影响因子：0.259

ISSN：1002-8684

年,卷(期)：2024.48(2)

参考文献量12