电声技术2024,Vol.48Issue(2) :28-31,35.DOI:10.16311/j.audioe.2024.02.009

一种基于预训练模型的语音深度伪造算法识别方法

A Pre-trained Model Based Recognition Method for Speech Deepfake Algorithms

田野 罗曦 许斌 葛珊 张向阳
电声技术2024,Vol.48Issue(2) :28-31,35.DOI:10.16311/j.audioe.2024.02.009

一种基于预训练模型的语音深度伪造算法识别方法

A Pre-trained Model Based Recognition Method for Speech Deepfake Algorithms

田野 1罗曦 1许斌 1葛珊 1张向阳1
扫码查看

作者信息

  • 1. 中国电子科技集团公司第三研究所,北京 100015
  • 折叠

摘要

为提高语音深度伪造算法识别模型的准确性和对未知伪造算法识别的泛化性,文章提出一种基于预训练模型的识别方法.基于真伪语音数据集,微调训练HuBERT预训练模型,并基于模型输出的深层嵌入特征构建流形空间,通过度量不同伪造算法下语音数据流形空间的测地线距离进行伪造算法的判定.实验表明,所提方法可以较为有效地实现对已知和未知伪造算法的识别.

Abstract

To improve the accuracy of the recognition model of speech deepfake algorithms and the generalization of the recognition of unknown deepfake algorithms,a recognition method based on the pre-trained model is proposed.Based on the real and fake speech dataset,the HuBERT pre-trained model is fine-tuned and the manifold space is constructed based on the deep embedded features output from the model,and the determination of deepfake algorithms is carried out by measuring the geodesic distances between the manifold spaces of different deepfake algorithms.Experiments show that the proposed method can realize the recognition of known and unknown deepfake algorithms more effectively.

关键词

深度伪造算法识别/预训练模型/流形测度

Key words

deepfake algorithm recognition/pre-trained models/manifold measurement

引用本文复制引用

出版年

2024
电声技术
电视电声研究所(中国电子科技集团公司第三研究所)

电声技术

影响因子:0.259
ISSN:1002-8684
参考文献量12
段落导航相关论文