首页|Frequency offset correction in speech without detecting pitch

Frequency offset correction in speech without detecting pitch

扫码查看
Radio-transmitted speech sometimes contains a residual frequency shift or offset, resulting from incorrect demodulation in single-sideband channels。 Frequency-shifted speech can mask speaker identity and reduce intelligibility。 Therefore, frequency offset will degrade the performance of downstream speech technologies。 Existing offset correction methods require a pitch estimate of the speech signal, which is difficult in noisy radio channels。 We present a new, automatic algorithm for detecting and correcting frequency offset, based on third-order modulation spectral analysis。 Our method is remarkably simple and does not require pitch estimation。 We provide derivations, examples, and a pilot study demonstrating how offset correction improves speaker verification for radio-transmitted speech。

Modulation spectrumfrequency offsetsingle-sidebandspeaker recognitionspeech enhancement

Clark, Pascal、Mallidi, Sri Harish、Jansen, Aren、Hermansky, Hynek

展开 >

Human Language Technology Center of Excellence, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland USA|c|

IEEE International Conference on Acoustics, Speech and Signal Processing

Vancouver(CA)

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

7020-7024

2013