An Acoustic Feature Aggregation Method with Stepwise Pooling
In a speaker feature extraction model based on neural networks,different pooling methods can affect the aggregation effect of voiceprint features.Compared with traditional pooling methods,some pooling methods that combine attention mechanisms exhibit stronger feature aggregation capabilities.Based on this,a step-by-step pooling method for voiceprint feature aggregation is proposed,and experiments are conducted on publicly available datasets.The results show that the proposed method can effectively improve the aggregation effect of voiceprint features and enhance the accuracy of voiceprint recognition.