联系客服
客服二维码

联系客服获取更多资料

微信号:LingLab1

客服电话:010-82185409

意见反馈
关注我们
关注公众号

关注公众号

linglab语言实验室

回到顶部
VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX

575 阅读 2020-02-11 15:12:57 上传

In this paper, a fast method for voiced-unvoiced classification of speech signals is introduced. The suggested method makes the V-UV decision, using signal energy, the peak-to-peak differerence of the autocorrelation function, number of zero crossings of the autocorrelation function and the unit delay autocorrelation coefficient all together.

5. KAYNAKÇA

[1] Fisher, E., Tabrikian, J., Dubnov, S., "Generalized likelihoodratio test for voiced-unvoiced decision in noisy speech using the harmonic model," IEEE Transactions on Audio, Speech, and Language Processing, vol.14, no.2, pp. 502- 510, March 2006.

[2] E. Fisher, J. Tabrikian and S. Dubnov, “Generalized Likelihood Ratio Test for Voiced-Unvoiced Decision in Noisy Speech Using the Harmonic Model,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14, No. 2, 2006, pp. 502-510. doi:10.1109/TSA.2005.857806

[3] S. Ahmadi. and A. S. Spanias, “Cepstrum-based pitch detection using a new statistical V/UV classification algorithm,” IEEE Trans. Speech Audio Pro., vol. 7 No. 3, pp. 333-338, 1999

[4] Nemer, E., Goubran, R., Mahmoud, S., “Robust voice activity detection using higher-order statistics in the LPC residual domain,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 217 231, Mar. 2001.

[5] B. Atal and M. Schroeder, “Predictive Coding of Speech Signals and Subjective Error Criteria,” IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 27, No. 3, 2003, pp. 247-254. doi:10.1109/TASSP.1979.1163237

[6] S. Imai, “Cepstral Analysis Synthesis on the Mel Frequency Scale,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 8, 2003, pp. 93-96.

[7] B. Atal and L. Rabiner, “A Pattern Recognition Approach to Voicedunvoiced-Silence Classification with Applications to Speech Recognition,” IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 24, No. 3, 2003, pp. 201-212. doi:10.1109/TASSP.1976.1162800

[8] R. J. McAulay and T. F. Quatieri, “Pitch Estimation and Voicing Detection Based on a Sinusoidal Speech Model,” International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, 1990, pp. 249-252. doi:10.1109/ICASSP.1990.115585

[9] L. Rabiner, “On the Use of Autocorrelation Analysis for Pitch Detection,” IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 25, No. 1, 2003. pp. 24-33. doi:10.1109/TASSP.1977.1162905

[10] Y. Qi and B. R. Hunt, “Voiced-Unvoiced-Silence Classifications of Speech Using Hybrid Features and a Network Classifier,” IEEE Transactions on Speech and Audio Processing, Vol. 1, No. 2, 2002, pp. 250-255. doi:10.1109/89.222883

[11] Li Hui, Bei-qian Dai, Lu Wei, "A Pitch Detection Algorithm Based on AMDF and ACF", IEEE International Conference ICASSP 2006 , vol. 1, p. I-I, 2006

[12] J. A. Marks, “Real time speech classification and pitch detection,” Communications and Signal Processing, pp. 1-6, June , 1988.

点赞
收藏
表情
图片
附件