OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems 2012

Source Separation via Spectral Masking for Speech Recognition Systems

DOI: 10.11601/ijates.v1i2-3.16

Gustavo Fernandes Rodrigues,Thiago de Souza Siqueira,Ana Cláudia Silva de Souza,Hani Camille Yehia

Full-Text Cite this paper Add to My Lib

Abstract:

In this paper we present an insight into the use of spectral masking techniques in time-frequency domain, as a preprocessing step for the speech signal recognition. Speech recognition systems have their performance negatively affected in noisy environments or in the presence of other speech signals. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. We show the robustness of the spectral masking techniques against four types of noise: white, pink, brown and human speech noise (bubble noise). The main contribution of this work is to analyze the performance limits of recognition systems using spectral masking. We obtain an increase of 18% on the speech hit rate, when the speech signals were corrupted by other speech signals or bubble noise, with different signal-to-noise ratio of approximately 1, 10 and 20 dB. On the other hand, applying the ideal binary masks to mixtures corrupted by white, pink and brown noise, results an average growth of 9% on the speech hit rate, with the same different signal-to-noise ratio. The experimental results suggest that the masking spectral techniques are more suitable for the case when it is applied a bubble noise, which is produced by human speech, than for the case of applying white, pink and brown noise.

Full-Text

Contact Us

[email protected]

QQ:3279437679

WhatsApp +8615387084133