Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (5): 162-170.doi: 10.19665/j.issn1001-2400.2019.05.023

Previous Articles     Next Articles

Speech enhancement based on the modified phase using signal-to-noise ratio information and time-frequency characteristics

JIA Hairong,WANG Weimei,JI Huifang   

  1. College of Information and Computer, Taiyuan University of Technology, Taiyuan 030024, China
  • Received:2019-06-12 Online:2019-10-20 Published:2019-10-30

Abstract:

Aiming for the problem that the harmonic model-based phase spectrum speech enhancement algorithm can only reconstruct the phase of voiced segment, which leads to speech distortion and auditory discontinuity, a new method to improve phase reconstruction by using signal-to-noise ratio (SNR) information and time-frequency features is proposed. First, the time-frequency characteristics related to phase distortion are introduced and the decision threshold is calculated. Then the phase deviation between noisy speech and clean speech is calculated by using the signal-to-noise ratio information. The two comparisons further estimate the phase of voiced and unvoiced speech, which can effectively improve the coherence of speech. Finally, the reconstructed phase is combined with the amplitude estimation of the improved binary hypothesis model and the speech enhancement is performed. Experiments on different speeches in different noise backgrounds show that phase deviation of the new algorithm is closer to the original signal. Compared with the comparison algorithm, the signal-to-noise ratio of the enhanced speech is increased by 2.39dB on average, and the perceptual evaluation of speech quality is increased by 0.12 on average, which effectively reduces the speech distortion and improves speech intelligibility.

Key words: phase reconstruction, SNR information, time-frequency characteristics, decision threshold, phase deviation

CLC Number: 

  • TN912.35

Baidu
map