J4 ›› 2013, Vol. 40 ›› Issue (3): 14-19+94.doi: 10.3969/j.issn.1001-2400.2013.03.003

• Original Articles • Previous Articles     Next Articles

Packet-layer model for voice quality assessment using  two-level temporal pooling scheme

JIANG Liangliang;YANG Fuzheng;REN Guangliang   

  1. (State Key Lab. of Integrated Service Networks, Xidian Univ., Xi'an  710071, China)
  • Received:2012-10-08 Online:2013-06-20 Published:2013-07-29
  • Contact: JIANG Liangliang E-mail:lljiang@stu.xidian.edu.cn

Abstract:

Aiming at the problem that the voice qualities corresponding to different packet loss patterns show significant differences at the same packet loss rate, a packet-layer model for voice quality assessment, which well reflects the effect of the packet loss patterns on the voice quality, is presented. First, the information about the coding bit-rate and packet loss is obtained by analyzing the packet header, on the basis of which the frame quality is measured with the further information about silence detection and error propagation. Then the voice sequence is divided into groups of frames (GOFs) with a variable length and a short-term temporal pooling method is employed to obtain the GOF quality. Finally, the overall voice quality is determined by the long-term temporal pooling of the GOF qualities. The proposed two-level temporal pooling scheme well describes the effect of different packet loss patterns on the voice quality since the strongest impairments are predominately emphasized. Experimental results show that the presented model can lead to an increment of about 0.0129 in the Pearson Correlation coefficient (PCC) and a decrement of about 0.0234 in the Root Mean Squared Error (RMSE) compared with the E-model in ITU-T recommendation G.107.

Key words: voice quality assessment, temporal pooling, packet loss, quality of service

CLC Number: 

  • TN912

Baidu
map