利用两级时域联合的包层语音质量评价模型

doi:10.3969/j.issn.1001-2400.2013.03.003

Abstract

Abstract:

Aiming at the problem that the voice qualities corresponding to different packet loss patterns show significant differences at the same packet loss rate, a packet-layer model for voice quality assessment, which well reflects the effect of the packet loss patterns on the voice quality, is presented. First, the information about the coding bit-rate and packet loss is obtained by analyzing the packet header, on the basis of which the frame quality is measured with the further information about silence detection and error propagation. Then the voice sequence is divided into groups of frames (GOFs) with a variable length and a short-term temporal pooling method is employed to obtain the GOF quality. Finally, the overall voice quality is determined by the long-term temporal pooling of the GOF qualities. The proposed two-level temporal pooling scheme well describes the effect of different packet loss patterns on the voice quality since the strongest impairments are predominately emphasized. Experimental results show that the presented model can lead to an increment of about 0.0129 in the Pearson Correlation coefficient (PCC) and a decrement of about 0.0234 in the Root Mean Squared Error (RMSE) compared with the E-model in ITU-T recommendation G.107.

Key words: voice quality assessment, temporal pooling, packet loss, quality of service

CLC Number:

TN912

JIANG Liangliang;YANG Fuzheng;REN Guangliang. Packet-layer model for voice quality assessment using two-level temporal pooling scheme[J].J4, 2013, 40(3): 14-19+94.

References

［1］ Daengsi T, Preechayasomboon A, Wutiwiwatchai C, et al. A Study of VoIP Quality Evaluation: User Perception of Voice Quality from G.729, G. 711 and G. 722 ［C］//Proc of the 9th Annual IEEE Consumer Communications and Networking Conference. Piscataway: IEEE Computer Society, 2012: 342-345.
［2］ Jelassi S, Rubino G, Melvin H, et al. Quality of Experience of VoIP Service: a Survey of Assessment Approaches and Open Issues ［J］. IEEE Communications Surveys ＆ Tutorials, 2012, 14(2): 491-513.
［3］ Rix A W, Beerends J G, Kim D S, et al. Objective Assessment of Speech and Audio Quality: Technology and Applications ［J］. IEEE Trans on Audio, Speech, and Language Processing, 2006, 14(6): 1890-1901.
［4］ ITU-T Recommendation P.861. Objective Quality Measurement of Telephone-band (300-3400 Hz) Speech Codecs ［S］. Geneva: Telecommunication Standardization Sector, 1996.
［5］ ITU-T Recommendation P.862. Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End to End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs ［S］. Geneva: Telecommunication Standardization Sector, 2001.
［6］ Moller S, Chan W, Cote N, et al. Speech Quality Estimation: Models and Trends ［J］. IEEE Signal Processing Magazine, 2011, 28(6): 18-28.
［7］ Narwaria M, Lin W, McLoughlin I V, et al. Nonintrusive Quality Assessment of Noise Suppressed Speech with Mel-Filtered Energies and Support Vector Regression ［J］. IEEE Trans on Audio, Speech, and Language Processing, 2012, 20(4): 1217-1232.
［8］ Takahashi A, Yoshino H, Kitawaki N. Perceptual QoS Assessment Technologies for VoIP ［J］. IEEE Communications Magazine, 2004, 42(7): 28-34.
［9］ Clark A. Modeling the Effects of Burst Packet Loss and Recency on Subjective Voice Quality ［C］//Proc of IP-Telephony Workshop. New York: IEEE, 2001: 123-127.
［10］ ITU-T SG12 Temporary Document TD 297. Updated Draft Terms of Reference for P.NAMS ［S］. Geneva: Telecommunication Standardization Sector, 2010.
［11］ ITU-T Recommendation G.107. The E-model, a Computational Model for Use in Transmission Planning ［S］. Geneva: Telecommunication Standardization Sector, 2002.
［12］ Jelassi S, Rubino G. A Comparison Study of Automatic Speech Quality Assessors Sensitive to Packet Loss Burstiness ［C］//Proc of the 8th Annual IEEE Consumer Communications and Networking Conference. Piscataway: IEEE Computer Society, 2011: 415-420.
［13］ Yang Fuzheng, Jiang Liangliang, Li Xiao. Real-time Quality Assessment for Voice over IP ［J］. Concurrency and Computation: Practice and Experience, 2012, 24(11): 1192-1199.
［14］ Radhakrishnan K, Larijani H, Buggy T. A Non-intrusive Method to Assess Voice Quality over Internet ［C］//Proc of the 2010 International Symposium on Performance Evaluation of Computer and Telecommunication Systems. Piscataway: IEEE Computer Society, 2010: 380-386.
［15］李维, 杨付正. 考虑包内容特性的网络语音质量评价模型［J］. 西安电子科技大学学报, 2011, 38(2): 23-28.
Li Wei, Yang Fuzheng. No-reference Quality Assessment Model for Networked Speech Based on the Content Feature of Packets ［J］. Journal of Xidian University, 2011, 38(2): 23-28.
［16］ Rohaly A M, Lu J, Franzen N R, et al. Comparison of Temporal Pooling Methods for Estimating the Quality of Complex Video Sequences ［C］//Proc of SPIE Human Vision and Electronic Imaging Conference. Bellingham: SPIE, 1999: 218-226.
［17］ Rix A, Beerends J, Hollier M, et al. Perceptual Evaluation of Speech Quality (PESQ)-a New Method for Speech Quality Assessment of Telephone Networks and Codecs ［C］//Proc of IEEE International Conference on Acoustics, Speech, and Signal Processing. Salt Lake City: IEEE, 2001: 749-752.
［18］ ITU Rep. COM12-D97-E. Packet Loss Distributions and Packet Loss Models ［S］. Geneva: ITU-T Study Group 12, 2003.

[1]	CAO Jing;WU Junsheng;YANG Wenchao;WANG Shuochen. Multi-metric cross layer routing protocol for cognitive radio ad hoc networks [J]. Journal of Xidian University, 2018, 45(2): 77-83.
[2]	DUAN Ruimeng;MA Wenping;LUO Wei;ZHAO Feifei. Fully distributed resource allocation strategy in Femtocell networks [J]. Journal of Xidian University, 2017, 44(3): 36-42.
[3]	SONG Jiarun;XU Ziqiang;YANG Fuzheng. Parametric-planning model combining the transmission characteristics of network video service [J]. Journal of Xidian University, 2016, 43(2): 168-173.
[4]	YANG Peng;LI Xiangpan;LIU Dou . Delay-aware scheduling algorithm for enhancing video services QoS in the LTE [J]. J4, 2015, 42(2): 140-145.
[5]	MENG Xianjia;MA Jianfeng;LU Di;WANG Yichuan. Trust and behavioral modeling based two layer service selection [J]. J4, 2014, 41(4): 198-204.
[6]	PENG Fei;DENG Haojiang;LIU Lei. QoS-aware temporal prediction model for personalized service recommendation [J]. J4, 2013, 40(4): 174-180.
[7]	LIU Hechao;CHANG Yilin;YUAN Hui;YANG Jiawei. No-reference video quality assessment over the IP network based on packet loss [J]. J4, 2012, 39(2): 29-34.
[8]	YANG Yanfei;ZHU Zhangming;ZHOU Duan;YANG Yintang. Delay-independent asynchronous dynamic priority arbiter for the network on chips [J]. J4, 2012, 39(1): 42-48+110.
[9]	LI Wei;YANG Fuzheng. No-reference quality assessment model for networked speech based on the content feature of packets [J]. J4, 2011, 38(2): 23-28+98.
[10]	TUO Yan-jun;LIU Yun;LIU Yong-jian. Link Load Based Multicast Packet Diffusion Algorithm in the LEO Satellite Constellation System [J]. J4, 2010, 37(2): 380-384.
[11]	ZHANG Yan;SHENG Min;LI Jian-dong;TIAN Ye;YAO Jun-liang. Novel strategy for dynamic partner choice [J]. J4, 2010, 37(1): 73-79.
[12]	HUANG Bo-hu;DUAN Zhen-hua. Application of the quantum genetic algorithm in web services selection [J]. J4, 2010, 37(1): 56-61+67.
[13]	WANG Jun-fang;ZHANG Si-dong. Study of the optimal packet length in the wireless network [J]. J4, 2010, 37(1): 152-157.
[14]	LI Jian-dong;CHEN Ting;DENG Shao-ping;LI Chang-le. Study of the cross-layer architecture for guarantee of QoS in WiMAX [J]. J4, 2009, 36(4): 573-582.
[15]	ZHANG Guo-peng;ZHANG Hai-lin. Non-cooperative game theoretical admission control algorithm for traffic flows in wireless LANs [J]. J4, 2008, 35(5): 805-810.

Packet-layer model for voice quality assessment using two-level temporal pooling scheme

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0