PEMO-Q - A new method for objective: Audio quality assessment using a model of auditory perception

被引：233

作者：

Huber, Rainer ^{[1
]}

Kollmeier, Birger

机构：

[1] HorTech gGmbH, Kompetenzzentrum, D-26111 Oldenburg, Germany

[2] Carl von Ossietzky Univ Oldenburg, Fak 5, D-26111 Oldenburg, Germany

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 06期

关键词：

audio quality; auditory model; objective quality assessment;

D O I：

10.1109/TASL.2006.883259

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A new method for the objective assessment and prediction of perceived audio quality is introduced. It represents an expansion of the speech quality measure q(C), introduced by Hansen and Kollmeier, and is based on a psychoacoustically validated, quantitative model of the "effective" peripheral auditory processing by Dan et al. To evaluate the audio quality of a given distorted signal relative to a corresponding high-quality reference signal, the auditory model is employed to compute "internal representations" of the signals, which are partly assimilated in order to account for assumed cognitive aspects. The linear cross correlation coefficient of the assimilated internal representations represents the perceptual similarity measure (PSM). PSM shows good correlations with subjective quality ratings if different types of audio signals are considered separately, whereas a better accuracy of signal-independent quality prediction is achieved by a second quality measure PSMt represented by the fifth percentile of the sequence of instantaneous audio quality PSM(t). The new measures were evaluated using a large database of subjective listening tests that were originally carried out on behalf of the International Telecommunication Union (ITU) and Moving Pictures Experts Group (MPEG) for the evaluation of various low bit-rate audio codecs. Additional tests with data unknown in the development phase of the model were carried out. Except for linear distortions, the new method shows a higher prediction accuracy than the ITU-R recommendation BS.1387 ("PEAQ") for the tested data.

引用

页码：1902 / 1911

页数：10

共 37 条

[1]

[Anonymous], BS1387 ITUR

[2]

[Anonymous], 2001, P862 ITUT

[3]

[Anonymous], 1996, OBJ QUAL MEAS TEL BA

[4]

Beerends J. G., 1994, WORKSH SPEECH QUAL A, P1

[5]

BEERENDS JG, 1992, J AUDIO ENG SOC, V40, P963

[6]

BERGER J, 1998, THESIS U KIEL KIEL G

[7]

BRANDENBURG K, 1994, J AUDIO ENG SOC, V42, P780

[8]

COLOMES C, 1995, J AUDIO ENG SOC, V43, P233

[9] A quantitative model of the ''effective'' signal processing in the auditory system .2. Simulations and measurements [J].

Dau, T ;

Puschel, D ;

Kohlrausch, A .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3623-3631

[10] A quantitative model of the ''effective'' signal processing in the auditory system .1. Model structure [J].

Dau, T ;

Puschel, D ;

Kohlrausch, A .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3615-3622

← 1 2 3 4 →