12
1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler HEAD acoustics GmbH Typical IP-Scenarios: components and their influence on speech quality testing techniques for speech quality parameters, selected examples Summary and future work

Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

  • Upload
    others

  • View
    15

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

1

Conversational Speech Quality -The Dominating Parameters in

VoIP Systems

H.W. Gierlich, F. Kettler

HEAD acoustics GmbH

• Typical IP-Scenarios: components and their influence on speech quality

• testing techniques for speech quality parameters, selected examples

• Summary and future work

Page 2: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

2

SpeechQuality

��� ���

Parameter Influencing Quality TGE_001

Sound Quality &Naturalness

Intelligibility

Listening & TalkingEffort

ConversationalEffort

Double TalkPerformance

Expectation

Transmission ofBackground Noise

SpeechCharacteristics

Individual Listening

NetworkConditions

EnvironmentalConditions

PABX PABXIP-Gateway

IP-Gateway

E1/T1 E1/T1

IP

PSTN - PSTN over IP

PSTN to IP

IP to IP

The different IP-Scenarios

Page 3: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

3

-H(f) Coder Buffer

Buffer Decoder

H(f)

IP

IP

Echo canceller Voice controlled attenuation/switching

- Comfort noise insertion

Background noise reduction

VAD

VAD

The IP-Scenario from the speech quality point of view (example)

•delay, time variant

•coding ( also with packet loss)

•(speech controlled) attenuation of signals (e.g. based on VAD)

•echo cancellation

•noise reduction and„comfort-noise“- injection

•interactivity•perception of echo impairments

•interactivity•clipping effects•„missing words“

•(sound)-quality•clipping effects•„missing words“

•noise modulation•masking e. g. of echo impairments

Signal processing and speech quality

packet loss

Page 4: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

4

Speech quality from the users perspective

speechquality

talkingsituation

listeningsituation

conversationalsituation

- delay- echo attenuation- echo-/sidetone- distortion- switching characteristics- time variant echo characteristics

- delay- echo attenuation- echo-/sidetone- distortion- proposals for psychoacoustically motivated tests, e.g. PESQM (perceptual echo and sidetone quality measure)- background noise transmission, e.g. Relative Approach

- “standard"- para- meters, e.g. Loudness Ratings- psychoacoustic motivated proced., e.g. PESQ, TOSQA- ext. measurements, e.g. switching characteristics- background noise transmission, e.g. Relative Approach

Relevant quality parameters in the three situations

listening situation talking situation conversational situation

Page 5: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

5

TGE_014

SLR CLR CLR RLR7dB 0dB 0dB 3dB

TELR a(Echo) a(Echo) TELR

RLR CLR CLR SLR3dB 0dB 0dB 7dB OLR = 10dB

Fig. 1 G.131

Delay and echo attenuation during single talk

Echo level offset single talk/double talk

0

4

8

12

16

1 1,5 2 2,5 3 3,5 4 4,5-> MOS

dB

MOS

1

2

3

4

5

0 3 6 9 12 16 20-> loudness variation single/double talk

Double talk, important performance limits

echo during double talk:the effect of echo level increase during double talk on subjects judgement

level variation betweensingle- and double talk:the effect on subjectsjudgement

Page 6: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

6

Measurement-technique - selected examples:

• Listening speech quality • Echo & switching during double talk• background noise transmission (Relative Approach)• „comfort-noise“ injection

Measurement for handsets and headsets

Page 7: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

7

Realizations:•spectral distance measures•PESQ (P.862)•TOSQA....

TG_e019

Instrumental Measures based on Hearing Models

Modeling the Results of Auditory Tests by Comparisonof Reference Speech Signal with Processed Speech Signals

Typical Processing Steps (Schematic):

:

Adaptation Hearing Model

Comparison,Reference,Reduction,Signal Value

Results ofListening Tests!

S indexQ

General construction of perceptual models

1.0

2.0

3.0

4.0

5.0

1.0 2.0 3.0 4.0 5.0

Auditory MOS

TM

OS

(T

OS

QA

)

single codecsmobile2*tandem3*tandemMNRU

ITU-T 8kbit/sec char. test, exp1ρ = 0.97

1.0

2.0

3.0

4.0

5.0

1.0 2.0 3.0 4.0 5.0

Auditory MOS

TM

OS

(T

OS

QA

)

real connectionssimulationssingle codecs4+tandemMNRU

Combinations of ADPCM Systems ρ = 0.97

1.0

2.0

3.0

4.0

5.0

1.0 2.0 3.0 4.0 5.0

Auditory MOS

TM

OS

(T

OS

QA

)

G.729G.728GSM-EFRsingle codecsMNRU

ETSI - TIPHON

ρ = 0.95

div. low bitrate codecs under packet loss

1.0

2.0

3.0

4.0

5.0

1.0 2.0 3.0 4.0 5.0

Auditory MOS

TM

OS

(T

OS

QA

)

G.723GSM-FR&G.723GSM-FR&G.729single codecsMNRU

ETSI - TIPHON

ρ = 0.95

div. low bitrate codecs under packet loss

TOSQA validation, some Results [Berger]

Page 8: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

8

( )( )

( ){ }S

S

ES

Hss

ft

t=

FF

sending

receiving(double talk)

transmitted signal insending, extractedduring double talk

original test signal insending

• test signal in sending • test signal in receiving• cut off from measured sending signal

• test signal

• transfer function• switching characteristics

Test signal combination for double talk situations

test signal in receiving

measured signal in receiving

switching duringdouble talk

echo duringdouble talk

Principle of double talk tests

Page 9: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

9

strong echo components

clipping

Example of a double talk test

strongclipping

comfort noise

Example of a double talk test

Page 10: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

10

echo components

desyncroni- sation

(delay variation)

Example of a double talk test

),())(()()1((),(24

1 11∑ ∑

= =

+⋅−−+=

i

T

nGGGG niFiiFwiFiFfSNfQ

]− + ⋅F i n w i F iG G( , ) ( , ( )) )1 2

Background noise transmission: Relative Approach [Genuit]

Basic principle of the Relative Approach:

Comparison between short term and long termaveraging of signal energies in critical bands (app. 2 svs. 2 ms) based on a hearing model [Sottek]

Page 11: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

11

Background noise transmission analysis (example)

onset peak, level variation(level = -35,1 dBm0, SLR 16,7 dB)

clear structures between 1,6 and 3,5 kHz

clearly visible structure intime and frequency

Background noise transmission analysis (example)

analysis of thebackground noisewhen call was puton hold(example from thebeginning)

Page 12: Conversational Speech Quality - The Dominating …hgs/papers/iptel2001/Gierlich.pdf1 Conversational Speech Quality - The Dominating Parameters in VoIP Systems H.W. Gierlich, F. Kettler

12

Background noise in sendingsimultaneously with receivingsignal (from loudspeaker)

transmitted signal insending

power densityspectrum

Original -34,6 dBm0comfort noise -36,2 dBm0

spectralanalysis

Comfort noise insertion analysis (example)

Relative Approach analysis

Summary

Summary and outlook:

- Speech quality influenced by * condition and load of the network * interaction of network components * interaction of terminal and network * environmental conditions at the users location

- Test methods available for various parameters

- Further investigations needed on noise transmission „overall quality“ - number