Speech and multimedia Transmission Quality (STQ); Transmission requirements for Super-Wideband / Fullband handset and headset terminals from a QoS perspective as perceived by the user

RTS/STQ-208-1

General Information

Status
Published
Publication Date
26-Mar-2018
Current Stage
12 - Completion
Due Date
30-Apr-2018
Completion Date
27-Mar-2018
Ref Project

Buy Standard

Standard
ETSI TS 102 924 V1.2.1 (2018-03) - Speech and multimedia Transmission Quality (STQ); Transmission requirements for Super-Wideband / Fullband handset and headset terminals from a QoS perspective as perceived by the user
English language
49 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

ETSI TS 102 924 V1.2.1 (2018-03)






TECHNICAL SPECIFICATION
Speech and multimedia Transmission Quality (STQ);
Transmission requirements for Super-Wideband / Fullband
handset and headset terminals from a QoS perspective
as perceived by the user

---------------------- Page: 1 ----------------------
2 ETSI TS 102 924 V1.2.1 (2018-03)



Reference
RTS/STQ-208-1
Keywords
QoS, terminal
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE

Tel.: +33 4 92 94 42 00  Fax: +33 4 93 65 47 16

Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88

Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the
print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
If you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.
The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.

© ETSI 2018.
All rights reserved.

TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are trademarks of ETSI registered for the benefit of its Members.
TM TM
3GPP and LTE are trademarks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.
oneM2M logo is protected for the benefit of its Members.
®
GSM and the GSM logo are trademarks registered and owned by the GSM Association.
ETSI

---------------------- Page: 2 ----------------------
3 ETSI TS 102 924 V1.2.1 (2018-03)
Contents
Intellectual Property Rights . 5
Foreword . 5
Modal verbs terminology . 5
Introduction . 5
1 Scope . 6
2 References . 6
2.1 Normative references . 6
2.2 Informative references . 8
3 Definitions and abbreviations . 8
3.1 Definitions . 8
3.2 Abbreviations . 9
4 Applications and coder considerations . 9
4.1 Applications . 9
4.2 Coder considerations . 10
4.2.0 Premise . 10
4.2.1 Super-wideband (SWB) . 10
4.2.2 Fullband (FB). 11
5 Test considerations and test equipment . 12
5.0 Introduction . 12
5.1 IP half channel measurement adaptor . 12
5.2 Environmental conditions for tests . 12
5.3 Accuracy of measurements and test signal generation . 13
5.4 Network impairment simulation . 13
5.5 Acoustic environment . 14
5.6 Verification of the environmental conditions . 15
5.7 Influence of terminal delay on measurements . 15
5.8 Specific test considerations . 15
5.8.0 Premise . 15
5.8.1 Loudness rating and Loudness . 16
5.8.1.1 Loudness Rating . 16
5.8.1.2 Loudness . 16
5.8.2 Binaural listening . 16
6 Requirements considerations and associated measurement Methodologies . 16
6.1 Considerations . 16
6.2 Test setup. 17
6.2.1 General . 17
6.2.2 Setup for handsets and headsets . 17
6.2.3 Position and calibration of HATS . 18
6.2.4 Test signal and test signal levels . 18
6.2.5 Setup of background noise simulation . 18
6.2.6 Setup of variable echo path . 19
6.3 Coding independent parameters . 20
6.3.1 Send frequency response . 20
6.3.2 Send Loudness Rating (SLR). 23
6.3.3 Mic mute . 23
6.3.4 Linearity range of SLR . 24
6.3.5 Send Distortion . 24
6.3.5.1 Signal to harmonic distortion versus frequency . 24
6.3.5.2 Signal to harmonic distortion for higher input level . 25
6.3.6 Send Noise . 26
6.3.7 Sidetone Masking Rating STMR (Mouth to ear) . 26
6.3.8 Sidetone delay . 27
ETSI

---------------------- Page: 3 ----------------------
4 ETSI TS 102 924 V1.2.1 (2018-03)
6.3.9 Terminal Coupling Loss (TCL) . 27
6.3.10 Stability loss. 28
6.4 Receive parameters. 29
6.4.1 Equalization . 29
6.4.2 Receive Frequency response . 29
6.4.3 Receive Loudness Rating (monaural reproduction) . 32
6.4.4 RLR for stereo/dichotic reproduction . 32
6.4.5 Loudness . 32
6.4.6 Receive Distortion . 32
6.4.7 Minimum activation level and sensitivity in Receive direction . 33
6.4.8 Receive Noise . 33
6.4.9 Automatic level control in receiving . 34
6.4.10 Double talk performance . 34
6.4.10.1 General . 34
6.4.10.2 Attenuation range in send direction during double talk A . 34
H,S,dt
6.4.10.3 Attenuation range in receive direction during double talk A . 35
H,R,dt
6.4.10.4 Detection of echo components during double talk . 36
6.4.11 Switching characteristics . 37
6.4.11.1 Note . 37
6.4.11.2 Activation in send direction . 38
6.4.11.3 Silence suppression and comfort noise generation . 38
6.4.12 Speech and audio quality in presence of noise. 38
6.4.12.1 Performance in send in the presence of background noise . 38
6.4.12.2 Speech quality in the presence of background noise . 39
6.4.12.3 Quality of background noise transmission (with far end speech). 40
6.4.13 Quality of echo cancellation . 41
6.4.13.1 Temporal echo effects . 41
6.4.13.2 Spectral echo attenuation . 41
6.4.13.3 Occurrence of artefacts . 42
6.4.13.4 Variable echo path. 42
6.4.14 Variant impairments; network dependant . 42
6.4.14.1 Clock accuracy send . 42
6.4.14.2 Clock accuracy receive . 43
6.4.14.3 Send packet delay variation. 43
6.4.15 Send and receive delay - round trip delay . 44
6.5 Other parameters . 45
6.5.1 Objective listening quality . 45
6.5.2 Quality of jitter buffer adjustment . 46
Annex A: Void . 48
History . 49

ETSI

---------------------- Page: 4 ----------------------
5 ETSI TS 102 924 V1.2.1 (2018-03)
Intellectual Property Rights
Essential patents
IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).
Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.
Trademarks
The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners.
ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no
right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does
not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks.
Foreword
This Technical Specification (TS) has been produced by ETSI Technical Committee Speech and multimedia
Transmission Quality (STQ).
Modal verbs terminology
In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and
"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of
provisions).
"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.
Introduction
Speech terminals are currently implementing narrowband and wideband bandwidth. Terminal equipment may offer
wider bandwidth, due to features already available in these terminals. Such equipment may implement conversational
features that may benefit of the electroacoustic equipment already available in the terminal and may provide wider
quality for the end users.
.
The present document is intended to provide initial requirements and test methods for such type of equipment
ETSI

---------------------- Page: 5 ----------------------
6 ETSI TS 102 924 V1.2.1 (2018-03)
1 Scope
The present document provides speech & audio transmission performance requirements and measurement methods for
handset and headset functions of super-wideband/fullband terminals. The present document provides requirements in
order to optimize the end to end quality perceived by users.
Users become more sensitive to voice and music quality (for music used in conversational services) when using
ICT/terminal equipment and so are more demanding for further enhancement especially further extension of the audio
coded bandwidth.
For instance, this is the case for high quality conferencing services with music on hold, better background environment
rendering and longer duration than normal point to point calls.
Standardized super-wideband and fullband coders are now available, some being also compatible with wideband
coders.
The present document will consider only conversational services (that may be mixed with other services) and does not
cover the streaming-only services.
Such applications include:
• Speech and audio communication including conferencing.
• Bandwidth extension which may allow usage for some mixed content.
• Super-wideband enhancement coupled with stereo/dichotic.
The send path it can be characterized in two ways:
• The signal picked up by microphone may combine speech, music and every type of environmental signal.
• Direct insertion of any type of signal.
For receive path, signal may be combine two types:
• Communication signals such as described for send path.
• Signal coming from distributed applications (e.g. advertisement, music on hold, etc.).
2 References
2.1 Normative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
Referenced documents which are not found to be publicly available in the expected location might be found at
https://docbox.etsi.org/Reference/.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are necessary for the application of the present document.
[1] Recommendation ITU-T P.501: "Test signals for use in telephonometry".
[2] Recommendation ITU-T P.10/G.100: "Vocabulary for performance and quality of service".
[3] Recommendation ITU-T P.58: "Head and torso simulator for telephonometry".
ETSI

---------------------- Page: 6 ----------------------
7 ETSI TS 102 924 V1.2.1 (2018-03)
[4] Recommendation ITU-T P.581: "Use of head and torso simulator for hands-free and handset
terminal testing".
[5] Recommendation ITU-T P.79: "Calculation of loudness ratings for telephone sets".
[6] Recommendation G.711.1 (2008) Amendment 4 (11/10): "Wideband embedded extension for
G.711 pulse code modulation".
[7] Recommendation ITU-T G.722.1 (annex C): "Low-complexity coding at 24 and 32 kbit/s for
hands-free operation in systems with low frame loss".
[8] Recommendation G.729.1 (05/06): "G.729-based embedded variable bit-rate coder: An 8-32 kbit/s
scalable wideband coder bitstream interoperable with G.729".
[9] Recommendation ITU-T G.718 (06/08)": "Frame error robust narrow-band and wideband
embedded variable bit-rate coding of speech and audio from 8-32 kbit/s".
[10] Recommendation ITU-T G.719: "Low-complexity, full-band audio coding for high-quality,
conversational applications".
[11] ETSI TS 103 224: "Speech and multimedia Transmission Quality (STQ); A sound field
reproduction method for terminal testing including a background noise database".
[12] ETSI ES 202 739: "Speech and multimedia Transmission Quality (STQ);Transmission
requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
[13] ETSI TS 103 739: "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for wideband wireless terminals (handset and headset) from a QoS perspective as
perceived by the user".
[14] Recommendation ITU-T P.863: "Perceptual objective listening quality assessment".
[15] Recommendation ITU-T P.380: "Electro-acoustic measurements on headsets".
[16] IEC 61260-1: "Electroacoustics - Octave-band and fractional-octave-band filters - Part 1:
Specifications".
[17] Void.
[18] Void.
[19] Recommendation ITU-T G.722: "7 kHz audio-coding within 64 kbit/s".
[20] Void.
[21] Recommendation ITU-T G.711.1 (annex F): "Wideband embedded extension for G.711 pulse code
modulation".
[22] Recommendation ITU-T P.57: "Artificial ears".
[23] Recommendation ITU-T P.64: "Determination of sensitivity/frequency characteristics of local
telephone systems".
[24] ISO 3745: "Acoustics -- Determination of sound power levels and sound energy levels of noise
sources using sound pressure -- Precision methods for anechoic rooms and hemi-anechoic rooms".
[25] ETSI TR 126 952: "Universal Mobile Telecommunications System (UMTS); LTE; Codec for
Enhanced Voice Services (EVS); Performance characterization (3GPP TR 26.952 version 12.2.0
Release 12)".
[26] ETSI TS 126 441: "Universal Mobile Telecommunications System (UMTS); LTE; Codec for
Enhanced Voice Services (EVS); General overview (3GPP TS 26.441)".
[27] Recommendation ITU-T P.56: "Objective measurement of active speech level".
ETSI

---------------------- Page: 7 ----------------------
8 ETSI TS 102 924 V1.2.1 (2018-03)
[28] ETSI TS 103 281: "Speech and multimedia Transmission Quality (STQ); Speech quality in the
presence of background noise: Objective test methods for super-wideband and fullband terminals".
[29] Recommendation ITU-T G.122: "Influence of national systems on stability and talker echo in
international connections".
[30] Recommendation ITU-T P.340: "Transmission characteristics and speech quality parameters of
hands-free terminals".
[31] Recommendation ITU-T P.502: "Objective test methods for speech communication systems using
complex test signals".
[32] Recommendation ITU-T P.863.1: "Application Guide for Recommendation ITU-T P.863".
[33] Recommendation ITU-T P.1010: "Fundamental voice transmission objectives for VoIP terminals
and gateways".
[34] IETF RFC 3550: "RTP: A Transport Protocol for Real-Time Applications".
[35] ETSI ES 202 737: "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for narrowband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
[36] IETF RFC 6716: "Definition of the Opus Audio Codec".
2.2 Informative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.
[i.1] ISO 532: "Acoustics -- Method for calculating loudness level".
TM
[i.2] NIST Net .
NOTE: Available at https://www-x.antd.nist.gov/itg/nistnet/.
TM
[i.3] Netem .
NOTE: Available at http://www.linuxfoundation.org/en/Net:Netem.
[i.4] Trace Control for Netem (TCN) (2006): "Trace Control for Netem, Semester Thesis SA-2006-15",
ETH Zürich, A. Keller.
[i.5] ETSI EG 202 425: "Speech Processing, Transmission and Quality Aspects (STQ); Definition and
implementation of VoIP reference point".
[i.6] STQ(15)48-0309: "Objective Codec Evaluation of EVS. HEAD acoustics GmbH".
3 Definitions and abbreviations
3.1 Definitions
For the purposes of the present document, the following terms and definitions apply:
binaural listening: both ears are involved for the perception of sound
ETSI

---------------------- Page: 8 ----------------------
9 ETSI TS 102 924 V1.2.1 (2018-03)
dichotic: relating to or involving the presentation of a stimulus to one ear that differs in some respect (as pitch,
loudness, frequency, or energy) from a stimulus presented to the other ear
diotic: pertaining to or affecting both ears (same signal in both ears)
dual channel mode: audio mode, in which two audio channels with independent programme contents (e.g. bilingual)
are encoded within one audio bit stream
fullband bandwidth: transmission of speech with a nominal bandwidth of 20 Hz - 20 kHz
stereo mode: audio mode in which two channels forming a stereo pair (left and right) are encoded within one bit stream
and for which the coding process is the same as for the Dual channel mode
super-wideband: transmission with supre-wideband bandwith which may cover at least mono capabilities. Stereo
capabilities may be possible
super-wideband bandwidth: transmission of speech with a nominal pass-band wider than 100 Hz to 7 000 Hz, usually
understood to be 50 Hz - 14 000 Hz (definition from Recommendation ITU-T P.10 /G.100 [2])
3.2 Abbreviations
For the purposes of the present document, the following abbreviations apply:
ACR Absolute Category Rating
DRP ear Drum Reference Point
ERP Ear reference Point
EVS Enhanced Voice Services
FB FullBand
GAT Group Audio Terminal
G-MOS-LQO Overall Quality Mean Opinion Score, Listening Quality Objective, fullband
F
HATS Head and Torso Simulator
MCU Multiplexing Control Unit
MRP Mouth Reference Point
MS Mid-sized Stereo
N-MOS-LQO Noise Quality Mean Opinion Score, Listening Quality Objective, fullband
F
POI Point Of Interconnection
SLR Send Loudness Rating
S-MOS-LQO Speech Quality Mean Opinion Score, Listening Quality Objective, fullband
F
SWB Super-WideBand
TCL Terminal Echo Loss
4 Applications and coder considerations
4.1 Applications
The following applications are within the scope of the present document:
• Speech and audio communication including conferencing using high quality hands free systems, for which
super-wideband/f
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.