EN 60268-16:2011
(Main)Sound system equipment - Part 16: Objective rating of speech intelligibility by speech transmission index
Sound system equipment - Part 16: Objective rating of speech intelligibility by speech transmission index
IEC 60268-16:2011 specifies objective methods for rating the transmission quality of speech with respect to intelligibility. It provides a comprehensive manual for all types of users of the STI method in the fields of audio, communications and acoustics. Three methods are presented, which are closely related and are referred to as STI, STIPA, and STITEL. The first two methods are intended for rating speech transmission performance with or without sound systems. The STITEL method has more restricted uses. This fourth edition cancels and replaces the third edition, published in 2003, and constitutes a technical revision. It includes the following significant technical changes with respect to the previous edition: - development of more comprehensive, complete and unambiguous standardization of the STI methodology; - the term STIr is discontinued. A new function for the prediction of auditory masking effects is introduced; - the concept of 'speech level' and the setting of the level of the test signal have been introduced; - additional information has been included on prediction and measurement procedures. The French version of this standard has not been voted upon.
Elektroakustische Geräte - Teil 16: Objektive Bewertung der Sprachverständlichkeit durch den Sprachübertragungsindex
Equipements pour systèmes électroacoustiques - Partie 16: Evaluation objective de l'intelligibilité de la parole au moyen de l'indice de transmission de la parole
La CEI 60268-16:2011 spécifie des méthodes objectives d'évaluation de la qualité de transmission de la parole sous l'angle de l'intelligibilité. Elle met un manuel complet à la disposition de tous les types d'utilisateurs de la méthode STI dans les domaines de l'audio, des communications et de l'acoustique. Trois méthodes, étroitement liées, sont présentées sous le nom de STI, STIPA, et STITEL. Les deux premières méthodes sont destinées à l'évaluation des performances de la transmission de la parole avec ou sans systèmes électroacoustiques. La méthode STITEL a des utilisations plus restreintes. Cette quatrième édition annule et remplace la troisième édition parue en 2003, et constitue une révision technique. Elle inclut les modifications techniques importantes suivantes par rapport à l'édition précédente: - Le développement d'une normalisation plus aboutie, plus complète et moins ambiguë de la méthodologie STI; - le terme STIr n'est plus utilisé. Une nouvelle fonction pour la prédiction des effets du masquage auditif est introduite; - le concept de "niveau de parole" et le réglage du niveau du signal d'essai ont été introduits; - des informations supplémentaires relatives aux modes opératoires de prédiction et de mesures ont été incluses. La version française de cette norme n'a pas été soumise au vote.
Oprema za zvokovne sisteme - 16. del: Objektivno ocenjevanje govorne razumljivosti z uporabo indeksa prenosa govora (IEC 60268-16:2011)
Ta del IEC 60268 določa objektivne metode za ocenjevanje kakovosti prenosa govora z zvezi z razumljivostjo.
Cilj tega standarda je zagotoviti celosten priročnik za vse vrste uporabnikov
metode STI pri slušnosti komunikaciji in akustiki.
Ta standard ne zagotavlja kriterijev STI za certifikacijo kanalov prenosa (npr. kriterijev za sistem glas-alarm).
Predstavljene so tri metode, ki so tesno povezane in so omenjene kot STI, STIPA in STITEL. Prvi dve metodi sta namenjeni ocenjevanju delovanja prenosa govora
z zvokovnimi sistemi ali brez njih.
Uporabe metode STITEL so bolj omejene.
OPOMBA: Nobena izmed teh metod ni primerna za merjenje in ocenitev zasebnosti govora in sistemov za maskiranje govora, ker STI ni bil potrjen za zahteve, ki predstavljajo uporabe za zasebnost govora [1]1.
Vključene so naslednje informacije:
• tehnike merjenja;
• tehnike napovedovanja.
General Information
- Status
- Withdrawn
- Publication Date
- 25-Aug-2011
- Withdrawal Date
- 01-Aug-2014
- Technical Committee
- CLC/TC 100X - Audio, video and multimedia systems and equipment and related sub-systems
- Drafting Committee
- IEC/TC 100 - IEC_TC_100
- Parallel Committee
- IEC/TC 100 - IEC_TC_100
- Current Stage
- 9960 - Withdrawal effective - Withdrawal
- Start Date
- 30-Oct-2023
- Completion Date
- 30-Oct-2023
Relations
- Effective Date
- 28-Jan-2023
- Effective Date
- 09-Jan-2018
Frequently Asked Questions
EN 60268-16:2011 is a standard published by CLC. Its full title is "Sound system equipment - Part 16: Objective rating of speech intelligibility by speech transmission index". This standard covers: IEC 60268-16:2011 specifies objective methods for rating the transmission quality of speech with respect to intelligibility. It provides a comprehensive manual for all types of users of the STI method in the fields of audio, communications and acoustics. Three methods are presented, which are closely related and are referred to as STI, STIPA, and STITEL. The first two methods are intended for rating speech transmission performance with or without sound systems. The STITEL method has more restricted uses. This fourth edition cancels and replaces the third edition, published in 2003, and constitutes a technical revision. It includes the following significant technical changes with respect to the previous edition: - development of more comprehensive, complete and unambiguous standardization of the STI methodology; - the term STIr is discontinued. A new function for the prediction of auditory masking effects is introduced; - the concept of 'speech level' and the setting of the level of the test signal have been introduced; - additional information has been included on prediction and measurement procedures. The French version of this standard has not been voted upon.
IEC 60268-16:2011 specifies objective methods for rating the transmission quality of speech with respect to intelligibility. It provides a comprehensive manual for all types of users of the STI method in the fields of audio, communications and acoustics. Three methods are presented, which are closely related and are referred to as STI, STIPA, and STITEL. The first two methods are intended for rating speech transmission performance with or without sound systems. The STITEL method has more restricted uses. This fourth edition cancels and replaces the third edition, published in 2003, and constitutes a technical revision. It includes the following significant technical changes with respect to the previous edition: - development of more comprehensive, complete and unambiguous standardization of the STI methodology; - the term STIr is discontinued. A new function for the prediction of auditory masking effects is introduced; - the concept of 'speech level' and the setting of the level of the test signal have been introduced; - additional information has been included on prediction and measurement procedures. The French version of this standard has not been voted upon.
EN 60268-16:2011 is classified under the following ICS (International Classification for Standards) categories: 33.160.01 - Audio, video and audiovisual systems in general. The ICS classification helps identify the subject area and facilitates finding related standards.
EN 60268-16:2011 has the following relationships with other standards: It is inter standard links to EN 60268-16:2003, EN IEC 60268-16:2020. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.
You can purchase EN 60268-16:2011 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of CLC standards.
Standards Content (Sample)
SLOVENSKI STANDARD
01-november-2011
1DGRPHãþD
SIST EN 60268-16:2003
Oprema za zvokovne sisteme - 16. del: Objektivno ocenjevanje govorne
razumljivosti z uporabo indeksa prenosa govora (IEC 60268-16:2011)
Sound system equipment - Part 16: Objective rating of speech intelligibility by speech
transmission index (IEC 60268-16:2011)
Elektroakustische Geräte - Teil 16: Objektive Bewertung der Sprachverständlichkeit
durch den Sprachübertragungsindex (IEC 60268-16:2011)
(TXLSHPHQWVSRXUV\VWqPHVpOHFWURDFRXVWLTXHV3DUWLH(YDOXDWLRQREMHFWLYHGH
O
LQWHOOLJLELOLWpGHODSDUROHDXPR\HQGHO
LQGLFHGHWUDQVPLVVLRQGHODSDUROH
&(,
Ta slovenski standard je istoveten z: EN 60268-16:2011
ICS:
33.160.30 Avdio sistemi Audio systems
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
EUROPEAN STANDARD
EN 60268-16
NORME EUROPÉENNE
August 2011
EUROPÄISCHE NORM
ICS 33.160.01 Supersedes EN 60268-16:2003
English version
Sound system equipment -
Part 16: Objective rating of speech intelligibility by speech transmission
index
(IEC 60268-16:2011)
Equipements pour systèmes Elektroakustische Geräte -
électroacoustiques - Teil 16: Objektive Bewertung der
Partie 16: Evaluation objective de Sprachverständlichkeit durch den
l'intelligibilité de la parole au moyen de Sprachübertragungsindex
l'indice de transmission de la parole (IEC 60268-16:2011)
(CEI 60268-16:2011)
This European Standard was approved by CENELEC on 2011-08-02. CENELEC members are bound to comply
with the CEN/CENELEC Internal Regulations which stipulate the conditions for giving this European Standard
the status of a national standard without any alteration.
Up-to-date lists and bibliographical references concerning such national standards may be obtained on
application to the Central Secretariat or to any CENELEC member.
This European Standard exists in three official versions (English, French, German). A version in any other
language made by translation under the responsibility of a CENELEC member into its own language and notified
to the Central Secretariat has the same status as the official versions.
CENELEC members are the national electrotechnical committees of Austria, Belgium, Bulgaria, Croatia, Cyprus,
the Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy,
Latvia, Lithuania, Luxembourg, Malta, the Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia,
Spain, Sweden, Switzerland and the United Kingdom.
CENELEC
European Committee for Electrotechnical Standardization
Comité Européen de Normalisation Electrotechnique
Europäisches Komitee für Elektrotechnische Normung
Management Centre: Avenue Marnix 17, B - 1000 Brussels
© 2011 CENELEC - All rights of exploitation in any form and by any means reserved worldwide for CENELEC members.
Ref. No. EN 60268-16:2011 E
Foreword
The text of document 100/1812/FDIS, future edition 4 of IEC 60268-16, prepared by IEC TC 100, Audio,
video and multimedia systems and equipment, was submitted to the IEC-CENELEC parallel vote and was
approved by CENELEC as EN 60268-16 on 2011-08-02.
This European Standard supersedes EN 60268-16:2003.
— development of more comprehensive, complete and unambiguous standardization of the STI
methodology;
— the term STI is discontinued. A new function for the prediction of auditory masking effects is
r
introduced;
— the concept of 'speech level' and the setting of the level of the test signal have been introduced;
— additional information has been included on prediction and measurement procedures.
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. CEN and CENELEC shall not be held responsible for identifying any or all such patent
rights.
The following dates were fixed:
– latest date by which the EN has to be implemented
at national level by publication of an identical
(dop) 2012-05-02
national standard or by endorsement
– latest date by which the national standards conflicting
(dow) 2014-08-02
with the EN have to be withdrawn
Annex ZA has been added by CENELEC.
__________
Endorsement notice
The text of the International Standard IEC 60268-16:2011 was approved by CENELEC as a European
Standard without any modification.
In the official version, for Bibliography, the following notes have to be added for the standards indicated:
[2] IEC 60318-1:2009 NOTE Harmonized as EN 60318-1:2009 (not modified).
[3] IEC 61672 series NOTE Harmonized in EN 61672 series.
[37] ISO 7029:2000 NOTE Harmonized as EN ISO 7029:2000.
[44] ISO 3382-1:2009 NOTE Harmonized as EN ISO 3382-1:2009.
__________
- 3 - EN 60268-16:2011
Annex ZA
(normative)
Normative references to international publications
with their corresponding European publications
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
NOTE When an international publication has been modified by common modifications, indicated by (mod), the relevant EN/HD
applies.
Publication Year Title EN/HD Year
IEC 61260 1995 Electroacoustics - Octave-band EN 61260 1995
+ A1 2001 and fractional-octave-band filters + A1 2001
ISO 18233 2006 Acoustics - Application of new EN ISO 18233 2006
measurement methods in building and room
acoustics
IEC 60268-16 ®
Edition 4.0 2011-06
INTERNATIONAL
STANDARD
Sound system equipment –
Part 16: Objective rating of speech intelligibility by speech transmission index
INTERNATIONAL
ELECTROTECHNICAL
COMMISSION
PRICE CODE
XB
ICS 33.160.01 ISBN 978-2-88912-522-7
– 2 – 60268-16 IEC:2011(E)
CONTENTS
FOREWORD . 5
INTRODUCTION . 7
1 Scope . 9
2 Normative references . 9
3 Terms and definitions . 9
4 Description of the STI method . 15
4.1 General . 15
4.1.1 Rationale for the STI method . 15
4.1.2 Applicability of the STI method . 15
4.2 Background of the STI method . 16
4.2.1 General . 16
4.2.2 Theoretical overview . 17
4.2.3 Measurement of STI . 18
4.3 Applicability of STI test methods . 19
4.4 Use of direct and indirect methods . 20
4.5 Limitations of the STI method . 21
4.5.1 General . 21
4.5.2 Frequency shifts . 21
4.5.3 Centre clipping . 22
4.5.4 Drop outs . 22
4.5.5 Jitter . 22
4.5.6 Vocoders . 22
4.5.7 Overestimation of STI under low background noise conditions . 22
4.5.8 Frequency response . 22
4.5.9 Echoes . 23
4.5.10 Fast amplitude compression and expansion . 23
4.5.11 Non-linear distortion . 24
4.5.12 Impulsive and fluctuating noise . 24
4.5.13 Hearing impaired listeners . 24
4.6 Conclusion . 24
5 Direct method of measuring STI . 24
5.1 Overview . 24
5.2 STIPA . 25
5.3 Application . 26
5.4 Limitations . 26
6 Indirect method of measuring STI using the impulse response . 26
6.1 Overview . 26
6.2 Application . 27
6.3 Limitations (non-linear distortion) . 28
7 Measurement procedures, post-processing of data and applications . 28
7.1 General . 28
7.2 Acoustical input . 28
7.3 Acoustical output . 30
7.4 Electrical input . 30
7.5 Electrical output . 30
7.6 Examples of input/output combinations. 30
60268-16 IEC:2011(E) – 3 –
7.6.1 Acoustical input – Acoustical output . 30
7.6.2 Electrical input – Electrical output (e.g. assessment of wired and
wireless) communication systems) . 30
7.6.3 Acoustical input – Electrical output (e.g. assessment of microphones) . 31
7.6.4 Electrical input – Acoustical output (e.g. assessment of PA systems) . 31
7.7 Post-processing of measured MTF data . 31
7.8 Issues concerning noise . 31
7.8.1 General . 31
7.8.2 Measurement of background noise . 32
7.8.3 Fluctuating noise . 32
7.9 Analysis and interpretation of the results . 32
7.10 Binaural STI measurements . 33
8 Use of STI as a design prediction tool . 33
8.1 Overview . 33
8.2 Statistical predictions . 33
8.3 Prediction from simulated impulse response . 34
Annex A (normative) Speech transmission index (STI) and revised STI methods . 35
Annex B (normative) STIPA method . 47
Annex C (normative) STITEL method . 48
Annex D (informative) RASTI method (obsolete) . 49
Annex E (informative) Qualification of the STI and relationships with other speech
intelligibility measures. 51
Annex F (informative) Nominal qualification bands for STI . 53
Annex G (informative) Examples of STI qualification bands and typical applications . 54
Annex H (informative) Non-native listeners . 55
Annex I (informative) Effect of age-related hearing loss and hearing impairment on
speech intelligibility. 56
Annex J (normative) Calibration of STI test signal level . 57
Annex K (informative) Example test report sheet for STI measurements . 59
Annex L (normative) Prediction of STI using statistical methods . 61
Annex M (informative) Adjustments to measured STI and STIPA results for simulation
of occupancy noise and different speech levels . 63
Annex N (informative) Other methods of determining speech intelligibility . 67
Bibliography . 69
Figure 1 – Concept of the reduction in modulation due to a transmission channel . 16
Figure 2 – Modulation transfer function – Input/output comparison . 17
Figure A.1 – Envelope function (panel A) of a 10 s speech signal for the 250 Hz
octave band and corresponding envelope spectrum (panel B) . 35
Figure A.2 – Theoretical expression of the MTF . 36
Figure A.3 – Measurement system and frequencies for the STI method . 38
Figure A.4 – Auditory masking of octave band (k – 1) on octave band (k) . 39
Figure D.1 – Illustration of a practical RASTI test signal . 50
Figure E.1 – Relationships between some speech intelligibility measures . 51
Figure E.2 – Relationship between STI, speech intelligibility scores and listening
difficulty ratings [34], [35] . 52
Figure F.1 – STI qualification bands . 53
– 4 – 60268-16 IEC:2011(E)
Table 1 – Comparison of STI test methods for different types of distortion . 19
Table 2 – Applicability of test . 20
Table 3 – Choice of method . 21
Table A.1 – Auditory masking as a function of the octave band level. 40
Table A.2 – Absolute speech reception threshold level in octave bands . 41
Table A.3 – MTI octave band weighting factors . 41
Table A.4 – Octave band levels (dB) relative to the A-weighted speech level . 42
Table B.1 – Modulation frequencies for the STIPA method . 47
Table C.1 – Modulation frequencies for the STITEL method . 48
Table D.1 – Modulation frequencies for the RASTI method . 49
Table E.1 – Categories for listening difficulty . 52
Table G.1 – Examples between STI qualification bands and typical applications . 54
Table H.1 – Adjusted intelligibility qualification tables for non-native listeners . 55
Table I.1 – Adjusted intelligibility qualification tables for normal listeners and people
over 60 years old with hearing loss . 56
Table M.1 – Example calculation . 63
60268-16 IEC:2011(E) – 5 –
INTERNATIONAL ELECTROTECHNICAL COMMISSION
____________
SOUND SYSTEM EQUIPMENT –
Part 16: Objective rating of speech intelligibility
by speech transmission index
FOREWORD
1) The International Electrotechnical Commission (IEC) is a worldwide organization for standardization comprising
all national electrotechnical committees (IEC National Committees). The object of IEC is to promote
international co-operation on all questions concerning standardization in the electrical and electronic fields. To
this end and in addition to other activities, IEC publishes International Standards, Technical Specifications,
Technical Reports, Publicly Available Specifications (PAS) and Guides (hereafter referred to as “IEC
Publication(s)”). Their preparation is entrusted to technical committees; any IEC National Committee interested
in the subject dealt with may participate in this preparatory work. International, governmental and non-
governmental organizations liaising with the IEC also participate in this preparation. IEC collaborates closely
with the International Organization for Standardization (ISO) in accordance with conditions determined by
agreement between the two organizations.
2) The formal decisions or agreements of IEC on technical matters express, as nearly as possible, an international
consensus of opinion on the relevant subjects since each technical committee has representation from all
interested IEC National Committees.
3) IEC Publications have the form of recommendations for international use and are accepted by IEC National
Committees in that sense. While all reasonable efforts are made to ensure that the technical content of IEC
Publications is accurate, IEC cannot be held responsible for the way in which they are used or for any
misinterpretation by any end user.
4) In order to promote international uniformity, IEC National Committees undertake to apply IEC Publications
transparently to the maximum extent possible in their national and regional publications. Any divergence
between any IEC Publication and the corresponding national or regional publication shall be clearly indicated in
the latter.
5) IEC itself does not provide any attestation of conformity. Independent certification bodies provide conformity
assessment services and, in some areas, access to IEC marks of conformity. IEC is not responsible for any
services carried out by independent certification bodies.
6) All users should ensure that they have the latest edition of this publication.
7) No liability shall attach to IEC or its directors, employees, servants or agents including individual experts and
members of its technical committees and IEC National Committees for any personal injury, property damage or
other damage of any nature whatsoever, whether direct or indirect, or for costs (including legal fees) and
expenses arising out of the publication, use of, or reliance upon, this IEC Publication or any other IEC
Publications.
8) Attention is drawn to the Normative references cited in this publication. Use of the referenced publications is
indispensable for the correct application of this publication.
9) Attention is drawn to the possibility that some of the elements of this IEC Publication may be the subject of
patent rights. IEC shall not be held responsible for identifying any or all such patent rights.
International Standard IEC 60268-16 has been prepared by IEC technical committee 100:
Multimedia equipment and systems.
This fourth edition cancels and replaces the third edition, published in 2003, and constitutes a
technical revision.
This edition includes the following significant technical changes with respect to the previous
edition:
• development of more comprehensive, complete and unambiguous standardization of the
STI methodology;
• the term STI is discontinued. A new function for the prediction of auditory masking effects
r
is introduced;
• the concept of ‘speech level’ and the setting of the level of the test signal have been
introduced;
– 6 – 60268-16 IEC:2011(E)
• additional information has been included on prediction and measurement procedures.
NOTE See Introduction for a historical summary referring to the various changes from the first to the fourth
edition (current edition).
The text of this standard is based on the following documents:
FDIS Report on voting
100/1812/FDIS 100/1849/RVD
Full information on the voting for the approval of this standard can be found in the report on
voting indicated in the above table.
This publication has been drafted in accordance with the ISO/IEC Directives, Part 2.
A list of all the parts in the IEC 60268 series, published under the general title Sound system
equipment can be found on the IEC website.
The committee has decided that the contents of this publication will remain unchanged until
the stability date indicated on the IEC web site under "http://webstore.iec.ch" in the data
related to the specific publication. At this date, the publication will be
• reconfirmed,
• withdrawn,
• replaced by a revised edition, or
• amended.
A bilingual version of this publication may be issued at a later date.
60268-16 IEC:2011(E) – 7 –
INTRODUCTION
The Speech Transmission Index (STI) is an objective measure to predict the intelligibility of
speech transmitted from talker to listener by a transmission channel. The STI method has
been the subject of ongoing development and refinement since its introduction in the 1970s.
Major improvements of the STI have been consolidated by incorporating them in successive
revisions of IEC 60268-16.
The history of revisions is as follows.
• Revision 1: 1988. In the first version of the STI standard, a gender-independent test
signal spectrum was used.
• Revision 2: 1998. Gender specific test signals were introduced, for male and female
talkers, each gender relating to a specific set of weighting factors. In addition, weightings
were introduced for redundancy factors. The term STI was introduced to signify the use of
r
these redundancy factors.
• Revision 3: 2003. Important differences between Revision 2 and Revision 3 are the
introduction of
− level dependent masking functions,
− the STI derivative STIPA.
STIPA was specially developed as a fast measurement method that could deal with
electro-acoustic and acoustic effects while determining the speech transmission quality of
PA systems.
• Revision 4: 2010. The aim of Revision 4 (this revision) is to provide a more
comprehensive, complete and unambiguous standardization of the STI methodology. The
term STI is now discontinued. A new function for the prediction of auditory masking
r
effects is introduced.
Speech is considered to be the major method of communication between humans. In many
situations the speech signal is degraded by the signal path or the transmission channel
between talker and listener, resulting in a reduction of the intelligibility of the speech at the
listener’s location.
To quantify the deterioration of the speech intelligibility induced by the transmission channel,
a fast and objective measuring method was developed; the Speech Transmission Index (STI).
The STI method applies a specific test signal to the transmission channel and by analysing
the received test signal; the speech transmission quality of the channel is derived and
expressed in a value between 0 and 1, as the Speech Transmission Index (STI). Using the
obtained STI-value, the potential speech intelligibility can be determined.
Although there are limitations to the STI method, the use of STI has proved useful in many
situations and has gained international acceptance.
Items that have changed in this revision
Specific changes that have been incorporated in this revision are:
• refinement of the STI model with respect to the level dependent masking function;
• Room Acoustic Speech Transmission Index (RASTI) has become obsolete and should not
be used;
• calculations to add or remove the effects of background noise and to change the speech
level and a worked example;
• notes regarding limitations of the STI method;
• methods to predict the STI performance of transmission channels based on the predicted
(as distinct from measured) performance of parts or all of the transmission channel;
– 8 – 60268-16 IEC:2011(E)
• introduction of STI corrections for non-native language listeners;
• introduction of STI corrections for listeners with some specific forms of hearing loss;
• relationships between STI and ‘Listening Difficulty’ scale.
Potential applications of STI
STI may be used to measure the potential intelligibility of a wide range of electronic systems
and acoustic environments. Typical applications include:
• measurement of Public Address and Sound Reinforcement Systems;
• measurement and Certification of Voice Alarm and emergency sound systems;
• measurement of communication channels / systems such as intercoms and wireless
communication;
• measurement of potential speech intelligibility and communication in rooms and auditoria;
• evaluation of direct speech communication (situations without electronic amplification) in
rooms or acoustic spaces including vehicles;
• evaluation of the potential intelligibility of Assistive Hearing Systems;
NOTE The STI method is not validated for the measurement and evaluation of speech privacy or speech masking
systems.
Potential users of STI
The range of users of STI measurements is diverse. Among the users who may apply this
method are:
• certifiers of voice alarm and other types of emergency systems;
• certifiers of sound reinforcement and audio systems;
• audio and telecommunication equipment manufacturers;
• audio and communication engineers;
• acoustical and electro-acoustical engineers;
• sound system installers;
• researchers into STI methods and developers of instruments to measure STI.
To avoid misinterpretation of STI results, it is important that all users have an understanding
of the basic principles, the application domain and its limitations.
60268-16 IEC:2011(E) – 9 –
SOUND SYSTEM EQUIPMENT –
Part 16: Objective rating of speech intelligibility
by speech transmission index
1 Scope
This part of IEC 60268 specifies objective methods for rating the transmission quality of
speech with respect to intelligibility.
The objective of this standard is to provide a comprehensive manual for all types of users of
the STI method in the fields of audio, communications and acoustics.
This standard does not provide STI criteria for certification of transmission channels (e.g.
criteria for a voice-alarm system).
Three methods are presented, which are closely related and are referred to as STI, STIPA,
and STITEL. The first two methods are intended for rating speech transmission performance
with or without sound systems. The STITEL method has more restricted uses.
NOTE None of the methods are suitable for the measurement and assessment of speech privacy and speech
masking systems, as STI has not been validated for conditions that represent speech privacy applications [1] .
The following information is included:
• measurement techniques;
• prediction techniques.
2 Normative references
The following referenced documents are indispensable for the application of this document.
For dated references, only the edition cited applies. For undated references, the latest edition
of the referenced document (including any amendments) applies:
IEC 61260:1995, Electroacoustics – Octave-band and fractional-octave-band filters
Amendment 1 (2001)
ISO 18233:2006, Acoustics – Application of new measurement methods in building and room
acoustics
3 Terms and definitions
For the purpose of this document, the following terms and definitions apply.
3.1
speech intelligibility
rating of the proportion of speech that is understood
—————————
Figures in square brackets refer to the Bibliography.
– 10 – 60268-16 IEC:2011(E)
3.2
speech quality
rating of sound quality of a speech signal
3.3
speech transmission index
STI
metric ranging between 0 and 1 representing the transmission quality of speech with respect
to intelligibility by a speech transmission channel
3.4
speech intelligibility index
SII
objective method for prediction of speech intelligibility based on the Articulation Index
3.5
STI method
FULL STI
objective method for prediction and measurement of the speech transmission index that uses
14 modulation frequencies over a range of 7 octave bands
3.6
distortion
any unintentional and generally undesired change of the form of a signal occurring in a
speech transmission channel
NOTE Distortion can include both linear and non-linear effects in both frequency and time domain.
3.7
speech transmission index for public address systems
STIPA
method obtained by using a condensed version of the STI method but still responsive to
distortions found in room acoustics and/or public address systems
NOTE STIPA is applied as a direct method.
3.8
speech transmission index for telecommunication systems
STITEL
method obtained by using a condensed version of the STI method but still responsive to
distortions found in communication systems
NOTE STITEL is applied as a direct method.
3.9
room acoustical speech transmission index
RASTI
method obtained by using a condensed version of the STI method, to be used for screening
purposes only and focused on direct communication between persons without making use of
an electro-acoustic communication system
NOTE 1 RASTI accounts for noise interference and distortions in the time domain (echoes, reverberation).
NOTE 2 RASTI is now obsolete.
3.10
direct STI method
method using modulated (speech like) test signals to directly measure the modulation transfer
function
60268-16 IEC:2011(E) – 11 –
3.11
indirect STI method
method using the impulse response and forward energy integral (Schroeder integral) to derive
the modulation transfer function
3.12
speech transmission channel
acoustic or electro-acoustic signal path between a talker and a listener
3.13
public address system
PA
electronic sound distribution system, employing microphones, amplifiers and loudspeakers,
used to reinforce or amplify a given sound (such as an announcement or a pre-recorded
message) and distributing the sound within a building or a space
3.14
voice alarm system
VAS
sound distribution system that broadcasts speech messages or warning signals, or both, in an
emergency
3.15
real speech level
signal level of a speech signal in dB A where only the segments that contribute to the speech
signal are taken into account; pauses and silences between words and sentences are ignored
NOTE See also Annex J.
3.16
reference speech level
speech level equivalent to 60 dB A at 1 m distance in front of the talker’s mouth
3.17
vocal effort
exertion of the speaker, quantified objectively by the A-weighted speech level at 1 m distance
in front of the mouth and qualified subjectively by a description
3.18
artificial mouth
device consisting of a loudspeaker mounted in an enclosure and having a directivity and
radiation pattern similar to those of the average human mouth
NOTE The degree of similarity required cannot be easily specified and depends on the particular application. See
for example ITU-T P.50 [47].
3.19
non-native speaker
person speaking a language which is different from the language that was learned as primary
language during the childhood of the speaker
3.20
absolute speech reception threshold
absolute threshold of hearing increased by the minimal required dynamic range for the correct
recognition of speech
– 12 – 60268-16 IEC:2011(E)
3.21
auditory masking
process by which the threshold of hearing (audibility) for one sound is raised by the presence
of another (masking) sound
NOTE Within the STI method, auditory masking is also referred to as the upward spread of masking.
3.22
artificial ear
device with similar characteristics as the human ear for the reception of acoustic signals
NOTE See IEC 60318 [2].
3.23
intensity function
the squared amplitude signal as a function of time
3.24
envelope function
envelope of the intensity function
3.25
envelope spectrum
relative contribution of spectral components of the envelope function
3.26
modulation frequency
frequency of the sinusoidal variation of the envelope function
NOTE The modulation frequency f is expressed in Hertz (Hz).
m
3.27
modulation index
value between 0 and 1 that describes the depth of a sinusoidal modulation of the intensity
function
3.28
modulation transfer ratio
ratio between the modulation depth of the received and the original (transmitted) modulation
depth of the intensity function
3.29
modulation transfer function
MTF
modulation transfer ratios as a function of the modulation frequency
3.30
octave band weighting factor
α
relative contribution of each octave band to the speech transmission index
3.31
octave band redundancy factor
β
fraction of information overlap between two adjacent octave bands with respect to the speech
intelligibility
60268-16 IEC:2011(E) – 13 –
3.32
background noise
all sounds including noise remaining in the absence of the speech or test signal
3.33
fluctuating noise
continuous sound or noise whose sound pressure level varies significantly, but not in an
impulsive manner, during the observation period
3.34
impulsive noise
sound or noise characterized by brief bursts of sound pressure
3.35
signal-to-noise ratio
SNR
difference between the sound pressure level of the speech or test signal and the sound
pressure level of the background noise where the sound pressure levels are determined with
a standardized frequency weighting
NOTE The signal-to-noise ratio SNR is expressed in decibels (dB).
3.36
effective signal-to-noise ratio
SNR
eff
difference between the level of the intensity modulation and the level of the intensity of all the
distortions of a received STI test signal
NOTE 1 The effective signal-to-noise ratio is expressed in decibels (dB).
NOTE 2 Examples of distortions are reverberation field levels, ambient noise levels, non-linear distortion levels
and masking levels.
3.37
crest factor
difference between the peak and the RMS sound pressure levels during a given time-interval
NOTE The crest-factor is expressed in decibels (dB).
3.38
Lombard effect
spontaneous increase of the vocal effort induced by the increase of the ambient noise level at
the speaker’s ear
NOTE Voice pitch shift at higher talking levels is not accounted for here.
3.39
fractional-octave-band filter
bandpass filter for which the ratio of upper cut-off frequency f to lower cut-off frequency f is
2 1
two raised to an exponent equal to the fraction of an octave band
1/b
NOTE 1 In symbols, the ratio of the cut-off frequencies is f /f = 2 , with 1/b denoting the fraction of an octave.
2 1
1/2
EXAMPLE 1 For half-octave band filters, the frequency ratio is 2 = √2.
EXAMPLE 2 For octave band filters, the frequency ratio is 2.
NOTE 2 For further information, refer to IEC 61260.
– 14 – 60268-16 IEC:2011(E)
3.40
reference sound pressure
p
sound pressure, conventionally chosen to be equal to 20 μPa for airborne sound
3.41
sound pressure level
twenty times the logarithm to the base ten of the ratio of RMS sound pressure to the reference
sound pressure
NOTE The sound pressure level is expressed in decibels (dB). The notation is L .
p
3.42
equivalent continuous sound pressure level
ten-fold logarithm to the base ten of the ratio of the squared RMS sound pressure level for a
given time-interval to the squared reference sound pressure.
NOTE 1 The sound pressure level L is given by the following equation:
eq,T
t
p (t)dt
∫
T
t
L =10lg
eq,T
p
where
p(t) is the instantaneous sound pressure at time t;
t is the integration variable for time;
T = t – t , is the length of the time interval, for which the continuous sound pressure level is determined and
2 1
p the reference sound pressure (20 µPa).
The numerator in the argument of the logarithm in the given equation is the RMS sound pressure for the averaging
time T.
NOTE 2 As a matter of principle, no time-weighting is applied in the determination of the continuous sound
pressure level.
NOTE 3 For further definitions, see IEC 61672 [3].
3.43
percentile level
ten-fold decimal logarithm of the ratio of the squared RMS sound pressure level being
exceeded for a given part of the measurement time to the squared reference sound pressure
where the RMS sound pressure is determined with a standardized time and frequency
weighting, e.g., L or L
10 A10
NOTE 1 For application within the framework of this standard, the time-weighting “Fast” is to be applied for the
determination of the percentile level.
NOTE 2 L is the A and Fast-weighted sound pressure level being exceeded in 10 % of the measurement time.
A10
3.44
modulation transfer index
MTI
unweighted mean of the scaled effective signal to noise ratios for a given octave band
3.45
operational speech level
sound pressure level of speech signal that will be used or is found in the applicable situation
60268-16 IEC:2011(E) – 15 –
3.46
operational background noise level
sound pressure level of background that will be present or is found in the applicable situation
NOTE This level is used for predictions and post-processing of measurements.
4 Description of the STI method
4.1 General
4.1.1 Rationale for the STI method
The STI method was developed as a fast and objective test method for determining the
speech transmission quality of speech transmission channels. Using the speech transmission
index, the speech intelligibility can be predicted for different types of word and sentence
formats for a wide range of speech transmission systems.
In speech, the intensity of the signal varies with time producing a variation in the intensity
envelope of the speech. Slow fluctuations of the intensity envelope correspond with word and
sentence boundaries while fast fluctuations coincide with individual phonemes within words.
Within the STI concept, preservation of the intensity envelope is considered to be of the
utmost importance.
In contrast to the original approach of the articulation index [4], which is based on the signal
to noise ratios in different speech spectral bands, the STI measurement determines the
degree to which the intensity envelope of the speech signal is affected by a transmission
channel. A Modulation Transfer Function (MTF) is determined which quantifies how the
channel affects the intensity envelope of the speech signal.
The STI produces a metric on a scale of 0 to 1, based on weighted contributions from a range
of frequency bands present in speech.
The STI method and its derivatives (see below) can be used to determine the potential
intelligibility of a speech transmission channel at various locations and for various conditions.
In particular, the effect of changes in the acoustic properties of spaces can be assessed.
4.1.2 Applicability of the STI method
The STI method is an objective and validated measure of speech transmission quality for a
wide range of acoustic and electro-acoustical distortions that influence intelligibility. However,
as it is a simplification of human speech, the STI model can be limited in its applicability.
Users of the STI method that apply the method beyond its current limits may obtain inaccurate
intelligibility predictions. An overview of the applications and limitations is therefore given that
aims to help STI users decide on which STI method is most suitable for their application, so
as to obtain the most meaningful and accurate results.
The STI method was validated for an acoustic output using a single omnidirectional
microphone. The use of a directional microphone produces different and uncorrelatable
results and is not normally advised. Further information is given in clause 7.10.
If the situation or the transmission channel does not allow the use of STI methods, alternative
techniques for assessing intelligibility shall be used. Other methods exist to assess the quality
of speech communication, each with their advantages and disadvantages and therefore have
different users. Annex N describes a number of other measures of intelligibility.
---------------
...










Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...