ETSI TS 102 563 V2.1.1 (2017-01)
RTS/JTC-DAB-82
General Information
Standards Content (sample)
ETSI TS 102 563 V2.1.1 (2017-01)
TECHNICAL SPECIFICATION
Digital Audio Broadcasting (DAB);
DAB+ audio coding (MPEG HE-AACv2)
---------------------- Page: 1 ----------------------
2 ETSI TS 102 563 V2.1.1 (2017-01)
Reference
RTS/JTC-DAB-82
Keywords
audio, broadcasting, coding, DAB, digital
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE
Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16
Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88
Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the
print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspxIf you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspxCopyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.© European Telecommunications Standards Institute 2017.
© European Broadcasting Union 2017.
All rights reserved.
TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members.
3GPP and LTE™ are Trade Marks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.GSM® and the GSM logo are Trade Marks registered and owned by the GSM Association.
ETSI---------------------- Page: 2 ----------------------
3 ETSI TS 102 563 V2.1.1 (2017-01)
Contents
Intellectual Property Rights ................................................................................................................................ 5
Foreword ............................................................................................................................................................. 5
Modal verbs terminology .................................................................................................................................... 5
1 Scope ........................................................................................................................................................ 6
2 References ................................................................................................................................................ 6
2.1 Normative references ......................................................................................................................................... 6
2.2 Informative references ........................................................................................................................................ 6
3 Definitions, abbreviations and arithmetic operators ................................................................................. 6
3.1 Definitions .......................................................................................................................................................... 6
3.2 Abbreviations ..................................................................................................................................................... 7
3.3 Arithmetic operators ........................................................................................................................................... 7
4 Introduction .............................................................................................................................................. 7
5 Audio ........................................................................................................................................................ 8
5.1 HE-AACv2 audio coding ................................................................................................................................... 8
5.2 Audio super framing syntax ............................................................................................................................... 9
5.3 MPEG Surround ............................................................................................................................................... 12
5.3.1 Overview .................................................................................................................................................... 12
5.3.2 Requirements for MPEG Surround encoders and decoders ........................................................................ 13
5.3.3 Operational aspects of broadcasting ........................................................................................................... 13
5.3.4 Receiver implementation aspects ................................................................................................................ 13
5.4 Programme Associated Data (PAD) ................................................................................................................. 14
5.4.0 Introduction................................................................................................................................................. 14
5.4.1 PAD insertion ............................................................................................................................................. 14
5.4.2 Coding of F-PAD and X-PAD .................................................................................................................... 15
5.4.3 PAD extraction ........................................................................................................................................... 15
6 Transport error coding and interleaving ................................................................................................. 16
6.0 Introduction ...................................................................................................................................................... 16
6.1 RS coding ......................................................................................................................................................... 16
6.2 Formation of the coding array .......................................................................................................................... 17
6.3 Formation of the parity array ............................................................................................................................ 17
6.4 Formation of the output array ........................................................................................................................... 17
6.5 Order of data transmission................................................................................................................................ 17
7 Signalling ............................................................................................................................................... 17
7.1 FIC signalling ................................................................................................................................................... 17
7.2 Audio parameter signalling .............................................................................................................................. 17
8 Reconfiguration ...................................................................................................................................... 18
Annex A (normative): Error concealment ......................................................................................... 19
A.0 Introduction ............................................................................................................................................ 19
A.1 AAC error concealment.......................................................................................................................... 19
A.1.0 Introduction ...................................................................................................................................................... 19
A.1.1 Interpolation of one corrupt AU ....................................................................................................................... 19
A.1.2 Fade-out and fade-in ......................................................................................................................................... 20
A.2 SBR error concealment .......................................................................................................................... 20
A.3 Parametric stereo error concealment ...................................................................................................... 22
A.4 MPEG Surround error concealment ....................................................................................................... 22
Annex B (informative): Implementation tips for PAD insertion........................................................ 23
ETSI---------------------- Page: 3 ----------------------
4 ETSI TS 102 563 V2.1.1 (2017-01)
Annex C (informative): Synchronizing to the audio super frame structure ..................................... 24
Annex D (informative): Processing a super frame .............................................................................. 26
Annex E (informative): Bit-rate available for audio ........................................................................... 27
History .............................................................................................................................................................. 28
ETSI---------------------- Page: 4 ----------------------
5 ETSI TS 102 563 V2.1.1 (2017-01)
Intellectual Property Rights
IPRs essential or potentially essential to the present document may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.Foreword
This Technical Specification (TS) has been produced by Joint Technical Committee (JTC) Broadcast of the European
Broadcasting Union (EBU), Comité Européen de Normalisation ELECtrotechnique (CENELEC) and the European
Telecommunications Standards Institute (ETSI).NOTE 1: The EBU/ETSI JTC Broadcast was established in 1990 to co-ordinate the drafting of standards in the
specific field of broadcasting and related fields. Since 1995 the JTC Broadcast became a tripartite body
by including in the Memorandum of Understanding also CENELEC, which is responsible for the
standardization of radio and television receivers. The EBU is a professional association of broadcasting
organizations whose work includes the co-ordination of its members' activities in the technical, legal,
programme-making and programme-exchange domains. The EBU has active members in about
60 countries in the European broadcasting area; its headquarters is in Geneva.European Broadcasting Union
CH-1218 GRAND SACONNEX (Geneva)
Switzerland
Tel: +41 22 717 21 11
Fax: +41 22 717 24 81
The Eureka Project 147 was established in 1987, with funding from the European Commission, to develop a system for
the broadcasting of audio and data to fixed, portable or mobile receivers. Their work resulted in the publication of
European Standard, ETSI EN 300 401 [1], for DAB (see note 2) which now has worldwide acceptance.
NOTE 2: DAB is a registered trademark owned by one of the Eureka Project 147 partners.
The DAB family of standards is supported by World DAB, an organization with members drawn from broadcasting
organizations and telecommunication providers together with companies from the professional and consumer
electronics industry.Modal verbs terminology
In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and
"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of
provisions)."must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.
ETSI---------------------- Page: 5 ----------------------
6 ETSI TS 102 563 V2.1.1 (2017-01)
1 Scope
The present document defines the method to code and transmit audio services using the HE-AACv2 [2] audio coder for
Eureka-147 Digital Audio Broadcasting (DAB) (ETSI EN 300 401 [1]) and details the necessary mandatory
requirements for decoders. The permitted audio modes and the data protection and encapsulation are detailed. This
audio coding scheme permits the full use of the PAD channel for carrying dynamic labels and user applications.
2 References2.1 Normative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.Referenced documents which are not found to be publicly available in the expected location might be found at
https://docbox.etsi.org/Reference.NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.The following referenced documents are necessary for the application of the present document.
[1] ETSI EN 300 401 (V2.1.1): "Radio Broadcasting Systems; Digital Audio Broadcasting (DAB) to
mobile, portable and fixed receivers".[2] ISO/IEC 14496-3: "Information technology - Coding of audio-visual objects - Part 3: Audio".
2.2 Informative referencesReferences are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.[i.1] ISO/IEC 23003-1: "Information technology - MPEG audio technologies - Part 1: MPEG
Surround".3 Definitions, abbreviations and arithmetic operators
3.1 Definitions
For the purposes of the present document, the terms and definitions given in ETSI EN 300 401 [1] and the following
apply:access unit: access unit contains the audio samples for 20 ms, 30 ms, 40 ms or 60 ms of audio depending on the
sampling rate of the AAC core, respectively 48 kHz, 32 kHz, 24 kHz or 16 kHzaudio super frame: audio super frame contains a number of AUs which together contain the encoded audio for 120 ms
ETSI---------------------- Page: 6 ----------------------
7 ETSI TS 102 563 V2.1.1 (2017-01)
subchannel_index: subchannel_index is derived from the size of the sub-channel carrying the audio service and defines
the number of Reed-Solomon code words in each audio super frame3.2 Abbreviations
For the purposes of the present document, the abbreviations given in ETSI EN 300 401 [1] and the following apply:
AAC Advanced Audio CodingAU Access Unit
CRC Cyclic Redundancy Check
CU Capacity Unit
DAB Digital Audio Broadcasting
DAC Digital Analogue Converter
DMB Digital Multimedia Broadcasting
DRC Dynamic Range Control
DVB Digital Video Broadcasting
FIC Fast Information Channel
FIG Fast Information Group
GF Galois Field
HE-AAC High Efficiency AAC
MCI Multiplex Configuration Information
MPEG Moving Pictures Experts Group
MSC Main Service Channel
PAD Programme Associated Data
PS Parametric Stereo
RS Reed-Solomon
SAC Spatial Audio Coding
SBR Spectral Band Replication
3.3 Arithmetic operators
+ addition
- subtraction
× multiplication
÷ division
m DIV p denotes the quotient part of the division of m by p (m and p are positive integers)
m MOD p denotes the remainder of the division of m by p (m and p are positive integers)
f()i denotes the sum: f(p) + f(p + 1) + f(p + 2) ... + f(q)i= p
f()i denotes the product: f(p) × f(p + 1) × f(p + 2) ... × f(q)
i= p
4 Introduction
The DAB system standard [1] allows audio (programme) services to be carried using either DAB audio or DAB+ audio.
The present document defines the way that audio (programme) services are carried when using DAB+ audio (MPEG 4
HE-AACv2 audio coding).MPEG 4 HE-AACv2 specifies two transforms, but only the 960 transform is permitted for DAB+ audio, with core
sampling rates of 48 kHz, 32 kHz, 24 kHz and 16 kHz. Each AU (audio frame) contains samples for 20 ms, 30 ms,
40 ms or 60 ms respectively. AUs are built into audio super frames of 120 ms duration which are then carried in five
DAB logical frames. In order to provide additional error control, Reed Solomon coding and virtual interleaving is
applied. The overall scheme is shown in figure 1.ETSI
---------------------- Page: 7 ----------------------
8 ETSI TS 102 563 V2.1.1 (2017-01)
Reed-Solomon coder DAB main
AAC
Audio super and service
framing virtual interleaver channel
audio
multiplexer
coder
Scope of present document
Figure 1: Conceptual diagram of the outer coder and interleaver
5 Audio
5.1 HE-AACv2 audio coding
For generic audio coding, a subset of the MPEG-4 High Efficiency Advanced Audio Coding v2 (HE-AACv2) profile
chosen to best suit the DAB system environment is used. The HE-AACv2 Profile, Level 2 according to [2] shall apply
with the following additional restrictions for the DAB system:• Sampling rates: permitted output sampling rates of the HE-AACv2 decoder are 32 kHz and 48 kHz, i.e. when
SBR is enabled the AAC core shall be operated at 16 kHz or 24 kHz, respectively. If SBR is disabled then the
AAC core shall be operated at 32 kHz or 48 kHz respectively.• Transform length: the number of samples per channel per AU is 960. This is required to harmonize HE-AAC
AU lengths to allow the combination of an integer number of AUs to build an audio super frame of 120 ms
duration.• Audio bit rates are restricted to fit within a maximum sub-channel size of 192 kbps (approximately 175 kbps
for audio, assuming no PAD).• Audio super framing: AUs are composed into audio super frames, which always correspond to 120 ms in time.
The AUs in the audio super frames are encoded together such that each audio super frame is of constant
length, i.e. that bit exchange between AUs is only possible within an audio super frame. The number of AUs
per super frame are: two (16 kHz AAC core sampling rate with SBR enabled), three (24 kHz AAC core
sampling rate with SBR enabled), four (32 kHz AAC core sampling rate) or six (48 kHz AAC core sampling
rate).Each audio super frame is carried in five consecutive logical DAB frames (see clause 7) which enables simple
synchronization and management of reconfigurations. The size of the audio super frame is defined by the size of the
MSC sub-channel (see ETSI EN 300 401 [1], clause 6.2.1) which carries the audio super frame. Sub-channels are
multiples of 8 kbps in size. The size of the audio super frame in bytes is given by the expressions below:
subchannel_index = MSC sub-channel size (kbps) ÷ 8audio_super_frame_size (bytes) = subchannel_index × 110
The first byte of the audio super frame is byte 0 and the last byte is byte (audio_super_frame_size - 1).
NOTE: The subchannel_index parameter may take the values 1 to 24 due to the restriction limiting the maximum
sub-channel size to 192 kbps.ETSI
---------------------- Page: 8 ----------------------
9 ETSI TS 102 563 V2.1.1 (2017-01)
5.2 Audio super framing syntax
Table 1: Syntax of he_aac_super_frame()
Syntax No. of bits Note
he_aac_super_frame(subchannel_index)
he_aac_super_frame_header() determines num_aus
for (n = 0; n < num_aus; n++) {
au[n] 8 × au_size[n]
au_crc[n] 16
NOTE: au corresponds to one single access unit.
Each au is protected by one CRC.
The size of he_aac_super_frame() is equal to audio_super_frame_size.
he_aac_super_frame_header()
The header contains the audio parameters for the audio super frame and the respective start positions of each AU within
the audio super frame, along with an error protection word. The au_start values for the second and subsequent AUs are
stored consecutively in the header. Depending on the number of AUs, 4 padding bits are added to achieve byte-
alignment.num_aus
The number of AUs in the audio super frame is determined by the settings of the audio parameters. num_aus may take
the values 2, 3, 4 or 6 (see table 2).au[n]
The AU contains the audio samples for 20 ms, 30 ms, 40 ms or 60 ms of audio depending on the core sampling rate,
respectively 48 kHz, 32 kHz, 24 kHz or 16 kHz.au_size[n]
This is the size in bytes of the AU.
au_crc[n]
Each AU is protected by a 16-bit CRC.
The CRC shall be generated according to the procedure defined in ETSI EN 300 401 [1], annex E. The generation shall
be based on the polynomial:16 12 5
G(x) = x + x + x +1
The CRC word shall be complemented (1s complement) prior to transmission. At the beginning of each CRC word
calculation, all register stages shall be initialized to "1".ETSI
---------------------- Page: 9 ----------------------
10 ETSI TS 102 563 V2.1.1 (2017-01)
Table 2: Syntax of he_aac_super_frame_header()
Syntax No. of bits Note
he_aac_super_frame_header()
header_firecode 16
// start of audio parameters
rfa 1
dac_rate 1
sbr_flag 1
aac_channel_mode 1
ps_flag 1
mpeg_surround_config 3
// end of audio parameters
if ((dac_rate == 0) && (sbr_flag == 1)) num_aus = 2; AAC core sampling rate 16 kHz
if ((dac_rate == 1) && (sbr_flag == 1)) num_aus = 3; AAC core sampling rate 24 kHz
if ((dac_rate == 0) && (sbr_flag == 0)) num_aus = 4; AAC core sampling rate 32 kHz
if ((dac_rate == 1) && (sbr_flag == 0)) num_aus = 6; AAC core sampling rate 48 kHz
for (n = 1; n < num_aus; n++) {au_start[n]; 12 AU start position
if !((dac_rate == 1) && (sbr_flag == 1))
alignment 4 byte-alignment
NOTE: The au_start for the first AU in the audio super frame (au_start[0]) is not transmitted. The first AU always
starts immediately after the he_aac_super_frame_header().header_firecode
The header_firecode is a 16-bit field containing a Fire code capable of detecting and correcting most single error burst
of up to 6 bits. The error pattern 101111 (where "1" indicates "bit error", "0" indicates "no bit error") can be detected
but not corrected. The Fire code shall be generated using the polynomial:11 5 3 2 16 14 13 12 11 5 3 2
G(x) = (x +1)(x + x + x + x +1) = x + x + x + x + x + x + x + x + x +1
The Fire code word shall be calculated over the nine bytes from byte 2 to byte 10 of the audio super frame.
NOTE 1: Except in the case where num_aus = 6, the Fire code calculation will include some bytes from the first
AU.At the beginning of each Fire code word calculation, all register stages shall be initialized to "0".
NOTE 2: Other definitions call the above code an error-correcting CRC code.audio parameters
The audio parameters comprise the rfa, dac_rate, sbr_flag, aac_channel_mode, ps_flag and mpeg_surround_config
fields.NOTE 3: When the audio parameters are changed, some interruption to the audio output should be expected.
Broadcasters should therefore plan audio parameter changes carefully.rfa
The rfa is a 1-bit field reserved for future addition. This bit shall be set to zero for the currently specified application.
ETSI---------------------- Page: 10 ----------------------
11 ETSI TS 102 563 V2.1.1 (2017-01)
dac_rate
The dac_rate is a 1-bit field to signal the sampling rate of the DAC according to table 3.
Table 3: Definition of dac_rateDac_rate Meaning Note
0 DAC sampling rate 32 kHz
1 DAC sampling rate 48 kHz
sbr_flag
The sbr_flag is a 1-bit field to signal the use of SBR according to table 4.
Table 4: Definition of sbr_flag
sbr_flag Meaning Note
0 SBR not used The sampling rate of the AAC core is equal to the
sampling rate of the DAC
1 SBR used The sampling rate of the AAC core is half the
sampling rate of the DAC
aac_channel_mode
The aac_channel_mode is a 1-bit field according to table 5.
Table 5: Definition of aac_channel_mode
aac_channel_mode Meaning Note
0 AAC (core) coding is mono mono refers to a single_channel_element() see [2]
1 AAC (core) coding is stereo stereo refers to a channel_pair_element() see [2]
ps_flag
The ps_flag is a 1-bit field to signal the use of PS according to table 6.
Table 6: Definition of ps_flag
ps_flag Meaning Note
0 PS not used
1 PS used only permitted when sbr_flag == 1 &&
aac_channel_mode == 0
mpeg_surround_config
The mpeg_surround_config is a 3-bit field according to table 7.
Table 7: Definition of mpeg_surround_config
mpeg_surround_config Meaning Note
000 MPEG Surround is not used
001 MPEG Surround with 5.1 output
channels is used
010 MPEG Surround with 7.1 output
channels is used
011 to 110 reserved for future definition
111 other mode (the mode can be derived
from the MPEG Surround
SpatialSpecificConfig())
ETSI
---------------------- Page: 11 ----------------------
12 ETSI TS 102 563 V2.1.1 (2017-01)
The number of output channels provided by MPEG Surround is intended as an information for the listener only. The
MPEG Surround decoder shall use this information only to determine whether MPEG Surround is used. If MPEG
Surround is used (mpeg_surround_config ≠ 000), the MPEG Surround decoder shall exclusively rely on the information
contained within the SpatialSpecificConfig().A decoder that does not support MPEG Surround shall ignore this parameter.
au_start[n]
The au_start is an unsigned integer, most significant bit first, carried in a 12-bit field that defines the start position
within the audio super frame of the respective AU by giving the byte number of the first byte of the AU. The value of
au_start for the first AU is not transmitted but is given by table 8.Table 8: Definition of au_start for the first AU of the audio super frame
num_aus value of au_start[0] Note
2 5
3 6
4 8
6 11
The value of au_start for subsequent AUs is given by the expressions below:
• au_start[n] = au_start[n - 1] + au_size[n - 1] + 2;
• au_start[num_aus] = audio_super_frame_size.
The decoder can derive the value of au_size[n] from the received au_start[n] and au_start[n + 1].
alignmentThis 4-bit field, when present, shall be set to 0 0 0 0.
5.3 MPEG Surround
5.3.1 Overview
MPEG Surround is standardized in MPEG-D, Part-1 (ISO/IEC 23003-1 [i.1]). It describes:
• Coding of multichannel signals based on a downmixed signal of the original multichannel signal, and
associated spatial parameters. It offers lowest possible data rate for coding of multichannel signals, as well as
an inherent mono or stereo downmix signal included in the data stream. Hence, a mono or stereo signal can be
expanded to multi-channel by a very small additional data overhead.• Binaural decoding of the MPEG Surround stream, enabling a surround sound experience over headphones.
• An Enhanced Matrix Mode that enables a multi-channel upmix from a stereo signal without any spatial
parameters.Hence, MPEG Surround (Spatial Audio Coding, SAC) is capable of re-creating N channels based on M < N transmitted
channels, and additional control data. In the preferred modes of operating the spatial audio coding system, the
M channels can either be a single mono channel or a stereo channel pair. The control data represents a significantly
lower data rate than required for transmitting all N channels, making the coding very efficient while at the same time
ensuring compatibility with both M channel devices and N channel devices.The MPEG Surround standard incorporates a number of tools that provide features which enable broad application of
the standard. A key feature is the ability to scale the spatial image quality gradually from very low spatial overhead
towards transparency. Another key feature is that the decoder input can be made compatible to existing matrix surround
technologies. All tools are grouped to be covered by certain profiles.ETSI
---------------------- Page: 12 ----------------------
13 ETSI TS 1
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.