RTS/JTC-DAB-82

General Information

Status
Published
Publication Date
18-Jan-2017
Current Stage
12 - Completion
Due Date
18-Jan-2017
Completion Date
19-Jan-2017
Ref Project

Buy Standard

Standard
ETSI TS 102 563 V2.1.1 (2017-01) - Digital Audio Broadcasting (DAB); DAB+ audio coding (MPEG HE-AACv2)
English language
28 pages
world standards week 25% off
Preview
world standards week 25% off
Preview

Standards Content (sample)

ETSI TS 102 563 V2.1.1 (2017-01)
TECHNICAL SPECIFICATION
Digital Audio Broadcasting (DAB);
DAB+ audio coding (MPEG HE-AACv2)
---------------------- Page: 1 ----------------------
2 ETSI TS 102 563 V2.1.1 (2017-01)
Reference
RTS/JTC-DAB-82
Keywords
audio, broadcasting, coding, DAB, digital
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE
Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16
Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88
Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search

The present document may be made available in electronic versions and/or in print. The content of any electronic and/or

print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any

existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the

print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.

Users of the present document should be aware that the document may be subject to revision or change of status.

Information on the current status of this and other ETSI documents is available at

https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx

If you find errors in the present document, please send your comment to one of the following services:

https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification

No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying

and microfilm except as authorized by written permission of ETSI.

The content of the PDF version shall not be modified without the written authorization of ETSI.

The copyright and the foregoing restriction extend to reproduction in all media.
© European Telecommunications Standards Institute 2017.
© European Broadcasting Union 2017.
All rights reserved.
TM TM TM

DECT , PLUGTESTS , UMTS and the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members.

3GPP and LTE™ are Trade Marks of ETSI registered for the benefit of its Members and

of the 3GPP Organizational Partners.

GSM® and the GSM logo are Trade Marks registered and owned by the GSM Association.

ETSI
---------------------- Page: 2 ----------------------
3 ETSI TS 102 563 V2.1.1 (2017-01)
Contents

Intellectual Property Rights ................................................................................................................................ 5

Foreword ............................................................................................................................................................. 5

Modal verbs terminology .................................................................................................................................... 5

1 Scope ........................................................................................................................................................ 6

2 References ................................................................................................................................................ 6

2.1 Normative references ......................................................................................................................................... 6

2.2 Informative references ........................................................................................................................................ 6

3 Definitions, abbreviations and arithmetic operators ................................................................................. 6

3.1 Definitions .......................................................................................................................................................... 6

3.2 Abbreviations ..................................................................................................................................................... 7

3.3 Arithmetic operators ........................................................................................................................................... 7

4 Introduction .............................................................................................................................................. 7

5 Audio ........................................................................................................................................................ 8

5.1 HE-AACv2 audio coding ................................................................................................................................... 8

5.2 Audio super framing syntax ............................................................................................................................... 9

5.3 MPEG Surround ............................................................................................................................................... 12

5.3.1 Overview .................................................................................................................................................... 12

5.3.2 Requirements for MPEG Surround encoders and decoders ........................................................................ 13

5.3.3 Operational aspects of broadcasting ........................................................................................................... 13

5.3.4 Receiver implementation aspects ................................................................................................................ 13

5.4 Programme Associated Data (PAD) ................................................................................................................. 14

5.4.0 Introduction................................................................................................................................................. 14

5.4.1 PAD insertion ............................................................................................................................................. 14

5.4.2 Coding of F-PAD and X-PAD .................................................................................................................... 15

5.4.3 PAD extraction ........................................................................................................................................... 15

6 Transport error coding and interleaving ................................................................................................. 16

6.0 Introduction ...................................................................................................................................................... 16

6.1 RS coding ......................................................................................................................................................... 16

6.2 Formation of the coding array .......................................................................................................................... 17

6.3 Formation of the parity array ............................................................................................................................ 17

6.4 Formation of the output array ........................................................................................................................... 17

6.5 Order of data transmission................................................................................................................................ 17

7 Signalling ............................................................................................................................................... 17

7.1 FIC signalling ................................................................................................................................................... 17

7.2 Audio parameter signalling .............................................................................................................................. 17

8 Reconfiguration ...................................................................................................................................... 18

Annex A (normative): Error concealment ......................................................................................... 19

A.0 Introduction ............................................................................................................................................ 19

A.1 AAC error concealment.......................................................................................................................... 19

A.1.0 Introduction ...................................................................................................................................................... 19

A.1.1 Interpolation of one corrupt AU ....................................................................................................................... 19

A.1.2 Fade-out and fade-in ......................................................................................................................................... 20

A.2 SBR error concealment .......................................................................................................................... 20

A.3 Parametric stereo error concealment ...................................................................................................... 22

A.4 MPEG Surround error concealment ....................................................................................................... 22

Annex B (informative): Implementation tips for PAD insertion........................................................ 23

ETSI
---------------------- Page: 3 ----------------------
4 ETSI TS 102 563 V2.1.1 (2017-01)

Annex C (informative): Synchronizing to the audio super frame structure ..................................... 24

Annex D (informative): Processing a super frame .............................................................................. 26

Annex E (informative): Bit-rate available for audio ........................................................................... 27

History .............................................................................................................................................................. 28

ETSI
---------------------- Page: 4 ----------------------
5 ETSI TS 102 563 V2.1.1 (2017-01)
Intellectual Property Rights

IPRs essential or potentially essential to the present document may have been declared to ETSI. The information

pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found

in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in

respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web

server (https://ipr.etsi.org/).

Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee

can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web

server) which are, or may be, or may become, essential to the present document.
Foreword

This Technical Specification (TS) has been produced by Joint Technical Committee (JTC) Broadcast of the European

Broadcasting Union (EBU), Comité Européen de Normalisation ELECtrotechnique (CENELEC) and the European

Telecommunications Standards Institute (ETSI).

NOTE 1: The EBU/ETSI JTC Broadcast was established in 1990 to co-ordinate the drafting of standards in the

specific field of broadcasting and related fields. Since 1995 the JTC Broadcast became a tripartite body

by including in the Memorandum of Understanding also CENELEC, which is responsible for the

standardization of radio and television receivers. The EBU is a professional association of broadcasting

organizations whose work includes the co-ordination of its members' activities in the technical, legal,

programme-making and programme-exchange domains. The EBU has active members in about

60 countries in the European broadcasting area; its headquarters is in Geneva.
European Broadcasting Union
CH-1218 GRAND SACONNEX (Geneva)
Switzerland
Tel: +41 22 717 21 11
Fax: +41 22 717 24 81

The Eureka Project 147 was established in 1987, with funding from the European Commission, to develop a system for

the broadcasting of audio and data to fixed, portable or mobile receivers. Their work resulted in the publication of

European Standard, ETSI EN 300 401 [1], for DAB (see note 2) which now has worldwide acceptance.

NOTE 2: DAB is a registered trademark owned by one of the Eureka Project 147 partners.

The DAB family of standards is supported by World DAB, an organization with members drawn from broadcasting

organizations and telecommunication providers together with companies from the professional and consumer

electronics industry.
Modal verbs terminology

In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and

"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of

provisions).

"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.

ETSI
---------------------- Page: 5 ----------------------
6 ETSI TS 102 563 V2.1.1 (2017-01)
1 Scope

The present document defines the method to code and transmit audio services using the HE-AACv2 [2] audio coder for

Eureka-147 Digital Audio Broadcasting (DAB) (ETSI EN 300 401 [1]) and details the necessary mandatory

requirements for decoders. The permitted audio modes and the data protection and encapsulation are detailed. This

audio coding scheme permits the full use of the PAD channel for carrying dynamic labels and user applications.

2 References
2.1 Normative references

References are either specific (identified by date of publication and/or edition number or version number) or

non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the

referenced document (including any amendments) applies.

Referenced documents which are not found to be publicly available in the expected location might be found at

https://docbox.etsi.org/Reference.

NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee

their long term validity.

The following referenced documents are necessary for the application of the present document.

[1] ETSI EN 300 401 (V2.1.1): "Radio Broadcasting Systems; Digital Audio Broadcasting (DAB) to

mobile, portable and fixed receivers".

[2] ISO/IEC 14496-3: "Information technology - Coding of audio-visual objects - Part 3: Audio".

2.2 Informative references

References are either specific (identified by date of publication and/or edition number or version number) or

non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the

referenced document (including any amendments) applies.

NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee

their long term validity.

The following referenced documents are not necessary for the application of the present document but they assist the

user with regard to a particular subject area.

[i.1] ISO/IEC 23003-1: "Information technology - MPEG audio technologies - Part 1: MPEG

Surround".
3 Definitions, abbreviations and arithmetic operators
3.1 Definitions

For the purposes of the present document, the terms and definitions given in ETSI EN 300 401 [1] and the following

apply:

access unit: access unit contains the audio samples for 20 ms, 30 ms, 40 ms or 60 ms of audio depending on the

sampling rate of the AAC core, respectively 48 kHz, 32 kHz, 24 kHz or 16 kHz

audio super frame: audio super frame contains a number of AUs which together contain the encoded audio for 120 ms

ETSI
---------------------- Page: 6 ----------------------
7 ETSI TS 102 563 V2.1.1 (2017-01)

subchannel_index: subchannel_index is derived from the size of the sub-channel carrying the audio service and defines

the number of Reed-Solomon code words in each audio super frame
3.2 Abbreviations

For the purposes of the present document, the abbreviations given in ETSI EN 300 401 [1] and the following apply:

AAC Advanced Audio Coding
AU Access Unit
CRC Cyclic Redundancy Check
CU Capacity Unit
DAB Digital Audio Broadcasting
DAC Digital Analogue Converter
DMB Digital Multimedia Broadcasting
DRC Dynamic Range Control
DVB Digital Video Broadcasting
FIC Fast Information Channel
FIG Fast Information Group
GF Galois Field
HE-AAC High Efficiency AAC
MCI Multiplex Configuration Information
MPEG Moving Pictures Experts Group
MSC Main Service Channel
PAD Programme Associated Data
PS Parametric Stereo
RS Reed-Solomon
SAC Spatial Audio Coding
SBR Spectral Band Replication
3.3 Arithmetic operators
+ addition
- subtraction
× multiplication
÷ division

m DIV p denotes the quotient part of the division of m by p (m and p are positive integers)

m MOD p denotes the remainder of the division of m by p (m and p are positive integers)

f()i denotes the sum: f(p) + f(p + 1) + f(p + 2) ... + f(q)
i= p
f()i denotes the product: f(p) × f(p + 1) × f(p + 2) ... × f(q)
i= p
4 Introduction

The DAB system standard [1] allows audio (programme) services to be carried using either DAB audio or DAB+ audio.

The present document defines the way that audio (programme) services are carried when using DAB+ audio (MPEG 4

HE-AACv2 audio coding).

MPEG 4 HE-AACv2 specifies two transforms, but only the 960 transform is permitted for DAB+ audio, with core

sampling rates of 48 kHz, 32 kHz, 24 kHz and 16 kHz. Each AU (audio frame) contains samples for 20 ms, 30 ms,

40 ms or 60 ms respectively. AUs are built into audio super frames of 120 ms duration which are then carried in five

DAB logical frames. In order to provide additional error control, Reed Solomon coding and virtual interleaving is

applied. The overall scheme is shown in figure 1.
ETSI
---------------------- Page: 7 ----------------------
8 ETSI TS 102 563 V2.1.1 (2017-01)
Reed-Solomon coder DAB main
AAC
Audio super and service
framing virtual interleaver channel
audio
multiplexer
coder
Scope of present document
Figure 1: Conceptual diagram of the outer coder and interleaver
5 Audio
5.1 HE-AACv2 audio coding

For generic audio coding, a subset of the MPEG-4 High Efficiency Advanced Audio Coding v2 (HE-AACv2) profile

chosen to best suit the DAB system environment is used. The HE-AACv2 Profile, Level 2 according to [2] shall apply

with the following additional restrictions for the DAB system:

• Sampling rates: permitted output sampling rates of the HE-AACv2 decoder are 32 kHz and 48 kHz, i.e. when

SBR is enabled the AAC core shall be operated at 16 kHz or 24 kHz, respectively. If SBR is disabled then the

AAC core shall be operated at 32 kHz or 48 kHz respectively.

• Transform length: the number of samples per channel per AU is 960. This is required to harmonize HE-AAC

AU lengths to allow the combination of an integer number of AUs to build an audio super frame of 120 ms

duration.

• Audio bit rates are restricted to fit within a maximum sub-channel size of 192 kbps (approximately 175 kbps

for audio, assuming no PAD).

• Audio super framing: AUs are composed into audio super frames, which always correspond to 120 ms in time.

The AUs in the audio super frames are encoded together such that each audio super frame is of constant

length, i.e. that bit exchange between AUs is only possible within an audio super frame. The number of AUs

per super frame are: two (16 kHz AAC core sampling rate with SBR enabled), three (24 kHz AAC core

sampling rate with SBR enabled), four (32 kHz AAC core sampling rate) or six (48 kHz AAC core sampling

rate).

Each audio super frame is carried in five consecutive logical DAB frames (see clause 7) which enables simple

synchronization and management of reconfigurations. The size of the audio super frame is defined by the size of the

MSC sub-channel (see ETSI EN 300 401 [1], clause 6.2.1) which carries the audio super frame. Sub-channels are

multiples of 8 kbps in size. The size of the audio super frame in bytes is given by the expressions below:

subchannel_index = MSC sub-channel size (kbps) ÷ 8
audio_super_frame_size (bytes) = subchannel_index × 110

The first byte of the audio super frame is byte 0 and the last byte is byte (audio_super_frame_size - 1).

NOTE: The subchannel_index parameter may take the values 1 to 24 due to the restriction limiting the maximum

sub-channel size to 192 kbps.
ETSI
---------------------- Page: 8 ----------------------
9 ETSI TS 102 563 V2.1.1 (2017-01)
5.2 Audio super framing syntax
Table 1: Syntax of he_aac_super_frame()
Syntax No. of bits Note
he_aac_super_frame(subchannel_index)
he_aac_super_frame_header() determines num_aus
for (n = 0; n < num_aus; n++) {
au[n] 8 × au_size[n]
au_crc[n] 16
NOTE: au corresponds to one single access unit.
Each au is protected by one CRC.
The size of he_aac_super_frame() is equal to audio_super_frame_size.
he_aac_super_frame_header()

The header contains the audio parameters for the audio super frame and the respective start positions of each AU within

the audio super frame, along with an error protection word. The au_start values for the second and subsequent AUs are

stored consecutively in the header. Depending on the number of AUs, 4 padding bits are added to achieve byte-

alignment.
num_aus

The number of AUs in the audio super frame is determined by the settings of the audio parameters. num_aus may take

the values 2, 3, 4 or 6 (see table 2).
au[n]

The AU contains the audio samples for 20 ms, 30 ms, 40 ms or 60 ms of audio depending on the core sampling rate,

respectively 48 kHz, 32 kHz, 24 kHz or 16 kHz.
au_size[n]
This is the size in bytes of the AU.
au_crc[n]
Each AU is protected by a 16-bit CRC.

The CRC shall be generated according to the procedure defined in ETSI EN 300 401 [1], annex E. The generation shall

be based on the polynomial:
16 12 5
G(x) = x + x + x +1

The CRC word shall be complemented (1s complement) prior to transmission. At the beginning of each CRC word

calculation, all register stages shall be initialized to "1".
ETSI
---------------------- Page: 9 ----------------------
10 ETSI TS 102 563 V2.1.1 (2017-01)
Table 2: Syntax of he_aac_super_frame_header()
Syntax No. of bits Note
he_aac_super_frame_header()
header_firecode 16
// start of audio parameters
rfa 1
dac_rate 1
sbr_flag 1
aac_channel_mode 1
ps_flag 1
mpeg_surround_config 3
// end of audio parameters

if ((dac_rate == 0) && (sbr_flag == 1)) num_aus = 2; AAC core sampling rate 16 kHz

if ((dac_rate == 1) && (sbr_flag == 1)) num_aus = 3; AAC core sampling rate 24 kHz

if ((dac_rate == 0) && (sbr_flag == 0)) num_aus = 4; AAC core sampling rate 32 kHz

if ((dac_rate == 1) && (sbr_flag == 0)) num_aus = 6; AAC core sampling rate 48 kHz

for (n = 1; n < num_aus; n++) {
au_start[n]; 12 AU start position
if !((dac_rate == 1) && (sbr_flag == 1))
alignment 4 byte-alignment

NOTE: The au_start for the first AU in the audio super frame (au_start[0]) is not transmitted. The first AU always

starts immediately after the he_aac_super_frame_header().
header_firecode

The header_firecode is a 16-bit field containing a Fire code capable of detecting and correcting most single error burst

of up to 6 bits. The error pattern 101111 (where "1" indicates "bit error", "0" indicates "no bit error") can be detected

but not corrected. The Fire code shall be generated using the polynomial:
11 5 3 2 16 14 13 12 11 5 3 2
G(x) = (x +1)(x + x + x + x +1) = x + x + x + x + x + x + x + x + x +1

The Fire code word shall be calculated over the nine bytes from byte 2 to byte 10 of the audio super frame.

NOTE 1: Except in the case where num_aus = 6, the Fire code calculation will include some bytes from the first

AU.

At the beginning of each Fire code word calculation, all register stages shall be initialized to "0".

NOTE 2: Other definitions call the above code an error-correcting CRC code.
audio parameters

The audio parameters comprise the rfa, dac_rate, sbr_flag, aac_channel_mode, ps_flag and mpeg_surround_config

fields.

NOTE 3: When the audio parameters are changed, some interruption to the audio output should be expected.

Broadcasters should therefore plan audio parameter changes carefully.
rfa

The rfa is a 1-bit field reserved for future addition. This bit shall be set to zero for the currently specified application.

ETSI
---------------------- Page: 10 ----------------------
11 ETSI TS 102 563 V2.1.1 (2017-01)
dac_rate

The dac_rate is a 1-bit field to signal the sampling rate of the DAC according to table 3.

Table 3: Definition of dac_rate
Dac_rate Meaning Note
0 DAC sampling rate 32 kHz
1 DAC sampling rate 48 kHz
sbr_flag
The sbr_flag is a 1-bit field to signal the use of SBR according to table 4.
Table 4: Definition of sbr_flag
sbr_flag Meaning Note
0 SBR not used The sampling rate of the AAC core is equal to the
sampling rate of the DAC
1 SBR used The sampling rate of the AAC core is half the
sampling rate of the DAC
aac_channel_mode
The aac_channel_mode is a 1-bit field according to table 5.
Table 5: Definition of aac_channel_mode
aac_channel_mode Meaning Note
0 AAC (core) coding is mono mono refers to a single_channel_element() see [2]
1 AAC (core) coding is stereo stereo refers to a channel_pair_element() see [2]
ps_flag
The ps_flag is a 1-bit field to signal the use of PS according to table 6.
Table 6: Definition of ps_flag
ps_flag Meaning Note
0 PS not used
1 PS used only permitted when sbr_flag == 1 &&
aac_channel_mode == 0
mpeg_surround_config
The mpeg_surround_config is a 3-bit field according to table 7.
Table 7: Definition of mpeg_surround_config
mpeg_surround_config Meaning Note
000 MPEG Surround is not used
001 MPEG Surround with 5.1 output
channels is used
010 MPEG Surround with 7.1 output
channels is used
011 to 110 reserved for future definition
111 other mode (the mode can be derived
from the MPEG Surround
SpatialSpecificConfig())
ETSI
---------------------- Page: 11 ----------------------
12 ETSI TS 102 563 V2.1.1 (2017-01)

The number of output channels provided by MPEG Surround is intended as an information for the listener only. The

MPEG Surround decoder shall use this information only to determine whether MPEG Surround is used. If MPEG

Surround is used (mpeg_surround_config ≠ 000), the MPEG Surround decoder shall exclusively rely on the information

contained within the SpatialSpecificConfig().
A decoder that does not support MPEG Surround shall ignore this parameter.
au_start[n]

The au_start is an unsigned integer, most significant bit first, carried in a 12-bit field that defines the start position

within the audio super frame of the respective AU by giving the byte number of the first byte of the AU. The value of

au_start for the first AU is not transmitted but is given by table 8.
Table 8: Definition of au_start for the first AU of the audio super frame
num_aus value of au_start[0] Note
2 5
3 6
4 8
6 11
The value of au_start for subsequent AUs is given by the expressions below:
• au_start[n] = au_start[n - 1] + au_size[n - 1] + 2;
• au_start[num_aus] = audio_super_frame_size.

The decoder can derive the value of au_size[n] from the received au_start[n] and au_start[n + 1].

alignment
This 4-bit field, when present, shall be set to 0 0 0 0.
5.3 MPEG Surround
5.3.1 Overview

MPEG Surround is standardized in MPEG-D, Part-1 (ISO/IEC 23003-1 [i.1]). It describes:

• Coding of multichannel signals based on a downmixed signal of the original multichannel signal, and

associated spatial parameters. It offers lowest possible data rate for coding of multichannel signals, as well as

an inherent mono or stereo downmix signal included in the data stream. Hence, a mono or stereo signal can be

expanded to multi-channel by a very small additional data overhead.

• Binaural decoding of the MPEG Surround stream, enabling a surround sound experience over headphones.

• An Enhanced Matrix Mode that enables a multi-channel upmix from a stereo signal without any spatial

parameters.

Hence, MPEG Surround (Spatial Audio Coding, SAC) is capable of re-creating N channels based on M < N transmitted

channels, and additional control data. In the preferred modes of operating the spatial audio coding system, the

M channels can either be a single mono channel or a stereo channel pair. The control data represents a significantly

lower data rate than required for transmitting all N channels, making the coding very efficient while at the same time

ensuring compatibility with both M channel devices and N channel devices.

The MPEG Surround standard incorporates a number of tools that provide features which enable broad application of

the standard. A key feature is the ability to scale the spatial image quality gradually from very low spatial overhead

towards transparency. Another key feature is that the decoder input can be made compatible to existing matrix surround

technologies. All tools are grouped to be covered by certain profiles.
ETSI
---------------------- Page: 12 ----------------------
13 ETSI TS 1
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.