Information technology — High efficiency coding and media delivery in heterogeneous environments — Part 3: 3D audio

This document specifies technology that supports the efficient transmission of immersive audio signals and flexible rendering for the playback of immersive audio in a wide variety of listening scenarios. These include home theatre setups with 3D loudspeaker configurations, 22.2 loudspeaker systems, automotive entertainment systems and playback over headphones connected to a tablet or smartphone.

Technologies de l'information — Codage à haute efficacité et livraison des medias dans des environnements hétérogènes — Partie 3: Audio 3D

General Information

Status
Published
Publication Date
16-Aug-2022
Current Stage
9092 - International Standard to be revised
Completion Date
21-Jul-2024
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 23008-3:2022 - Information technology — High efficiency coding and media delivery in heterogeneous environments — Part 3: 3D audio Released:17. 08. 2022
English language
867 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)


INTERNATIONAL ISO/IEC
STANDARD 23008-3
Third edition
2022-08
Information technology — High
efficiency coding and media delivery
in heterogeneous environments —
Part 3:
3D audio
Technologies de l'information — Codage à haute efficacité et livraison
des medias dans des environnements hétérogènes —
Partie 3: Audio 3D
Reference number
© ISO/IEC 2022
© ISO/IEC 2022
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
© ISO/IEC 2022 – All rights reserved

Contents Page
Foreword . xiii
Introduction .xiv
1 Scope . 1
2 Normative references . 1
3 Terms, definitions, symbols, abbreviated terms and mnemonics . 2
3.1 Terms, definitions, symbols and abbreviated terms . 2
3.2 Mnemonics . 2
4 Technical overview . 2
4.1 Decoder block diagram . 2
4.2 Overview over the codec building blocks . 3
4.3 Efficient combination of decoder processing blocks in the time domain and QMF
domain . 6
4.4 Rule set for determining processing domains . 9
4.4.1 Audio core codec processing domain . 9
4.4.2 Mixing . 10
4.4.3 DRC-1 Operation domains (DRC in rendering context) . 10
4.4.4 Audio core codec interface domain to rendering . 10
4.4.5 Rendering context . 10
4.4.6 Post-processing context . 11
4.4.7 End-of-chain context . 11
4.5 Sample rate converter . 11
4.6 Decoder delay . 11
4.7 Contribution mode of MPEG-H 3D audio . 12
4.8 MPEG-H 3D audio profiles and levels . 12
4.8.1 General . 12
4.8.2 Profiles . 13
5 MPEG-H 3D audio core decoder . 27
5.1 Definitions . 27
5.1.1 Joint stereo . 27
5.1.2 MPEG surround based stereo (MPS 212) . 28
5.2 Syntax . 28
5.2.1 General . 28
5.2.2 Decoder configuration . 28
5.2.3 MPEG-H 3D audio core bitstream payloads . 51
5.3 Data structure . 72
5.3.1 General . 72
5.3.2 General configuration data elements . 72
5.3.3 Loudspeaker configuration data elements . 75
5.3.4 Core decoder configuration data elements . 77
5.3.5 Downmix matrix data elements . 81
5.3.6 HOA rendering matrix data elements . 84
5.3.7 Signal group information elements . 87
5.3.8 Low frequency enhancement (LFE) channel element, mpegh3daLfeElement() . 87
5.3.9 Compatible profile and levels sets. 88
5.4 Configuration element descriptions . 88
5.4.1 General . 88
5.4.2 Downmix configuration . 88
5.4.3 HOA rendering matrix configuration . 94
© ISO/IEC 2022 – All rights reserved iii

5.5 Tool descriptions . 98
5.5.1 General. 98
5.5.2 Quad channel element . 98
5.5.3 Transform splitting . 100
5.5.4 MPEG surround for mono to stereo upmixing . 107
5.5.5 Enhanced noise filling . 110
5.5.6 Audio pre-roll . 134
5.5.7 Fullband LPD . 137
5.5.8 Time-domain bandwidth extension . 148
5.5.9 LPD stereo coding . 161
5.5.10 Multichannel coding tool . 169
5.5.11 Filterbank and block switching . 179
5.5.12 Frequency domain prediction . 180
5.5.13 Long-term postfilter. 183
5.5.14 Tonal component coding . 188
5.5.15 Internal channel on MPS212 for low complexity format conversion . 198
5.5.16 High resolution envelope processing (HREP) tool . 210
5.6 Buffer requirements . 216
5.6.1 Minimum decoder input buffer . 216
5.6.2 Bit reservoir . 216
5.6.3 Maximum bit rate. 217
5.7 Stream access point requirements and inter-frame dependency . 217
6 Dynamic range control and loudness processing . 218
6.1 General. 218
6.2 Description . 218
6.3 Syntax . 219
6.3.1 Loudness metadata . 219
6.3.2 Dynamic range control metadata . 219
6.3.3 Data elements . 220
6.4 Decoding process . 222
6.4.1 General. 222
6.4.2 Dynamic range control . 224
6.4.3 Usage of downmixId in MPEG-H . 224
6.4.4 DRC set selection process . 225
6.4.5 DRC-1 for SAOC 3D Content . 227
6.4.6 DRC-1 for HOA content . 228
6.4.7 Loudness normalization . 229
6.4.8 Peak limiter . 230
6.4.9 Time-synchronization of DRC gains . 230
6.4.10 Default parameters .
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.