ISO/IEC 23008-9:2022
(Main)Information technology — High efficiency coding and media delivery in heterogeneous environments — Part 9: 3D Audio conformance testing
Information technology — High efficiency coding and media delivery in heterogeneous environments — Part 9: 3D Audio conformance testing
This document specifies conformance criteria for both bitstreams and decoders compliant with the MPEG-H 3D audio standard as defined in ISO/IEC 23008-3. This is done to assist implementers and to ensure interoperability.
Technologies de l'information — Codage à haut rendement et fourniture de supports dans les environnements hétérogènes — Partie 9: Essais de conformité 3D Audio
General Information
Relations
Buy Standard
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 23008-9
Second edition
2022-03
Information technology — High
efficiency coding and media delivery
in heterogeneous environments —
Part 9:
3D Audio conformance testing
Technologies de l'information — Codage à haut rendement et
fourniture de supports dans les environnements hétérogènes —
Partie 9: Essais de conformité 3D Audio
Reference number
ISO/IEC 23008-9:2022(E)
© ISO/IEC 2022
---------------------- Page: 1 ----------------------
ISO/IEC 23008-9:2022(E)
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2022
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
© ISO/IEC 2022 – All rights reserved
---------------------- Page: 2 ----------------------
ISO/IEC 23008-9:2022(E)
Contents Page
Foreword .vii
Introduction .viii
1 Scope . 1
2 Normative references . 1
3 Terms, definitions and abbreviated terms . 1
3.1 Terms and definitions . 1
3.2 Abbreviated terms . 2
4 MPEG-H 3D audio conformance testing . 3
4.1 General . 3
4.2 Profiles . 3
4.3 Test procedure . 3
4.3.1 General . 3
4.3.2 Naming convention . 4
4.3.3 Conformance test tools . 6
5 MPEG-H 3D audio bitstreams . 6
5.1 Characteristics, test procedure . 6
5.2 MPEG-H 3D audio general configuration . 7
5.2.1 mpegh3daConfig() . 7
5.2.2 FrameworkConfig3d() . 7
5.2.3 Signals3d() . . 7
5.2.4 SpeakerConfig3d() . 7
5.2.5 mpegh3daFlexibleSpeakerConfig(). 7
5.2.6 mpegh3daSpeakerDescription() . . 8
5.3 MPEG-H 3D core audio configuration . 8
5.3.1 mpegh3daDecoderConfig() . 8
5.3.2 mpegh3daSingleChannelElementConfig() . 8
5.3.3 mpegh3daChannelPairElementConfig() . 8
5.3.4 mpegh3daCoreConfig() . 9
5.3.5 mpegh3daLfeElementConfig() . 9
5.3.6 mpegh3daExtElementConfig() . 9
5.3.7 mpegh3daConfigExtension() . 10
5.3.8 SbrConfig() . 10
5.3.9 Mps212Config() . . 10
5.4 MPEG-H 3D core audio frame . 10
5.4.1 mpegh3daFrame() . 10
5.4.2 mpegh3daSingleChannelElement() . 10
5.4.3 mpegh3daChannelPairElement() . 11
5.4.4 mpegh3daLfeElement() . 11
5.4.5 mpegh3daExtElement() . 11
5.4.6 ics_info() . 11
5.4.7 mpegh3daCoreCoderData() .12
5.4.8 StereoCoreToolInfo() .12
5.4.9 fd_channel_stream() .12
5.4.10 lpd_channel_stream(). 13
5.4.11 acelp_coding() . 14
5.4.12 tcx_coding () . 14
5.4.13 lpd_stereo_stream() . 14
5.4.14 igf_stereo_pred_data(). 15
5.4.15 igf_data() . 15
5.4.16 tbe_data() . 15
5.4.17 tw_data() . 16
5.4.18 scale_factor_data() . 16
iii
© ISO/IEC 2022 – All rights reserved
---------------------- Page: 3 ----------------------
ISO/IEC 23008-9:2022(E)
5.4.19 tns_data() . 16
5.4.20 ac_spectral_data() . 16
5.4.21 arith_data() . 16
5.4.22 fac_data() . 16
5.4.23 code_book_indices() . 16
5.4.24 UsacSbrData() . 16
5.4.25 Mps212Data() . 16
5.5 Fill element . . 16
5.6 MPEG surround configuration, SpatialSpecificConfig(). 16
5.7 MPEG surround frame, SpatialFrame() . 17
5.8 SAOC configuration, SAOCSpecificConfig() . 17
5.9 SAOC frame, SAOCFrame() . 17
5.10 AudioPreRoll . 17
5.10.1 Recursive presence of AudioPreRoll extension payload . 17
5.10.2 AudioPreRoll() . 17
5.11 Dynamic range control configuration . 17
5.11.1 mpegh3daUniDrcConfig() . 17
5.11.2 mpegh3daUniDrcChannelLayout(). 17
5.11.3 drcCoefficientsUniDrc() . 18
5.11.4 drcInstructionsUniDrc() . 18
5.11.5 uniDrcConfigExtension() . 18
5.12 Dynamic range control frame, uniDrcGain() . 18
5.13 Object metadata configuration, ObjectMetadataConfig() . 18
5.14 Object metadata frame . 18
5.14.1 object_metadata_efficient() . 18
5.14.2 object_metadata() . 18
5.14.3 object_metadata_efficient() . 18
5.14.4 intracoded_object_metadata_efficient() . 18
5.14.5 differential_object_metadata() . 19
5.14.6 offset_data() . 21
5.14.7 object_metadata_low_delay() . 21
5.14.8 intracoded_object_metadata_low_delay() . 21
5.14.9 dynamic_object_metadata() . 22
5.14.10 single_dynamic_object_metadata() . 22
5.15 EnhancedObjectMetadataConfig() . 23
5.16 EnhancedObjectMetadataFrame() . 23
5.17 SAOC 3D Config . 24
5.17.1 SAOC3DSpecificConfig() . 24
5.17.2 SAOC3DgetNumChannels() . 25
5.17.3 SAOC3DExtensionConfig() . 25
5.17.4 SAOC3DExtensionConfigData() . 25
5.17.5 SAOCExtensionConfig() .25
5.18 SAOC 3D frame . 25
5.18.1 Saoc3DFrame() . 25
5.18.2 SAOC3DFramingInfo() . 26
5.18.3 EcDataSaoc() . . .26
5.18.4 ByteAlign() .26
5.18.5 SAOC3DExtensionFrame() . 26
5.18.6 SAOC3DExtensionFrameData() . 26
5.18.7 SAOCExtensionFrame() . 26
5.18.8 HOAConfig() .26
5.18.9 HOADecoderConfig() . 26
5.18.10 HOAEnhConfig() . 27
5.18.11 HOADecoderEnhConfig () . 27
5.18.12 getSubbandWidths () . 27
5.19 HOA frame. 27
5.19.1 HOAFrame() . 27
5.19.2 HOAEnhFrame () . 27
iv
© ISO/IEC 2022 – All rights reserved
---------------------- Page: 4 ----------------------
ISO/IEC 23008-9:2022(E)
5.19.3 ChannelSideInfoData() .28
5.19.4 AddAmbHoaInfoChannel() .28
5.19.5 HOAGainCorrectionData() .28
5.19.6 VVectorData() .28
5.19.7 HOAPredictionInfo() .29
5.19.8 HOADirectionalPredictionInfo() .29
5.19.9 readDirPredDiffValues() .29
5.19.10 HOAParInfo () .29
5.19.11 readParDiffValues () .30
5.20 FMT converter frame, FormatConverterFrame() .30
5.21 Multi-channel coding tool config, MCTConfig () .30
5.22 Multi-channel coding tool frame . 30
5.22.1 MultichannelCodingBoxRotation () .30
5.22.2 MultichannelCodingBoxPrediction () . 31
5.22.3 MultichannelCodingFrame() . 31
5.23 Tonal component coding configuration, TccConfig () . 31
5.24 Tonal component coding frame . 31
5.24.1 General . 31
5.24.2 TccGroupOfSegments() . 31
5.25 HREP config, HREPConfig() . 32
5.26 HREP frame, HREPFrame() . 32
5.27 ICG config, ICGConfig () .33
5.28 SignalGroupInformation Config, SignalGroupInformation () . 33
5.29 DownmixMatrix . 33
5.29.1 downmixConfig() . 33
5.29.2 DownmixMatrixSet() . 33
5.29.3 DownmixMatrix() . 33
5.29.4 DecoderGainValue() .34
5.29.5 ReadRange() .34
5.29.6 EqualizerConfig() .34
5.30 Loudness info . 35
5.30.1 mpegh3daLoudnessInfoSet() . 35
5.30.2 loudnessInfo() . 35
5.30.3 loudnessInfoSetExtension() . 35
5.31 Audioscene info . 35
5.31.1 mae_AudioSceneInfo . .35
5.31.2 mae_Data() . 35
5.31.3 mae_GroupDefinition() . . 36
5.31.4 mae_SwitchGroupDefinition() .36
5.31.5 mae_Description() . 37
5.31.6 mae_ContentData() . 37
5.31.7 mae_CompositePair() . 37
5.31.8 mae_GroupPresetDefinition() . 37
5.31.9 mae_ProductionScreenSizeData() .38
5.31.10 mae_LoudnessCompensationData () .38
5.31.11 mae_ProductionScreenSizeDataExtension() .39
5.31.12 mae_GroupPresetDefinitionExtension() .39
5.31.13 mae_DrcUserInterfaceInfo() .40
5.32 HOA matrix .40
5.32.1 HoaRenderingMatrixSet() .40
5.32.2 HoaRenderingMatrix() .
...
FINAL
INTERNATIONAL ISO/IEC
DRAFT
STANDARD FDIS
23008-9
ISO/IEC JTC 1/SC 29
Information technology — High
Secretariat: JISC
efficiency coding and media delivery
Voting begins on:
2021-10-14 in heterogeneous environments —
Voting terminates on:
Part 9:
2021-12-09
3D Audio conformance testing
Technologies de l'information — Codage à haut rendement et
fourniture de supports dans les environnements hétérogènes —
Partie 9: Essais de conformité 3D Audio
RECIPIENTS OF THIS DRAFT ARE INVITED TO
SUBMIT, WITH THEIR COMMENTS, NOTIFICATION
OF ANY RELEVANT PATENT RIGHTS OF WHICH
THEY ARE AWARE AND TO PROVIDE SUPPOR TING
DOCUMENTATION.
IN ADDITION TO THEIR EVALUATION AS
Reference number
BEING ACCEPTABLE FOR INDUSTRIAL, TECHNO-
ISO/IEC FDIS 23008-9:2021(E)
LOGICAL, COMMERCIAL AND USER PURPOSES,
DRAFT INTERNATIONAL STANDARDS MAY ON
OCCASION HAVE TO BE CONSIDERED IN THE
LIGHT OF THEIR POTENTIAL TO BECOME STAN-
DARDS TO WHICH REFERENCE MAY BE MADE IN
©
NATIONAL REGULATIONS. ISO/IEC 2021
---------------------- Page: 1 ----------------------
ISO/IEC FDIS 23008-9:2021(E)
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2021
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 2021 – All rights reserved
---------------------- Page: 2 ----------------------
ISO/IEC FDIS 23008-9:2021(E)
Contents Page
Foreword .vii
Introduction .viii
1 Scope . 1
2 Normative references . 1
3 Terms, definitions and abbreviated terms . 1
3.1 Terms and definitions . 1
3.2 Abbreviated terms . 2
4 MPEG-H 3D audio conformance testing . 3
4.1 General . 3
4.2 Profiles . 3
4.3 Test procedure . 3
4.3.1 General. 3
4.3.2 Naming convention . 4
4.3.3 Conformance test tools . 6
5 MPEG-H 3D audio bitstreams . 6
5.1 Characteristics, test procedure . 6
5.2 MPEG-H 3D audio general configuration . 7
5.2.1 mpegh3daConfig(). 7
5.2.2 FrameworkConfig3d() . 7
5.2.3 Signals3d() . 7
5.2.4 SpeakerConfig3d() . 7
5.2.5 mpegh3daFlexibleSpeakerConfig() . 8
5.2.6 mpegh3daSpeakerDescription() . 8
5.3 MPEG-H 3D core audio configuration . 8
5.3.1 mpegh3daDecoderConfig() . 8
5.3.2 mpegh3daSingleChannelElementConfig() . 8
5.3.3 mpegh3daChannelPairElementConfig() . 8
5.3.4 mpegh3daCoreConfig() . 9
5.3.5 mpegh3daLfeElementConfig(). 9
5.3.6 mpegh3daExtElementConfig() . 9
5.3.7 mpegh3daConfigExtension() .10
5.3.8 SbrConfig() .10
5.3.9 Mps212Config() .10
5.4 MPEG-H 3D core audio frame .10
5.4.1 mpegh3daFrame().10
5.4.2 mpegh3daSingleChannelElement() .11
5.4.3 mpegh3daChannelPairElement() . .11
5.4.4 mpegh3daLfeElement() .11
5.4.5 mpegh3daExtElement() .11
5.4.6 ics_info() .12
5.4.7 mpegh3daCoreCoderData() .12
5.4.8 StereoCoreToolInfo() .12
5.4.9 fd_channel_stream() .13
5.4.10 lpd_channel_stream() .13
5.4.11 acelp_coding() .14
5.4.12 tcx_coding () .14
5.4.13 lpd_stereo_stream() .15
5.4.14 igf_stereo_pred_data() .15
5.4.15 igf_data() .15
5.4.16 tbe_data() .15
5.4.17 tw_data() .16
5.4.18 scale_factor_data() . .16
© ISO/IEC 2021 – All rights reserved iii
---------------------- Page: 3 ----------------------
ISO/IEC FDIS 23008-9:2021(E)
5.4.19 tns_data() .16
5.4.20 ac_spectral_data() .16
5.4.21 arith_data() .16
5.4.22 fac_data() .16
5.4.23 code_book_indices() .16
5.4.24 UsacSbrData() .16
5.4.25 Mps212Data() .16
5.5 Fill element .17
5.6 MPEG surround configuration, SpatialSpecificConfig() .17
5.7 MPEG surround frame, SpatialFrame() .17
5.8 SAOC configuration, SAOCSpecificConfig() .17
5.9 SAOC frame, SAOCFrame() .17
5.10 AudioPreRoll .17
5.10.1 Recursive presence of AudioPreRoll extension payload .17
5.10.2 AudioPreRoll() .17
5.11 Dynamic range control configuration .17
5.11.1 mpegh3daUniDrcConfig() .17
5.11.2 mpegh3daUniDrcChannelLayout() .18
5.11.3 drcCoefficientsUniDrc() .18
5.11.4 drcInstructionsUniDrc() .18
5.11.5 uniDrcConfigExtension() .18
5.12 Dynamic range control frame, uniDrcGain() .18
5.13 Object metadata configuration, ObjectMetadataConfig() .18
5.14 Object metadata frame .18
5.14.1 object_metadata_efficient() .18
5.14.2 object_metadata() .18
5.14.3 object_metadata_efficient() .19
5.14.4 intracoded_object_metadata_efficient() .19
5.14.5 differential_object_metadata().20
5.14.6 offset_data() .21
5.14.7 object_metadata_low_delay().21
5.14.8 intracoded_object_metadata_low_delay() .21
5.14.9 dynamic_object_metadata() .22
5.14.10 single_dynamic_object_metadata() .22
5.15 EnhancedObjectMetadataConfig() .23
5.16 EnhancedObjectMetadataFrame() .24
5.17 SAOC 3D Config .25
5.17.1 SAOC3DSpecificConfig() .25
5.17.2 SAOC3DgetNumChannels() .25
5.17.3 SAOC3DExtensionConfig() .25
5.17.4 SAOC3DExtensionConfigData() .25
5.17.5 SAOCExtensionConfig() .25
5.18 SAOC 3D frame .26
5.18.1 Saoc3DFrame().26
5.18.2 SAOC3DFramingInfo() .26
5.18.3 EcDataSaoc() .26
5.18.4 ByteAlign() .26
5.18.5 SAOC3DExtensionFrame() .26
5.18.6 SAOC3DExtensionFrameData() .26
5.18.7 SAOCExtensionFrame() .26
5.18.8 HOAConfig() .26
5.18.9 HOADecoderConfig().27
5.18.10 HOAEnhConfig() .27
5.18.11 HOADecoderEnhConfig () .27
5.18.12 getSubbandWidths () .28
5.19 HOA frame .28
5.19.1 HOAFrame() .28
5.19.2 HOAEnhFrame ().28
iv © ISO/IEC 2021 – All rights reserved
---------------------- Page: 4 ----------------------
ISO/IEC FDIS 23008-9:2021(E)
5.19.3 ChannelSideInfoData() .28
5.19.4 AddAmbHoaInfoChannel() .28
5.19.5 HOAGainCorrectionData() .28
5.19.6 VVectorData() .29
5.19.7 HOAPredictionInfo().29
5.19.8 HOADirectionalPredictionInfo() .29
5.19.9 readDirPredDiffValues() .30
5.19.10 HOAParInfo () .30
5.19.11 readParDiffValues () .30
5.20 FMT converter frame, FormatConverterFrame() .30
5.21 Multi-channel coding tool config, MCTConfig () .30
5.22 Multi-channel coding tool frame .31
5.22.1 MultichannelCodingBoxRotation () .31
5.22.2 MultichannelCodingBoxPrediction () .31
5.22.3 MultichannelCodingFrame() .32
5.23 Tonal component coding configuration, TccConfig () .32
5.24 Tonal component coding frame .32
5.24.1 General.32
5.24.2 TccGroupOfSegments() .32
5.25 HREP config, HREPConfig() .33
5.26 HREP frame, HREPFrame() .33
5.27 ICG config, ICGConfig () .33
5.28 SignalGroupInformation Config, SignalGroupInformation () .33
5.29 DownmixMatrix .34
5.29.1 downmixConfig().34
5.29.2 DownmixMatrixSet() .34
5.29.3 DownmixMatrix() .34
5.29.4 DecoderGainValue() .35
5.29.5 ReadRange() .35
5.29.6 EqualizerConfig() .35
5.30 Loudness info .35
5.30.1 mpegh3daLoudnessInfoSet() .35
5.30.2 loudnessInfo() .36
5.30.3 loudnessInfoSetExtension() .36
5.31 Audioscene info .36
5.31.1 mae_AudioSceneInfo .36
5.31.2 mae_Data() .36
5.31.3 mae_GroupDefinition() .36
5.31.4 mae_SwitchGroupDefinition() .37
5.31.5 mae_Description() .38
5.31.6 mae_ContentData() .38
5.31.7 mae_CompositePair() .38
5.31.8 mae_GroupPresetDefinition() .38
5.31.9 mae_ProductionScreenSizeData() .39
5.31.10 mae_LoudnessCompensationData () .39
5.31.11 mae_ProductionScreenSizeDataExtension() .40
5.31.12 mae_GroupPresetDefinitionExtension() .40
5.31.13 mae_DrcUserInterfaceInfo() .41
5.32 HOA matrix .
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.