ISO/IEC 23090-2:2023
(Main)Information technology — Coded representation of immersive media — Part 2: Omnidirectional media format
Information technology — Coded representation of immersive media — Part 2: Omnidirectional media format
This document specifies the omnidirectional media format for coding, storage, delivery and rendering of omnidirectional media, including video, images, audio and timed text. Omnidirectional image or video can contain graphics elements generated by computer graphics but encoded as image or video. Multiple viewpoints, each corresponding to an omnidirectional camera, are supported. The document also specifies storage and delivery of overlay images or video intended to be rendered over the omnidirectional background image or video.
Technologies de l'information — Représentation codée de média immersifs — Partie 2: Format de média omnidirectionnel
General Information
Relations
Buy Standard
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 23090-2
Third edition
2023-06
Information technology — Coded
representation of immersive media —
Part 2:
Omnidirectional media format
Technologies de l'information — Représentation codée de média
immersifs —
Partie 2: Format de média omnidirectionnel
Reference number
ISO/IEC 23090-2:2023(E)
© ISO/IEC 2023
---------------------- Page: 1 ----------------------
ISO/IEC 23090-2:2023(E)
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
© ISO/IEC 2023 – All rights reserved
---------------------- Page: 2 ----------------------
ISO/IEC 23090-2:2023(E)
Contents Page
Foreword . vii
Introduction . viii
1 Scope. 1
2 Normative references . 1
3 Terms, definitions, abbreviated terms and symbols . 3
3.1 Terms and definitions .3
3.2 Abbreviated terms . 11
3.3 Symbols.12
3.3.1 Arithmetic operators and mathematical functions . 12
3.3.2 Order of operation precedence . 13
3.3.3 Range notation . 14
3.3.4 Variables . 14
3.3.5 Processes . 14
3.3.6 Syntax structures . 15
3.3.7 Conventions for indicating the number of boxes in tables . 15
4 Overview . 15
4.1 Overall architecture . 15
4.2 Projected omnidirectional video/images . 18
4.2.1 General .18
4.2.2 Stitching, rotation, projection, and region-wise packing . 18
4.3 Fisheye omnidirectional video/images . 19
4.4 Mesh omnidirectional video . 20
4.5 Streaming methods for omnidirectional video . 20
4.5.1 Overview . 20
4.5.2 Tile-based streaming with viewport-specific author-driven binding . 22
4.5.3 Tile-based streaming with free-viewport author-driven binding. 22
4.5.4 Tile-based streaming with late binding . 23
4.6 Additional functionalities . 25
4.7 Conformance and interoperability . 25
4.7.1 General .25
4.7.2 Media profiles . 26
4.7.3 Presentation profiles . 28
4.7.4 Toolset brands . 28
4.7.5 Summary of referenceable code points . 28
5 Omnidirectional video projection and region-wise packing . 35
5.1 Coordinate system . 35
5.2 Omnidirectional projection formats . 36
5.2.1 General .36
5.2.2 Equirectangular projection for one sample location . 36
5.2.3 Cubemap projection for one sample location. 37
5.3 Conversion from the local coordinate axes to the global coordinate axes . 39
5.4 Region-wise packing formats . 40
5.4.1 General .40
5.4.2 Conversion of one sample location for rectangular region-wise packing . 41
6 Fisheye omnidirectional video . 42
6.1 General . 42
6.2 The FisheyeVideoEssentialInfoStruct() syntax structure . 42
6.2.1 Syntax .42
6.2.2 Semantics . 43
6.3 The FisheyeVideoSupplementalInfoStruct() syntax structure . 46
6.3.1 Syntax .46
6.3.2 Semantics . 47
© ISO/IEC 2023 – All rights reserved iii
---------------------- Page: 3 ----------------------
ISO/IEC 23090-2:2023(E)
7 Omnidirectional media storage and metadata signalling in the ISOBMFF . 52
7.1 Generic extensions to the ISOBMFF . 52
7.1.1 Indication of a track not intended to be presented alone . 52
7.1.2 Clarifications on the stereo video box . 52
7.1.3 Generic sub-picture track grouping extensions . 53
7.1.4 Media offset box . 57
7.2 Generic extensions to ISO/IEC 14496-15 . 58
7.2.1 Containing of SpatialRelationship2DDescriptionBox for HEVC tile base track and
HEVC tile tracks . 58
7.3 OMAF-specific extensions to the ISOBMFF . 58
7.3.1 Sync samples in timed metadata tracks . 58
7.4 OMAF-specific extensions to ISO/IEC 14496-15 . 59
7.4.1 Coverage information box in an HEVC tile base track . 59
7.5 Structures and semantics that are common for video tracks and image items . 59
7.5.1 Semantics of sample locations within a decoded picture . 59
7.5.2 Projection format structure . 62
7.5.3 Region-wise packing structure . 63
7.5.4 Rotation structure . 70
7.5.5 Content coverage structure . 71
7.5.6 Sphere region structure . 72
7.6 Restricted video schemes for omnidirectional video . 76
7.6.1 Scheme types . 76
7.6.2 Projected omnidirectional video box . 81
7.6.3 Fisheye omnidirectional video box . 82
7.6.4 Region-wise packing box . 83
7.6.5 Rotation box . 83
7.6.6 Coverage information box . 84
7.6.7 Mesh omnidirectional video box . 84
7.6.8 Mesh box . 85
7.7 Timed metadata for sphere regions . 87
7.7.1 General . 87
7.7.2 Sample entry . 88
7.7.3 Sample format . 89
7.7.4 Initial viewing orientation . 89
7.7.5 Recommended viewport . 91
7.7.6 Timed text sphere location metadata . 94
7.8 Signalling of region-wise quality ranking . 95
7.8.1 General . 95
7.8.2 Spherical region-wise quality ranking . 95
7.8.3 2D region-wise quality ranking . 97
7.9 Storage of omnidirectional images . 99
7.9.1 General . 99
7.9.2 Frame packing item property . 99
7.9.3 Projection format item property . 100
7.9.4 Essential fisheye image item property . 101
7.9.5 Supplemental fisheye image item property . 102
7.9.6 Region-wise packing item property . 102
7.9.7 Rotation item property . 103
7.9.8 Coverage information item property . 103
7.9.9 Initial viewing orientation item property . 104
7.10 Storage of timed text for omnidirectional video . 105
7.10.1 General . 105
7.10.2 OMAF timed text configuration box . 105
7.10.3 IMSC1 tracks . 108
7.10.4 WebVTT tracks . 108
7.11 ERP region timed metadata . 109
7.11.1 General . 109
7.11.2 Sample entry format . 109
7.11.3 Semantics . 109
7.11.4 Sample format . 110
iv © ISO/IEC 2023 – All rights reserved
---------------------- Page: 4 ----------------------
ISO/IEC 23090-2:2023(E)
7.11.5 Generating ERP region metadata . 111
7.12 Storage and signalling of viewpoints for omnidirectional video and images . 111
7.12.1 Viewpoint information structures . 111
7.12.2 Viewpoint entity grouping . 120
7.12.3 Timed metadata for viewpoints . 122
7.13 Storage of omnidirectional video in sub-picture tracks. 127
7.13.1 General . 127
7.13.2 Projected omnidirectional video . 127
7.13.3 Indication of composition pictures being packed pictures or projected pictures . 128
7.13.4 Fisheye omnidirectional video . 128
7.14 Storage and signalling of overlays for omnidirectional video and images . 129
7.14.1 General . 129
7.14.2 Overlay structure . 131
7.14.3 Overlay control structures . 132
7.14.4 Overlay configuration box . 143
7.14.5 Overlay item property . 143
7.14.6 Overlay timed metadata track . 144
7.14.7 Entity groups . 145
7.14.8 Overlay alpha auxiliary image . 148
7.15 Signalling of viewing space information . 149
7.15.1 General . 149
7.15.2 Viewing space structure. 149
7.15.3 Viewing space box . 152
7.15.4 Viewing space item property . 152
7.15.5 Time varying immersive viewing space signalling . 152
7.16 Mapping of rectangular regions to the 3D mesh . 153
7.16.1 General . 153
7.16.2 Tile mesh sample grouping . 153
7.16.3 Rectangular region structure. 155
7.16.4 Projection of a sample location onto the 3D mesh . 156
8 Omnidirectional media encapsulation and signalling in DASH . 157
8.1 Architecture of DASH delivery in OMAF . 157
8.2 Usage of DASH in OMAF . 159
8.2.1 General . 159
8.2.2 Signalling of stereoscopic frame packing . 159
8.2.3 Carriage of timed metadata . 159
8.2.4 Associating Adaptation Sets or Representations with each other . 160
8.3 DASH MPD descriptors for omnidirectional media in the namespace
"urn:mpeg:mpegI:omaf:2017" . 161
8.3.1 XML namespace and schema . 161
8.3.2 Signalling of projection type information . 161
8.3.3 Signalling of region-wise packing type . 162
8.3.4 Signalling of content coverage . 163
8.3.5 Signalling of spherical region-wise quality ranking . 166
8.3.6 Signalling of 2D region-wise quality ranking . 172
8.3.7 Signalling of fisheye omnidirectional video . 177
8.4 Carriage of images . 177
8.4.1 General . 177
8.4.2 Format and constraints for Segments. 178
8.5 DASH MPD descriptors for omnidirectional media in the namespace
"urn:mpeg:mpegI:omaf:2020" . 178
8.5.1 XML namespace and schema . 178
8.5.2 Signalling of association . 178
8.5.3 Signalling of viewpoints . 180
8.5.4 Signalling of sub-picture composition iden
...
© ISO/IEC 2022 – All rights reserved
ISO/IEC FDIS 23090-2:20222023(E)
ISO/IEC JTC 1/SC 29/WG 03
Secretariat: KATS
Information technology — Coded representation of immersive
media — Part 2: Omnidirectional media format
i
---------------------- Page: 1 ----------------------
rd
ISO/IEC FDIS 23090-2:202x2023(E) 3 Edition
© ISO 20222023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this
publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical,
including photocopying, or posting on the internet or an intranet, without prior written permission. Permission
can be requested from either ISO at the address below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 20222023 – All rights reserved
---------------------- Page: 2 ----------------------
rd
ISO/IEC FDIS 23090-2:202x2023(E) 3 Edition
Contents Page
Foreword . vii
Introduction . viii
1 Scope. 1
2 Normative references . 1
3 Terms, definitions, abbreviated terms and symbols . 3
3.1 Terms and definitions . 3
3.1.9 constituent picture . 4
3.2 Abbreviated terms . 11
3.3 Symbols . 12
3.3.1 Arithmetic operators and mathematical functions . 12
3.2 Abbreviated terms . 11
3.3 Symbols . 12
3.3.1 Arithmetic operators and mathematical functions . 12
3.3.2 Order of operation precedence . 13
3.3.3 Range notation . 14
3.3.4 Variables . 14
3.3.5 Processes . 14
3.3.6 Syntax structures . 15
3.3.7 Conventions for indicating the number of boxes in tables . 15
4 Overview . 15
4.1 Overall architecture . 15
4.2 Projected omnidirectional video/images . 18
4.2.1 General . 18
4.2.2 Stitching, rotation, projection, and region-wise packing . 18
4.3 Fisheye omnidirectional video/images . 19
4.4 Mesh omnidirectional video . 20
4.5 Streaming methods for omnidirectional video . 20
4.5.1 Overview . 20
4.5.2 Tile-based streaming with viewport-specific author-driven binding . 22
4.5.3 Tile-based streaming with free-viewport author-driven binding . 22
4.5.4 Tile-based streaming with late binding . 23
4.6 Additional functionalities . 25
4.7 Conformance and interoperability . 25
4.7.1 General . 25
4.7.2 Media profiles . 26
4.7.3 Presentation profiles . 28
4.7.4 Toolset brands . 28
4.7.5 Summary of referenceable code points . 28
5 Omnidirectional video projection and region-wise packing . 35
5.1 Coordinate system . 35
5.2 Omnidirectional projection formats . 36
5.2.1 General . 36
5.2.2 Equirectangular projection for one sample location . 36
5.2.3 Cubemap projection for one sample location . 37
5.3 Conversion from the local coordinate axes to the global coordinate axes . 39
5.4 Region-wise packing formats . 40
5.4.1 General . 40
5.4.2 Conversion of one sample location for rectangular region-wise packing . 41
6 Fisheye omnidirectional video . 42
6.1 General . 42
6.2 The FisheyeVideoEssentialInfoStruct() syntax structure . 42
6.2.1 Syntax . 42
6.2.2 Semantics . 43
© ISO/IEC 20222023 – All rights reserved iii
---------------------- Page: 3 ----------------------
rd
ISO/IEC FDIS 23090-2:202x2023(E) 3 Edition
6.3 The FisheyeVideoSupplementalInfoStruct() syntax structure . 46
6.3.1 Syntax . 46
6.3.2 Semantics . 47
7 Omnidirectional media storage and metadata signalling in the ISOBMFF . 52
7.1 Generic extensions to the ISOBMFF . 52
7.1.1 Indication of a track not intended to be presented alone . 52
7.1.2 Clarifications on the stereo video box . 52
7.1.3 Generic sub-picture track grouping extensions . 52
7.1.4 Media offset box . 56
7.2 Generic extensions to ISO/IEC 14496-15 . 57
7.2.1 Containing of SpatialRelationship2DDescriptionBox for HEVC tile base track and
HEVC tile tracks . 57
7.3 OMAF-specific extensions to the ISOBMFF . 57
7.3.1 Sync samples in timed metadata tracks . 57
7.4 OMAF-specific extensions to ISO/IEC 14496-15 . 58
7.4.1 Coverage information box in an HEVC tile base track . 58
7.5 Structures and semantics that are common for video tracks and image items . 58
7.5.1 Semantics of sample locations within a decoded picture . 58
7.5.2 Projection format structure . 61
7.5.3 Region-wise packing structure . 61
7.5.4 Rotation structure . 69
7.5.5 Content coverage structure . 69
7.5.6 Sphere region structure . 70
7.6 Restricted video schemes for omnidirectional video . 74
7.6.1 Scheme types . 74
7.6.2 Projected omnidirectional video box . 78
7.6.3 Fisheye omnidirectional video box . 80
7.6.4 Region-wise packing box . 80
7.6.5 Rotation box . 81
7.6.6 Coverage information box. 81
7.6.7 Mesh omnidirectional video box . 82
7.6.8 Mesh box . 82
7.7 Timed metadata for sphere regions . 85
7.7.1 General . 85
7.7.2 Sample entry . 85
7.7.3 Sample format . 86
7.7.4 Initial viewing orientation . 86
7.7.5 Recommended viewport . 88
7.7.6 Timed text sphere location metadata . 90
7.8 Signalling of region-wise quality ranking . 91
7.8.1 General . 91
7.8.2 Spherical region-wise quality ranking . 92
7.8.3 2D region-wise quality ranking . 94
7.9 Storage of omnidirectional images . 96
7.9.1 General . 96
7.9.2 Frame packing item property . 96
7.9.3 Projection format item property . 96
7.9.4 Essential fisheye image item property . 98
7.9.5 Supplemental fisheye image item property . 98
7.9.6 Region-wise packing item property . 99
7.9.7 Rotation item property . 99
7.9.8 Coverage information item property . 100
7.9.9 Initial viewing orientation item property . 100
7.10 Storage of timed text for omnidirectional video . 101
7.10.1 General . 101
7.10.2 OMAF timed text configuration box . 102
7.10.3 IMSC1 tracks . 104
7.10.4 WebVTT tracks . 104
7.11 ERP region timed metadata . 105
iv © ISO/IEC 20222023 – All rights reserved
---------------------- Page: 4 ----------------------
rd
ISO/IEC FDIS 23090-2:202x2023(E) 3 Edition
7.11.1 General . 105
7.11.2 Sample entry format . 105
7.11.3 Semantics . 105
7.11.4 Sample format . 106
7.11.5 Generating ERP region metadata . 107
7.12 Storage and signalling of viewpoints for omnidirectional video and images. 107
7.12.1 Viewpoint information structures . 107
7.12.2 Viewpoint entity grouping . 115
7.12.3 Timed metadata for viewpoints . 117
7.13 Storage of omnidirectional video in sub-picture tracks . 121
7.13.1 General . 121
7.13.2 Projected omnidirectional video . 121
7.13.3 Indication of composition pictures being packed pictures or projected pictures . 122
7.13.4 Fisheye omnidirectional video . 123
7.14 Storage and signalling of overlays for omnidirectional video and images . 123
7.14.1 General . 123
7.14.2 Overlay structure . 125
7.14.3 Overlay control structures . 127
7.14.4 Overlay configuration box . 136
7.14.5 Overlay item property . 137
7.14.6 Overlay timed metadata track . 137
7.14.7 Entity groups . 138
7.14.8 Overlay alpha auxiliary image . 141
7.15 Signalling of viewing space information . 142
7.15.1 General . 142
7.15.2 Viewing space structure . 142
7.15.3 Viewing space box. 145
7.15.4 Viewing space item property . 145
7.15.5 Time varying immersive viewing space signalling . 145
7.16 Mapping of rectangular regions to the 3D mesh. 146
7.16.1 General . 146
7.16.2 Tile mesh sample grouping . 146
7.16.3 Rectangular region structure . 148
7.16.4 Projection of a sample location onto the 3D mesh . 149
8 Omnidirectional media encapsulation and signalling in DASH . 150
8.1 Architecture of DASH delivery in OMAF . 150
8.2 Usage of DASH in OMAF . 152
8.2.1 General . 152
8.2.2 Signalling of stereoscopic frame packing . 152
8.2.3 Carriage of timed metadata . 152
8.2.4 Associating Adaptation Sets or Representations with each other . 153
8.3 DASH MPD descriptors for omnidirectional media in the namespace "urn:mpeg:mpegI:omaf:2017" . 154
8.3.1 XML namespace and schema . 154
8.3.2 Signalling of projection type information . 154
8.3.3 Signalling of region-wise packing type . 155
8.3.4 Signalling of content coverage . 156
8.3.5 Signalling of spherical region-wise quality ranking . 159
8.3.6 Signalling of 2D region-wise quality ranking . 164
8.3.7 Signalling of fisheye omnidirectional video . 168
8.4 Carriage of images . 169
8.4.1 General . 169
8.4.2 Format and constraints for Segments . 169
8.5 DASH MPD descriptors for omnidirectional media in the namespace "urn:mpeg:mpegI:omaf:2020" . 170
8.5.1 XML namespace and schema . 170
8.5.2 Signalling of association . 170
8.5.3 Signalling of viewpoints . 172
8.5.4 Signalling of sub-picture composition identifier and its attributes . 179
8.5.5 Signalling of overlays .
...
FINAL
INTERNATIONAL ISO/IEC
DRAFT
STANDARD FDIS
23090-2
ISO/IEC JTC 1/SC 29
Information technology — Coded
Secretariat: JISC
representation of immersive media —
Voting begins on:
2023-03-10
Part 2:
Voting terminates on:
Omnidirectional media format
2023-05-05
Technologies de l'information — Représentation codée de média
immersifs —
Partie 2: Format de média omnidirectionnel
RECIPIENTS OF THIS DRAFT ARE INVITED TO
SUBMIT, WITH THEIR COMMENTS, NOTIFICATION
OF ANY RELEVANT PATENT RIGHTS OF WHICH
THEY ARE AWARE AND TO PROVIDE SUPPOR TING
DOCUMENTATION.
IN ADDITION TO THEIR EVALUATION AS
Reference number
BEING ACCEPTABLE FOR INDUSTRIAL, TECHNO-
ISO/IEC FDIS 23090-2:2023(E)
LOGICAL, COMMERCIAL AND USER PURPOSES,
DRAFT INTERNATIONAL STANDARDS MAY ON
OCCASION HAVE TO BE CONSIDERED IN THE
LIGHT OF THEIR POTENTIAL TO BECOME STAN-
DARDS TO WHICH REFERENCE MAY BE MADE IN
NATIONAL REGULATIONS. © ISO/IEC 2023
---------------------- Page: 1 ----------------------
ISO/IEC FDIS 23090-2:2023(E)
FINAL
INTERNATIONAL ISO/IEC
DRAFT
STANDARD FDIS
23090-2
ISO/IEC JTC 1/SC 29
Information technology — Coded
Secretariat: JISC
representation of immersive media —
Voting begins on:
Part 2:
Voting terminates on:
Omnidirectional media format
Technologies de l'information — Représentation codée de média
immersifs —
Partie 2: Format de média omnidirectionnel
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
RECIPIENTS OF THIS DRAFT ARE INVITED TO
ISO copyright office
SUBMIT, WITH THEIR COMMENTS, NOTIFICATION
OF ANY RELEVANT PATENT RIGHTS OF WHICH
CP 401 • Ch. de Blandonnet 8
THEY ARE AWARE AND TO PROVIDE SUPPOR TING
CH-1214 Vernier, Geneva
DOCUMENTATION.
Phone: +41 22 749 01 11
IN ADDITION TO THEIR EVALUATION AS
Reference number
Email: copyright@iso.org
BEING ACCEPTABLE FOR INDUSTRIAL, TECHNO
ISO/IEC FDIS 230902:2023(E)
Website: www.iso.org
LOGICAL, COMMERCIAL AND USER PURPOSES,
DRAFT INTERNATIONAL STANDARDS MAY ON
Published in Switzerland
OCCASION HAVE TO BE CONSIDERED IN THE
LIGHT OF THEIR POTENTIAL TO BECOME STAN
DARDS TO WHICH REFERENCE MAY BE MADE IN
ii
© ISO/IEC 2023 – All rights reserved
NATIONAL REGULATIONS. © ISO/IEC 2023
---------------------- Page: 2 ----------------------
ISO/IEC FDIS 23090-2:2023(E)
Contents Page
Foreword . vii
Introduction . viii
1 Scope. 1
2 Normative references . 1
3 Terms, definitions, abbreviated terms and symbols . 3
3.1 Terms and definitions .3
3.2 Abbreviated terms . 11
3.3 Symbols.12
3.3.1 Arithmetic operators and mathematical functions . 12
3.3.2 Order of operation precedence . 13
3.3.3 Range notation . 14
3.3.4 Variables . 14
3.3.5 Processes . 14
3.3.6 Syntax structures . 15
3.3.7 Conventions for indicating the number of boxes in tables . 15
4 Overview . 15
4.1 Overall architecture . 15
4.2 Projected omnidirectional video/images . 18
4.2.1 General .18
4.2.2 Stitching, rotation, projection, and region-wise packing . 18
4.3 Fisheye omnidirectional video/images . 19
4.4 Mesh omnidirectional video . 20
4.5 Streaming methods for omnidirectional video . 20
4.5.1 Overview . 20
4.5.2 Tile-based streaming with viewport-specific author-driven binding . 22
4.5.3 Tile-based streaming with free-viewport author-driven binding. 22
4.5.4 Tile-based streaming with late binding . 23
4.6 Additional functionalities . 25
4.7 Conformance and interoperability . 25
4.7.1 General .25
4.7.2 Media profiles . 26
4.7.3 Presentation profiles . 28
4.7.4 Toolset brands . 28
4.7.5 Summary of referenceable code points . 28
5 Omnidirectional video projection and region-wise packing . 35
5.1 Coordinate system . 35
5.2 Omnidirectional projection formats . 36
5.2.1 General .36
5.2.2 Equirectangular projection for one sample location . 36
5.2.3 Cubemap projection for one sample location. 37
5.3 Conversion from the local coordinate axes to the global coordinate axes . 39
5.4 Region-wise packing formats . 40
5.4.1 General .40
5.4.2 Conversion of one sample location for rectangular region-wise packing . 41
6 Fisheye omnidirectional video . 42
6.1 General . 42
6.2 The FisheyeVideoEssentialInfoStruct() syntax structure . 42
6.2.1 Syntax .42
6.2.2 Semantics . 43
6.3 The FisheyeVideoSupplementalInfoStruct() syntax structure . 46
6.3.1 Syntax .46
6.3.2 Semantics . 47
© ISO/IEC 2023 – All rights reserved iii
---------------------- Page: 3 ----------------------
ISO/IEC FDIS 23090-2:2023(E)
7 Omnidirectional media storage and metadata signalling in the ISOBMFF . 52
7.1 Generic extensions to the ISOBMFF . 52
7.1.1 Indication of a track not intended to be presented alone . 52
7.1.2 Clarifications on the stereo video box . 52
7.1.3 Generic sub-picture track grouping extensions . 53
7.1.4 Media offset box . 57
7.2 Generic extensions to ISO/IEC 14496-15 . 58
7.2.1 Containing of SpatialRelationship2DDescriptionBox for HEVC tile base track and
HEVC tile tracks . 58
7.3 OMAF-specific extensions to the ISOBMFF . 58
7.3.1 Sync samples in timed metadata tracks . 58
7.4 OMAF-specific extensions to ISO/IEC 14496-15 . 59
7.4.1 Coverage information box in an HEVC tile base track . 59
7.5 Structures and semantics that are common for video tracks and image items . 59
7.5.1 Semantics of sample locations within a decoded picture . 59
7.5.2 Projection format structure . 62
7.5.3 Region-wise packing structure . 63
7.5.4 Rotation structure . 70
7.5.5 Content coverage structure . 71
7.5.6 Sphere region structure . 72
7.6 Restricted video schemes for omnidirectional video . 76
7.6.1 Scheme types . 76
7.6.2 Projected omnidirectional video box . 81
7.6.3 Fisheye omnidirectional video box . 82
7.6.4 Region-wise packing box . 83
7.6.5 Rotation box . 83
7.6.6 Coverage information box . 84
7.6.7 Mesh omnidirectional video box . 84
7.6.8 Mesh box . 85
7.7 Timed metadata for sphere regions . 87
7.7.1 General . 87
7.7.2 Sample entry . 88
7.7.3 Sample format . 89
7.7.4 Initial viewing orientation . 89
7.7.5 Recommended viewport . 91
7.7.6 Timed text sphere location metadata . 94
7.8 Signalling of region-wise quality ranking . 95
7.8.1 General . 95
7.8.2 Spherical region-wise quality ranking . 95
7.8.3 2D region-wise quality ranking . 97
7.9 Storage of omnidirectional images . 99
7.9.1 General . 99
7.9.2 Frame packing item property . 99
7.9.3 Projection format item property . 100
7.9.4 Essential fisheye image item property . 101
7.9.5 Supplemental fisheye image item property . 102
7.9.6 Region-wise packing item property . 102
7.9.7 Rotation item property . 103
7.9.8 Coverage information item property . 103
7.9.9 Initial viewing orientation item property . 104
7.10 Storage of timed text for omnidirectional video . 105
7.10.1 General . 105
7.10.2 OMAF timed text configuration box . 105
7.10.3 IMSC1 tracks . 108
7.10.4 WebVTT tracks . 108
7.11 ERP region timed metadata . 109
7.11.1 General . 109
7.11.2 Sample entry format . 109
7.11.3 Semantics . 109
7.11.4 Sample format . 110
iv © ISO/IEC 2023 – All rights reserved
---------------------- Page: 4 ----------------------
ISO/IEC FDIS 23090-2:2023(E)
7.11.5 Generating ERP region metadata . 111
7.12 Storage and signalling of viewpoints for omnidirectional video and images . 111
7.12.1 Viewpoint information structures . 111
7.12.2 Viewpoint entity grouping . 120
7.12.3 Timed metadata for viewpoints . 122
7.13 Storage of omnidirectional video in sub-picture tracks. 127
7.13.1 General . 127
7.13.2 Projected omnidirectional video . 127
7.13.3 Indication of composition pictures being packed pictures or projected pictures . 128
7.13.4 Fisheye omnidirectional video . 128
7.14 Storage and signalling of overlays for omnidirectional video and images . 129
7.14.1 General . 129
7.14.2 Overlay structure . 131
7.14.3 Overlay control structures . 132
7.14.4 Overlay configuration box . 143
7.14.5 Overlay item property . 143
7.14.6 Overlay timed metadata track . 144
7.14.7 Entity groups . 145
7.14.8 Overlay alpha auxiliary image . 148
7.15 Signalling of viewing space information . 149
7.15.1 General . 149
7.15.2 Viewing space structure. 149
7.15.3 Viewing space box . 152
7.15.4 Viewing space item property . 152
7.15.5 Time varying immersive viewing space signalling . 152
7.16 Mapping of rectangular regions to the 3D mesh . 153
7.16.1 General . 153
7.16.2 Tile mesh sample grouping . 153
7.16.3 Rectangular region structure. 155
7.16.4 Projection of a sample location onto the 3D mesh . 156
8 Omnidirectional media encapsulation and signalling in DASH . 157
8.1 Architecture of DASH delivery in OMAF . 157
8.2 Usage of DASH in OMAF . 159
8.2.1 General . 159
8.2.2 Signalling of stereoscopic frame packing . 159
8.2.3 Carriage of timed metadata . 159
8.2.4 Associating Adaptation Sets or Representations with each other . 160
8.3 DASH MPD descriptors for omnidirectional media in the namespace
"urn:mpeg:mpegI:omaf:2017" . 161
8.3.1 XML namespace and schema . 161
8.3.2 Signalling of projection type information . 161
8.3.3 Signalling of region-wise packing type . 162
8.3.4 Signalling of content coverage . 163
8.3.5 Signalling of spherical region-wise quality ranking . 166
8.3.6 Signalling of 2D region-wise quality ranking . 172
8.3.7 Signallin
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.