ISO/IEC PRF 19566-6
(Main)Information technologies - JPEG systems - Part 6: JPEG 360
Information technologies - JPEG systems - Part 6: JPEG 360
This document specifies omnidirectional/360-degree image and motion contents using Rec. ITU-T T.81 | ISO/IEC 10918‑1, Rec. ITU-T T.800 (11/2015) | ISO/IEC 15444‑1, and ISO/IEC 18477-3.
Technologies de l'information — Systèmes JPEG — Partie 6: JPEG 360
General Information
- Status
- Not Published
- Current Stage
- 5020 - FDIS ballot initiated: 2 months. Proof sent to secretariat
- Start Date
- 17-Dec-2025
- Completion Date
- 27-Dec-2025
Relations
- Effective Date
- 08-Feb-2025
- Effective Date
- 08-Feb-2025
- Effective Date
- 08-Feb-2025
Overview
ISO/IEC PRF 19566-6 (JPEG 360) defines a standardized way to store, describe and interoperate omnidirectional (360°) still and motion image content within the JPEG family. Built on existing JPEG box formats and compression technologies, JPEG 360 specifies image projections (default: equirectangular projection), metadata structures, and container boxes (notably the JPEG 360 Content Type JUMBF superbox) to support multi-sensor captures, stereoscopic 360 imagery, viewport rendering and secure/encrypted image components.
Key topics and technical requirements
- Equirectangular projection: Default mapping of spherical surface to 2D images; coordinates described by longitude (ϕ) and latitude (θ) with defined angle ranges and mapping rules.
- Orientation metadata: Support for gravity direction and compass heading (ω) to allow upright and nominally oriented presentation of scenes.
- Viewport and ROI support: Definitions for viewport representation and accelerated rendering of Regions of Interest (ROIs) to enable efficient extraction of perspective views from full-sphere images.
- Stereoscopic 360: Provisions for stereoscopic image content and associated parameters (improvements over previous edition).
- Metadata structuring: Use of JPEG Universal Metadata Box Format (JUMBF) with defined XML boxes and sub-box components for descriptive metadata, unstitched image elements, and encrypted parts.
- Serialization/box formats: Conformance to box-file layout and serialization rules compatible with JPEG, JPEG 2000 and ISO/IEC box formats.
- Interoperability constraints: Angle range constraints, coordinate conventions, and conformance language to ensure consistent interpretation across implementations.
Applications and who uses it
- 360° camera and smartphone manufacturers: embed standardized orientation, projection and unstitched-image metadata to improve playback and stitching workflows.
- VR/AR and HMD developers: reliably extract viewports and stereoscopic renderings with correct orientation and ROI acceleration.
- Media platforms and content distributors: ensure consistent playback across web, mobile and head-mounted displays by leveraging standardized metadata and container boxes.
- Imaging software vendors and SDK authors: implement parsing, remapping, encryption and ROI-accelerated decoding for efficient editing, tagging, augmentation and analytics.
- Archivists and standards bodies: preserve structure and metadata for long-term accessibility.
Related standards
- ISO/IEC 19566-5 - JPEG Universal Metadata Box Format (JUMBF)
- Rec. ITU‑T T.81 | ISO/IEC 10918‑1 - JPEG baseline still image
- Rec. ITU‑T T.800 | ISO/IEC 15444‑1 - JPEG 2000 core
- ISO/IEC 18477‑3 - Scalable compression box file format
This standard improves interoperability and practical workflows for 360° imaging by standardizing projection, metadata and container semantics for modern multi-sensor and stereoscopic use cases.
ISO/IEC PRF 19566-6 - Information technologies — JPEG systems — Part 6: JPEG 360 Released:17. 12. 2025
REDLINE ISO/IEC PRF 19566-6 - Information technologies — JPEG systems — Part 6: JPEG 360 Released:17. 12. 2025
Frequently Asked Questions
ISO/IEC PRF 19566-6 is a draft published by the International Organization for Standardization (ISO). Its full title is "Information technologies - JPEG systems - Part 6: JPEG 360". This standard covers: This document specifies omnidirectional/360-degree image and motion contents using Rec. ITU-T T.81 | ISO/IEC 10918‑1, Rec. ITU-T T.800 (11/2015) | ISO/IEC 15444‑1, and ISO/IEC 18477-3.
This document specifies omnidirectional/360-degree image and motion contents using Rec. ITU-T T.81 | ISO/IEC 10918‑1, Rec. ITU-T T.800 (11/2015) | ISO/IEC 15444‑1, and ISO/IEC 18477-3.
ISO/IEC PRF 19566-6 is classified under the following ICS (International Classification for Standards) categories: 35.040.30 - Coding of graphical and photographical information. The ICS classification helps identify the subject area and facilitates finding related standards.
ISO/IEC PRF 19566-6 has the following relationships with other standards: It is inter standard links to ISO/IEC 19566-6:2019, ISO/IEC 19566-6:2019/Amd 1:2021, ISO/IEC 19566-6:2019/Amd 2:2025. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.
You can purchase ISO/IEC PRF 19566-6 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of ISO standards.
Standards Content (Sample)
International
Standard
ISO/IEC 19566-6
Second edition
Information technologies — JPEG
systems —
Part 6:
JPEG 360
Technologies de l'information — Systèmes JPEG —
Partie 6: JPEG 360
PROOF/ÉPREUVE
Reference number
ISO/IEC 19566-6:2025(en) © ISO/IEC 2025
ISO/IEC 19566-6:2025(en)
© ISO/IEC 2025
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ii
ISO/IEC 19566-6:2025(en)
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms, definitions and abbreviated terms . 1
3.1 Terms and definitions .1
3.2 Abbreviated terms .2
4 Conventions . 2
4.1 Conformance language .2
4.2 Typesetting .2
5 Description and definition of JPEG 360 images . 3
5.1 General .3
5.2 Relative orientation.5
5.3 Viewport representation .7
5.4 Stereoscopic 360 image .8
5.4.1 General .8
5.4.2 Stereoscopic formats .8
5.4.3 Viewport definition in the stereo vision model .10
5.4.4 Compatible with a conventional 360 image viewer .10
5.5 Basic set of general 360 image parameters .11
5.5.1 360 image parameters .11
5.5.2 Viewport parameters .11
6 Structuring of JPEG 360 Metadata . .11
6.1 General .11
6.2 Definition of JPEG 360 Content Type boxes . 12
Annex A (normative) JPEG 360 Content Type JUMBF box .13
Annex B (normative) XML box for JPEG 360 . 14
Annex C (normative) Accelerated ROIs rendering for JPEG (ISO/IEC 10918-1) images .29
Bibliography .38
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
iii
ISO/IEC 19566-6:2025(en)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical activity.
ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations,
governmental and non-governmental, in liaison with ISO and IEC, also take part in the work.
The procedures used to develop this document and those intended for its further maintenance are described
in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the different types
of document should be noted. This document was drafted in accordance with the editorial rules of the ISO/
IEC Directives, Part 2 (see www.iso.org/directives or www.iec.ch/members_experts/refdocs).
ISO and IEC draw attention to the possibility that the implementation of this document may involve the
use of (a) patent(s). ISO and IEC take no position concerning the evidence, validity or applicability of any
claimed patent rights in respect thereof. As of the date of publication of this document, ISO and IEC had not
received notice of (a) patent(s) which may be required to implement this document. However, implementers
are cautioned that this may not represent the latest information, which may be obtained from the patent
database available at www.iso.org/patents and https://patents.iec.ch. ISO and IEC shall not be held
responsible for identifying any or all such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and expressions
related to conformity assessment, as well as information about ISO's adherence to the World Trade
Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www.iso.org/iso/foreword.html.
In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
This second edition cancels and replaces the first edition (ISO/IEC 19566-6:2019), which has been technically
revised. It also incorporates the Amendments ISO/IEC 19566-6:2019/Amd 1:2021 and ISO/IEC 19566-6:2019/
Amd 2:2025.
The main changes are as follows:
— incorporates the latest advancements in stereoscopic imaging as well as dealing with accelerated
rendering of region of interest viewports.
A list of all parts in the ISO/IEC 19566 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
iv
ISO/IEC 19566-6:2025(en)
Introduction
The ISO/IEC 19566 series is designed primarily for format and metadata storage and protection method of
compressed continuous-tone photographic content.
There is increasing use of multi-sensor images from multiple image sensor devices, such as 360° capturing
cameras or dual-camera smartphones available to consumers. Images from these cameras are shown on
computers, smartphones, and head-mounted displays (HMD).
Because existing JPEG standards do not fully cover these new uses, incompatibilities have reduced the
interoperability of these images, and thus reducing the widespread ubiquity which consumers have come to
expect when using JPEG-based images.
Additionally, new modalities for interacting with images, such as computer-based augmentation, face-
tagging, and object classification require support for metadata that was not part of the original JPEG scope.
This document defines “JPEG 360”, building upon the features of JPEG Universal Metadata Box Format
(JUMBF) (see ISO/IEC 19566-5).
This document defines the use of the JPEG 360 Content Type JUMBF superbox with respect to the sub-box
components which include the definition of an XML box, the use of other boxes such as unstitched image
elements for omnidirectional captures together with the main image and descriptive metadata, and
encrypted parts of the image.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
v
International Standard ISO/IEC 19566-6:2025(en)
Information technologies — JPEG systems —
Part 6:
JPEG 360
1 Scope
This document specifies omnidirectional/360° image and motion contents using
Rec. ITU-T T.81 | ISO/IEC 10918-1, Rec. ITU-T T.800 | ISO/IEC 15444-1 and ISO/IEC 18477-3.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content constitutes
requirements of this document. For dated references, only the edition cited applies. For undated references,
the latest edition of the referenced document (including any amendments) applies.
ISO/IEC 19566-5, Information technologies — JPEG systems — Part 5: JPEG universal metadata box format
(JUMBF)
Rec. ITU-T T.81 | ISO/IEC 10918-1, Information technology — Digital compression and coding of continuous-
tone still images: Requirements and guidelines
3 Terms, definitions and abbreviated terms
3.1 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1.1
box
binary structure that encapsulates an object embedded in a file
3.1.3
deserialization
extraction of data structure from a series of bytes
3.1.4
equirectangular projection
projection for mapping a portion of the surface of a sphere to a flat image
3.1.6
metadata
data that describes other data, including text, image, hypertext and combinations thereof
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
3.1.7
omnidirectional
(sub)spherical surface of an image of a scene as if observed from a single point of projection
3.1.8
serialization
translation of data structures into a series of bytes that can be stored and/or transmitted
3.1.9
JPEG 1
common image compression data format and means of reference to ISO/IEC 10918-1
3.2 Abbreviated terms
3D three dimensional
CTE centre-to-edge
ERP equirectangular projection
FOV field of view
JSON JavaScript object notation
JPEG joint photographic experts group
JUMBF JPEG universal metadata box format
MIME multipurpose internet mail extensions
URL uniform resource locator
XML eXtensible Markup Language
XMP eXtensible Metadata Platform
ROI region of Interest
RDF resource description framework
W3C World Wide Web Consortium
UMF universal metadata framework
URI uniform resource identifier
UUID universal unique identifier
4 Conventions
4.1 Conformance language
The keyword "reserved" indicates a provision that is not specified at this time, shall not be used, and may
be specified in the future. The keyword "forbidden" indicates "reserved" and in addition indicates that the
provision will never be specified in the future.
4.2 Typesetting
Regular face fonts as this text describe informative text that provides instructions, comments or details for
the reader.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Monospaced text as this paragraph indicates program input or output as necessary to either run
the software, or as generated by the software on the console.
5 Description and definition of JPEG 360 images
5.1 General
The equirectangular projection is a commonly used projection of omnidirectional cameras to a two-
dimensional rectangular image and is the default image projection for JPEG 360.
The equirectangular projection maps the image onto a spherical surface from a single projection point at the
centre of the sphere. The surface of the sphere is specified by two angular measures as shown in Figure 1;
for convenience, a unit sphere is assumed. The centre of the sphere is coincident with the origin of the three-
dimensional Cartesian coordinates. A point on the surface of the sphere is defined by the two angles ϕ and θ,
which are also referred to as “longitude” and “latitude” respectively.
a
Origin of 360 image (θ = ϕ = 0).
b
Direction of increasing ϕ.
c
Direction of increasing θ.
Figure 1 — Spherical and Cartesian coordinates
In Cartesian coordinates, the points on the sphere are defined as follows:
— x = cosϕ · cosθ
— y = sinϕ · sinθ
— z = sinθ
For an equirectangular projection the sphere’s surface, expressed in terms of the angles ϕ and θ, can be
mapped to a two-dimensional Cartesian surface. For example, Figure 2 presents the mapping for several
lines of constant longitude or constant latitude.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
a
Prime meridian (commonly at ϕ = 0).
b
Equator (typically at θ = 0).
c
Origin of 360 image (θ = ϕ = 0).
d
Span of ϕ (commonly −180° to 180°).
e
Span of θ (commonly −90° to 90°).
Figure 2 — Description of mapping from spherical surface to equirectangular
In this document, a simplified representation is used. For example, an equirectangular projection of the full
spherical surface is shown in Figure 3.
Figure 3 — Simplified representation for equirectangular projection for full spherical surface
The generalized equirectangular projection is expressed in terms of ranges for ϕ and θ by defining four
values ϕ , ϕ , θ , and θ as shown in Figure 4.
min max min max
Figure 4 — Generalized description of equirectangular projection
ϕ , and ϕ are constrained by the two conditions:
min max
— (−360) ≤ ϕ ≤ ϕ ≤ 360, and
min max
— ϕ − ϕ ≤ 360.
max min
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
θ , and θ are constrained by the two conditions:
min max
— (−180 ) ≤ θ ≤ θ ≤ 180, and
min max
— θ − θ ≤ 180.
max min
The advantages of this generalized description are:
a) it defines equirectangular projections that are smaller than a full sphere, and
b) it can mathematically shift the origin (0, 0) of the equirectangular project in terms of the angle range
values.
For example, an “equatorial image” to be a band around the equator of the equirectangular projection or set
the origin at the left edge of the equirectangular projection.
5.2 Relative orientation
The full 360 camera captures equivalent scenes regardless of the exact position the user holds the camera
during image exposure. However, it is highly desirable to improve the viewer’s experience of the image
by displaying the image in a way that is nominally oriented so that the scene horizon is parallel to the
equirectangular projection’s centre line. Sensors in the camera can provide the camera orientation relative
to the local direction of the Earth’s gravity; using this information allows the image to be remapped to the
expected view.
Figure 5 shows the direction of Earth’s gravity with respect to the camera’s local coordinate systems in both
3D space and equirectangular projection for a full spherical surface as can be expected when the camera is
held in its upright position.
a
Centre of 360 image as projected on sphere (θ = ϕ = 0).
b
Direction of gravity as projected on sphere.
c
Centre of 360 image in equirectangular projection (θ = ϕ = 0).
d
Direction of gravity in equirectangular projection.
Figure 5 — Representation of the direction of Earth’s gravity for upright camera
When the camera is not held upright, the direction of Earth’s gravity is shifted relative to the camera’s
coordinate systems. In Figure 6, this is illustrated.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
a
Centre of 360 image as projected on sphere (θ = ϕ = 0).
b
Direction of gravity as projected on sphere.
c
Point on surface of sphere along direction of gravity.
d
Direction of gravity in equirectangular projection.
Figure 6 — Representation for direction of gravity for non-upright camera
The direction of Earth’s gravity is defined to be the vector from the centre of the sphere to the point defined
by the pair ϕ and θ . It is possible for the equirectangular position of the Earth’s gravity to be
gravity gravity
outside the ranges defined by ϕ , ϕ , θ , and θ . The ϕ and θ are constrained shown
min max min max gravity gravity
below:
— ϕ −180 < ϕ < ϕ + 180, and
max gravity max
— θ -90 < θ < θ + 90.
min gravity max
The compass heading relative to the camera’s coordinates is very useful information. This is a one-
dimensional value given by the angle, ω, where 0 ≤ ω < 360, as shown in Figure 7.
NOTE In principle, the value of the compass heading can be less than ϕ or greater than ϕ .
min max
a
Centre of 360 image as projected on sphere (θ = ϕ = 0).
b
Compass heading (ω).
Figure 7 — Compass heading definition
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
5.3 Viewport representation
A common interaction modality is to present the viewer with only a limited view of the equirectangular
projection that corresponds to the perspective that is familiar to the human visual experience. This can
be further used to guide the viewer’s experience of the image by starting at an initial location in the
equirectangular projection. This is referred to a “viewport”. A viewport is defined as a rectangular sub-
region within the equirectangular projection.
In Figure 8, the blue box represents a viewport; while not required, it is recommended to render a view into
the equirectangular projection through a viewport.
a
Viewport centre.
b
Viewport horizontal field of view span.
c
Viewport vertical field of view span.
Figure 8 — Viewport as a subset of the equirectangular projection
In general, as shown in Figure 9, the edges of the viewport may not be parallel to the equirectangular
projection coordinates.
a
Viewport centre.
b
Direction of increasing viewport roll.
c
Line of constant ϕ.
Figure 9 — Non-parallel viewport
The viewport is characterized by:
— the centre of the viewport as defined by longitude (ϕ) and latitude values (θ),
— the span in ϕ and θ dimensions, and
— the viewport roll which is the rotation angle between a line-of-constant-φ through the viewport centre
point and the centre line of the viewport.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
In addition, an accelerated viewport rendering to support an efficient and low latency viewport transmission
shall be done as defined in Annex C.
5.4 Stereoscopic 360 image
5.4.1 General
A stereoscopic 360 image consists of a pair of images that depict the same scene or object from a slightly
different angle or perspective for the left and right eyes. In addition to monoscopic 360 images, this document
presents three formats for representing stereoscopic 360 images.
5.4.2 Stereoscopic formats
The stereoscopic format indicates an arrangement of the left and right images. This subclause defines three
formats for organizing the pair of the stereoscopic 360 images as follows:
a) Side-by-side: left- and right-eye images are packed horizontally (see Figure 10).
b) Top-bottom: left- and right-eye images are packed vertically (see Figure 11).
c) Extended: one of the left- and right-eye images is stored in the form of a JPEG image, which contains the
metadata required to construct a stereoscopic scene. The metadata includes information that identifies
the location of the remaining image in both eyes. Figure 12 shows an example of the extended format.
Although this example shows the right-eye image is contained in a single JPEG 1 image file, the right-eye
image may be referenced externally as a different JPEG 1 image file, as per ISO/IEC 19566-5.
a) Structure
b) Example of the format
Figure 10 — Side-by-side format
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
a) Structure b) Example of the format
Figure 11 — Top-bottom format
Figure 12 — Example of the extended format
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
5.4.3 Viewport definition in the stereo vision model
a
Centre of the baseline.
b
Left eye.
c
Right eye.
d
Baseline.
e
Viewport.
f
Sphere.
g
Left 360° image.
h
Viewport mapped on the left image.
i
Viewport mapped on the right image.
j
Right 360° image.
Figure 13 — Viewport definition in the stereo vision model
Figure 13 illustrates the stereo vision model used for stereoscopic 360 images and how a viewport is
organized. The viewport is a rectangular sub-region of the sphere specified by the FOV span values from the
centre of the sphere that is supposed to be the centre of the two eyes in the model.
In this model, P, which is a point on the sphere with Cartesian coordinate, is mapped on to the left and right
images, P and P , differently depending on the configuration of the capture devices. This allows the human
L R
visual system to perceive the point P more realistically when left and right images are projected on both
eyes, respectively. In this case, the viewports for both eyes are not necessarily defined for the left and right
eyes separately but can be defined by the viewing direction from the centre of both eyes. Therefore, the
viewport is specified by the FOV span values at the centre of the sphere that is considered as the centre of
the two eyes in the model.
5.4.4 Compatible with a conventional 360 image viewer
An interaction modality, such as a head-mounted display, is able to present a stereoscopic scene while a
conventional 360 image viewer can parse only monoscopic JPEG 360 image and present a viewport. However,
when the conventional viewer loads the stereoscopic JPEG 360 image file, it is possible to extract the left- or
right-eye image from the file based on the metadata embedded in the file and then present it to the viewer.
Furthermore, when the stereoscopic JPEG 360 image is encoded based on the extended stereoscopic format,
the conventional viewer will display one of the left- and right-eye images without additional processing. This
follows the backward compatible mechanism, which allows that an image is decoded when the file is opened
by conventional JPEG viewing applications.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
5.5 Basic set of general 360 image parameters
5.5.1 360 image parameters
— JPEG360VersionNumber: provides a way to identify parameter sets and associated definitions.
[1]
— MediaType: commonly referred to as MIME type, this identifies the codestream format.
— ProjectionType: identifies the spherical projection for the associated image.
— PhiMin: ϕ as specified in 5.1.
min
— PhiMax: ϕ as specified in 5.1.
max
— ThetaMax: θ as specified in 5.1.
max
— ThetaMin: θ as specified in 5.1.
min
— PhiGravity: ϕ as specified in 5.1.
gravity
— ThetaGravity: corresponding to θ as specified in 5.1.
gravity
— CompassPhi: the angle between the camera’s coordinate system and the Earth’s compass coordinates.
— BoxReference: indication if the equirectangular image position in the file is in the codestream position
expected by earlier versions of JPEG standards.
5.5.2 Viewport parameters
— JPEG360ViewportNumber: a numbered set of viewport descriptors.
— ViewportPhi: phi angle value for viewport centre: the longitude value for the centre of the rectangular
sub-region in the equirectangular projection.
— ViewportTheta: theta angle value for viewport centre: the latitude value for the centre of the rectangular
sub-region in the equirectangular projection.
— ViewportPhiFOV: the longitude span of the viewport with span centred at the viewport latitude centre.
— ViewportThetaFOV: the latitude span of the viewport with span centred at the viewport longitude
centre.
— ViewportRoll: the angular rotation of the viewport.
6 Structuring of JPEG 360 Metadata
6.1 General
This clause defines the JPEG 360 Content Type box, which is based on the JUMBF superbox defined by
ISO/IEC 19566-5. The sub-box components are defined, which include the definition of an XML box, the use of
other boxes such as unstitched image elements for omnidirectional captures together with the main image
and descriptive metadata, and encrypted parts of the image in order to construct a conformant JPEG 360
image file. An overview of the JPEG 360 Content Type box is shown in Figure 14.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Figure 14 — Overview of JPEG 360 Content type JUMBF box
6.2 Definition of JPEG 360 Content Type boxes
The JPEG 360 Content Type box shall conform to ISO/IEC 19566-5, and shall be defined in:
— Annex A, which contains the definition of the JUMBF box that shall be used for JPEG 360 (e.g. the structure
and definitions of required content of this box);
— Annex B, which defines the JPEG 360 metadata schema that shall be used.
The boxes in Annex A follow the box-based file construction as specified in ISO/IEC 19566-5.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Annex A
(normative)
JPEG 360 Content Type JUMBF box
A.1 General
This annex defines the use of JUMBF elements for JPEG 360; the elements are defined in ISO/IEC 19566-5.
A.2 JUMBF Description box: Type for JPEG 360
Table A.1 defines the TYPE field in the JUMBF Description box that corresponds to the Content Type of the
JPEG 360 JUMBF box.
Table A.1 — Definition of JUMBF Description box TYPE of a JPEG 360 JUMBF box
Parameter Value
TYPE 0x785f34b7-5d4b-474c-b89f-
1d99e0e3a8dd
NOTE The UUID above is per
recommendation ITU-T X.667 |
[2]
ISO/IEC 9834-8 .
A.3 JUMBF Description box: recommended/default label string for JPEG 360
Table A.2 defines the LABEL field in the JUMBF Description box that corresponds to the Content Type of the
JPEG 360 JUMBF box.
Table A.2 — Definition of JUMBF Description box LABEL of a JPEG 360 JUMBF box
Parameter Value
LABEL “JPEG360Metadata”
Each instance can have a distinct label or the same label if they are embedded in different files. If there are
multiple JUMBF boxes in the same file, the user is responsible to assign a unique label within the scope of the
file. It is recommended to use the label “JPEG360Metadata”, if this label does not already exist in the current
scope of the file (e.g. a newly created JPEG 360 image).
A.4 Description of the JPEG 360 Content Type JUMBF box
As shown in Figure 14, JPEG 360 metadata is contained within a JUMBF superbox. This JPEG 360 Content
Type JUMBF box is internally composed of the following:
— The JUMBF Description box with the UUID field set to 0x785f34b7-5d4b-474c-b89f-1d99e0e3a8dd.
— Exactly one XML box which contains the JPEG 360 metadata as specified in Annex B.
— Optional Codestream Content type JUMBF boxes.
— Optional Embedded File Content type JUMBF boxes.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Annex B
(normative)
XML box for JPEG 360
B.1 General
JPEG 360 is designed to balance the need to provide a timely standard while keeping flexibility to grow the
feature sets to more fully accommodate the identified use cases. To meet this need, a basic schema descriptor
and an empty metadata set are provided; which assign default values for the metadata elements.
However, without loss of generality, metadata elements can be set to support a variety of basic
equirectangular images, and a number of viewports.
B.2 Definition of JPEG 360 metadata
The schema elements of the JPEG 360 metadata contain basic properties, which are specified in Table B.1.
Table B.1 — Definition of JPEG 360 schema descriptor elements
JPEG 360 schema descriptor elements Meaning Data type
JPEG- Schema name string
360Meta-
data
JPEG360ImageMetadata Name of subschema for JPEG 360 image string
parameters
JPEG360Version Version # of JPEG image metadata schema; integer
provides a way to identify parameter sets and
associated definitions
MediaType Media type (formerly MIME type) for string
encoded 360 projection image codestream
ProjectionType 360 image projection type string
Stereoscopic Format Shall be as either ‘side-by-side’, ‘top-bottom’, string
or ‘extended’
PhiMin As shown in Clause 6, in units of degrees (°) real
PhiMax real
ThetaMax real
ThetaMin real
PhiGravity real
ThetaGravity real
CompassPhi real
BoxReference In the case of ‘extended’ Stereoscopic Format, string
it refers to a label for either a JUMBF Code-
stream Content type box or a JUMBF Embed-
ded File Content type box, as per ISO/IEC
19566-5. In any other case, the default value is
‘conventional’.
JPEG360ViewportMetadata Name of subschema for JPEG 360 viewport string
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
TTaabbllee BB.11 ((ccoonnttiinnueuedd))
JPEG 360 schema descriptor elements Meaning Data type
JPEG360Viewport As several viewports can be assigned, a integer
Number unique ID to further differentiate them.
The value ‘0’ is reserved for the default
viewport as specified in Table B.4.
ViewportPhi As shown in Clause 6, in units of degrees (°) real
ViewportTheta As shown in Clause 6, in units of degrees (°) real
ViewportPhiFOV real
ViewportThetaFOV real
ViewportRoll real
The JPEG 360 metadata elements are required to follow the definitions of the JPEG 360 schema descriptor.
B.3 Overview of metadata representation
[5]
This annex defines JPEG 360 schema based on XMP specification (according to ISO 16684-3). The data
source layer implements serialization, deserialization and embedding of metadata with using XMP with the
addition of new tags to express a schema descriptor, and to associate metadata elements with that schema
descriptor, as shown diagrammatically in Figure B.1.
Figure B.1 — Logical view of the metadata
The high-level structuring of the JPEG 360 XML box is shown in Table B.2.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Table B.2 — High-level structuring of the XML box in a JPEG 360 JUMBF box
XMP Purpose Top level structure and tags Second level structure and tags
framing
Internal counter u m f : ne x t -id
Schemas XMP arrays of schemas XMP array of schema names
description storage
umf : s c he m a s u m f : de s c r ip t or s
XMP array of fields/types
umf: fields
Metadata storage XMP array of metadata XMP array of metadata items
u m f : me t ad at a umf : s e t
XMP array of metadata field/values,
references
umf: fields, umf: refs
The top-level structure ordering is not critical as the structure in Table B.3 is equivalent to the structure in
Table B.2.
Table B.3 — Alternative high-level structuring of the XML box in a JPEG 360 JUMBF box
XMP Purpose Top level structure and tags Second level structure and tags
framing
Internal counter u m f : ne x t -id
Metadata storage XMP array of metadata XMP array of metadata items
u m f : me t ad at a umf : s e t
XMP array of metadata field/values,
references
umf: fields, umf: refs
Schemas XMP arrays of schemas XMP array of schema names
description storage
umf : s c he m a s u m f : de s c r ip t or s
XMP array of fields/types
umf: fields
B.4 Encoding JPEG 360 metadata syntax
The JPEG 360 schema is serialized and stored using a subset of the W3C Resource Description Framework
[3] [4]
(RDF) , expressed in XML . (XMP character encoding uses UTF-8, so the text characters shown in the
following monospaced examples can be different from their UTF-8 values.)
The XMP statement to frame a metadata stream is:
B.5 Reserved tags
Table B.4 defines tags reserved in the JPEG 360 XML Box.
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
Table B.4 — JPEG 360 reserved XML tags
Reserved tag Description
umf: schemas Tag to define start of schema descriptor. Multiple schema descriptors can be defined, each
with multiple sets of metadata elements.
umf: schema Tag to define a unique name for a schema.
umf: name Tag to define name of schema descriptor elements, or to label sets of metadata elements.
umf: type Tag to associate a data type to a name in the schema descriptor.
umf:id Unique id of a metadata set.
umf: next -id Used by implementation(s) to assign a unique id to metadata items.
umf: index Reserved tag. Index into an image sequence that contains the first image associated with
this metadata item (-1 in case of global metadata).
umf: nframes Reserved tag. Number of sequential frames the metadata item is associated with (0 in case
of global metadata).
umf: fields Tag for description of schema fields. Fields of metadata items provide a list of value/name
pairs (in case it is a structure) or an array of values (in case it is an array) or just a single
value.
umf: refs Array of references to other metadata items. Each reference is stored as metadata id.
umf: set Tag to associate fields of metadata elements.
B.6 Values for JPEG 360 metadata
The schema descriptor elements of the JPEG 360 metadata are shown in Table B.5.
Table B.5 — JPEG 360 schema descriptor elements
JPEG 360 schema descriptor Schema descriptor
JPEG360Metadata JPEG360Metadata
JPEG360ImageMetadata JPEG360ImageMetadata
JPEG360Version JPEG360Version
integer
MediaType MediaType
string
ProjectionType ProjectionType
string
Stereoscopic Format StereoscopicFormat
string
PhiMin PhiMin
real
PhiMax PhiMax
real
ThetaMax ThetaMax
real
ThetaMin ThetaMin
real
PhiGravity PhiGravity
real
ThetaGravity ThetaGravity
real
CompassPhi CompassPhi
real
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
TTaabbllee BB.55 ((ccoonnttiinnueuedd))
JPEG 360 schema descriptor Schema descriptor
BoxReference BoxReference
string
JPEG360ViewportMetadata JPEG360ViewportMetadata
JPEG 360Viewport- JPEG360ViewportNumber
Number
integer
ViewportPhi ViewportPhi
real
ViewportTheta ViewportTheta
real
ViewportPhiFOV ViewportPhiFOV
real
ViewportThetaFOV ViewportThetaiFOV
real
ViewportRoll ViewportRoll
real
B.7 XMP expression of minimum self-describing schema (without metadata
elements)
The JPEG 360 XML box contains data which is structured using XMP; the XMP expression of the schema
defines its properties.
The minimal XMP expression consists of the JPEG 360 schema descriptor and an unpopulated JPEG 360
metadata storage, as shown below. When this minimal expression is provided, a number of default values for
the JPEG 360 image description are assigned; these default values are detailed in Clause B.8.
xmlns:xmp="http://ns.adobe.com/xap/1.0/"
xmlns:umf="http://ns.intel.com/umf/2.0">
0
JPEG360Metadata
JPEG360ImageMetadata
JPEG360Version
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
integer
MediaType
string
ProjectionType
string
PhiMin
real
PhiMax
real
ThetaMax
real
ThetaMin
real
PhiGravity
real
ThetaGravity
real
CompassPhi
real
BoxReference
string
PROOF/ÉPREUVE
© ISO/IEC 2025 – All rights reserved
ISO/IEC 19566-6:2025(en)
JPEG360ViewportMetadata
JPEG360ViewportNumber
integer
ViewportPhi
real
ViewportTheta
real
ViewportPhiFOV
real
ViewportThetaFOV
<
...
ISO/IEC DISPRF 19566-6:2025(en)
ISO/IEC /JTC 1/SC 29/WG 1
Secretariat: JISC
Date: 2025-05-0212-15
Information technologytechnologies — JPEG systems —
Part 6:
JPEG 360
Technologies de l'information — Systèmes JPEG —
Partie 6: JPEG 360
FDIS stage
© ISO/IEC 2025
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication
may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying,
or posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO
at the address below or ISO'sISO’s member body in the country of the requester.
ISO Copyright Officecopyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: + 41 22 749 01 11
Email: E-mail: copyright@iso.org
Website: www.iso.org
Published in Switzerland.
© ISO/IEC 2025 – All rights reserved
ii
ISO/IEC DISPRF 19566-6:2025(en)
Contents Page
Foreword . iv
Introduction . v
1 Scope . 1
2 Normative references . 1
3 Terms, definitions and abbreviated terms . 1
3.1 Terms and definitions . 1
3.2 Abbreviated terms . 2
4 Conventions . 3
4.1 Conformance language . 3
4.2 Typesetting . 3
5 Description and definition of JPEG 360 images . 3
5.1 General . 3
5.2 Relative orientation . 7
5.3 Viewport representation . 10
5.4 Stereoscopic 360 image . 12
5.5 Basic set of general 360 image parameters . 18
6 Structuring of JPEG 360 Metadata . 19
6.1 General . 19
6.2 Definition of JPEG 360 Content Type boxes . 20
Annex A (normative) JPEG 360 Content Type JUMBF box . 22
Annex B (normative) XML box for JPEG 360 . 23
Annex C (normative) Accelerated ROIs rendering for JPEG (ISO/IEC 10918-1) images . 47
Bibliography . 59
© ISO/IEC 2025 – All rights reserved
iii
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are members
of ISO or IEC participate in the development of International Standards through technical committees
established by the respective organization to deal with particular fields of technical activity. ISO and IEC
technical committees collaborate in fields of mutual interest. Other international organizations, governmental
and non-governmental, in liaison with ISO and IEC, also take part in the work.
The procedures used to develop this document and those intended for its further maintenance are described
in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the different types of
document should be noted. This document was drafted in accordance with the editorial rules of the ISO/IEC
Directives, Part 2 (see www.iso.org/directives or www.iec.ch/members_experts/refdocs).
Field Code Changed
ISO and IEC draw attention to the possibility that the implementation of this document may involve the use of
(a) patent(s). ISO and IEC take no position concerning the evidence, validity or applicability of any claimed
patent rights in respect thereof. As of the date of publication of this document, ISO and IEC had not received
notice of (a) patent(s) which may be required to implement this document. However, implementers are
cautioned that this may not represent the latest information, which may be obtained from the patent database
available at www.iso.org/patents and https://patents.iec.ch. ISO and IEC shall not be held responsible for
identifying any or all such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and expressions
related to conformity assessment, as well as information about ISO's adherence to the World Trade
Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www.iso.org/iso/foreword.html.
In the IEC, see www.iec.ch/understanding-standards.
Field Code Changed
.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
This second edition cancels and replaces the first edition (ISO/IEC 19566-6:2019), which has been technically
revised. It also incorporates the Amendments ISO/IEC 19566-6:2019/Amd 1:2021 and ISO/IEC 19566-
6:2019/Amd 2:2025.
The main changes are as follows:
— — incorporates the latest advancements in stereoscopic imaging as well as dealing with accelerated
rendering of region of interest viewports.
A list of all parts in the ISO/IEC 19566 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html and www.iec.ch/national-
committees.
© ISO/IEC 2025 – All rights reserved
iv
ISO/IEC DISPRF 19566-6:2025(en)
Introduction
The ISO/IEC 19566 series is designed primarily for format and metadata storage and protection method of
compressed continuous-tone photographic content.
There is increasing use of multi-sensor images from multiple image sensor devices, such as 360 degree°
capturing cameras or dual-camera smartphones available to consumers. Images from these cameras are
shown on computers, smartphones, and head-mounted displays (HMD).
Because existing JPEG standards do not fully cover these new uses, incompatibilities have reduced the
interoperability of these images, and thus reducing the widespread ubiquity which consumers have come to
expect when using JPEG-based images.
Additionally, new modalities for interacting with images, such as computer-based augmentation, face-tagging,
and object classification require support for metadata that was not part of the original JPEG scope.
This document defines “JPEG 360”, building upon the features of JPEG Universal Metadata Box Format
(JUMBF) (see ISO/IEC 19566--5).
This document defines the use of the JPEG 360 Content Type JUMBF superbox with respect to the sub-box
components which include the definition of an XML box, the use of other boxes such as unstitched image
elements for omnidirectional captures together with the main image and descriptive metadata, and encrypted
parts of the image.
© ISO/IEC 2025 – All rights reserved
v
DRAFT International Standard ISO/IEC DIS 19566-6:2025(en)
Information technologytechnologies — JPEG systems —
Part 6:
JPEG 360
1 Scope
This document specifies omnidirectional/360° image and motion contents using Rec. ITU-
T T.81 | ISO/IEC 10918--1, Rec. ITU-T T.800 | ISO/IEC 15444--1, and ISO/IEC 18477-3.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content constitutes
requirements of this document. For dated references, only the edition cited applies. For undated references,
the latest edition of the referenced document (including any amendments) applies.
ISO/IEC 19566-5, Information technologies — JPEG systems — Part 5: JPEG universal metadata box format
(JUMBF)
Rec. ITU-T T.81 | | ISO/IEC 10918-1, Information technology — Digital compression and coding of continuous-
tone still images: Requirements and guidelines
Rec. ITU-T T.800 | ISO/IEC 15444-1, Information technology — JPEG 2000 Image Coding System — Part 1: Core
coding system
ISO/IEC 18477-3, Information technology — Scalable compression and coding of continuous-tone still images
— Part 3: Box file format
ISO/IEC 19566-5, Information technologies — JPEG systems — Part 5: JPEG universal metadata box format
(JUMBF)
3 Terms, definitions and abbreviated terms
3.1 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— — ISO Online browsing platform: available at https://www.iso.org/obp
— — IEC Electropedia: available at https://www.electropedia.org/
3.1.1 3.1.1
box
binary structure that encapsulates an object embedded in a file
© ISO/IEC 2025 – All rights reserved
3.1.2 3.1.3
deserialization
extraction of data structure from a series of bytes
3.1.3 3.1.4
equirectangular projection
projection for mapping a portion of the surface of a sphere to a flat image
3.1.4 3.1.6
metadata
data that describes other data, including text, image, hypertext and combinations thereof
3.1.5 3.1.7
omnidirectional
(sub)spherical surface of an image of a scene as if observed from a single point of projection
3.1.6 3.1.8
serialization
translation of data structures into a series of bytes that can be stored and/or transmitted
3.1.7 3.1.9
JPEG 1
common image compression data format and means of reference to ISO/IEC10918IEC 10918-1:1994
3.2 Abbreviated terms
3D three dimensional
CTE centre-to-edge
ERP equirectangular projection
FOV field of view
JSON JavaScript object notation
JPEG joint photographic experts group
JUMBF JPEG universal metadata box format
MIME multipurpose internet mail extensions
URL uniform resource locator
XML eXtensible Markup Language
XMP eXtensible Metadata Platform
ROI region of Interest
RDF resource description framework
W3C World Wide Web Consortium
UMF universal metadata framework
URI uniform resource identifier
UUID universal unique identifier
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
4 Conventions
4.1 Conformance language
The keyword "reserved" indicates a provision that is not specified at this time, shall not be used, and may be
specified in the future. The keyword "forbidden" indicates "reserved" and in addition indicates that the
provision will never be specified in the future.
4.2 Typesetting
Regular face fonts as this text describe informative text that provides instructions, comments or details for the
reader.
Monospaced text as this paragraph indicates program input or output as necessary to either run
the software, or as generated by the software on the console.
5 Description and definition of JPEG 360 images
5.1 General
The equirectangular projection is a commonly used projection of omnidirectional cameras to a two-
dimensional rectangular image and is the default image projection for JPEG 360.
The equirectangular projection maps the image onto a spherical surface from a single projection point at the
centre of the sphere. The surface of the sphere is specified by two angular measures as shown in
Figure 1Figure 1;; for convenience, a unit sphere is assumed. The centre of the sphere is coincident with the
origin of the three-dimensional Cartesian coordinates. A point on the surface of the sphere is defined by the
two angles ϕ and θ, which are also referred to as “longitude” and “latitude” respectively.
© ISO/IEC 2025 – All rights reserved
a
Origin of 360 image (θ = ϕ = 0).
b
Direction of increasing ϕ.
c
Direction of increasing θ.
Figure 1 — Spherical and Cartesian coordinates
In Cartesian coordinates, the points on the sphere are defined as follows:
— — x = cosϕ · cosθ
— — y = sinϕ · sinθ
— — z = sinθ
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
For an equirectangular projection the sphere’s surface, expressed in terms of the angles ϕ and θ, can be
mapped to a two-dimensional Cartesian surface. For example, Figure 2Figure 2 presents the mapping for
several lines of constant longitude or constant latitude.
a
Prime meridian (commonly at ϕ = 0)).
b
Equator (typically at θ = 0)).
c
Origin of 360 image (θ = ϕ = 0)).
d
Span of ϕ (commonly −180° to 180°)°).
e
Span of θ (commonly −90° to 90°)°).
Figure 2 — Description of mapping from spherical surface to equirectangular
In this document, a simplified representation will beis used. For example, an equirectangular projection of the
full spherical surface is shown in Figure 3Figure 3.
© ISO/IEC 2025 – All rights reserved
Figure 3 — Simplified representation for equirectangular projection for full spherical surface
The generalized equirectangular projection is expressed in terms of ranges for ϕ and θ by defining four values
ϕmin, ϕmax, θmin, and θmax as shown in Figure 4Figure 4.
Figure 4 — Generalized description of equirectangular projection
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
ϕ , and ϕ are constrained by the two conditions:
min max
— — (−360) ≤ ϕ ≤ ϕ ≤ 360, and
min max
— — ϕ − ϕ ≤ 360.
max min
θ , and θ are constrained by the two conditions:
min max
— — (−180 )≤) ≤ θ ≤ θ ≤ 180, and
min max
— — θ − θ ≤ 180.
max min
The advantages of this generalized description are:
a) a) it defines equirectangular projections that are smaller than a full sphere, and
b) b) it can mathematically shift the origin (0, 0) of the equirectangular project in terms of the angle
range values.
For example, an “equatorial image” to be a band around the equator of the equirectangular projection or set
the origin at the left edge of the equirectangular projection.
5.2 Relative orientation
The full 360 camera captures equivalent scenes regardless of the exact position the user holds the camera
during image exposure. However, it is highly desirable to improve the viewer’s experience of the image by
displaying the image in a way that is nominally oriented so that the scene horizon is parallel to the
equirectangular projection’s centre line. Sensors in the camera can provide the camera orientation relative to
the local direction of the Earth’s gravity; using this information allows the image to be remapped to the
expected view.
Figure 5Figure 5 shows the direction of Earth’s gravity with respect to the camera’s local coordinate systems
in both 3D space and equirectangular projection for a full spherical surface as can be expected when the
camera is held in its upright position.
© ISO/IEC 2025 – All rights reserved
a
Centre of 360 image as projected on sphere (θ = ϕ = 0)).
b
Direction of gravity as projected on sphere.
c
Centre of 360 image in equirectangular projection (θ = ϕ = 0)).
d
Direction of gravity in equirectangular projection.
Figure 5 — Representation of the direction of Earth’s gravity for upright camera
When the camera is not held upright, the direction of Earth’s gravity is shifted relative to the camera’s
coordinate systems. In Figure 6Figure 6,, this is illustrated.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
a
Centre of 360 image as projected on sphere (θ = ϕ = 0)).
b
Direction of gravity as projected on sphere.
c
Point on surface of sphere along direction of gravity.
d
Direction of gravity in equirectangular projection.
Figure 6 — Representation for direction of gravity for non-upright camera
The direction of Earth’s gravity is defined to be the vector from the centre of the sphere to the point defined
by the pair ϕgravity and θgravity. It is possible for the equirectangular position of the Earth’s gravity to be outside
the ranges defined by ϕ , ϕ , θ , and θ . The ϕ and θ are constrained shown below:
min max min max gravity gravity
— — ϕ −180 < ϕ < ϕ + 180, and
max gravity max
— — θ -90 < θ < θ + 90.
min gravity max
The compass heading relative to the camera’s coordinates is very useful information. This is a one-dimensional
value given by the angle, ω, where 0 ≤ ω < 360, as shown in Figure 7Figure 7.
NOTE In principle, the value of the compass heading can be less than ϕmin or greater than ϕmax.
© ISO/IEC 2025 – All rights reserved
a
Centre of 360 image as projected on sphere (θ = ϕ = 0)).
b
Compass heading (ω)).
Figure 7 — Compass heading definition
5.3 Viewport representation
A common interaction modality is to present the viewer with only a limited view of the equirectangular
projection that corresponds to the perspective that is familiar to the human visual experience. This can be
further used to guide the viewer’s experience of the image by starting at an initial location in the
equirectangular projection. This is referred to a “viewport”. A viewport is defined as a rectangular sub-region
within the equirectangular projection.
In Figure 8Figure 8,, the blue box represents a viewport; while not required, it is recommended to render a
view into the equirectangular projection through a viewport.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
a
Viewport centre.
b
Viewport horizontal field of view span.
c
Viewport vertical field of view span.
Figure 8 — Viewport as a subset of the equirectangular projection
In general, as shown in Figure 9Figure 9,, the edges of the viewport may not be parallel to the equirectangular
projection coordinates.
© ISO/IEC 2025 – All rights reserved
a
Viewport centre.
b
Direction of increasing viewport roll.
c
Line of constant ϕ.
Figure 9 — Non-parallel viewport
The viewport is characterized by:
— — Thethe centre of the viewport as defined by longitude (ϕ) and latitude values (θ),
— — Thethe span in ϕ and θ dimensions, and
— — Thethe viewport roll which is the rotation angle between a line-of-constant-φ through the viewport
centre point and the centre line of the viewport.
In addition, an accelerated viewport rendering to support an efficient and low latency viewport transmission
shall be done as defined in Annex CAnnex C.
5.4 Stereoscopic 360 image
5.4.1 General
A stereoscopic 360 image consists of a pair of images that depict the same scene or object from a slightly
different angle or perspective for the left and right eyes. In addition to monoscopic 360 images, this document
presents three formats for representing stereoscopic 360 images.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
5.4.2 Stereoscopic formats
The stereoscopic format indicates an arrangement of the left and right images. This subclause defines three
formats for organizing the pair of the stereoscopic 360 images as follows:
a) a) Side-by-side: left- and right-eye images are packed horizontally (see Figure 10Figure 10).).
b) b) Top-bottom: left- and right-eye images are packed vertically (see Figure 11Figure 11).).
c) c) Extended: one of the left- and right-eye images is stored in the form of a JPEG image, which
contains the metadata required to construct a stereoscopic scene. The metadata includes information that
identifies the location of the remaining image in both eyes. Figure 12Figure 12 shows an example of the
extended format. Although this example shows the right-eye image is contained in a single JPEG 1 image
file, the right-eye image may be referenced externally as a different JPEG 1 image file, as per ISO/IEC
19566-5.
a) Structure
© ISO/IEC 2025 – All rights reserved
b) Example of the format
Figure 10 — Side-by-side format
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
a) Structure b) Example of the format
Figure 11 — Top-bottom format
—
© ISO/IEC 2025 – All rights reserved
Figure 12 — Example of the extended format
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
5.4.3 Viewport definition in the stereo vision model
a
CenterCentre of the baseline.
b
Left eye.
c
Right eye.
d
Baseline.
e
Viewport.
f
Sphere.
g
Left 360° image.
h
Viewport mapped on the left image.
i
Viewport mapped on the right image.
j
Right 360° image.
Figure 13 — Viewport definition in the stereo vision model
Figure 13Figure 13 illustrates the stereo vision model used for stereoscopic 360 images and how a viewport
is organized. The viewport is a rectangular sub-region of the sphere specified by the FOV span values from the
centre of the sphere that is supposed to be the centre of the two eyes in the model.
In this model, P, which is a point on the sphere with Cartesian coordinate, is mapped on to the left and right
images, P and P , differently depending on the configuration of the capture devices. This allows the human
L R
© ISO/IEC 2025 – All rights reserved
visual system to perceive the point P more realistically when left and right images are projected on both eyes,
respectively. In this case, the viewports for both eyes are not necessarily defined for the left and right eyes
separately but can be defined by the viewing direction from the centre of both eyes. Therefore, the viewport
is specified by the FOV span values at the centre of the sphere that is considered as the centre of the two eyes
in the model.
5.4.4 Compatible with a conventional 360 image viewer
An interaction modality, such as a head-mounted display, is able to present a stereoscopic scene while a
conventional 360 image viewer maycan parse only monoscopic JPEG 360 image and present a viewport.
However, when the conventional viewer loads the stereoscopic JPEG 360 image file, it is possible to extract
the left- or right-eye image from the file based on the metadata embedded in the file and then present it to the
viewer.
Furthermore, when the stereoscopic JPEG 360 image is encoded based on the extended stereoscopic format,
the conventional viewer will display one of the left- and right-eye images without additional processing. This
follows the backward compatible mechanism, which allows that an image is decoded when the file is opened
by conventional JPEG viewing applications.
5.5 Basic set of general 360 image parameters
5.5.1 360 image parameters
— — JPEG360VersionNumber: provides a way to identify parameter sets and associated definitions.
[1 ]
— — MediaType: commonly referred to as MIME [1] type, this identifies the codestream format.
— — ProjectionType: identifies the spherical projection for the associated image.
— — PhiMin: ϕ as specified in 5.1Subclause 5.1.
min
— — PhiMax: ϕ as specified in 5.1Subclause 5.1.
max
— — ThetaMax: θ as specified in 5.1Subclause 5.1.
max
— — ThetaMin: θ as specified in 5.1Subclause 5.1.
min
— — PhiGravity: ϕgravity as specified in 5.1Subclause 5.1.
— — ThetaGravity: corresponding to θgravity as specified in 5.1Subclause 5.1.
— — CompassPhi: the angle between the camera’s coordinate system and the Earth’s compass coordinates.
— — BoxReference: indication if the equirectangular image position in the file is in the codestream position
expected by earlier versions of JPEG standards.
5.5.2 Viewport parameters
— — JPEG360ViewportNumber: a numbered set of viewport descriptors.
— — ViewportPhi: phi angle value for viewport centre: the longitude value for the centre of the rectangular
sub-region in the equirectangular projection.
— — ViewportTheta: theta angle value for viewport centre: the latitude value for the centre of the
rectangular sub-region in the equirectangular projection.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
— — ViewportPhiFOV: the longitude span of the viewport with span centred at the viewport latitude centre.
— — ViewportThetaFOV: the latitude span of the viewport with span centred at the viewport longitude
centre.
— — ViewportRoll: the angular rotation of the viewport.
6 Structuring of JPEG 360 Metadata
6.1 General
This Clauseclause defines the JPEG 360 Content Type box, which is based on the JUMBF superbox defined by
ISO/IEC 19566--5. The sub-box components are defined, which include the definition of an XML box, the use
of other boxes such as unstitched image elements for omnidirectional captures together with the main image
and descriptive metadata, and encrypted parts of the image in order to construct a conformant JPEG 360 image
file. An overview of the JPEG 360 Content Type box is shown in Figure 14Figure 14.
© ISO/IEC 2025 – All rights reserved
Figure 14 — Overview of JPEG 360 Content type JUMBF box
6.2 Definition of JPEG 360 Content Type boxes
With reference to ISO/IEC 19566-5, theThe JPEG 360 Content Type box shall conform to ISO/IEC 19566-5,
and shall be defined in:
— Annex A— Annex A,, which contains the definition of the JUMBF box that shall be used for JPEG 360 (e.g.
the structure and definitions of required content of this box);
— Annex B— Annex B,, which defines the JPEG 360 metadata schema that shall be used.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
The boxes in Annex AAnnex A follow the box-based file construction as specified in ISO/IEC 19566-5.
© ISO/IEC 2025 – All rights reserved
Annex A
(normative)
JPEG 360 Content Type JUMBF box
A.1 General
This annex defines the use of JUMBF elements for JPEG 360; the elements are defined in ISO/IEC 19566--5.
A.2 JUMBF Description box: Type for JPEG 360
Table A.1Table A.1 defines the TYPE field in the JUMBF Description box that corresponds to the Content Type
of the JPEG 360 JUMBF box.
Table A.1 — Definition of JUMBF Description box TYPE of a JPEG 360 JUMBF box
Parameter Value
TYPE 0x785f34b7-5d4b-474c-b89f-
1d99e0e3a8dd
NOTE The UUID above is per
Recommendationrecommendation ITU-
T X.667 |
[2 ]
ISO/IEC 9834-8 [2]. .
A.3 JUMBF Description box: recommended/default label string for JPEG 360
Table A.2Table A.2 defines the LABEL field in the JUMBF Description box that corresponds to the Content Type
of the JPEG 360 JUMBF box.
Table A.2 — Definition of JUMBF Description box LABEL of a JPEG 360 JUMBF box
Parameter Value
LABEL “JPEG360Metadata”
Each instance can have a distinct label or the same label if they are embedded in different files. If there are
multiple JUMBF boxes in the same file, the user is responsible to assign a unique label within the scope of the
file. It is recommended to use the label “JPEG360Metadata”, if this label does not already exist in the current
scope of the file (e.g. a newly created JPEG 360 image).
A.4 Description of the JPEG 360 Content Type JUMBF box
As shown in Figure 14Figure 14,, JPEG 360 metadata is contained within a JUMBF superbox. This JPEG 360
Content Type JUMBF box is internally composed of the following:
— — The JUMBF Description box with the UUID field set to 0x785f34b7-5d4b-474c-b89f-1d99e0e3a8dd.
— — Exactly one XML box which contains the JPEG 360 metadata as specified in Annex BAnnex B.
— — Optional Codestream Content type JUMBF boxes.
— — Optional Embedded File Content type JUMBF boxes.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
Annex B
(normative)
XML box for JPEG 360
B.1 General
JPEG 360 is designed to balance the need to provide a timely standard while keeping flexibility to grow the
feature sets to more fully accommodate the identified use cases. To meet this need, a basic schema descriptor
and an empty metadata set are provided; which assign default values for the metadata elements.
However, without loss of generality, metadata elements can be set to support a variety of basic
equirectangular images, and a number of viewports.
B.2 Definition of JPEG 360 metadata
The schema elements of the JPEG 360 metadata contain basic properties, which are specified in
Table B.1Table B.1.
Table B.1 — Definition of JPEG 360 schema descriptor elements
JPEG 360 schema descriptor elements Meaning Data type
JPEG360M Schema name string
Split Cells
etadata
JPEG360ImageMetadata Name of subschema for JPEG 360 image string
parameters
JPEG360Version Version # of JPEG image metadata schema; integer
provides a way to identify parameter sets and
associated definitions
MediaType Media type (formerly MIME type) for string
encoded 360 projection image codestream
ProjectionType 360 image projection type string
Stereoscopic Format Shall be as either ‘side-by-side’, ‘top-bottom’, string
or ‘extended’
PhiMin As shown in Clause 6Clause 6,, in units of real
degrees (°)
PhiMax real
ThetaMax real
ThetaMin real
PhiGravity real
ThetaGravity real
CompassPhi real
BoxReference In the case of ‘extended’ Stereoscopic Format, string
it refers to a label for either a JUMBF
Codestream Content type box or a JUMBF
Embedded File Content type box, as per
© ISO/IEC 2025 – All rights reserved
JPEG 360 schema descriptor elements Meaning Data type
ISO/IEC 19566-5. In any other case, the
default value is ‘conventional’.
JPEG360ViewportMetadata Name of subschema for JPEG 360 viewport string
JPEG360Viewport As several viewports can be assigned, a integer
Number unique ID to further differentiate them.
The value ‘0’ is reserved for the default
viewport as specified in Table B.4Table B.4.
ViewportPhi As shown in Clause 6Clause 6,, in units of real
degrees (°)
ViewportTheta As shown in Clause 6Clause 6,, in units of real
degrees (°)
ViewportPhiFOV real
ViewportThetaFOV real
ViewportRoll real
The JPEG 360 metadata elements are required to follow the definitions of the JPEG 360 schema descriptor.
B.3 Overview of metadata representation
[5 ]
This Annexannex defines JPEG 360 schema based on XMP specification (according to ISO 16684-3). ) [5]. The
data source layer implements serialization, deserialization and embedding of metadata with using XMP with
the addition of new tags to express a schema descriptor, and to associate metadata elements with that schema
descriptor, as shown diagrammatically in Figure B.1Figure B.1.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
Figure B.1 — Logical view of the metadata
The high-level structuring of the JPEG 360 XML box is shown in Table B.2Table B.2.
Table B.2 — High-level structuring of the XML box in a JPEG 360 JUMBF box
XMP Purpose Top level structure and tags Second level structure and tags
Split Cells
framing
Internal counter umf:next-id
Schemas XMP arrays of schemas XMP array of schema names
description storage
umf:schemas umf:descriptors
XMP array of fields/types
umf:fields
Metadata storage XMP array of metadata XMP array of metadata items
umf:metadata umf:set
XMP array of metadata field/values,
references
© ISO/IEC 2025 – All rights reserved
umf:fields, umf:refs
It should be understood that theThe top-level structure ordering is not critical as the structure in
Table B.3Table B.3 is equivalent to the structure in Table B.2Table B.2.
Table B.3 — Alternative high-level structuring of the XML box in a JPEG 360 JUMBF box
XMP Purpose Top level structure and tags Second level structure and tags
Split Cells
framing
Internal counter umf:next-id
Metadata storage XMP array of metadata XMP array of metadata items
umf:metadata umf:set
XMP array of metadata field/values,
references
umf:fields, umf:refs
Schemas XMP arrays of schemas XMP array of schema names
description storage
umf:schemas umf:descriptors
XMP array of fields/types
umf:fields
B.4 Encoding JPEG 360 metadata syntax
The JPEG 360 schema is serialized and stored using a subset of the W3C Resource Description Framework
[3 [3]] [4 [4] ]
(RDF) ), , expressed in XML . (Please note that . (XMP character encoding uses UTF-8, so the text
characters shown in the following monospaced examples maycan be different from their UTF-8 values.)
The XMP statement to frame a metadata stream is:
#
">
">
B.5 Reserved tags
Table B.4Table B.4 defines tags reserved in the JPEG 360 XML Box.
Table B.4 — JPEG 360 reserved XML tags
Reserved tag Description
umf:schemas Tag to define start of schema descriptor. Multiple schema descriptors can be defined, each
with multiple sets of metadata elements.
umf:schema Tag to define a unique name for a schema.
umf:name Tag to define name of schema descriptor elements, or to label sets of metadata elements.
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
Reserved tag Description
umf:type Tag to associate a data type to a name in the schema descriptor.
umf:id Unique id of a metadata set.
umf:next-id Used by implementation(s) to assign a unique id to metadata items.
umf:index Reserved tag. Index into an image sequence that contains the first image associated with
this metadata item (-1 in case of global metadata).
umf:nframes Reserved tag. Number of sequential frames the metadata item is associated with (0 in case
of global metadata).
umf:fields Tag for description of schema fields. Fields of metadata items provide a list of value/name
pairs (in case it is a structure) or an array of values (in case it is an array) or just a single
value.
umf:refs Array of references to other metadata items. Each reference is stored as metadata id.
umf:set Tag to associate fields of metadata elements.
B.6 Values for JPEG 360 metadata
The schema descriptor elements of the JPEG 360 metadata are shown in Table B.5Table B.5.
Table B.5 — JPEG 360 schema descriptor elements
JPEG 360 schema descriptor Schema descriptor
JPEG360Metadata JPEG360Metadata
JPEG360ImageMetadata JPEG360ImageMetadata
JPEG360Version JPEG360Version
integer
MediaType MediaType
string
ProjectionType ProjectionType
string
Stereoscopic Format StereoscopicFormat
string
PhiMin PhiMin
real
PhiMax PhiMax
real
ThetaMax ThetaMax
real
ThetaMin ThetaMin
real
PhiGravity PhiGravity
real
ThetaGravity ThetaGravity
real
CompassPhi CompassPhi
© ISO/IEC 2025 – All rights reserved
JPEG 360 schema descriptor Schema descriptor
real
BoxReference BoxReference
string
JPEG360ViewportMetadata JPEG360ViewportMetadata
JPEG 360ViewportNu JPEG360ViewportNumber
mber
integer
ViewportPhi ViewportPhi
real
ViewportTheta ViewportTheta
real
ViewportPhiFOV ViewportPhiFOV
real
ViewportThetaFOV ViewportThetaiFOV
real
ViewportRoll ViewportRoll
real
B.7 XMP expression of minimum self-describing schema (without metadata
elements)
The JPEG 360 XML box contains data which is structured using XMP; the XMP expression of the schema defines
its properties.
The minimal XMP expression consists of the JPEG 360 schema descriptor and an unpopulated JPEG 360
metadata storage, as shown below. When this minimal expression is provided, a number of default values for
the JPEG 360 image description are assigned; these default values are detailed in Clause B.8Subclause B.8.
xmlns:xmp="http://ns.adobe.com/xap/1.0/"
xmlns:umf="http://ns.intel.com/umf/2.0">
0
JPEG360Metadata
JPEG360ImageMetadata
JPEG360Version
integer
© ISO/IEC 2025 – All rights reserved
ISO/IEC DISPRF 19566-6:2025(en)
MediaType
string
ProjectionType
string
PhiMin
real
PhiMax
real
ThetaMax
real
ThetaMin
real
PhiGravity
real
ThetaGravity
real
CompassPhi
real
BoxReference
string
JPEG360ViewportMetadata
JPEG360ViewportNumber
integer
ViewportPhi
real
© ISO/IEC 2025 – All rights reserved
ViewportTheta
...














Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...