Augmented Reality Framework (ARF); AR standards landscape

DGR/ARF-001

General Information

Status
Published
Publication Date
25-Apr-2019
Current Stage
12 - Completion
Due Date
25-Apr-2019
Completion Date
26-Apr-2019
Ref Project
Standard
ETSI GR ARF 001 V1.1.1 (2019-04) - Augmented Reality Framework (ARF); AR standards landscape
English language
41 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)


GROUP REPORT
Augmented Reality Framework (ARF);
AR standards landscape
Disclaimer
The present document has been produced and approved by the Augmented Reality Framework (ARF) ETSI Industry
Specification Group (ISG) and represents the views of those members who participated in this ISG.
It does not necessarily represent the views of the entire ETSI membership.

2 ETSI GR ARF 001 V1.1.1 (2019-04)

Reference
DGR/ARF-001
Keywords
augmented reality, interoperability, platforms

ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE

Tel.: +33 4 92 94 42 00  Fax: +33 4 93 65 47 16

Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88

Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the prevailing version of an ETSI
deliverable is the one made publicly available in PDF format at www.etsi.org/deliver.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
If you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.
The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.

© ETSI 2019.
All rights reserved.
TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are trademarks of ETSI registered for the benefit of its Members.
TM TM
3GPP and LTE are trademarks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.
oneM2M™ logo is a trademark of ETSI registered for the benefit of its Members and
of the oneM2M Partners. ®
GSM and the GSM logo are trademarks registered and owned by the GSM Association.
ETSI
3 ETSI GR ARF 001 V1.1.1 (2019-04)
Contents
Intellectual Property Rights . 5
Foreword . 5
Modal verbs terminology . 5
1 Scope . 6
2 References . 6
2.1 Normative references . 6
2.2 Informative references . 6
3 Definition of terms, symbols and abbreviations . 11
3.1 Terms . 11
3.2 Symbols . 11
3.3 Abbreviations . 11
4 Standards for augmented reality . 13
4.1 Introduction . 13
4.2 Augmented Reality Application Format (ARAF) . 13
4.3 Augmented Reality Markup Language (ARML 2.0) . 17
4.4 W3C WebXR . 19
4.5 Mixed and Augmented Reality Reference Model (MAR-RM) . 20
5 Standards for AR-related data representation . 20
5.1 Introduction . 20
5.2 Text . 21
5.3 Image . 21
5.4 Video . 21
5.4.1 MPEG Video standards . 21
5.4.2 HTML5 Video Element . 21
5.5 Audio . 21
5.5.1 MPEG Audio standards . 21
5.5.2 HTML5 Audio Element . 21
5.6 2D graphics objects . 22
5.6.1 Simple Vector Graphics . 22
5.7 3D graphics objects and scenes . 22
5.7.1 Non-compressed data formats. 22
5.7.2 Compressed data formats . 22
5.8 Scene description . 25
5.8.1 VRML . 25
5.8.2 X3D . 25
5.8.3 MPEG-4 Part 11 BIFS (ISO/IEC 14496-16) . 26
5.9 Standards related to sensors and actuators data . 26
5.9.1 MPEG-V . 26
5.9.2 W3C Media Capture and Streams . 26
5.10 Standards related to geographic data . 26
5.10.1 Geography Markup Language (GML) . 26
5.10.2 City Geography Markup Language (CityGML) . 27
5.10.3 Indoor Geography Markup Language (IndoorGML) . 27
5.10.4 Keyhole Markup Language (KML) . 27
5.10.5 Web Map Service (WMS) . 27
5.10.6 W3C GeoLocation . 28
5.10.7 IETF RFC 5870 . 28
5.10.8 IETF RFC 7946 . 28
6 Standards for communication protocols . 28
6.1 Introduction . 28
6.2 TCP, UDP. 28
6.3 RTP, RTCP, RTSP . 29
6.4 WebRTC. 29
ETSI
4 ETSI GR ARF 001 V1.1.1 (2019-04)
7 Standards for Hardware API . 29
7.1 Introduction . 29
7.2 OpenVX™ specifications . 30
7.3 OpenXR™ specifications . 31
7.4 OpenGL™ specifications . 32
7.5 WebGL™ specifications . 32
7.6 Vulkan™ specifications . 32
8 User interaction standards . 32
8.1 Gestures . 32
8.1.1 Introduction. 32
8.1.2 DIN SPEC 91333 . 32
8.1.3 ISO MPEG-U. 33
9 Domain-specific standards . 33
9.1 Building/construction . 33
9.2 Manufacturing and Installations . 34
9.2.1 Introduction. 34
9.2.2 Process sequence . 34
9.2.3 Enterprise asset management . 35
9.2.4 Computerized Maintenance Management System . 36
9.2.5 Other standards . 36
10 Standards for assessing AR quality . 37
10.1 User experience . 37
Annex A: Authors & contributors . 39
Annex B: Bibliography . 40
History . 41

ETSI
5 ETSI GR ARF 001 V1.1.1 (2019-04)
Intellectual Property Rights
Essential patents
IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).
Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.
Trademarks
The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners.
ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no
right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does
not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks.
Foreword
This Group Report (GR) has been produced by ETSI Industry Specification Group (ISG) Augmented Reality
Framework (ARF).
The ISG ARF shares the following understanding for Augmented Reality: Augmented Reality (AR) is the ability to mix
in real-time spatially-registered digital content with the real world. The present report provides a snapshot of
standardization efforts conducted by various Standards Development Organizations (SDOs) and other fora, as available
at the time of publishing. While the goal of the present document is to provide an exhaustive list of relevant standards
for AR, this may not be the case with the first version of the present document and updates may be available in future
versions.
Modal verbs terminology
In the present document "should", "should not", "may", "need not", "will", "will not", "can" and "cannot" are to be
interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions).
"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.

ETSI
6 ETSI GR ARF 001 V1.1.1 (2019-04)
1 Scope
The present document aims to identify the role of existing standards relevant to augmented reality and to contribute to
identify any interoperability gaps. The activity summarized in the present document consisted in analysing the
standardization work related to augmented reality in various standards setting organizations. While some of these
standards under review are directly addressing AR as a whole, others are addressing key technological components that
can be useful to increase interoperability of AR applications and services.
2 References
2.1 Normative references
Normative references are not applicable in the present document.
2.2 Informative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.
[i.1] ISO/IEC 18039:2019: "Mixed and augmented reality (MAR) reference model".
NOTE: Available at https://www.iso.org/standard/30824.html.
[i.2] ISO/IEC 23000-13:2017: "Multimedia application format (MPEG-A) -- Part 13: Augmented
reality application format" a.k.a ARAF.
NOTE: Available at https://www.iso.org/standard/69465.html.
[i.3] ISO/IEC 14496-1:2010: "Coding of audio-visual objects -- Part 1: Systems".
NOTE: Available at https://www.iso.org/standard/55688.html.
[i.4] ISO/IEC 14496-2:2004: "Coding of audio-visual objects -- Part 2: Visual".
NOTE: Available at https://www.iso.org/standard/39259.html.
[i.5] ISO/IEC 14496-3:2009: "Coding of audio-visual objects -- Part 3: Audio", a.k.a. AAC.
NOTE: Available at https://www.iso.org/standard/53943.html.
[i.6] ISO/IEC 14496-10:2014: "Coding of audio-visual objects -- Part 10: Advanced Video Coding",
a.k.a AVC.
NOTE: Available at https://www.iso.org/standard/66069.html.
[i.7] ISO/IEC 14496-11:2015: "Scene description and application engine", a.k.a. BIFS.
NOTE: Available at https://www.iso.org/standard/63548.html.
[i.8] ISO/IEC 14496-16 2011: "Coding of audio-visual objects -- Part 16: Animation Framework
eXtension (AFX)".
NOTE: Available at https://www.iso.org/standard/57367.html.
ETSI
7 ETSI GR ARF 001 V1.1.1 (2019-04)
[i.9] ISO/IEC 14772-1:1997: "The Virtual Reality Modeling Language", a.k.a VRML.
NOTE: Available at https://www.iso.org/standard/25508.html.
[i.10] ISO/IEC 19775:2013: "Extensible 3D (X3D)".
NOTE: Available at https://www.iso.org/standard/60760.html.
[i.11] ISO/IEC 23005-5:2016: "Media context and control", a.k.a. MPEG-V.
NOTE: Available at https://www.iso.org/standard/65398.html.
[i.12] ISO 19005-3:2012: "Document management -- Electronic document file format for long-term
preservation -- Part 3: Use of ISO 32000-1 with support for embedded files (PDF/A-3)", a.k.a
PDF.
NOTE: Available at https://www.iso.org/standard/57229.html.
[i.13] ISO/IEC 10918:1997: "Digital compression and coding of continuous-tone still images", a.k.a.
JPEG.
NOTE: Available at https://www.iso.org/standard/25037.html.
[i.14] ISO/IEC 15948 2004: "Portable Network Graphics (PNG)".
NOTE: Available at https://www.iso.org/standard/29581.html.
[i.15] ISO/IEC 15444-2:2004: "JPEG 2000 image coding system".
NOTE: Available at https://www.iso.org/standard/33160.html.
[i.16] ISO/IEC 23008-2:2017: "High efficiency coding and media delivery in heterogeneous
environments", a.k.a HEVC.
NOTE: Available at https://www.iso.org/standard/69668.html.
[i.17] ISO/IEC 23090:2019: "Coded representation of immersive media".
NOTE: Available at https://www.iso.org/standard/73310.html.
[i.18] ISO/IEC 11172-3:1993: "Coding of moving pictures and associated audio for digital storage media
at up to about 1,5 Mbit/s", a.k.a mp3.
NOTE: Available at https://www.iso.org/standard/22412.html.
[i.19] ISO/IEC 23003:2018: "MPEG audio technologies -- Part 2: Spatial Audio Object Coding
(SAOC)".
NOTE: Available at https://www.iso.org/standard/73122.html.
[i.20] ISO/IEC 23090-5 (under development): "Coded representation of immersive media -- Part 5:
Video-based Point Cloud Compression".
NOTE: Available at https://www.iso.org/standard/73025.html.
[i.21] ISO/IEC 23090-9 (under development): "Coded representation of immersive media -- Part 5:
Graphics-based Point Cloud Compression".
[i.22] ISO/IEC 30113-1:2015: "User interface -- Gesture-based interfaces across devices and methods".
NOTE: Available at https://www.iso.org/standard/53233.html.
[i.23] ISO/IEC 23007-2:2012: "Rich media user interfaces -- Part 2: Advanced user interaction (AUI)
interfaces".
NOTE: Available at https://www.iso.org/standard/59241.html.
ETSI
8 ETSI GR ARF 001 V1.1.1 (2019-04)
[i.24] ISO 55000:2014: "Asset management -- Overview, principles and terminology".
NOTE: Available at https://www.iso.org/standard/55088.html.
[i.25] ISO 55001:2014: "Asset management -- Management systems - Requirements".
NOTE: Available at https://www.iso.org/standard/55089.html.
[i.26] ISO 55002:2018: "Asset management -- Management systems -- Guidelines for the application of
ISO 55001".
NOTE: Available at https://www.iso.org/standard/70402.html.
[i.27] ISO 15926-1:2004: "Industrial automation systems and integration -- Integration of life-cycle data
for process plants including oil and gas production facilities -- Part 1: Overview and fundamental
principles".
NOTE: Available at https://www.iso.org/standard/29556.html.
[i.28] ISO 15926-2:2003: "Industrial automation systems and integration -- Integration of life-cycle data
for process plants including oil and gas production facilities -- Part 2: Data model".
NOTE: Available at https://www.iso.org/standard/29557.html.
[i.29] ISO 9241:1983: "Series of standards related to Ergonomics of human-system interaction".
NOTE: Available at https://www.iso.org/committee/53372.html.
[i.30] ISO 9126:2001: "Software engineering -- Product quality -- Part 1: Quality model".
NOTE: Available at https://www.iso.org/standard/22749.html.
[i.31] ISO/IEC 25060:2010: "Systems and software engineering -- Systems and software product Quality
Requirements and Evaluation (SQuaRE) -- Common Industry Format (CIF) for usability: General
framework for usability-related information".
NOTE: Available at https://www.iso.org/standard/35786.html.
[i.32] ISO/IEC 25062:2006: "Software engineering -- Software product Quality Requirements and
Evaluation (SQuaRE) -- Common Industry Format (CIF) for usability test reports".
NOTE: Available at https://www.iso.org/standard/43046.html.
[i.33] ISO/IEC 25063:2014: "Systems and software engineering -- Systems and software product Quality
Requirements and Evaluation (SQuaRE) -- Common Industry Format (CIF) for usability: Context
of use description".
NOTE: Available at https://www.iso.org/standard/35789.html.
[i.34] ISO/IEC 25064:2013: "Systems and software engineering -- Software product Quality
Requirements and Evaluation (SQuaRE) -- Common Industry Format (CIF) for usability: User
needs report".
NOTE: Available at https://www.iso.org/standard/35790.html.
[i.35] ISO 14915:2002: "Software ergonomics for multimedia user interfaces -- Part 1: Design principles
and framework".
NOTE: Available at, https://www.iso.org/standard/25578.html.
[i.36] ISO/IEC 40500:2012: "W3C Web Content Accessibility Guidelines (WCAG) 2.0".
NOTE: Available at https://www.iso.org/standard/58625.html.
[i.37] ISO/IEC 15938-13:2015: "Compact Descriptors for Visual Search".
NOTE: Available at https://www.iso.org/standard/65393.html.
ETSI
9 ETSI GR ARF 001 V1.1.1 (2019-04)
[i.38] ISO/IEC 16739:2018: "Industry Foundation Classes (IFC) for data sharing in the construction and
facility management industries -- Part 1: Data schema".
NOTE: Available at https://www.iso.org/standard/70303.html.
[i.39] Recommendation ITU-T H.264 (2017): "Advanced video coding for generic audiovisual services".
NOTE: Available at https://www.itu.int/rec/T-REC-H.264-201704-I/en.
[i.40] Recommendation ITU-T H.265 (2018): "High efficiency video coding".
NOTE: Available at https://www.itu.int/rec/T-REC-H.265-201802-I/en.
[i.41] IETF RFC 7159 (2014): "The JavaScript Object Notation (JSON) Data Interchange Format".
NOTE: Available at https://tools.ietf.org/html/rfc7159.
[i.42] IETF RFC 1889 (1996): "RTP: A Transport Protocol for Real-Time Applications".
NOTE: Available at https://tools.ietf.org/html/rfc1889.
[i.43] IETF RFC 3550 (2003): "RTP: A Transport Protocol for Real-Time Applications".
NOTE: Available at https://tools.ietf.org/html/rfc3550.
[i.44] IETF RFC 2326 (1998): "Real Time Streaming Protocol (RTSP)".
NOTE: Available at https://www.ietf.org/rfc/rfc2326.txt.
[i.45] IETF RFC 7826 (2016): "Real-Time Streaming Protocol Version 2.0".
NOTE: Available at https://tools.ietf.org/html/rfc7826.
[i.46] IETF RFC 5870 (2010): "A Uniform Resource Identifier for Geographic Locations ('geo' URI)".
NOTE: Available at https://tools.ietf.org/html/rfc5870.
[i.47] IETF RFC 7946 (2016): "The GeoJSON Format".
NOTE: Available at https://tools.ietf.org/html/rfc7946.
[i.48] IETF RFC 793 (1981): "Transmission Control Protocol".
NOTE: Available at https://tools.ietf.org/html/rfc793.
[i.49] OGC ARML (2015): "Augmented Reality Markup Language 2.0 (ARML 2.0)".
NOTE: Available at http://docs.opengeospatial.org/is/12-132r4/12-132r4.html.
[i.50] OGC GML (2007): "OpenGIS® Geography Markup Language (GML) Encoding standard".
NOTE: Available at https://www.opengeospatial.org/standards/gml.
[i.51] OGC CityGML (2012): "OGC City Geography Markup Language (CityGML) Encoding
Standard".
NOTE: Available at https://www.opengeospatial.org/standards/citygml.
[i.52] OGC IndoorGML (2018): "OGC® IndoorGML Encoding Standard".
NOTE: Available at https://www.opengeospatial.org/standards/indoorgml.
[i.53] OGC KML (2015): "OGC KML 2.3".
NOTE: Available at https://www.opengeospatial.org/standards/kml.
[i.54] OGC WMS (2006): "OpenGIS® Web Map Server Implementation Specification".
NOTE: Available at https://www.opengeospatial.org/standards/wms.
ETSI
10 ETSI GR ARF 001 V1.1.1 (2019-04)
[i.55] W3C WebXR (2019): "WebXR Device API".
NOTE: Available at https://www.w3.org/TR/webxr/.
[i.56] W3C Recommendation 16 August 2006: "Extensible Markup Language (XML) 1.1 (Second
Edition)".
NOTE: Available at https://www.w3.org/TR/2006/REC-xml11-20060816/.
[i.57] W3C Working Draft 18 October 2018: "HTML 5.3".
NOTE: Available at https://www.w3.org/TR/html53/.
[i.58] W3C Candidate Recommendation 27 September 2018: "WebRTC 1.0: Real-time Communication
Between Browsers".
NOTE: Available at https://www.w3.org/TR/webrtc/.
[i.59] W3C Recommendation 16 August 2011: "Scalable Vector Graphics (SVG) 1.1 (Second Edition)".
NOTE: Available at https://www.w3.org/TR/SVG11/.
[i.60] W3C Candidate Recommendation 3 October 2017: "Media Capture and Streams".
NOTE: Available at https://www.w3.org/TR/mediacapture-streams/.
[i.61] W3C Recommendation 8 November 2016: "Geolocation API Specification 2nd Edition".
NOTE: Available at https://www.w3.org/TR/geolocation-API/.
[i.62] Khronos, COLLADA (2008): "COLLADA Digital Asset Schema Release 1.4.1 Specification (2nd
Edition)".
NOTE: Available at https://www.khronos.org/files/collada_spec_1_4.pdf.
[i.63] Khronos, glTF (2017): "GL Transmission Format (glTF) (Version 2.0)".
NOTE: Available at https://github.com/KhronosGroup/glTF/tree/master/specification/2.0.
[i.64] Khronos, OpenVX (2017): "The OpenVX Specification".
NOTE: Available at https://www.khronos.org/registry/OpenVX/specs/1.2/html/index.html.
[i.65] Khronos, OpenXR, under development: "OpenXR Overview".
NOTE: Available at https://www.khronos.org/openxr.
[i.66] Khronos, OpenGL (2019): "The OpenGL© Graphics System, OpenGL 4.6 Core Profile".
NOTE: Available at https://www.khronos.org/registry/OpenGL/specs/gl/glspec46.core.pdf.
[i.67] Khronos, WebGL (2018): "WebGL 2.0 Specification".
NOTE: Available at https://www.khronos.org/registry/webgl/specs/latest/2.0/.
[i.68] Khronos, Vulkan (2019): "Vulkan 1.1 API Specifications".
NOTE: Available at https://www.khronos.org/registry/vulkan/#apispecs.
[i.69] DIN SPEC 91333 (2016): "Contactless gesture control for human-system interaction".
NOTE: Available at https://www.beuth.de/en/technical-rule/din-spec-91333/255376761.
[i.70] DIN EN ISO 9241-960 (2018): "Ergonomics of human-system interaction - Part 960: Framework
and guidance for gesture interactions (ISO 9241-960:2017)".
NOTE: Available at https://www.beuth.de/en/standard/din-en-iso-9241-960/272433976.
ETSI
11 ETSI GR ARF 001 V1.1.1 (2019-04)
[i.71] ASTM E2132 - 17 (2017): "Standard Practice for Inventory Verification: Electronic and Physical
Inventory of Assets".
NOTE: Available at https://www.astm.org/Standards/E2132.htm.
[i.72] ASTM E3035 - 15 2015: "Standard Classification for Facility Asset Component Tracking System
(FACTS)".
NOTE: Available at https://www.astm.org/Standards/E3035.htm.
[i.73] ASTM E2499 - 18 2018: "Standard Practice for Classification of Asset Location Information".
NOTE: Available at https://www.astm.org/Standards/E2499.htm.
[i.74] MIMOSA CCOM 4.0.0 2016: "Open Standards for Physical Asset Management".
NOTE: Available at http://www.mimosa.org/specifications/ccom-4-0-0/.
[i.75] MIMOSA OSA-EAI 2014: "Open Standards for Physical Asset Management".
NOTE: Available at http://www.mimosa.org/mimosa-osa-eai/.
[i.76] Recommendation ITU-T T.81: "Information technology - Digital compression and coding of
continuous-tone still images - Requirements and guidelines".
3 Definition of terms, symbols and abbreviations
3.1 Terms
Void.
3.2 Symbols
Void.
3.3 Abbreviations
For the purposes of the present document, the following abbreviations apply:
AAC Advanced Audio Coding
ADAS Advanced Driver Assistance Systems
AFX Animation Framework eXtention
API Application Programming Interface
APM Augmented Printed Material
AR Augmented Reality
ARAF Augmented Reality Applications Format
ARF Augmented Reality Framework
ARML Augmented Reality Markup Language
ASCII American Standard Code for Information Interchange
ASTM American Society for Testing and Materials
AUI Advanced User Interface
AVC Advanced Video Coding
BBA Bone Based Animation
BIFS Binary Format for Scenes
bpv bits per vertex
CAD Computer Aided Design
CAVE Cave Automatic Virtual Environment
CCOM Common Collaborative Object Model
CDVS Compact Descriptors for Visual Search
ETSI
12 ETSI GR ARF 001 V1.1.1 (2019-04)
CI Coordinate Interpolator
CIF Common Industry Format
CMMS Computerized Maintenance Management System
CNN Convolutional Neural Network
CPU Central Processing Unit
CUDA Compute Unified Device Architecture
DAE Digital Asset Exchange
DB Data Base
DIN Deutsches Institut für Normung
DOM Document Object Model
EAM Enterprise Asset Management
ERP Enterprise Resource Planning
FAMC Frame-based Animated Mesh Compression
FBA Face and Body Animation
FBX FilmBoX - data format
GIS Geographic Information System
glTF GL Transmission Format
GML Geography Markup Language
GPS Global Positioning System
GR Group Report
HEVC High Efficiency Video Coding
HMD Head Mounted Display
HTML HyperText Markup Language
HTTP Hypertext Transfer Protocol
IEC International Electrotechnical Commission
IETF Internet Engineering Task Force
IFC Industry Foundation Classes
IP Internet Protocol
ISO International Organization for Standardization
ITU-T International Telecommunication Union - Telecommunication Standardization Sector
JPEG Joint Photographic Experts Group
JSON JavaScript Object Notation
KARML Keyhole Augmented Reality Application Format
KML Keyhole Markup Language
LOD Level of Details
LUT Lookup Table
MAR Mixed and Augmented Reality
MAR-RM Mixed and Augmented Reality Reference Model
MPEG Motion Picture Expert Group
OGC Open Geographic Consortium
OI Orientation Interpolation
OPC Open Plateforme Communication
PAS Publicly Available Specification
PCC Point Cloud Compression
PFS Progressive Forest Split
PI Position Interpolation
PM Progression Mesh
PNG Portable Network Graphics
PROTO VRML Prototype
QR Quick Response
RFC Requests For Comments
RTCP Real Time Control Protocol
RTP Real Time Protocol
RTSP Real Time Streaming Protocol
SCTP Stream Control Transmission Protocol
SDK Software Development Kit
SLAM Synchronous Location and Mapping
SPEC Specification
SVG Scalable Vector Graphics
TCP Transmission Control Protocol
TFAN Triangle Fan
TS Topological Surgery
ETSI
13 ETSI GR ARF 001 V1.1.1 (2019-04)
UDP User Datagram Protocol
URI Uniform Resource Identifier
URL Uniform Resource Locator
VR Virtual Reality
VRML Virtual Reality Markup Language
WFS Web Feature Service
WMS Web Map Service
WSS Wavelet Subdivision Surface
XML eXtensible Markup Language
XR eXtended Reality
4 Standards for augmented reality
4.1 Introduction
This clause describes the existing technical specifications published by various SDOs and directly addressing AR
applications and services. At the time of publishing the present document, the following SDOs producing AR specific
standards were identified:
• International Organization for Standardization (ISO/IEC) JTC1 https://www.iso.org/isoiec-jtc-1.html.
• Open Geospatial Consortium (OGC) http://www.opengeospatial.org/.
• World Wide Web Consortium (W3C) https://www.w3.org/.
Table 1 indicates the AR related standards produced by these organizations.
Table 1: AR related standards
ISO/IEC ARAF - Augmented ISO IEC 23000-13 [i.2] https://www.iso.org/standard/69465.html
JTC1 Reality Application
Format
MAR-RM Mixed and ISO/IEC 18039 [i.1] https://www.iso.org/standard/30824.html
Augmented Reality
Reference Model
OGC ARML Augmented Reality https://www.opengeospatial.org/standards/arml
Markup Language [i.2]
W3C WebXR [i.2] https://www.w3.org/blog/tags/webxr/

In the following clauses, each of these standards are briefly introduced.
4.2 Augmented Reality Application Format (ARAF)
Augmented Reality Application Format (ISO/IEC 23000-13 [i.2]) focuses on the data format used to provide an
augmented reality presentation and not on the client or server procedures. ARAF specifies scene description elements
for representing AR content, mechanisms to connect to local and remote sensors and actuators, mechanisms to integrate
compressed media (image, audio, video, graphics), mechanisms to connect to remote resources such as maps and
compressed media. ARAF was developed by MPEG, the same technical committee that created mp3 for audio, AVC
and HEVC for video. MPEG already provided data type representations for all kinds of media, from static image, video,
audio to 3D graphics and complex dynamic scenes. Additionally, MPEG developed a set of standards related to sensors
and actuators. By bringing these two components together into an application format called ARAF, MPEG enables
interoperability when used to build AR applications and services.
Table 2 synthesizes the set of existing MPEG tools that are used for addressing AR applications.
ETSI
14 ETSI GR ARF 001 V1.1.1 (2019-04)
Table 2: MPEG tools addressing AR applications
Feature Standard
Image (video) & audio capturing MPEG-V published as ISO/IEC 23005-5 [i.11]
Capture real camera position and MPEG-V published as ISO/IEC 23005-5 [i.11]
orientation CDVS published as ISO/IEC 15938-13 [i.37]
Detection and tracking of visual objects CDVS published as ISO/IEC 15938-13 [i.37]
Representation and transmission of media MPEG-4 Systems published as ISO/IEC 14496-1 [i.3]
assets MPEG-4 Visual published as ISO/IEC 14496-2 [i.4]
MPEG-4 Audio published as ISO/IEC 14496-3 [i.5]
MPEG-4 AVC published as ISO/IEC 14496-10 [i.6]
MPEG-4 BIFS published as ISO/IEC 14496-11 [i.7]
MPEG-4 AFX published as ISO/IEC 14496-16 [i.8]
Image & video rendering as a background MPEG-4 Systems published as ISO/IEC 14496-1 [i.3]
Control the virtual camera parameters MPEG-4 BIFS published as ISO/IEC 14496-11 [i.7]
Synthetic content representation MPEG-4 BIFS published as ISO/IEC 14496-11 [i.7]
MPEG-4 AFX published as ISO/IEC 14496-16 [i.8]

Therefore, by using these MPEG technologies it is possible to envision a generic AR browser. Instead of downloading a
new and heavy application for every context, the users could simply point to an URL. The ARAF browser retrieves a
scenario from the Internet, starts the video acquisition, tracks images and objects, recognizes them from visual
signatures, computes the camera position, downloads 2D/3D graphics, composes a new scene, gets input from various
sensors and constantly adapts the interaction in order to offer an optimal ARAF experience. Instead of developing a new
application for each use case and smart phone/glasses platform, the industry could rely on MPEG-compliant Authoring
Tools and MPEG-compliant browsers to reach a maximum number of customers. The schema in figure 1 presents the
MPEG vision on AR.
Figure 1: MPEG strategy on addressing AR [i.2]
The creation of ARAF content is fast and easy because the AR experience creator does not necessarily need to have any
ARAF prior knowledge. Instead of designing the application from scratch, he can choose to create generic template-
based AR applications in a matter of hours by using authoring tools. The authoring tool can be a user-friendly web
interface where the user is allowed to design and define the behaviour of his AR experience. The generated result, an
XML representation, and the media which is linked into the application, are compressed together in a single file which
contains the full ARAF content. At the end of the chain, the content is downloaded and consumed by an ARAF
browser.
On the other hand, the ARAF model provides full control to the designers who want to create brand new AR
experiences. If one decides to create an AR experience from scratch, one can simply start writing the ARAF content by
oneself, as one would create an HTML page.
ETSI
15 ETSI GR ARF 001 V1.1.1 (2019-04)
Some key elements differentiating ARAF from other available technologies are the following:
1) it can be easily used by designers without a programming background;
2) it is an open standard provided by ISO; and
3) the software implementation of ARAF browser is available as open source.
Figure 2 presents a generic AR architecture including ARAF. The ARAF content, available as a file or stream, is
interpreted by a device, called ARAF device. The elements (nodes) of the ARAF scene point to different sources of
multimedia content such as 2D/3D image, audio, video and graphics, and sensor/sensory information sources/sinks that
are either local or remote.
Figure 2: The ARAF context
In order to design a multimedia scene ARAF extends MPEG-4 Part 11 BIFS (ISO/IEC 14496-11 [i.7]) which is based
on VRML97 (ISO/IEC 14772-1 [i.9]). About two hundred elements are standardized in MPEG-4 BIFS
) and VRML, giving the possibility of a content creator to create any AR experience that he can
(ISO/IEC 14496-11 [i.7]
imagine.
MPEG-4 Part 11 describes a scene with a hierarchical structure that can be represented as a graph. The nodes (elements)
of the graph build up various types of objects, such as audio, video, image, graphics and text. Furthermore, to ensure
flexibility, a new user-defined type of node can be defined on demand by using the prototyping (PROTO) method.
In general, the nodes expose a set of parameters, through which aspects of their appearance and behaviour can be
controlled. By setting these values, the content creators (designers) have the possibility to force a scene reconstruction
or scene update at clients' terminals to adhere to their intention in a predefined manner. In more advanced scenarios, the
structure of the BIFS nodes is not necessarily static; nodes can be added or removed from the scene graph arbitrarily
and dynamically.
Certain types of nodes called sensors can interact with users and trigger appropriate actions, which are transmitted to
other nodes through a routing mechanism, causing changes in the state of these receiving nodes. They are one of the
bases for dynamic behaviour of multimedia content supported by MPEG-4 Part 11 (ISO/IEC 14496-11 [i.7]).
The maximum flexibility in the programmable feature of MPEG-4 scene is carried out with the Script node. By using
the routing mechanism to a Script node, an associated function can be called. The implementation of this function is
designed by the MAR experience creator, i.e. the creator can freely process some computations, and then output the
result to the scene graph. In other words, by using the scripting feature of ARAF, the MAR experience creator can
achieve a set of functionalities which are not supported natively by the nodes that are already part of the technology.
This way the scene can be programmatically updated based on time events, user actions, or a mixture of the two. It is
therefore provided the possibility of manipulating everything that is defined in the scene graph, at any moment in time
and based on events that are triggered automatically or by the end user.
ETSI
16 ETSI GR ARF 001 V1.1.1 (2019-04)
ARAF supports the definition and reusability of complex elements by using the MPEG-4 PROTO mechanism. The
PROTO statement creates new nodes by defining a configurable object prototype; it can integrate any other node from
the scene graph and it basically provides a method of creating new functionality that can be re-used within the scene
graph.
Furthermore, a new functionality (a PROTO) that was designed by a MAR experience creator can be re-used in
different scene graphs by using the EXTERNPROTO mechanism. This feature allows an object prototype to be
described in a separate file and imported from there into any other scene graph.
The data that is captured from sensors or used to command actuators in ARAF is based on ISO/IEC 23005-5 [i.11], data
formats for interaction devices (MPEG-V published in ISO/IEC 23005-5 [i.11]). MPEG-V
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.

Loading comments...