Information technology — Multimedia content description interface — Part 3: Visual — Amendment 2: Perceptual 3D Shape Descriptor

Technologies de l'information — Interface de description du contenu multimédia — Partie 3: Visuel — Amendement 2: Descripteur sensoriel de forme 3D

General Information

Status
Published
Publication Date
28-Mar-2006
Current Stage
6060 - International Standard published
Due Date
21-Jan-2008
Completion Date
29-Mar-2006
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 15938-3:2002/Amd 2:2006 - Perceptual 3D Shape Descriptor
English language
10 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 15938-3
First edition
2002-02-15
AMENDMENT 2
2006-04-01


Information technology — Multimedia
content description interface —
Part 3:
Visual
AMENDMENT 2: Perceptual 3D Shape
Descriptor
Technologies de l'information — Interface de description du contenu
multimédia —
Partie 3: Visuel
AMENDEMENT 2: Descripteur sensoriel de forme 3D




Reference number
ISO/IEC 15938-3:2002/Amd.2:2006(E)
©
ISO/IEC 2006

---------------------- Page: 1 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but
shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat
accepts no liability in this area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation
parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In
the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.


©  ISO/IEC 2006
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland

ii © ISO/IEC 2006 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are members of
ISO or IEC participate in the development of International Standards through technical committees
established by the respective organization to deal with particular fields of technical activity. ISO and IEC
technical committees collaborate in fields of mutual interest. Other international organizations, governmental
and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information
technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of the joint technical committee is to prepare International Standards. Draft International
Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as
an International Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Amendment 2 to ISO/IEC 15938-3:2002 was prepared by Joint Technical Committee ISO/IEC JTC 1,
Information technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia
information.
This document preserves the sectioning of ISO/IEC 15938-3. The text and figures given in this Amendment
are currently being considered as additions and/or modifications to those corresponding sections in
ISO/IEC 15938-3.

© ISO/IEC 2006 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
Information technology — Multimedia content description
interface —
Part 3:
Visual
AMENDMENT 2: Perceptual 3D Shape Descriptor
Replace subclause 1.2 with:
1.2 Overview of Visual Description Tools
This part of ISO/IEC 15938 specifies tools for description of visual content, including still images, video and
3D models. These tools are defined by their syntax in DDL and binary representations and semantics
associated with the syntactic elements. They enable description of the visual features of the visual material,
such as color, texture, shape and motion, as well as localization of the described objects in the image or video
sequence. An overview of the visual description tools is shown in Figure 1.
The basic structure description tools include five supporting tools of visual descriptions defined in clauses 6-11.
They are categorized into two groups, descriptor containers and basic supporting tools. The former consists of
three datatypes, GridLayout providing efficient representations of visual features on grids, TimeSeries
representing temporal arrays of several descriptions, GofGopFeature describes representative descriptions
over video segment, and MultipleView describing a 3D object using several pictures captured from different
view angles. The latter contains two tools, Spatial2DcoordinateSystem used to specify the 2D coordinate
system and TemporalInterpolation indicating the interpolation method between two samples on a time axis.
The remaining description tools, except for the FaceRecognition descriptor, are associated with visual
features and are grouped into five feature categories: Color, Texture, Shape, Motion and Localization.
The color description tools include five color descriptors to represent different aspects of color features:
representative colors (DominantColor), color distribution (ScalableColor), spatial distribution of colors
(ColorLayout and ColorStructure) and perceptual feeling of illumination color (ColorTemperature). It also
contains three supporting tools, ColorSpace and ColorQuantization used in DominantColor and
IlluminationInvariantColor to extend four color descriptors, DominantColor, ScalableColor, ColorLayout and
ColorStructure, to support illumination invariant similarity matching. An extension of ScalableColor to a group
of frames or pictures (GoFGoPColor) is also included in this group. All the color descriptors can be extracted
from arbitrarily shaped regions.
The texture description tools facilitate browsing (TextureBrowsing) and similarity retrieval
(HomogeneousTexture and EdgeHistogram) using the texture of a still or moving image region. All the texture
descriptors can be extracted from arbitrarily shaped regions.
The shape description tools include two descriptors that characterize different shape features of a 2D object or
region. The RegionShape descriptor captures the distribution of all pixels within a region and the Contour
Shape descriptor characterizes the shape properties of the contour of an object. The extension of
RegionShape is also defined as ShapeVariation to describe temporal variation of shape over video segment.
The Shape3D and Perceptual 3D Shape descriptors provide 3-dimensional shape information; the former
represents an intrinsic shape characterization of 3D mesh models, and the latter represents part-based
representation of a 3D object.
The motion description tools include four descriptors that characterize various aspects of motion. The
CameraMotion descriptor specifies a set of basic camera operations such as, for example, panning and tilting.
© ISO/IEC 2006 – All rights reserved 1

---------------------- Page: 4 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
The motion of a key point (pixel) from a moving object or region can be characterized by the MotionTrajectory
descriptor. The ParametricMotion descriptor characterizes an evolution of an arbitrarily shaped region over
time in terms of a 2D geometric transformation. Finally, the MotionActivity descriptor captures the pace of the
motion in the sequence, as perceived by the viewer. All motion descriptors except for CameraMotion can be
extracted from arbitrarily shaped regions.
The localization description tools can be used to indicate regions of interest in the spatial (RegionLocator) and
spatio-temporal (SpatioTemporalLocator) domains.
The FaceRecognition descriptor and the Advance Face Recognition descriptor are not associated with any
particular visual feature and can be used to describe a human face for applications requiring the matching and
retrieval of face images.
Basic Structures
Descriptor Containers Basic Supporting Tools
GridLayout TemporalInterpolation
TimeSeries Spatial2DcoordinateSystem
GofGopFeature
MultipleView
Visual Features
Color
Color Feature Descriptors
Color Supporting Tools
DominantColor ColorSpace
ScalableColor ColorQuantization
ColorLayout IlluminationInvariantColor
ColorStructure
GofGopColor
ColorTemperature
Texture Shape Motion
HomogeneousTexture RegionShape CameraMotion
TextureBrowsing ContourShape MotionTrajectory
EdgeHistogram ParametricMotion
ShapeVariation
MotionActivity
Shape3D
Perceptual 3D Shape
Localization
RegionLocator
SpatioTemporalLocator
Other
FaceRecognition
AdvancedFaceRecognition

Figure 1 — Overview of Visual Description Tools
2 © ISO/IEC 2006 – All rights reserved

---------------------- Page: 5 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
Extend the definitions in clause 2:
2.4 adjacency matrix
matrix with rows and columns labeled by graph vertices, with a 1 or 0 in position (i, j) according to whether i-th
and j-th nodes are adjacent or not.

Replace subclause 4.2.2 with:
4.2.2 Generic binary representation
The use of the video-specific syntax is signalled using the codec configuration mechanism defined in
ISO/IEC 15938-1. And the following classification scheme is defined for this purpose.


 MPEG7CameraMotion
 ISO/IEC 15938-3 Binary Camera Motion
  Codec


 MPEG7ColorLayout
 ISO/IEC 15938-3 Binary Color Layout
  Codec


 MPEG7ColorQuantization
 ISO/IEC 15938-3 Binary Color Quantization
  Codec


 MPEG7ColorSpace
 ISO/IEC 15938-3 Binary Color Space
  Codec


 MPEG7ColorStructure
 ISO/IEC 15938-3 Binary Color Structure
  Codec


 MPEG7ContourShape
 ISO/IEC 15938-3 Binary Contour Shape
  Codec


 MPEG7DominantColor
 ISO/IEC 15938-3 Binary Dominant Color
  Codec


 MPEG7EdgeHistogram
 ISO/IEC 15938-3 Binary Edge Histogram
  Codec


 MPEG7FaceRecognition
© ISO/IEC 2006 – All rights reserved 3

---------------------- Page: 6 ----------------------
ISO/IEC 15938-3:2002/Amd.2:2006(E)
 ISO/IEC 15938-3 Binary Face Recognition
  Codec


 MPEG7FoFGoPColor
 ISO/IEC 15938-3 Binary GoFGoP Color
  Codec


 MPEG7GridLayout
 ISO/IEC 15938-3 Binary Grid Layout
  Codec


 MPEG7HomogeneousTexture
 ISO/IEC 15938-3 Binary Homogeneous Texture
  Codec


 MPEG7IrregularVisualTimeSeries
 ISO/IEC 15938-3 Binary Irregular Time Series
  Codec


 MPEG7MotionActivity
 ISO/IEC 15938-3 Binary Motion Activity
  Codec


 MPEG7MotionTrajectory
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.