Information technology — Multimedia content description interface — Part 5: Multimedia description schemes — Amendment 5: Quality metadata, multiple text encodings, extended classification metadata

Technologies de l'information — Interface de description du contenu multimédia — Partie 5: Schémas de description multimédia — Amendement 5: Métadonnées de qualité, encodages de texte multiples, métadonnées de classification étendues

General Information

Status
Published
Publication Date
09-Nov-2015
Current Stage
6060 - International Standard published
Start Date
01-Aug-2015
Due Date
07-Apr-2016
Completion Date
10-Nov-2015
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 15938-5:2003/Amd 5:2015 - Quality metadata, multiple text encodings, extended classification metadata
English language
18 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 15938-5
First edition
2003-05-15
AMENDMENT 5
2015-11-15
Information technology — Multimedia
content description interface —
Part 5:
Multimedia description schemes
AMENDMENT 5: Quality metadata,
multiple text encodings, extended
classification metadata
Technologies de l’information — Interface de description du
contenu multimédia —
Partie 5: Schémas de description multimédia
AMENDEMENT 5: Métadonnées de qualité, encodages de texte
multiples, métadonnées de classification étendues
Reference number
ISO/IEC 15938-5:2003/Amd.5:2015(E)
©
ISO/IEC 2015

---------------------- Page: 1 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)

COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2015, Published in Switzerland
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form
or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior
written permission. Permission can be requested from either ISO at the address below or ISO’s member body in the country of
the requester.
ISO copyright office
Ch. de Blandonnet 8 • CP 401
CH-1214 Vernier, Geneva, Switzerland
Tel. +41 22 749 01 11
Fax +41 22 749 09 47
copyright@iso.org
www.iso.org
ii © ISO/IEC 2015 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Contents Page
Foreword . iv
Introduction . v
1 Scope . 1
2 Multiple Text Encodings . 1
7.2.2 Textual datatypes . 1
3 Extended Classification Metadata . 3
9.2.3 Classification DS . 3
4 Quality Control Metadata . 9
8.2.6 Media Quality D . 9
8.2.6.1 Introduction . 9
8.2.9 Extended Media Quality Description Tools . 12
8.2.9.1 ExtendedMediaQuality D . 12
8.2.9.2 QC Profile Tools . 13
8.2.9.3 QC Item Result Tools . 16
© ISO/IEC 2015 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are members of
ISO or IEC participate in the development of International Standards through technical committees
established by the respective organization to deal with particular fields of technical activity. ISO and IEC
technical committees collaborate in fields of mutual interest. Other international organizations, governmental
and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information
technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
The procedures used to develop this document and those intended for its further maintenance are described
in the ISO/IEC Directives, Part 1. In particular the different approval criteria needed for the different types of
document should be noted. This document was drafted in accordance with the editorial rules of the ISO/IEC
Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Details of any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation on the meaning of ISO specific terms and expressions related to conformity assessment,
as well as information about ISO’s adherence to the WTO principles in the Technical Barriers to Trade (TBT)
see the following URL: Foreword - Supplementary information
The committee responsible for this document is ISO/IEC JTC 1, Information technology, Subcommittee SC 29,
Coding of audio, picture, multimedia and hypermedia information.
iv © ISO/IEC 2015 – All rights reserved

---------------------- Page: 4 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Introduction
The Multimedia Preservation Application Format (MP-AF, ISO/IEC 23000-15) makes use of MPEG-7 MDS for
descriptive and technical metadata. The following gaps in MPEG-7 MDS for preservation applications are
addressed by this amendment:
 Support for identifying the script of a string. This is useful in cases where a language can be
represented in multiple scripts (e.g., Japanese, Bulgarian). While the script identifier according to
ISO 15924 can be represented as part of the language tag, introducing a separate attribute improves
the modularity of the metadata to represent it separately.
 Support for multiple string encodings in a single XML document. In some cases strings (e.g.,
identifiers, file names) need to be kept in their original character encoding different to the encoding of
the metadata document. In order to enable this and avoid problems with XML parsing, the content of
the string is then base64 or percent encoded and the original character encoding is identified.
 Extended classification metadata, i.e. increasing the cardinality of some classification metadata
elements and adding the option to identify the version of the content.
 Metadata for quality control. Detailed quality descriptors are only available for audio (MPEG-7 part
4/AMD 2), while similar tools for format/wrapper and visual quality checks and defects are missing. A
representation compatible with the European Broadcasting Union’s (EBU) Quality Control data model
is needed.
This amendment defines solutions for filling these gaps in MDS. These extensions are expected to be useful
also for applications beyond preservation.
The extensions are all optional, thus descriptions conforming to a version of MPEG-7 MDS prior to this
amendment are valid against a version including this amendment.
© ISO/IEC 2015 – All rights reserved v

---------------------- Page: 5 ----------------------
FINAL DRAFT AMENDMENT ISO/IEC 15938-5:2003/Amd.5:2015(E)
Information Technology — Multimedia content description
interface — Part 5: Multimedia description schemes,
AMENDMENT 5: Quality metadata, multiple text encodings,
extended classification metadata
1 Scope
This amendment extends text and classification descriptors for descriptive metadata for audiovisual content
making use of existing datatypes and adding some elements and attributes, and adds support for quality
metadata. A first extension improves internationalization for text-based metadata by identifying the script(s)
used and enables carrying encoded strings represented in multiple character sets in the same XML document.
A second improvement extends classification metadata in order to: (a) improve interoperability with existing
metadata formats used in professional media production, (b) increase cardinality of some elements and (c)
add attributes and a version element. Finally a third extension improves the tools for describing profiles for
quality control of audiovisual media, as well as the results of quality analysis on wrapper, bitstream and
baseband level.
NOTE The XML schema implementing this amendment as well as a consolidated XML schema for MPEG-7 including
this amendment are contained in the electronic attachment.
2 Multiple Text Encodings
In 7.2.2, replace the entire text of the subclause with the following text:
7.2.2 Textual datatypes
7.2.2.1 Introduction
The TextualBase datatype provides a base from which other kinds of textual description tools are derived. It
can also be used in the definitions of tools that contain textual descriptions. An optional attribute, xml:lang, is
used to specify the language of the contained text.
The Textual datatype, derived from the TextualBase datatype, is a non-abstract type that is used for
simple textual descriptions.
In cases where textual elements need to be stored in their native character set (different from the encoding of
the XML document), the character set can be specified using the charset attribute. In order to allow for correct
parsing of the document, the content of the textual element must be encoded in this case (using base64 or
percent encoding).
7.2.2.2 Textual datatypes syntax





© ISO/IEC 2015 – All rights reserved 1

---------------------- Page: 6 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)








use="optional" default="sampa"/>












7.2.2.3 Textual datatypes semantics
Semantics of the TextualBaseType:
Name Definition
TextualBaseType
Abstract base type for textual descriptions. This type includes the
xml:lang attribute for identifying the language in which the
description is written and can contain a phonetic transcription of the
text
xml:lang Indicates the language in which the textual description is written
(optional). If no value is specified for this attribute, the value of the
xml:lang attribute is inherited from the closest containing element
that specifies a value for xml:lang.
phoneticTranscription Describes a phonetic transcription of the text using a phonetic
alphabet. The transcription is represented as a list of phones (i.e.
PhoneType). The defintion for PhoneType is given in ISO/IEC
15938-4.
phoneticAlphabet
Describes the phonetic alphabet that is used for the transcription in
phoneticTranscription, e.g. IPA (International Phonetic Association
alphabet). The specification of the phoneticAlphabetType is
given in ISO/IEC 15938-4.
charset
Specifies the character set of the string, if it does not conform to the
character set of the XML document. In this case, the content of
TextualBaseType must be encoded, and the encoding attribute
must be present.
encoding The encoding used to represent string in a charset different than the
global character set of the XML document. Recommended
2 © ISO/IEC 2015 – All rights reserved

---------------------- Page: 7 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Name Definition
encodings are base64 or percent.
script Specifies the script used for representing the string, using the
values defined in ISO 15924. This allows the correct identification of
the script being used for languages that can be represented in
multiple scripts. For modularity of representation, a separate
attribute is provided as an alternative to coding the script in the
xml:lang attribute. It is recommended to use the script attribute.
Semantics of the TextualType:
Name Definition
TextualType Describes a textual description. TextualType that extends the
abstract TextualBaseType. This type includes the xml:lang
attribute for identifying the language in which the description is written
and can contain a phonetic transcription of the text
7.2.2.4 Textual datatypes example (informative)
The following example description excerpt shows the use of Textual datatype for describing the free text
in English: "The Giants win the pennant."

The Giants win the pennant.

The following example description excerpt shows the use of Textual datatype for describing the name of
the city of Moscow in Russian, using Cyrillic script.
Москва
3 Extended Classification Metadata
In 9.2.3, replace the entire text of the subclause with the following text:
9.2.3 Classification DS
9.2.3.1 Introduction
The Classification DS describes the classification of the multimedia content. The resulting descriptions
facilitate searching and filtering of multimedia content based on user preferences (e.g, language, style, genre,
and so forth) and service-oriented classifications (e.g, purpose, parental guidance, market segmentation,
media review, and so forth).
9.2.3.2 Classification DS syntax


© ISO/IEC 2015 – All rights reserved 3

---------------------- Page: 8 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)






maxOccurs="unbounded" />












type="mpeg7:zeroToOneType" />




minOccurs="0" maxOccurs="unbounded"/>
minOccurs="0" maxOccurs="unbounded"/>
minOccurs="0" maxOccurs="unbounded"/>




default="true"/>
use="optional" default="false"/>









use="optional"/>







4 © ISO/IEC 2015 – All rights reserved

---------------------- Page: 9 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
minOccurs="0" maxOccurs="unbounded"/>

 use="optional"/>





minOccurs="0" maxOccurs="unbounded"/>


use="optional"/>
use="optional"/>


minOccurs="0" maxOccurs="unbounded"/>



minOccurs="0" maxOccurs="unbounded"/>
minOccurs="0" maxOccurs="unbounded"/>


















use="optional" default="false"/>










© ISO/IEC 2015 – All rights reserved 5

---------------------- Page: 10 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
minOccurs="0" maxOccurs="unbounded"/>






minOccurs="0" maxOccurs="unbounded"/>
minOccurs="0"/>



9.2.3.3 Classification DS semantics
Semantics of the ClassificationType:
Name Definition
ClassificationType Describes the classification of the multimedia content.
Form Describes the production type of the document, such as, film, news
program, magazine, documentary, etc (optional). An example of CS is
FormatCS.
Genre Describes what the multimedia content is about (broad classification),
such as sports, politics, economics, etc (optional). An example of CS is
the GenreCS.
type
Indicates the type of the genre of the multimedia content. The types of
genres are defined as follows.
 main – The specified genre is the main, or primary. This is the
default value.
 secondary – The specified genre is a secondary genre, such
as a subgenre.
confidence
Confidence of determined genre (optional).
Subject
Describes the subjects (specific classifications) of the multimedia
content (optional). The subjects allows a textual annotation to classify
the multimedia content.
Purpose Describes one or more purposes for which the multimedia content
was created (optional). An example of CS is IntentionCS.
Language
Describes one or more languages of the spoken audio of the program
(optional).
CaptionLanguage
Describes one or more languages of the caption information included
with the program (optional). The type of the caption information
associated with the program is denoted by the closed attribute. Closed
6 © ISO/IEC 2015 – All rights reserved

---------------------- Page: 11 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Name Definition
captions can be turned on or off by the user, while open captions (or
subtitles) are part of the picture itself and remain visible.
closed Indicates whether the specified caption is closed. Default value of the
attribute is true; if the attribute is set to false, then the provided caption
description refers to open captions/subtitles.
supplemental
Indicates whether the captions provide descriptions of the scene for the
benefit of hearing or visually impaired, in addition to a direct translation
of the spoken words. Closed captions may include such descriptive
information, such as speaker identification, and non-speech sounds
that would be missed. Default value of this attribute is false.
SignLanguage
Specifies the audio sign language provided for the multimedia content,
and, optionally, qualifies the use of signing as a primary language or as
a translation of the spoken dialogue (optional).
primary Indicates if the sign language is the primary language of the content or
not, i.e, if the content is produced specifically for the hearing impaired
or not (optional).
translation Indicates if the sign language is a translation of the spoken dialogue or
not (optional).
Release Describes the release date and region of the multimedia content
(optional).
Region
Indicates the countries or regions in which the multimedia content was
first released (optional). This locator may be different than the location
where it was created.
date Indicates the date on which the multimedia content was first released.
This date may be different than the date(s) when it was created
(optional).
Target Describes the target of the multimedia content in terms of market
classification, age and country or region (optional).
Market Describes targeted markets of the multimedia content (optional). An
example of CS is TargetGroupCS.
Age
Describes the targeted age range of the multimedia content (optional).
min Indicated the minimum age of the targeted age range (optional).
max Indicated the maximum age of the targeted age range (optional).
Region
Describes target countries or regions for the multimedia content
(optional).
ParentalGuidance Describes parental guidance classification of the multimedia content
(optional).
MediaReview Describes media reviews about the multimedia content (optional).
© ISO/IEC 2015 – All rights reserved 7

---------------------- Page: 12 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Name Definition
Version
The version of the content (optional).
Semantics of the ExtendedLanguageType:
Name Definition
ExtendedLanguageType Specifies the language of spoken audio of the multimedia
content.
type Indicates whether the spoken language is the original language
of the content, or has been dubbed. The following values are
allowed.
 original – The specified language is an original language
of the spoken content. This is the default value.
 dubbed – The specified language is a dubbed language
of the spoken content.
 background – The specified language is the original
language heard in the background, while another
(dubbed) language is heard in the foreground.
Other values that are datatype-valid with respect to
mpeg7:termReferenceType are reserved.
supplemental Indicates whether the specified language exclusively provides an
audio description of the multimedia content, for the benefit of the
visually impaired. Default value of this attribute is false.
Semantics of the ParentalGuidanceType:
Name Definition
ParentalGuidanceType
Describes parental guidance classification.
ParentalRating Indicates the rating value according to the rating scheme.
Examples of CSs are JapanRatingCS,
ICRAParentalRatingCS, MPAAParentalRatingCS,
ICRAParentalRatingNudityCS, RIAAParentalRatingCS,
ICRAParentalRatingSexCS, MPAAParentalRatingTVCS, and
IRCAParentalRatingViolenceCS.
MinimumAge Indicates the minimum recommended age in years of the end user.
Region
Indicates the country or region for which the parental guidance
description is specified (optional). If not specified, the parental
guidance applies worldwide.
8 © ISO/IEC 2015 – All rights reserved

---------------------- Page: 13 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Semantics of the MediaReviewType:
Name Definition
MediaReviewType Describes the Media Review about the multimedia content.
Rating Specifies the rating value and criterion used in the review (optional).
FreeTextReview
Describes a free-text review of the multimedia content without reference
to a rating scheme (optional). There can be multiple instances of the
review in different languages.
ReviewReference
Describes the Media Locator of the material from where the review may
have been extracted or quoted (optional). For example, the TV magazine
that published the review, the Films Guide from where the review was
quoted, an interview from where the review was transcribed.
Reviewer
Describes the reviewer/critic of the multimedia content (optional). Since it
is defined as AgentType, this field can describe a person, a group of
persons or an organization.
4 Quality Control Metadata
Replace subclause 8.2.6 with the following text (allowing QualityRating to be optional):
8.2.6 Media Quality D
8.2.6.1 Introduction
The MediaQuality D describes quality rating information of multimedia content. The quality of AV content
may decrease when the signal is compressed, transmitted, or converted. The MediaQuality D describes both
subjective and objective quality ratings.
8.2.6.2 MediaQuality D syntax





 
  











© ISO/IEC 2015 – All rights reserved 9

---------------------- Page: 14 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)







minOccurs="0" maxOccurs="unbounded"/>



minOccurs="0" maxOccurs="unbounded"/>
minOccurs="0" maxOccurs="unbounded"/>




  
 



 
  
  
















  
 

8.2.6.3 MediaQuality D semantics
Semantics of the MediaQualityType:
Name Definition
MediaQualityType
Describes the media quality of the Media Profile.
QualityRating
Describes the rating values and the criterion used to create the
media quality ratings (optional).
Describes the type of media quality rating. The types of quality rating
type
are defined as follows.
10 © ISO/IEC 2015 – All rights reserved

---------------------- Page: 15 ----------------------
ISO/IEC 15938-5:2003/Amd.5:2015(E)
Name Definition
 subjective – The rating is subjective, that is, it is provided by
human viewers.
 objective – The rating is objective, that is, it is acquired using
computational means.
RatingSource
Describes the source that provides the ratings (optional).
Indicates the locator for additional information about the quality
RatingInformationLocator
rating method (optional).
Describes defects that are perceived in the media (optional).
PerceptibleDefects
Describes the visual errors perceived in the media (optional). Errors
VisualDefects
are listed in the order of descending severity. An example of CS is
VisualDefectsCS.
Describes the audio errors perceived in the media (optional). Errors
AudioDefects
are listed in the order of descending severity. An example of CS is
AudioDefectsCS.
Semantics of the RatingType:
Name
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.