Digital publishing — EPUB3 preservation — Part 2: Metadata requirements

The ISO/IEC TS 22424 series supports long-term preservation of EPUB publications via a dual strategy. This document makes EPUB compliant with current practices of Open Archival Information Systems (OAIS) archives and technical requirements of repository systems. The former tend to rely on OAIS in their operations; the latter prefer to ingest electronic documents only in containers conforming to standards such as METS (Metadata Encoding and Transmission Standard). ISO/IEC TS 22424-1 considers EPUB features from a long-term preservation point of view.

Publications numériques — EPUB3 preservation — Partie 2: Titre manque

General Information

Status
Published
Publication Date
28-Jan-2020
Current Stage
9092 - International Standard to be revised
Completion Date
16-Sep-2024
Ref Project

Buy Standard

Technical specification
ISO/IEC TS 22424-2:2020 - Digital publishing — EPUB3 preservation — Part 2: Metadata requirements Released:1/29/2020
English language
35 pages
sale 15% off
Preview
sale 15% off
Preview
Technical specification
ISO/IEC TS 22424-2:2020 - Digital publishing -- EPUB3 preservation
English language
35 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)


TECHNICAL ISO/IEC TS
SPECIFICATION 22424-2
First edition
2020-01
Digital publishing — EPUB3
preservation —
Part 2:
Metadata requirements
Reference number
©
ISO/IEC 2020
© ISO/IEC 2020
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 2020 – All rights reserved

Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Syntax . 2
6 Packaging metadata . 4
6.1 General . 4
6.2 Package creator / submitter information . 4
6.3 Package status . 5
6.4 Package identifier . 5
6.5 Work and publication identifiers . 6
6.6 Core media type resource identifiers . 8
6.7 Foreign resource identifiers . 9
6.8 Identifiers for metadata records .10
6.9 Dates .11
6.9.1 General.11
6.9.2 Creation date of a submission information package .12
6.9.3 Modification date of a submission information package .12
6.9.4 Creation/modification date of an EPUB publication .12
6.9.5 Creation/modification of a metadata record .13
6.10 Metadata format and its versions .13
7 Administrative metadata .15
7.1 General .15
7.2 Technical metadata .16
7.2.1 File formats and their versions .16
7.2.2 Digital signatures and checksums.19
7.3 Rights metadata .20
7.3.1 General.20
7.3.2 Preservation related rights .21
7.4 Structural metadata .22
7.5 Preservation metadata .24
8 Structure of submission information packages .26
9 Content of submission information packages .27
Annex A (informative) Digital signature .29
Annex B (informative) Events .31
Bibliography .35
© ISO/IEC 2020 – All rights reserved iii

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that
are members of ISO or IEC participate in the development of International Standards through
technical committees established by the respective organization to deal with particular fields of
technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other
international organizations, governmental and non-governmental, in liaison with ISO and IEC, also
take part in the work.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for
the different types of document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www .iso .org/ patents) or the IEC
list of patent declarations received (see http:// patents .iec .ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www .iso .org/
iso/ foreword .html.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 34, Document description and processing languages.
A list of all parts in the ISO/IEC TS 22424 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/ members .html.
iv © ISO/IEC 2020 – All rights reserved

Introduction
This document facilitates the long-term preservation of EPUB publications by specifying metadata
elements which are required or recommended for long-term preservation (such as identifiers) and the
ways in which the EPUB publication and related metadata can be packaged. EPUB versions 3 and 3.0.1
are covered; if necessary, the EPUB version applicable is specified.
Long-term preservation in general requires two things:
— making the object such as EPUB publication fit for preservation – including features to be used and
feature to avoid;
— packaging the object (and any metadata related to it) together with any additional data such as
other versions of the object and other documentation into an Open Archival Information System
(OAIS) submission information package (SIP).
ISO/IEC TS 22424-1 concentrates on the archivability of EPUB documents.
The background to this document comes from the Open Archival Information System, which is
described in ISO/IEC TS 22424-1.
When a submission information package (SIP) is formed, mandatory preservation metadata need to
be present in the package. Depending on the agreements made between the producer and the archive,
metadata elements are stored either in the container document or the EPUB publication itself, or both.
Usually an archive would expect to find all relevant metadata in the container, unless the submission
agreement allows embedding of metadata into EPUB publications.
This document does not require any changes to be made to the current of future EPUB standards.
However, when an EPUB publication is created or modified for submission to an archive, there are some
EPUB features that should be used and others that should be avoided. ISO/IEC TS 22424-1 describes
how the EPUB format should be applied. This document concentrates on mandatory and recommended
metadata elements needed for the long-term preservation of EPUB publications and their METS
encoding. ISO/IEC TS 22424-1 recommends the usage of METS but allows also other container standards;
this document concentrates on preservation metadata and its METS encoding in SIPs. Future editions
1)
of these documents may specify other encodings such as BITS (Book Interchange Tag Suite) .
In order to guarantee access to documents, OAIS archives may migrate documents into new file formats
when the original formats are no longer supported by commonly used rendering tools. If the document
to be migrated is an e-book in an outdated EPUB format, migration can be made to a more modern
version of EPUB or, at least in principle, to another e-book format.
Generally, migration into another file format should be straightforward if the current and new format
are compatible and there are efficient and reliable migration tools available. If the target format is a
more modern version of the current format, compatibility should not be a problem. But if a format is
rich, migration tools may not be able to render all the properties of a resource.
This document applies to EPUB versions 3 and 3.0.1. Earlier versions (EPUB 2 and 2.0.1) are not covered.
Since there are no implementations of version 3.1, it is not covered in this document either. EPUB 3.2
2)
was published in May 2019 . It will be taken into account in the next edition of this document.
This document does not cover issues related to migration between EPUB versions or from EPUB to other
e-book formats. Migration to other formats is often lossy; this applies to e-book formats as well, since
there are EPUB features which are not supported in other e-book formats, and vice versa. Moreover,
even if the same feature is supported, technical implementations can be incompatible. For instance, if
an EPUB 3 publication using fixed layout is migrated to Amazon’s KF8 format, preserving fixed layout
properties requires special attention since there are significant technical differences between these
formats in how this feature has been implemented.
1) https:// www .loc .gov/
...


TECHNICAL ISO/IEC TS
SPECIFICATION 22424-2
First edition
2020-01
Digital publishing — EPUB3
preservation —
Part 2:
Metadata requirements
Reference number
©
ISO/IEC 2020
© ISO/IEC 2020
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 2020 – All rights reserved

Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Syntax . 2
6 Packaging metadata . 4
6.1 General . 4
6.2 Package creator / submitter information . 4
6.3 Package status . 5
6.4 Package identifier . 5
6.5 Work and publication identifiers . 6
6.6 Core media type resource identifiers . 8
6.7 Foreign resource identifiers . 9
6.8 Identifiers for metadata records .10
6.9 Dates .11
6.9.1 General.11
6.9.2 Creation date of a submission information package .12
6.9.3 Modification date of a submission information package .12
6.9.4 Creation/modification date of an EPUB publication .12
6.9.5 Creation/modification of a metadata record .13
6.10 Metadata format and its versions .13
7 Administrative metadata .15
7.1 General .15
7.2 Technical metadata .16
7.2.1 File formats and their versions .16
7.2.2 Digital signatures and checksums.19
7.3 Rights metadata .20
7.3.1 General.20
7.3.2 Preservation related rights .21
7.4 Structural metadata .22
7.5 Preservation metadata .24
8 Structure of submission information packages .26
9 Content of submission information packages .27
Annex A (informative) Digital signature .29
Annex B (informative) Events .31
Bibliography .35
© ISO/IEC 2020 – All rights reserved iii

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that
are members of ISO or IEC participate in the development of International Standards through
technical committees established by the respective organization to deal with particular fields of
technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other
international organizations, governmental and non-governmental, in liaison with ISO and IEC, also
take part in the work.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for
the different types of document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www .iso .org/ patents) or the IEC
list of patent declarations received (see http:// patents .iec .ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www .iso .org/
iso/ foreword .html.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 34, Document description and processing languages.
A list of all parts in the ISO/IEC TS 22424 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/ members .html.
iv © ISO/IEC 2020 – All rights reserved

Introduction
This document facilitates the long-term preservation of EPUB publications by specifying metadata
elements which are required or recommended for long-term preservation (such as identifiers) and the
ways in which the EPUB publication and related metadata can be packaged. EPUB versions 3 and 3.0.1
are covered; if necessary, the EPUB version applicable is specified.
Long-term preservation in general requires two things:
— making the object such as EPUB publication fit for preservation – including features to be used and
feature to avoid;
— packaging the object (and any metadata related to it) together with any additional data such as
other versions of the object and other documentation into an Open Archival Information System
(OAIS) submission information package (SIP).
ISO/IEC TS 22424-1 concentrates on the archivability of EPUB documents.
The background to this document comes from the Open Archival Information System, which is
described in ISO/IEC TS 22424-1.
When a submission information package (SIP) is formed, mandatory preservation metadata need to
be present in the package. Depending on the agreements made between the producer and the archive,
metadata elements are stored either in the container document or the EPUB publication itself, or both.
Usually an archive would expect to find all relevant metadata in the container, unless the submission
agreement allows embedding of metadata into EPUB publications.
This document does not require any changes to be made to the current of future EPUB standards.
However, when an EPUB publication is created or modified for submission to an archive, there are some
EPUB features that should be used and others that should be avoided. ISO/IEC TS 22424-1 describes
how the EPUB format should be applied. This document concentrates on mandatory and recommended
metadata elements needed for the long-term preservation of EPUB publications and their METS
encoding. ISO/IEC TS 22424-1 recommends the usage of METS but allows also other container standards;
this document concentrates on preservation metadata and its METS encoding in SIPs. Future editions
1)
of these documents may specify other encodings such as BITS (Book Interchange Tag Suite) .
In order to guarantee access to documents, OAIS archives may migrate documents into new file formats
when the original formats are no longer supported by commonly used rendering tools. If the document
to be migrated is an e-book in an outdated EPUB format, migration can be made to a more modern
version of EPUB or, at least in principle, to another e-book format.
Generally, migration into another file format should be straightforward if the current and new format
are compatible and there are efficient and reliable migration tools available. If the target format is a
more modern version of the current format, compatibility should not be a problem. But if a format is
rich, migration tools may not be able to render all the properties of a resource.
This document applies to EPUB versions 3 and 3.0.1. Earlier versions (EPUB 2 and 2.0.1) are not covered.
Since there are no implementations of version 3.1, it is not covered in this document either. EPUB 3.2
2)
was published in May 2019 . It will be taken into account in the next edition of this document.
This document does not cover issues related to migration between EPUB versions or from EPUB to other
e-book formats. Migration to other formats is often lossy; this applies to e-book formats as well, since
there are EPUB features which are not supported in other e-book formats, and vice versa. Moreover,
even if the same feature is supported, technical implementations can be incompatible. For instance, if
an EPUB 3 publication using fixed layout is migrated to Amazon’s KF8 format, preserving fixed layout
properties requires special attention since there are significant technical differences between these
formats in how this feature has been implemented.
1) https:// www .loc .gov/
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.