Data quality — Part 2: Vocabulary

This document defines terms relating to data quality. These terms are used by the parts in the ISO 8000 series.

Qualité des données — Partie 2: Vocabulaire

General Information

Status
Published
Publication Date
21-Sep-2022
Current Stage
9092 - International Standard to be revised
Start Date
16-May-2025
Completion Date
17-May-2025
Ref Project

Relations

Buy Standard

Standard
ISO 8000-2:2022 - Data quality — Part 2: Vocabulary Released:22. 09. 2022
English language
25 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)


INTERNATIONAL ISO
STANDARD 8000-2
Fifth edition
2022-09
Data quality —
Part 2:
Vocabulary
Qualité des données —
Partie 2: Vocabulaire
Reference number
© ISO 2022
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
3.1 Terms relating to quality . 1
3.2 Terms relating to data and information . 3
3.3 Terms relating to identifier . 4
3.4 Terms relating to measurement . 5
3.5 Terms relating to industrial data . 6
3.6 Terms relating to data dictionary. 7
3.7 Terms relating to characteristic data . 8
3.8 Terms relating to data quality . 8
3.9 Terms relating to syntax and semantics.12
3.10 Terms relating to transaction data . 13
3.11 Terms relating to master data .13
3.12 Terms relating to product data . 13
3.13 Terms relating to item of production and item of supply . 15
3.14 Terms relating to data quality role . 16
3.15 Terms relating to process assessment. 17
3.16 Terms relating to data governance . 20
Annex A (informative) Document identification .21
Bibliography .22
Index .24
iii
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see
www.iso.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 184, Automation systems and integration,
Subcommittee SC 4, Industrial data.
This fifth edition cancels and replaces the fourth edition (ISO 8000-2:2020), which has been technically
revised. It also incorporates the Amendment ISO 8000-2:2020/Amd 1:2021.
The main changes are as follows:
— additional terminological entries to align the ISO 8000 series further with ISO 9000;
— updates where the updates originate from a new edition of ISO 8000-110;
— updates where the updates originate from converting ISO 8000-150 to an International Standard
from a Technical Specification;
— updates where the updates originate from a new edition of ISO 10303-59;
— other minor improvements to entries to improve consistency and readability of entries.
A list of all parts in the ISO 8000 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
iv
Introduction
0.1 Foundations of the ISO 8000 series
Digital data deliver value by enhancing all aspects of organizational performance including:
— operational effectiveness and efficiency;
— safety and security;
— reputation with customers and the wider public;
— compliance with statutory regulations;
— innovation;
— consumer costs, revenues and stock prices.
In addition, many organizations are now addressing these considerations with reference to the United
1)
Nations Sustainable Development Goals .
The influence on performance originates from data being the formalized representation of
2)
information . This information enables organizations to make reliable decisions. Such decision making
can be performed by human beings directly and also by automated data processing including artificial
intelligence systems.
Through widespread adoption of digital computing and associated communication technologies,
organizations become dependent on digital data. This dependency amplifies the negative consequences
of lack of quality in these data. These consequences are the decrease of organizational performance.
The biggest impact of digital data comes from two key factors:
— the data having a structure that reflects the nature of the subject matter;
EXAMPLE 1 A research scientist writes a report using a software application for word processing. This report
includes a table that uses a clear, logical layout to show results from an experiment. These results indicate how
material properties vary with temperature. The report is read by a designer, who uses the results to create a
product that works in a range of different operating temperatures.
— the data being computer processable (machine readable) rather than just being for a person to read
and understand.
EXAMPLE 2 A research scientist uses a database system to store the results of experiments on a material.
This system controls the format of different values in the data set. The system generates an output file of digital
data. This file is processed by a software application for engineering analysis. The application determines the
optimum geometry when using the material to make a product.
ISO 9000 explains that quality is not an abstract concept of absolute perfection. Quality is actually
the conformance of characteristics to requirements. This actuality means that any item of data can
be of high quality for one purpose but not for a different purpose. The quality is different because the
requirements are different between the two purposes.
EXAMPLE 3 Time data are processed by calendar applications and also by control systems for propulsion
units on spacecraft. These data include start times for meetings in a calendar application and activation times in
a control system. These start times require less precision than the activation times.
1) https://sdgs.un.org/goals
2) This document defines information as “knowledge concerning objects, such as facts, events, things, processes,
or ideas, including concepts, that within a certain context has a particular meaning”.
v
The nature of digital data is fundamental to establishing requirements that are relevant to the specific
decisions made by an organization.
EXAMPLE 4 ISO 8000-1 identifies that data have syntactic (format), semantic (meaning) and pragmatic
(usefulness) characteristics.
To support the delivery of high-quality data, the ISO 8000 series addresses:
— data governance, data quality management and maturity assessment;
EXAMPLE 5 ISO 8000-61 specifies a process reference model for data quality management.
— creating and applying requirements for data and information;
EXAMPLE 6 ISO 8000-110 specifies how to exchange characteristic data that are master data.
— monitoring and measuring information and data quality;
EXAMPLE 7 ISO 8000-8 specifies approaches to measuring information and data quality.
— improving data and, consequently, information quality;
EXAMPLE 8 ISO/TS 8000-81 specifies an approach to data profiling, which identifies opportunities to improve
data quality.
— issues that are specific to the type of content in a data set.
EXAMPLE 9 ISO/TS 8000-311 specifies how to address quality considerations for product shape data.
Data quality management covers all aspects of data processing, including creating, collecting, storing,
maintaining, transferring, exploiting and presenting data to deliver information.
Effective data quality management is systemic and systematic, requiring an understanding of the
root causes of data quality issues. This understanding is the basis for not just correcting existing
nonconformities but also implementing solutions that prevent future reoccurrence of those
nonconformities.
EXAMPLE 10 If a data set includes dates in multiple formats including “yyyy-mm-dd”, “mm-dd-yy” and
“dd-mm-yy”, then data cleansing can correct the consistency of the values. Such cleansing requires additional
information, however, to resolve ambiguous entries (such as, “04-05-20”). The cleansing also cannot address any
process issues and people issues, including training, that have caused the inconsistency.
0.2 Understanding more about the ISO 8000 series
ISO 8000-1 provides a detailed explanation of the structure and scope of the whole ISO 8000 series.
3)
ISO has identified this document, ISO 8000-1 and ISO 8000-8 as horizontal deliverables .
0.3 Role of this document
As a contribution to the capability of the ISO 8000 series, this document specifies the single, common
vocabulary for the ISO 8000 series. This vocabulary is ideal reading material by which to understand
the overall subject matter of data quality. This document presents the vocabulary structured by a series
of topic areas (for example, terms relating to quality and terms relating to data and information).
This document supports activities that affect:
— one or more information systems;
— data flows within the organization and with external organizations;
— any phase of the data life cycle.
3) A horizontal deliverable is a deliverable dealing with a subject relevant to a n
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.