Artificial intelligence - Data quality for analytics and machine learning (ML) - Part 4: Data quality process framework (ISO/IEC 5259-4:2024)

This document establishes general common organizational approaches, regardless of the type, size or nature of the applying organization, to ensure data quality for training and evaluation in analytics and machine learning (ML). It includes guidance on the data quality process for:
—     supervised ML with regard to the labelling of data used for training ML systems, including common organizational approaches for training data labelling;
—     unsupervised ML;
—     semi-supervised ML;
—     reinforcement learning;
—     analytics.
This document is applicable to training and evaluation data that come from different sources, including data acquisition and data composition, data preparation, data labelling, evaluation and data use. This document does not define specific services, platforms or tools.

Künstliche Intelligenz - Datenqualität für Analytik und maschinelles Lernen (ML) - Teil 4: Rahmen für Datenqualitätsprozesse (ISO/IEC 5259-4:2024)

Intelligence artificielle - Qualité des données pour les analyses de données et l’apprentissage automatique - Partie 4: Cadre pour le processus de qualité des données (ISO/IEC 5259-4:2024)

Le présent document établit des approches organisationnelles communes générales, indépendamment du type, de la taille ou de la nature de l’organisme demandeur, afin de garantir la qualité des données pour l’entraînement et l’évaluation dans le cadre de l’analyse de données et de l’apprentissage automatique (AA). Il comprend des recommandations relatives au processus de qualité des données pour:
—     l’AA supervisé en ce qui concerne l’étiquetage des données utilisées pour entraîner les systèmes d’AA, y compris les approches organisationnelles communes pour l’étiquetage des données d’entraînement;
—     l’AA non supervisé;
—     l’AA semi-supervisé;
—     l’apprentissage par renforcement;
—     l’analyse de données.
Le présent document s’applique aux données d’entraînement et d’évaluation provenant de différentes sources, y compris l’acquisition et la composition des données, la préparation des données, l’étiquetage des données, l’évaluation et l’utilisation des données. Le présent document ne définit pas de services, plateformes ou outils spécifiques.

Umetna inteligenca - Kakovost podatkov za analizo in strojno učenje - 4. del: Procesni okvir zagotavljanja kakovosti podatkov (ISO/IEC 5259-4:2024)

General Information

Status
Published
Publication Date
20-May-2025
Current Stage
6060 - Definitive text made available (DAV) - Publishing
Start Date
21-May-2025
Due Date
28-Jan-2027
Completion Date
21-May-2025

Buy Standard

Standard
EN ISO/IEC 5259-4:2025
English language
37 pages
sale 10% off
Preview
sale 10% off
Preview
e-Library read for
1 day

Standards Content (Sample)


SLOVENSKI STANDARD
01-julij-2025
Umetna inteligenca - Kakovost podatkov za analizo in strojno učenje - 4. del:
Procesni okvir zagotavljanja kakovosti podatkov (ISO/IEC 5259-4:2024)
Artificial intelligence - Data quality for analytics and machine learning (ML) - Part 4: Data
quality process framework (ISO/IEC 5259-4:2024)
Künstliche Intelligenz - Datenqualität für Analytik und maschinelles Lernen (ML) - Teil 4:
Rahmen für Datenqualitätsprozesse (ISO/IEC 5259-4:2024)
Intelligence artificielle - Qualité des données pour les analyses de données et
l’apprentissage automatique - Partie 4: Cadre pour le processus de qualité des données
(ISO/IEC 5259-4:2024)
Ta slovenski standard je istoveten z: EN ISO/IEC 5259-4:2025
ICS:
35.020 Informacijska tehnika in Information technology (IT) in
tehnologija na splošno general
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.

EUROPEAN STANDARD EN ISO/IEC 5259-4

NORME EUROPÉENNE
EUROPÄISCHE NORM
May 2025
ICS 35.020
English version
Artificial intelligence - Data quality for analytics and
machine learning (ML) - Part 4: Data quality process
framework (ISO/IEC 5259-4:2024)
Intelligence artificielle - Qualité des données pour les Künstliche Intelligenz - Datenqualität für Analytik und
analyses de données et l'apprentissage automatique - maschinelles Lernen (ML) - Teil 4: Rahmen für
Partie 4: Cadre pour le processus de qualité des Datenqualitätsprozesse (ISO/IEC 5259-4:2024)
données (ISO/IEC 5259-4:2024)
This European Standard was approved by CEN on 18 May 2025.

CEN and CENELEC members are bound to comply with the CEN/CENELEC Internal Regulations which stipulate the conditions for
giving this European Standard the status of a national standard without any alteration. Up-to-date lists and bibliographical
references concerning such national standards may be obtained on application to the CEN-CENELEC Management Centre or to
any CEN and CENELEC member.
This European Standard exists in three official versions (English, French, German). A version in any other language made by
translation under the responsibility of a CEN and CENELEC member into its own language and notified to the CEN-CENELEC
Management Centre has the same status as the official versions.

CEN and CENELEC members are the national standards bodies and national electrotechnical committees of Austria, Belgium,
Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy,
Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Republic of North Macedonia, Romania, Serbia,
Slovakia, Slovenia, Spain, Sweden, Switzerland, Türkiye and United Kingdom.

CEN-CENELEC Management Centre:
Rue de la Science 23, B-1040 Brussels
© 2025 CEN/CENELEC All rights of exploitation in any form and by any means
Ref. No. EN ISO/IEC 5259-4:2025 E
reserved worldwide for CEN national Members and for
CENELEC Members.
Contents Page
European foreword . 3

European foreword
The text of ISO/IEC 5259-4:2024 has been prepared by Technical Committee ISO/IEC JTC 1
"Information technology” of the International Organization for Standardization (ISO) and has been
taken over as EN ISO/IEC 5259-4:2025 by Technical Committee CEN-CENELEC/ JTC 21 “Artificial
Intelligence” the secretariat of which is held by DS.
This European Standard shall be given the status of a national standard, either by publication of an
identical text or by endorsement, at the latest by November 2025, and conflicting national standards
shall be withdrawn at the latest by November 2025.
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. CEN-CENELEC shall not be held responsible for identifying any or all such patent rights.
Any feedback and questions on this document should be directed to the users’ national standards body.
A complete listing of these bodies can be found on the CEN and CENELEC websites.
According to the CEN-CENELEC Internal Regulations, the national standards organizations of the
following countries are bound to implement this European Standard: Austria, Belgium, Bulgaria,
Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland,
Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Republic of
North Macedonia, Romania, Serbia, Slovakia, Slovenia, Spain, Sweden, Switzerland, Türkiye and the
United Kingdom.
Endorsement notice
The text of ISO/IEC 5259-4:2024 has been approved by CEN-CENELEC as EN ISO/IEC 5259-4:2025
without any modification.
International
Standard
ISO/IEC 5259-4
First edition
Artificial intelligence — Data
2024-07
quality for analytics and machine
learning (ML) —
Part 4:
Data quality process framework
Intelligence artificielle — Qualité des données pour les analyses
de données et l’apprentissage automatique —
Partie 4: Cadre pour le processus de qualité des données
Reference number
ISO/IEC 5259-4:2024(en) © ISO/IEC 2024

ISO/IEC 5259-4:2024(en)
© ISO/IEC 2024
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
© ISO/IEC 2024 – All rights reserved
ii
ISO/IEC 5259-4:2024(en)
Contents Page
Foreword .v
Introduction .vi
1 Scope .1
2 Normative references .1
3 Terms and definitions .1
4 Symbols and abbreviated terms.3
5 Data quality process principles .3
6 Data quality process framework .3
6.1 General .3
6.2 Data quality planning .5
6.3 Data quality evaluation .6
6.4 Data quality improvement .6
6.5 Data quality process validation .6
6.6 Using the DQPF .7
7 Data quality process for ML .7
7.1 General .7
7.2 Data requirements .8
7.3 Data planning . .9
7.4 Data acquisition .9
7.5 Data preparation .10
7.5.1 General .10
7.5.2 Supervised ML .10
7.5.3 Unsupervised ML .10
7.5.4 Semi-supervised ML .10
7.5.5 Dataset composition .11
7.5.6 Data labelling .11
7.5.7 Data annotation .11
7.5.8 Data quality assessment . 12
7.5.9 Data quality improvement . 13
7.5.10 Data de-identification . 15
7.5.11 Data encoding. .16
7.6 Data provisioning .16
7.6.1 General .16
7.6.2 Supervised ML .16
7.6.3 Unsupervised ML .16
7.6.4 Semi-supervised ML .16
7.7 Data decommissioning .16
8 Data labelling methods and process .17
8.1 General .
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.