ISO/IEC TR 24028:2020 - Trustworthiness in Artificial Intelligence

Information technology - Artificial intelligence - Overview of trustworthiness in artificial intelligence

This document surveys topics related to trustworthiness in AI systems, including the following: - approaches to establish trust in AI systems through transparency, explainability, controllability, etc.; - engineering pitfalls and typical associated threats and risks to AI systems, along with possible mitigation techniques and methods; and - approaches to assess and achieve availability, resiliency, reliability, accuracy, safety, security and privacy of AI systems. The specification of levels of trustworthiness for AI systems is out of the scope of this document.

Technologies de l'information — Intelligence artificielle — Examen d'ensemble de la fiabilité en matière d'intelligence artificielle

General Information

Status: Published
Publication Date: 27-May-2020

ICS: 35.020 - Information technology (IT) in general

Technical Committee: ISO/IEC JTC 1/SC 42 - Artificial intelligence
Drafting Committee: ISO/IEC JTC 1/SC 42/WG 3 - Trustworthiness

Current Stage: 6060 - International Standard published
Start Date: 28-May-2020
Completion Date: 28-May-2020

Overview

ISO/IEC TR 24028:2020 - Information technology - Artificial intelligence - Overview of trustworthiness in artificial intelligence is a technical report from ISO/IEC JTC 1/SC 42 that surveys the landscape of trustworthiness in AI. It analyzes factors that affect trust in AI systems and summarizes approaches for improving transparency, explainability, controllability, safety, security, privacy, reliability and resiliency. The report is descriptive - it surveys methods, threats and mitigation strategies; it does not specify levels of trustworthiness.

Key topics

The report covers high-level and technical aspects relevant to building and assessing trustworthy AI:

Transparency & explainability: approaches, ex-ante vs ex-post explanations, levels and evaluation of explanations.
Controllability and human-in-the-loop: control points and governance mechanisms to keep systems aligned with human values.
Vulnerabilities & threats: AI-specific threats including data poisoning, adversarial attacks, model stealing, hardware threats, and privacy risks.
Bias, unpredictability and opaqueness: identification and mitigation strategies for fairness and interpretability issues.
Engineering pitfalls & lifecycle challenges: specification, data acquisition/preprocessing, modelling, model updates, software defects and hardware faults.
Mitigation measures: bias reduction strategies, privacy-preserving measures, robustness, resilience, functional safety, and hardware-assisted protections.
Testing & evaluation: validation and verification, robustness testing, privacy considerations and predictability evaluation.
Stakeholders & governance: types of stakeholders, assets, values, accountability and risk management approaches.
Related frameworks: application of existing software/data quality standards, risk management practices and hardware-assisted approaches.

Practical applications

ISO/IEC TR 24028:2020 is a practical reference for organizations that need to design, assess or govern AI systems for trustworthy operation:

AI developers and system architects - to identify engineering risks and mitigation options (robustness, privacy, testing).
Security and privacy teams - to map AI-specific attack surfaces (adversarial examples, data leakage) and defensive techniques.
Risk managers and compliance officers - to align AI risk-management practices with established frameworks.
Product managers and procurement - to define trust-related requirements, labeling and expectations.
Regulators, auditors and researchers - to understand common trustability concerns and evaluation approaches.

Related standards

The report complements other ISO/IEC work and industry standards on software quality, data quality, risk management and AI governance practices. It provides a survey-level foundation for applying those standards to AI systems.

Keywords: ISO/IEC TR 24028:2020, trustworthiness in AI, AI transparency, explainability, AI safety, adversarial attacks, data poisoning, model stealing, AI robustness, AI privacy, AI governance.

ISO/IEC TR 24028:2020 - Information technology — Artificial intelligence — Overview of trustworthiness in artificial intelligence
Released:5/28/2020 - Page 1 preview

Technical report

ISO/IEC TR 24028:2020 - Information technology — Artificial intelligence — Overview of trustworthiness in artificial intelligence Released:5/28/2020

English language

43 pages

sale 15% off

Preview

sale 15% off

Preview

Frequently Asked Questions

What is ISO/IEC TR 24028:2020?

ISO/IEC TR 24028:2020 is a technical report published by the International Organization for Standardization (ISO). Its full title is "Information technology - Artificial intelligence - Overview of trustworthiness in artificial intelligence". This standard covers: This document surveys topics related to trustworthiness in AI systems, including the following: - approaches to establish trust in AI systems through transparency, explainability, controllability, etc.; - engineering pitfalls and typical associated threats and risks to AI systems, along with possible mitigation techniques and methods; and - approaches to assess and achieve availability, resiliency, reliability, accuracy, safety, security and privacy of AI systems. The specification of levels of trustworthiness for AI systems is out of the scope of this document.

What is the scope of ISO/IEC TR 24028:2020?

What ICS categories does ISO/IEC TR 24028:2020 belong to?

ISO/IEC TR 24028:2020 is classified under the following ICS (International Classification for Standards) categories: 35.020 - Information technology (IT) in general. The ICS classification helps identify the subject area and facilitates finding related standards.

How can I purchase ISO/IEC TR 24028:2020?

You can purchase ISO/IEC TR 24028:2020 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of ISO standards.

Standards Content (Sample)

ISO/IEC TR 24028:2020 - Inform...

TECHNICAL ISO/IEC TR
REPORT 24028
First edition
2020-05
Information technology —
Artificial intelligence — Overview
of trustworthiness in artificial
intelligence
Technologies de l'information — Intelligence artificielle — Examen
d'ensemble de la fiabilité en matière d'intelligence artificielle
Reference number
©
ISO/IEC 2020
© ISO/IEC 2020
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 2020 – All rights reserved

Contents Page
Foreword .v
Introduction .vi
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Overview . 7
5 Existing frameworks applicable to trustworthiness . 7
5.1 Background . 7
5.2 Recognition of layers of trust. 8
5.3 Application of software and data quality standards . 8
5.4 Application of risk management.10
5.5 Hardware-assisted approaches .10
6 Stakeholders .11
6.1 General concepts .11
6.2 Types.12
6.3 Assets .12
6.4 Values .13
7 Recognition of high-level concerns .13
7.1 Responsibility, accountability and governance .13
7.2 Safety .14
8 Vulnerabilities, threats and challenges .14
8.1 General .14
8.2 AI specific security threats .15
8.2.1 General.15
8.2.2 Data poisoning .15
8.2.3 Adversarial attacks.15
8.2.4 Model stealing .16
8.2.5 Hardware-focused threats to confidentiality and integrity .16
8.3 AI specific privacy threats .16
8.3.1 General.16
8.3.2 Data acquisition .16
8.3.3 Data pre-processing and modelling .17
8.3.4 Model query .17
8.4 Bias .17
8.5 Unpredictability .17
8.6 Opaqueness .18
8.7 Challenges related to the specification of AI systems .18
8.8 Challenges related to the implementation of AI systems .19
8.8.1 Data acquisition and preparation.19
8.8.2 Modelling .19
8.8.3 Model updates .21
8.8.4 Software defects .21
8.9 Challenges related to the use of AI systems .21
8.9.1 Human-computer interaction (HCI) factors .21
8.9.2 Misapplication of AI systems that demonstrate realistic human behaviour.22
8.10 System hardware faults .22
9 Mitigation measures .23
9.1 General .23
9.2 Transparency .23
9.3 Explainability .24
9.3.1 General.24
© ISO/IEC 2020 – All rights reserved iii

9.3.2 Aims of explanation .24
9.3.3 Ex-ante vs ex-post explanation .24
9.3.4 Approaches to explainability .25
9.3.5 Modes of ex-post explanation .25
9.3.6 Levels of explainability .26
9.3.7 Evaluation of the explanations .27
9.4 Controllability .27
9.4.1 General.27
9.4.2 Human-in-the-loop control points .28
9.5 Strategies for reducing bias .28
9.6 Privacy .28
9.7 Reliability, resilience and robustness .28
9.8 Mitigating system hardware faults .29
9.9 Functional safety .29
9.10 Testing and evaluation .30
9.10.1 General.30
9.10.2 Software validation and verification methods .30
9.10.3 Robustness considerations .32
9.10.4 Privacy-related considerations .33
9.10.5 System predictability considerations.33
9.11 Use and applicability .34
9.11.1 Compliance .34
9.11.2 Managing expectations .34
9.11.3 Product labelling .34
9.11.4 Cognitive science research .34
10 Conclusions .34
Annex A (informative) Related work on societal issues .36
Bibliography .37
iv © ISO/IEC 2020 – All rights reserved

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that
are members of ISO or IEC participate in the development of International Standards through
technical committees established by the respective organization to deal with particular fields of
technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other
international organizations, governmental and non-governmental, in liaison with ISO and IEC, also
take part in the work.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for
the different types of document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www .iso .org/ patents) or the IEC
list of patent declarations received (see http:// patents .iec .ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso .org/
iso/ foreword .html.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information Technology,
Subcommittee SC 42, Artificial Intelligence.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/ members .html.
© ISO/IEC 2020 – All rights reserved v

Introduction
The goal of this document is to analyse the factors that can impact the trustworthiness of systems
providing or using AI, called hereafter artificial intelligence (AI) systems. The document briefly
surveys the existing approaches that can support or improve trustworthiness in technical systems
and discusses their potential application to AI systems. The document discusses possible approaches
to mitigating AI system vulnerabilities that relate to trustworthiness. The document also discusses
approaches to improving the trustworthiness of AI systems.
vi © ISO/IEC 2020 – All rights reserved

TECHNICAL REPORT ISO/IEC TR 24028:2020(E)
Information technology — Artificial intelligence —
Overview of trustworthiness in artificial intelligence
1 Scope
This document surveys topics related to trustworthiness in AI systems, including the following:
— approaches to establish trust in AI systems through transparency, explainability, controllability, etc.;
— engineering pitfalls and typical associated threats and risks to AI systems, along with possible
mitigation techniques and methods; and
— approaches to assess and achieve availability, resiliency, reliability, accuracy, safety, security and
privacy of AI systems.
The specification of levels of trustworthiness for AI systems is out of the scope of this document.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at http:// www .electropedia .org/
3.1
accountability
property that ensures that the actions of an entity (3.16) may be traced uniquely to that entity
[SOURCE: ISO/IEC 2382:2015, 2126250, modified — The Notes to entry have been removed.]
3.2
actor
entity (3.16) that communicates and interacts
[SOURCE: ISO/IEC TR 22417:2017, 3.1]
3.3
algorithm
set of rules for transforming the logical representation of data (3.11)
[SOURCE: ISO/IEC 11557:1992, 4.3]
3.4
artificial intelligence
AI
capability of an engineered system (3.38) to acquire, process and apply knowledge and skills
Note 1 to entry: Knowledge are facts, information (3.20) and skills acquired through experience or education.
© ISO/IEC 2020 – All rights reserved 1

3.5
asset
anything that has value (3.46) to a stakeholder (3.37)
Note 1 to entry: There are many types of assets, including:
a) information (3.20);
b) software, such as a computer program;
c) physical, such as computer;
d) services;
e) people and their qualifications, skills and experience; and
f) intangibles, such as reputation and image.
[SOURCE: ISO/IEC 21827:2008, 3.4, modified — In the definition, “the organization” has been changed
to “a stakeholder”. Note 1 to entry has been removed.]
3.6
attribute
property or characteristic of an object that can be distinguished quantitatively or qualitatively by
human or automated means
[SOURCE: ISO/IEC/IEEE 15939:2017, 3.2]
3.7
autonomy
autonomous
characteristic of a system (3.38) governed by its own rules as the result of self-learning
Note 1 to entry: Such systems are not subject to external control (3.10) or oversight.
3.8
bias
favouritism towards some things, people or groups over others
3.9
consistency
degree of uniformity, standardization and freedom from contradiction among the documents or parts
of a system (3.38) or component
[SOURCE: ISO/IEC 21827:2008, 3.14]
3.10
control
purposeful action on or in a process (3.29) to meet specified objectives
[SOURCE: IEC 61800-7-1:2015, 3.2.6]
3.11
data
re-interpretable representation of information (3.20) in a formalized manner suitable for
communication, interpretation or processing
Note 1 to entry: Data (3.11) can be processed by human or automatic means.
[SOURCE: ISO/IEC 2382:2015, 2121272, modified — Notes 2 and 3 to entry have been removed.]
2 © ISO/IEC 2020 – All rights reserved

3.12
data subject
individual about whom personal data (3.27) are recorded
[SOURCE: ISO 5127:2017, 3.13.4.01, modified — Note 1 to entry has been removed.]
3.13
decision tree
supervised-learning model for which inference can be represented by traversing one or more tree-like
structures
3.14
effectiveness
extent to which planned activities are realized and planned results achieved
[SOURCE: ISO 9000:2015, 3.7.11, modified — Note 1 to entry has been removed.]
3.15
efficiency
relationship between the results achieved and the resources used
[SOURCE: ISO 9000:2015, 3.7.10]
3.16
entity
any concrete or abstract thing of interest
[SOURCE: ISO/IEC 10746-2:2009, 6.1]
3.17
harm
injury or damage to the health of people or damage to property or the environment
[SOURCE: ISO/IEC Guide 51:2014, 3.1]
3.18
hazard
potential source of harm (3.17)
[SOURCE: ISO/IEC Guide 51:2014, 3.2]
3.19
human factors
environmental, organizational and job factors, in conjunction with cognitive human characteristics,
which influence the behaviour of persons or organizations
3.20
information
meaningful data (3.11)
[SOURCE: ISO 9000:2015, 3.8.2]
3.21
integrity
property of protecting the accuracy and completeness of assets (3.5)
[SOURCE: ISO/IEC 27000:2018, 3.36, modified — In the definition, "protecting the" has been added
before "accuracy" and "of assets" has been added after "completeness".]
© ISO/IEC 2020 – All rights reserved 3

3.22
intended use
use in accordance with information (3.20) provided with a product or system (3.38) or, in the absence of
such information, by generally understood patterns (3.26) of usage.
[SOURCE: ISO/IEC Guide 51:2014, 3.6]
3.23
machine learning
ML
process (3.29) by which a functional unit improves its performance by acquiring new knowledge or
skills or by reorganizing existing knowledge or skills
[SOURCE: ISO/IEC 2382:2015, 2123789]
3.24
machine learning model
mathematical construct that generates an inference or prediction, based on input data (3.11)
3.25
neural network
computational model utilizing distributed, parallel local processing and consisting of a network of
simple processing elements called artificial neurons, which can exhibit complex global behaviour
[SOURCE: ISO 18115-1:2013, 8.1]
3.26
pattern
set of features and their relationships used to recognize an entity (3.16) within a given context
[SOURCE: ISO/IEC 2382:2015, 2123798]
3.27
personal data
data (3.11) relating to an identified or identifiable individual
[SOURCE: ISO 5127:2017, 3.1.10.14, modified — The admitted terms and Notes 1 and 2 to entry have
been removed.]
3.28
privacy
freedom from intrusion into the private life or affairs of an individual when that intrusion results from
undue or illegal gathering and use of data (3.11) about that individual
[SOURCE: ISO/IEC 2382:2015, 2126263, modified — Notes 1 and 2 to entry have been removed.]
3.29
process
set of interrelated or interacting activities that use inputs to deliver an intended result
[SOURCE: ISO 9000:2015, 3.4.1, modified — The notes to entry have been omitted.]
3.30
reliability
property of consistent intended behaviour and results
[SOURCE: ISO/IEC 27000:2018, 3.55]
4 © ISO/IEC 2020 – All rights reserved

3.31
risk
effect of uncertainty on objectives
Note 1 to entry: An effect is a deviation from the expected. It can be positive, negative or both and can address,
create or result in opportunities and threats (3.39).
Note 2 to entry: Objectives can have different aspects and categories and can be applied at different levels.
Note 3 to entry: Risk is usually expressed in terms of risk sources, potential events, their consequences and their
likelihood.
[SOURCE: ISO 31000:2018, 3.1]
3.32
robot
programmed actuated mechanism with a degree of autonomy (3.7), moving within its environment, to
perform intended tasks
Note 1 to entry: A robot includes the control (3.10) system and interface of the control system (3.38).
Note 2 to entry: The classification of robot into industrial robot or service robot is done according to its intended
application.
[SOURCE: ISO 18646-2:2019, 3.1]
3.33
robotics
science and practice of designing, manufacturing and applying robots (3.32)
[SOURCE: ISO 8373:2012, 2.16]
3.34
safety
freedom from risk (3.31) which is not tolerable
[SOURCE: ISO/IEC Guide 51:2014, 3.14]
3.35
security
degree to which a product or system (3.38) protects information (3.20) and data (3.11) so that persons
or other products or systems have the degree of data access appropriate to their types and levels of
authorization
[SOURCE: ISO/IEC 25010:2011, 4.2.6]
3.36
sensitive data
data (3.11) with potentially harmful effects in the event of disclosure or misuse
[SOURCE: ISO 5127:2017, 3.1.10.16]
3.37
stakeholder
any individual, group or organization that can affect, be affected by or perceive itself to be affected by
a decision or activity
[SOURCE: ISO/IEC 38500:2015, 2.24]
© ISO/IEC 2020 – All rights reserved 5

3.38
system
combination of interacting elements organized to achieve one or more stated purposes
Note 1 to entry: A system is sometimes considered as a product or as the services it provides.
[SOURCE: ISO/IEC/IEEE 15288:2015, 3.38]
3.39
threat
potential cause of an unwanted incident, which may result in harm (3.17) to systems (3.38), organizations
or individuals
3.40
training
process (3.29) to establish or to improve the parameters of a machine learning model (3.24) based on a
machine learning algorithm (3.3) by using training data (3.11)
3.41
trust
degree to which a user (3.43) or other stakeholder (3.37) has confidence that a product or system (3.38)
will behave as intended
[SOURCE: ISO/IEC 25010:2011, 4.1.3.2]
3.42
trustworthiness
ability to meet stakeholders' (3.37) expectations in a verifiable way
Note 1 to entry: Depending on the context or sector and also on the specific product or service, data (3.11) and
technology used, different characteristics apply and need verification (3.47) to ensure stakeholders expectations
are met.
Note 2 to entry: Characteristics of trustworthiness include, for instance, reliability (3.30), availability, resilience,
security (3.35), privacy (3.28), safety (3.34), accountability (3.1), transparency, integrity (3.21), authenticity,
quality, usability.
Note 3 to entry: Trustworthiness is an attribute (3.6) that can be applied to services, products, technology, data
and information (3.20) as well as, in the context of governance, to organizations.
3.43
user
individual or group that interacts with a system (3.38) or benefits from a system during its utilization
[SOURCE: ISO/IEC/IEEE 15288:2015, 4.1.52, modified — Note 1 to entry has been removed.]
3.44
validation
confirmation, through the provision of objective evidence, that the requirements for a specific intended
use (3.22) or application have been fulfilled
Note 1 to entry: The right system (3.38) was built.
[SOURCE: ISO/IEC TR 29110-1:2016, 3.73, modified — Only the last sentence of Note 1 to entry has been
retained and Note 2 to entry has been removed.]
3.45
value
unit of data (3.11)
[SOURCE: ISO/IEC/IEEE 15939:2017, 3.41]
6 © ISO/IEC 2020 – All rights reserved

3.46
value
belief(s) an organization adheres to and the standards that it seeks to observe
[SOURCE: ISO 10303-11:2004, 3.3.22]
3.47
verification
confirmation, through the provision of objective evidence, that specified requirements have been
fulfilled
Note 1 to entry: The system (3.38) was built right.
[SOURCE: ISO/IEC TR 29110-1:2016, 3.74, modified — Only the last sentence of Note 1 to entry has been
retained.]
3.48
vulnerability
weakness of an asset (3.5) or control (3.10) that can be exploited by one or more threats (3.38)
[SOURCE: ISO/IEC 27000:2018, 3.77]
3.49
workload
mix of tasks typically run on a given computer system (3.38)
[SOURCE: ISO/IEC/IEEE 24765:2017, 3.4618, modified — Note 1 to entry has been removed.]
4 Overview
This document provides an overview of topics relevant to building trustworthiness of AI systems.
One of the goals of this document is to assist the standards community with identifying specific
standardization gaps in the area of AI.
In Clause 5, the document briefly surveys existing approaches being used for building trustworthiness
in technical systems and discusses their potential applicability to AI systems. In Clause 6, the document
identifies the stakeholders. In Clause 7, it discusses their considerations related to the responsibility,
accountability, governance and safety of AI systems. In Clause 8, the document surveys the
vulnerabilities of AI systems that can reduce their trustworthiness. In Clause 9, the document identifies
possible measures that improve trustworthiness of an AI system by mitigating vulnerabilities across
its lifecycle. Measures include those related to improving AI system transparency, controllability, data
handling, robustness, testing and evaluation and use. Conclusions are presented in Clause 10.
5 Existing frameworks applicable to trustworthiness
5.1 Background
For the purposes of this document, it is important to provide working definitions of artificial intelligence
(AI) systems and trustworthiness.
Here, we consider an AI system to be any system (whether a product or a service) that uses AI. There
are many different kinds of AI systems. Some are implemented completely in software, while others are
mostly implemented in hardware (e.g. robots).
A working definition of trustworthiness is the ability to meet stakeholders’ expectations in a verifiable
way. This definition can be applied across the broad range of AI systems, technologies and application
domains.
© ISO/IEC 2020 – All rights reserved 7

As with security, trustworthiness has been understood and treated as a non-functional requirement
specifying emergent properties of a system — i.e. a set of characteristics with their attributes — within
[20]
the context of quality of use. This is indicated in ISO/IEC 25010 .
Additionally, like with security, trustworthiness can be improved through an organizational process
with specific measurable outcomes and key performance indicators (KPIs).
In summary, trustworthiness has been understood and treated as both an ongoing organizational
process as well as a (non-functional) requirement.
[26]
According to UNEP , the “precautionary principle” means that where there are threats of serious or
irreversible harm, lack of full scientific certainty shall not be used as a reason for postponing effective
measures to prevent harm. In safety engineering, a process for capturing and then sizing, stakeholder
“value” requirements includes the understanding of the system’s context of use, the risks of harm and,
when applicable, an application of the “precautionary principle” as a risk mitigation technique against
potential unintended consequences, such as harm to rights and freedom of natural persons, life of any
kind, the environment, a species or a community.
AI systems are often existing systems enhanced with AI capabilities. In this case, all the approaches
and considerations regarding trustworthiness that applied to the old version of the system, continue
to apply to enhanced system. These include approaches to quality (both metrics and measurement
methodologies), safety and risk of harm and risk management frameworks (such as those existing
for security and privacy). Subclauses 5.2 to 5.5 present different frameworks for contextualizing the
trustworthiness of AI systems.
5.2 Recognition of layers of trust
An AI system can be conceptualized as operating in an ecosystem of functional layers. Trust is
established and maintained at each layer in order for the AI system to be trusted in its environment.
[27]
For example, the ITU-T report on Trust Provisioning introduces three layers of trust: physical
trust, cyber trust and social trust, taking into account the physical infrastructure for data collection
(e.g. sensors and actuators), IT infrastructure for data storage and processing (e.g. cloud) and end-
applications (e.g. ML algorithms, expert systems and applications for end-users).
Regarding the layer of physical trust, the concept is often synonymous to the combination of reliability
and safety because the metrics are based on a physical measurement or test. For instance, the technical
control of a car makes the car and its inner mechanisms trustworthy. In this context, the level of trust
can be determined through the level of fulfilment of a checklist. In addition, some processes such as
sensor calibration can guarantee the correctness of measurements and, therefore, the data produced.
At the cyber trust layer, concerns often shift to IT infrastructure security requirements, such as access
control and other measures to maintain AI system integrity and to keep its data safe.
Trust at the end-applications layer of an AI system requires, among other things, software that is
reliable and safe. In the context of critical systems, the production of software is framed by a set of
[28]
processes to verify and validate the “product” . The same is true for AI systems and more. With the
stochastic nature of AI systems based on machine learning, trustworthiness also implies fairness of the
system’s behaviour, corresponding to the absence of inappropriate bias.
Moreover, social trust is based on a person’s way of life, belief, character, etc. Without a clear
understanding of the internal functioning, its operating principles are not transparent to the non-
technical segment of population. In this case, the establishment of trust might not be dependent on
objective verification of the AI system’s performance, but rather based on a subjective pedagogical
explanation of the AI system’s observed behaviour.
5.3 Application of software and data quality standards
Software has an important effect on the trustworthiness of a typical AI system. As a result, identifying
and describing the quality attributes of its software can help to improve trustworthiness of the whole
[29]
system . These attributes can contribute to both cyber and social trust. For example, from a societal
8 © ISO/IEC 2020 – All rights reserved

[30]
perspective, trustworthiness can be described in terms of ability, integrity and benevolence . Below
are examples of how these terms are being interpreted today in the context of AI systems.
— Ability is the capability the AI system to do a specific task (e.g. discover the tumour within a medical
image or identify a person using face recognition over a video monitoring system). The attributes
related to ability include robustness, safety, reliability, etc.
— Integrity is an AI system’s respect of sound moral and ethical principles or the assurance that
information will not be manipulated in a malicious way by the AI system. Thus, the attributes of
integrity include completeness, accuracy, certainty, consistency, etc.
— Benevolence is the extent to which the AI system is believed to do good or in other terms, to what
extent the “Do No Harm” principle is respected.
The ISO/IEC SQuaRE series deals with software quality through models and measurement
(ISO/IEC 2501x on models and ISO/IEC 2502x on measurement) resulting in a list of characteristics for
software quality and characteristics for data quality.
SQuaRE series distinguish between the following models:
— quality of a software product resulting in a list of 8 characteristics;
— quality in use of a software product, data and IT services resulting in a list of 5 characteristics,
giving way to differentiate cyber trust and social trust, specifying also possible risks to mitigate;
— data quality, resulting in a list of 15 characteristics; and
— IT service quality, resulting in a list of 8 characteristics.
[20]
For example, in terms of ISO/IEC 25010 , emerging social requirements falls within the category of
“freedom from risk”. According to [20], freedom from risk is the “degree to which a product or system
mitigates the potential risk to economic status, human life, health or the environment”.
ISO/IEC 25010 is a part of the SQuaRE series of International Standards and describes a model,
consisting of characteristics and sub-characteristics, for software product quality and software quality
[19]
in use. ISO/IEC 25012 is part of the same series and in turn defines a general data quality model for
data processed in a structured format within a computer system. ISO/IEC 25012 focuses on the quality
of the data as part of a computer system and defines quality characteristics for target data used by
humans and systems.
The SQuaRE series have been developed for traditional software systems that store their data in a
structured manner and process it using explicit logic. ISO/IEC 25012 describes its data quality model
by using 15 different characteristics such as accuracy, completeness, accessibility, traceability and
portability.
It can be more challenging to measure both system and data quality characteristics in AI systems. The
data quality model in ISO/IEC 25012 does not sufficiently address all of the characteristics of the data-
driven nature of AI systems. For example, deep learning is an approach to creating rich hierarchical
representations through the training of neural networks with many hidden layers on large amounts
[31]
of data . In addition, a data quality model for AI systems needs to take into consideration other
characteristics not currently described in ISO/IEC 25012 such as bias in the data used to develop the
AI system.
To more adequately cover AI systems and the data they depend on, it is possible that there is a need
for extending or modifying existing standards to go beyond the characteristics and requirements of
traditional systems and software development described in ISO/IEC 25010 and the data quality model
described in ISO/IEC 25012.
© ISO/IEC 2020 – All rights reserved 9

5.4 Application of risk management
Risk management is a preventative process that helps to ensure that “by design” a specific AI product
or AI service is trustworthy throughout its lifecycle. The general process of risk management is defined
[14]
in ISO 31000:2018 and involves identifying stakeholders and their vulnerable assets and values,
assessing associated risks with their consequence or impact and making conscious risk treatment
[14]
decisions based on the organization’s objectives and its risk tolerance. Risk according to ISO 31000
is defined as “effect of uncertainty on objectives”, where an effect is a deviation from the expected and
it is measured or assessed in terms of the likelihood of the unexpected event and the level of impact
it can have on the stakeholders. Risk management is especially suited to new technologies where
the unknown is greater than the known. It is also best suited to deal with situations carrying risk
inherently, such as dealing with human errors and malicious attacks. Moreover, risk management
helps dealing with uncertainty in the areas where no recognized measurements of quality have been
established yet. These characteristics are common to AI systems making them particularly suitable for
risk management.
Key concepts from ISO 31000 are presented here to show how they can be applied to AI systems.
These include identifying stakeholders, an organization’s objectives, control objectives, controls and
associated measures. In the case of AI systems, the range of stakeholders can be especially broad and
includes not only the organization itself, its partners and customers, but also the human society at
large and the environment. Whether a developer, distributor or user of AI systems, an organization’s
objectives at the top-level would include reputation, security and privacy, fairness and safety. Achieving
trustworthiness relies on maintaining all these organizational objectives, which are translated
into more tangible control objectives (or risk sources). Control objectives typically correspond to
1)
vulnerabilities, pitfalls or anticipated threats. For AI systems, these would include (but not be limited
to) challenges to accountability, new security threats, new privacy threats, improper specification,
deficient implementation, incorrect use and different sources of bias. For each of the identified control
objectives, a set of possible controls (or mitigations) can be identified. For AI systems, these would
include (but not be limited to):
— approaches to transparency;
— new security controls;
— new privacy controls;
— considerations related to robustness and resilience;
— considerations related to the choice and the configuration of ML algorithms;
— considerations related to data; and
— considerations related to system controllability.
The risk management process further takes each control and points it to a set of corresponding
guidelines (or measures) to choose from the organization’s policy and the circumstances. Once this risk
management process framework is established, its proper implementation and correct deployment is
subject to continuous testing, review and improvement using different assessment and measurement
approaches including (but not be limited to) algorithmic performance metrics and field trials.
5.5 Hardware-assisted approaches
Typical machine learning systems (both training and use) are deployed on common and untrusted off-
the-shelf platforms, which can influence the correct execution of the system. For example, machine
learning applications are frequently dep
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.

Loading comments...