Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering

This document defines the following. — A reference comparison method. This method is applicable to two character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire, such as those of the different ISO/IEC 8-bit standard character sets, or any other character set, standardized or not, to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Table defined in this document or from one of its tailorings. This method provides a reference format. The format is described using the Backus-Naur Form (BNF). This format is used to describe the Common Template Table. The format is used normatively within this document. — A Common Template Table. A given tailoring of the Common Template Table is used by the reference comparison method. The Common Template Table describes an order for all characters encoded in the Unicode 13.0 standard,[27] included in ISO/IEC 10646:2020. It allows for a specification of a fully deterministic ordering. This table enables the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). NOTE 1 This Common Template Table is to be modified to suit the needs of a local environment. The main worldwide benefit is that, for other scripts, often no modification is required and the order will remain as consistent as possible and predictable from an international point of view. NOTE 2 The character repertoire used in this document is equivalent to that of the Unicode Standard version 13.0[27]. — A reference name. The reference name refers to this particular version of the Common Template Table, for use as a reference when tailoring. In particular, this name implies that the table is linked to a particular stage of development of the ISO/IEC 10646 Universal coded character set. — Requirements for a declaration of the differences (delta) between the collation table and the Common Template Table. This document does not mandate the following. — A specific comparison method; any equivalent method giving the same results is acceptable. — A specific format for describing or tailoring tables in a given implementation. — Specific symbols to be used by implementations, except for the name of the Common Template Table. — Any specific user interface for choosing options. — Any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either. — A context-dependent ordering. — Any particular preparation of character strings prior to comparison. NOTE 1 It is normally necessary to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this International Standard with their anticipated solutions.

Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

Le présent document définit ce qui suit. — Une méthode de référence pour la comparaison de deux chaînes de caractères ayant pour but de déterminer leur ordre de classement dans une liste triée. La méthode s'applique à des chaînes utilisant le répertoire complet de l'ISO/IEC 10646, des sous-répertoires tels que ceux des divers jeux normalisés ISO/IEC à 8 bits ou tout autre jeu de caractères, normalisé ou non, et permet de produire des résultats de tri valables (après adaptation) pour un ensemble de langues de chaque système d'écriture. Cette méthode de référence utilise des tables de tri dérivées soit de la table-modèle commune de classement définie dans le présent document, soit d'une de ses adaptations. La méthode procure un format de référence de la table-modèle commune. Ce format est décrit en notation BNF (forme de Backus-Naur, Backus-Naur Form). Son emploi est normatif dans le présent document. — Une table-modèle commune de classement utilisée par la méthode de référence. Cette table décrit un ordre de base pour tous les caractères du standard Unicode 13.027 compris dans l'ISO/IEC 10646:2020. Tout cela permet de spécifier un ordre complètement déterministe. Cette table constitue le point de départ permettant de préciser un ordre de classement adapté aux règles de classement locales, sans qu'il soit nécessaire de connaître tous les systèmes d'écriture repris dans le jeu universel de caractères codés (JUC). NOTE 1 Cette table-modèle commune de classement est destinée à être modifiée pour satisfaire aux besoins d'environnements locaux. L'avantage principal de cette pratique, sur le plan mondial, réside dans le fait que, pour d'autres systèmes d'écriture que celui de l'utilisateur, aucune modification n'est nécessaire et cet ordre demeurera aussi cohérent que possible et prévisible dans un contexte international. NOTE 2 Le répertoire de caractères utilisé dans le présent document est équivalent à celui du standard Unicode, version 13.0[27]. — Un nom de référence représentant cette version particulière de la table-modèle commune, à utiliser comme point de départ à toute adaptation. Ce nom implique notamment que la table est liée à un stade de développement particulier du jeu universel de caractères codés (ISO/IEC 10646). — Des exigences pour la déclaration de différences (delta) entre une table de tri et la table-modèle commune. Le présent document ne spécifie pas ce qui suit. — Une méthode particulière de comparaison; toute méthode équivalente conduisant aux mêmes résultats est acceptable. — Un format précis pour décrire ou pour adapter les tables dans une mise en œuvre donnée. — Des symboles précis à utiliser par les mises en œuvre, sauf pour ce qui est du nom de la table-modèle commune de classement. — Une interface utilisateur particulière destinée à choisir les options. — Un format interne particulier pour les clés intermédiaires utilisées dans les comparaisons ou pour la table de tri. L'utilisation de clés numériques n'est pas spécifiée non plus. — Un ordre dépendant du contexte. — Un prétraitement particulier des chaînes de caractères avant comparaison. NOTE 1 Bien que ceci ne soit pas spécifié par le présent document, il s'avère souvent nécessaire de préparer les chaînes de caractères avant leur comparaison (cf. l'Annexe C). NOTE 2 L'Annexe D décrit les problèmes qui ont donné lieu à la présente Norme internationale avec leurs solutions anticipées.

General Information

Status: Withdrawn
Publication Date: 20-Dec-2020

ICS: 35.040.10 - Coding of character sets

Technical Committee: ISO/IEC JTC 1/SC 2 - Coded character sets
Drafting Committee: ISO/IEC JTC 1/SC 2 - Coded character sets

Current Stage: 9599 - Withdrawal of International Standard
Start Date: 22-Jul-2025
Completion Date: 12-Feb-2026

Relations

Revised: ISO/IEC 14651:2025 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering
Effective Date: 01-Jul-2023

Revises: ISO/IEC 14651:2019 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering
Effective Date: 23-Apr-2020

Overview

ISO/IEC 14651:2020 - "Information technology - International string ordering and comparison" - specifies a reference method for comparing character strings and defines a Common Template Table that can be tailored for local collation needs. The standard applies to strings containing characters from the full repertoire of ISO/IEC 10646 (equivalent to Unicode 13.0) and provides a normative format (described in Backus‑Naur Form, BNF) for collation tables and their tailorings.

Key technical topics and requirements

Reference comparison method: A method to determine collating order for two strings; implementations may use any equivalent method that produces identical results.
Common Template Table: A deterministic baseline ordering for all Unicode 13.0 characters that implementers can tailor to local languages and scripts.
Tailoring and deltas: Differences between a local collation table and the Common Template Table must be declared as a delta; this supports predictable cross‑locale behavior.
BNF format: The document defines a normative format (BNF) to describe the Common Template Table and its tailorings.
Conformance requirements: A conformant process must produce results identical to the reference method and declare:
- the number of collation levels supported (minimum three);
- support for forward/backward processing parameters;
- the tailoring delta and number of levels in it;
- any preparation method used for input strings.
Non‑mandated aspects: ISO/IEC 14651 does not require a specific internal key format, UI, context‑dependent ordering, or particular string preparation method (though preparation is normally necessary - see Annex C).
Scope coverage: Applicable to full Unicode repertoire and to subsets or other character sets mapped to ISO/IEC 10646.

Practical applications and users

Who benefits:

Software internationalization (i18n) engineers implementing locale‑aware sorting and searching
Database and search engine developers needing consistent collation across scripts
Standards bodies and vendors creating localized collation tables
Libraries and toolkits (programming languages, OSs) that provide sort/collation APIs

Typical uses:

Implementing deterministic, language‑aware sorting of names, dictionaries, indexes
Creating and publishing tailored collation tables (deltas) for national/local standards
Ensuring interoperability where consistent ordering across scripts and systems is required

Related standards

ISO/IEC 10646:2020 (Universal Coded Character Set - UCS) / Unicode 13.0 - character repertoire referenced by ISO/IEC 14651
ISO/IEC TR 30112 - informative guidance on ordering keywords that complement 14651

Keywords: ISO/IEC 14651, string ordering, collation, Common Template Table, Unicode 13.0, ISO/IEC 10646, tailoring, delta, BNF, internationalization, locale‑aware sorting.

ISO/IEC 14651:2020 - Information technology -- International string ordering and comparison -- Method for comparing character strings and description of the common template tailorable ordering - Page 1 preview

Standard

ISO/IEC 14651:2020 - Information technology -- International string ordering and comparison -- Method for comparing character strings and description of the common template tailorable ordering

English language

52 pages

sale 15% off

Preview

sale 15% off

Preview

ISO/IEC 14651:2020 - Technologies de l'information -- Classement international et comparaison de chaînes de caractères -- Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement - Page 1 preview

Standard

ISO/IEC 14651:2020 - Technologies de l'information -- Classement international et comparaison de chaînes de caractères -- Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

French language

54 pages

sale 15% off

Preview

sale 15% off

Preview

Get Certified

Connect with accredited certification bodies for this standard

BSI Group

BSI (British Standards Institution) is the business standards company that helps organizations make excellence a habit.

UKAS United Kingdom Verified

Visit Website

NYCE

Mexican standards and certification body.

EMA Mexico Verified

Visit Website

Frequently Asked Questions

What is ISO/IEC 14651:2020?

ISO/IEC 14651:2020 is a standard published by the International Organization for Standardization (ISO). Its full title is "Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering". This standard covers: This document defines the following. — A reference comparison method. This method is applicable to two character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire, such as those of the different ISO/IEC 8-bit standard character sets, or any other character set, standardized or not, to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Table defined in this document or from one of its tailorings. This method provides a reference format. The format is described using the Backus-Naur Form (BNF). This format is used to describe the Common Template Table. The format is used normatively within this document. — A Common Template Table. A given tailoring of the Common Template Table is used by the reference comparison method. The Common Template Table describes an order for all characters encoded in the Unicode 13.0 standard,[27] included in ISO/IEC 10646:2020. It allows for a specification of a fully deterministic ordering. This table enables the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). NOTE 1 This Common Template Table is to be modified to suit the needs of a local environment. The main worldwide benefit is that, for other scripts, often no modification is required and the order will remain as consistent as possible and predictable from an international point of view. NOTE 2 The character repertoire used in this document is equivalent to that of the Unicode Standard version 13.0[27]. — A reference name. The reference name refers to this particular version of the Common Template Table, for use as a reference when tailoring. In particular, this name implies that the table is linked to a particular stage of development of the ISO/IEC 10646 Universal coded character set. — Requirements for a declaration of the differences (delta) between the collation table and the Common Template Table. This document does not mandate the following. — A specific comparison method; any equivalent method giving the same results is acceptable. — A specific format for describing or tailoring tables in a given implementation. — Specific symbols to be used by implementations, except for the name of the Common Template Table. — Any specific user interface for choosing options. — Any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either. — A context-dependent ordering. — Any particular preparation of character strings prior to comparison. NOTE 1 It is normally necessary to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this International Standard with their anticipated solutions.

What is the scope of ISO/IEC 14651:2020?

What ICS categories does ISO/IEC 14651:2020 belong to?

ISO/IEC 14651:2020 is classified under the following ICS (International Classification for Standards) categories: 35.040.10 - Coding of character sets. The ICS classification helps identify the subject area and facilitates finding related standards.

What standards are related to ISO/IEC 14651:2020?

ISO/IEC 14651:2020 has the following relationships with other standards: It is inter standard links to ISO/IEC 14651:2025, ISO/IEC 14651:2019. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.

How can I access ISO/IEC 14651:2020?

ISO/IEC 14651:2020 is available in PDF format for immediate download after purchase. The document can be added to your cart and obtained through the secure checkout process. Digital delivery ensures instant access to the complete standard document.

Standards Content (Sample)

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.

Loading comments...

Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering

Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

General Information

Relations

Overview

Key technical topics and requirements

Practical applications and users

Related standards

ISO/IEC 14651:2020 - Information technology -- International string ordering and comparison -- Method for comparing character strings and description of the common template tailorable ordering

ISO/IEC 14651:2020 - Technologies de l'information -- Classement international et comparaison de chaînes de caractères -- Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

Get Certified

BSI Group

NYCE

Frequently Asked Questions

Standards Content (Sample)

Questions, Comments and Discussion

This May Also Interest You