ISO/IEC JTC 1/SC 2 - Coded character sets
Standardization of graphic character sets and their characteristics, including string ordering, associated control functions, their coded representation for information interchange and code extension techniques. Excluded: audio and picture coding.
Jeux de caractères codés
Normalisation des jeux de caractères graphiques et de leurs caractéristiques, y compris l'ordre de classement des chaînes, les fonctions de commande associées, leur représentation codée pour l'échange d'information et les techniques d'extension de code. Sont exclus: le codage audio et d'image.
General Information
This document defines the following. — A reference comparison method. This method is applicable to two character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire, such as those of the different ISO/IEC 8-bit standard character sets, or any other character set, standardized or not, to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Table defined in this document or from one of its tailorings. This method provides a reference format. The format is described using the Backus-Naur Form (BNF). This format is used to describe the Common Template Table. The format is used normatively within this document. — A Common Template Table. A given tailoring of the Common Template Table is used by the reference comparison method. The Common Template Table describes an order for all characters encoded in the Unicode 13.0 standard,[27] included in ISO/IEC 10646:2020. It allows for a specification of a fully deterministic ordering. This table enables the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). NOTE 1 This Common Template Table is to be modified to suit the needs of a local environment. The main worldwide benefit is that, for other scripts, often no modification is required and the order will remain as consistent as possible and predictable from an international point of view. NOTE 2 The character repertoire used in this document is equivalent to that of the Unicode Standard version 13.0[27]. — A reference name. The reference name refers to this particular version of the Common Template Table, for use as a reference when tailoring. In particular, this name implies that the table is linked to a particular stage of development of the ISO/IEC 10646 Universal coded character set. — Requirements for a declaration of the differences (delta) between the collation table and the Common Template Table. This document does not mandate the following. — A specific comparison method; any equivalent method giving the same results is acceptable. — A specific format for describing or tailoring tables in a given implementation. — Specific symbols to be used by implementations, except for the name of the Common Template Table. — Any specific user interface for choosing options. — Any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either. — A context-dependent ordering. — Any particular preparation of character strings prior to comparison. NOTE 1 It is normally necessary to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this International Standard with their anticipated solutions.
- Standard52 pagesEnglish languagesale 15% off
- Standard54 pagesFrench languagesale 15% off
- Draft52 pagesEnglish languagesale 15% off
This document ? specifies the architecture of the UCS; ? defines terms used for the UCS; ? describes the general structure of the UCS codespace; ? specifies the assigned planes of the UCS: the Basic Multilingual Plane (BMP) of the UCS, the Supplementary Multilingual Plane (SMP), the Supplementary Ideographic Plane (SIP), the Tertiary Ideographic Plane (TIP), and the Supplementary Special-purpose Plane (SSP); ? defines a set of graphic characters used in scripts and the written form of languages on a world-wide scale; ? specifies the names for the graphic characters and format characters of the BMP, SMP, SIP, TIP, SSP and their coded representations within the UCS codespace; ? specifies the coded representations for control characters and private use characters; ? specifies three encoding forms of the UCS: UTF-8, UTF-16, and UTF-32; ? specifies seven encoding schemes of the UCS: UTF-8, UTF-16, UTF-16BE, UTF-16LE, UTF-32, UTF-32BE, and UTF-32LE; ? specifies the management of future additions to this coded character set. NOTE The determination of suitability of these characters for use as identifiers in programming languages is not specified by this document but can be found in an external reference. See Annex U.
- Standard2804 pagesEnglish languagesale 15% off
- Draft2804 pagesEnglish languagesale 15% off
ISO/IEC 8859 consists of several parts. Each part specifies a set of up to 191 graphic characters and the coded representation of these characters by means of a single 8-bit byte. Each set is intended for use for a particular group of languages. ISO/IEC 8859-7:2003 specifies a set of 188 coded graphic characters identified as Latin/Greek alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: English, Greek, and Latin. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. ISO/IEC 8859-7:2003 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367, or their corresponding G1 sets from the ISO International Register of Coded Character Sets to be Used with Escape Sequences, should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see Clause 6). NOTE ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard9 pagesEnglish languagesale 15% off
ISO/IEC 2375:2003 specifies the procedures to be followed for preparing, maintaining, and publishing a register of escape sequences and of the coded character sets they identify. The registration process specified in ISO/IEC 2375:2003 is not a procedure for standardization of characters or coded character sets. Organizations that wish ISO and/or IEC to create an international standard for a coded character set or that wish ISO and/or IEC to code additional characters into ISO/IEC 10646 need to follow the ISO/IEC procedures for doing so. In particular, Registration of a coded character set according to the procedures specified by ISO/IEC 2375:2003 implies no commitment by ISO and/or IEC to adopt the coded character set as an ISO/IEC standard. The existence of a character in an approved registration does not imply a commitment by ISO and/or IEC to encode that character into ISO/IEC 10646. ISO/IEC 2022 describes the escape sequences referenced in ISO/IEC 2375:2003, except for escape sequences reserved in ISO/IEC 2022 for private use. The use of these escape sequences includes code extension, that is, the provision of additional sets of characters, or of additional control functions, in accordance with ISO/IEC 2022. An escape sequence registered in accordance with ISO/IEC 2375:2003 serves as an identification of the character, the set of characters, or the control function associated with it in the register. The registration itself does not specify the rules in accordance with which a character or character set identified by an escape sequence is to be used. Rather, the registration identifies the documents (for example, standards) which specify such rules.
- Standard25 pagesEnglish languagesale 15% off
This International Standard a) specifies the coded representation of the characters; b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in many European languages using the Latin script; c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified character repertoire.
- Standard40 pagesEnglish languagesale 15% off
- Standard45 pagesEnglish languagesale 10% offe-Library read for1 day
This part of ISO/IEC 8859 specifies a set of 183 coded graphic characters identified as Latin/Thai alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Thai, English and Latin. Some of the characters in this set are combining characters (see clause 6). This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 or their corresponding G1 sets from ?ISO International Register of Coded Character Sets to be used with escape sequences', should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard10 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 10. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Croatian, English, Finnish, French, German, Hungarian, Irish Gaelic (new orthography), Italian, Latin, Polish, Romanian, and Slovenian. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367, or their corresponding G1 sets from the ISO International Register of Coded Character Sets to be Used with Escape Sequences, should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard9 pagesEnglish languagesale 15% off
- Standard10 pagesFrench languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 9. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Estonian, Faroese, Finnish, French, Frisian, Galician, German, Greenlandic, Icelandic, Irish Gaelic (new orthography), Italian, Latin, Luxemburgish, Norwegian, Portuguese, RhaetoRomanic, Scottish Gaelic, Spanish, and Swedish. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE – ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard9 pagesEnglish languagesale 15% off
- Standard10 pagesFrench languagesale 15% off
- Standard12 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 146 coded graphic characters identified as Latin/Arabic alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Arabic, English and Latin. Some of the characters in this set are combining characters (see clause 6). This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard11 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 2. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Croat, Czech, English, German, Hungarian, Latin, Polish, Romanian, Slovak, Slovene and Sorbian. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard10 pagesEnglish languagesale 15% off
- Standard14 pagesEnglish languagesale 10% offe-Library read for1 day
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 5. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faroese, Finnish, French (with restrictions, see Annex A.1, Notes), Frisian, Galician, German, Greenlandic, Irish Gaelic (new orthography), Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish and Turkish. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard10 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as the Latin/ Cyrillic alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Bulgarian, Byelorussian, English, Latin, (Slavic) Macedonian, Russian, Serbian and Ukrainian. NOTE ? Two letters recently added to the Ukrainian official alphabet are not included in the character set of this part. For a background the CEN/CENELEC/PT004 Report may be consulted (in Bibliography). This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard10 pagesEnglish languagesale 15% off
- Standard10 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 184 coded graphic characters identified as Latin alphabet No. 3. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Esperanto and Maltese, and if needed in conjunction with these, English, French (with restrictions, see Annex A.1, Notes), German, Italian, Latin and Portuguese. Coding of Turkish characters using this part is deprecated, that specified in part 9 is to be used. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard10 pagesEnglish languagesale 15% off
This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 8 (Celtic). This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Basque, Breton, Catalan, Cornish, Danish, Dutch, English, French (with restrictions, see Annex A.1, Notes), Frisian, Galician, German, Greenlandic, Irish Gaelic (old and new orthographies), Italian, Latin, Luxemburgish, Manx Gaelic, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish, and Welsh. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
- Standard9 pagesEnglish languagesale 15% off
- Standard10 pagesFrench languagesale 15% off
- Standard10 pagesEnglish languagesale 15% off
- Standard10 pagesEnglish languagesale 15% off
- Standard10 pagesEnglish languagesale 15% off
- Standard10 pagesEnglish languagesale 15% off
Specifies a set of 191 coded graphic characters identified as Latin alphabet No. 1. This set of coded graphic chacracters is intended for use in data and text processing applications and also for information interchange.
- Standard10 pagesEnglish languagesale 15% off
- Standard14 pagesEnglish languagesale 10% offe-Library read for1 day
Specifies a set of 83 graphic characters of the Armenian alphabet. Consists of a code table and a legend with explanatory notes containing each graphic, its use and its name. The character set is primarily intended for the interchange of information among data processing systems and within message transmission systems.
- Standard8 pagesEnglish languagesale 15% off
Specifies two coded Hebrew character sets. Set 1 contains 78 characters including 74 basic characters needed in Hebrew texts. Set 2 contains 51 additional characters used in combination with basic characters in some older Hebrew texts. The sets may be used in a 7-bit or an 8-bit environment in accordance with ISO/IEC 2022.
- Standard9 pagesEnglish languagesale 15% off
Specifies a set of 70 graphic characters. Consists of a code table and a legend with explanatory notes containing each graphic, its use and its name. The character set is primarily intended for the interchange of information among data processing systems and within message transmission systems. These characters form a supplement to the extended Latin alphabet coded character set.
- Standard7 pagesEnglish languagesale 15% off
- Standard7 pagesEnglish languagesale 10% offe-Library read for1 day
Gives a coded character set for the Glagolitic alphabet for use in bibliographic information interchange. Is intended for use with the Old Slavonic language. Contains code tables showing each graphic character with its Cyrillic alphabet equivalent, its position and name in the code table and comments on its use.
- Standard17 pagesEnglish languagesale 15% off
Specifies a set of 42 graphic characters in the modern Georgian script. Consists of a code table and a legend with explanatory notes containing each graphic, its use and its name. The character set is primarily intended for the interchange of information among data processing systems and within message transmission systems.
- Standard6 pagesEnglish languagesale 15% off
Specifies a set of 90 graphic characters. Consists of a code table and a legend with explanatory notes containing character codes, graphics and character names. The character set is primarily intended for the interchange of information among data processing systems and within message transmission systems. The sets may be used in a 7-bit or an 8-bit environment in accordance with ISO/IEC 2022.
- Standard9 pagesEnglish languagesale 15% off
- Standard14 pagesEnglish languagesale 15% off
Cancels and replaces the third edition (1986). Specifies the structure of 8-bit codes and 7-bit codes which provide for the coding of character sets. The codes specified here are designed to be used for data that is processed sequentially in a forward direction. Use of these codes in strings of data which are processed in some other way, or which are included in data formatted for fixed-length record processing, may have undesirable results or may require additional special treatment to ensure correct interpretation.
- Standard47 pagesEnglish languagesale 15% off
- Standard54 pagesEnglish languagesale 10% offe-Library read for1 day
Includes control functions and their coded representations for a 7-bit code, an extended 7-bit code, an 8-bit code or an extended 8-bit code if these correspond to ISO 2022.
- Standard89 pagesEnglish languagesale 15% off
- Standard97 pagesEnglish languagesale 10% offe-Library read for1 day
Specifies a unique coded character set for use as G0 set and a series of coded character sets of up to 96 characters for use as G1, G2 and G3 sets in versions of ISO/IEC 4873.
- Standard49 pagesEnglish languagesale 15% off
Specifies an 8-bit code which is derived from, and compatible with, the 7-bit coded character set specified in ISO/IEC 646. The normative Annex A gives restrictions applicable to the C0 and C1 sets.
- Standard19 pagesEnglish languagesale 15% off
Specifies a set of 128 control and graphic characters such as letters, digits and symbols with their coded representation. Applies to alphabets of the Latin script.
- Standard15 pagesEnglish languagesale 15% off
Defines the control functions and their coded representations. Applies only to text made up of characters. Does not define any control functions required for controlling the process of communication. Annexes A, B and C are for information only.
- Standard51 pagesEnglish languagesale 15% off
Lays down rules for converting between 58 characteristics of CCITT International Telegraph Alphabet No. 2 (Recommendation F.1) and the characters according to the ISO 646 and 6937-2 coded sets. Serves for interaction between international telex service and terminals in data networks if telex character repertoire is sufficient.
- Standard5 pagesEnglish languagesale 15% off
- Standard5 pagesFrench languagesale 15% off
- Standard5 pagesFrench languagesale 15% off
A set of mandatory 120 characters is described with the coded representation. This set is intended for interchange of information using Arabic language and includes control characters for code extensions. Procedures for using these control charactres are specified in ISO 2022. References: ISO 646; ISO 2022; Arab Standard ASMO 449.
- Standard11 pagesEnglish languagesale 15% off
- Standard11 pagesFrench languagesale 15% off
- Standard11 pagesFrench languagesale 15% off
Contains a set of 73 graphic characters with their coded representations. Consists of a code table and a legend showing each graphic, its use, and its name. Explanatory notes are also included. These characters, together with characters in the international reference version of ISO 646, constitute a character set for the international interchange of bibliographic citations, including their annotations, in the Greek alphabet.
- Standard5 pagesEnglish languagesale 15% off
- Standard5 pagesEnglish languagesale 10% offe-Library read for1 day
- Standard5 pagesFrench languagesale 15% off
- Standard5 pagesFrench languagesale 15% off
Specifies a set of 42 graphic characters with their coded representations. Consists of a code table and a legend showing each graphic, its use, and its name. Explanatory notes are also included. These characters, together with characters in the basic Cyrillic set, constitute a character set for the international interchange of bibliographic citations, including their annotations, in the Cyrillic alphabet.
- Standard4 pagesEnglish languagesale 15% off
- Standard4 pagesEnglish languagesale 10% offe-Library read for1 day
- Standard4 pagesFrench languagesale 15% off
- Standard4 pagesFrench languagesale 15% off
Contains a set of 76 graphic characters with coded representations. Consists of a code table and a legend showing each graphic, its use, and its name. The character set is primarily intended for the interchange of information among data processing systems and within message transmission systems.
- Standard6 pagesEnglish languagesale 15% off
- Standard6 pagesEnglish languagesale 10% offe-Library read for1 day
- Standard6 pagesFrench languagesale 15% off
- Standard6 pagesFrench languagesale 15% off
Contains a set of 60 African graphic characters with their coded representations. Consists of a code table, a legend giving for each character its name and its use, and explanatory notes. The list of African languages in Annex A is for information only and is not part of this document.
- Standard6 pagesEnglish languagesale 15% off
- Standard6 pagesEnglish languagesale 10% offe-Library read for1 day
- Standard6 pagesFrench languagesale 15% off
- Standard6 pagesFrench languagesale 15% off
Defines the coded representation of printed characters recognized by reading equipment. Includes the fonts E 13 B; CMC 7; OCR-A; OCR-B. Assigns bit-patterns to characters recognized by reading equipment. This information is then given to the recipient by different media and can be used by printing devices. Single-font reader and multiple-font reader are considered as applications. References: ISO 646; 1004; 1073; 2022.
- Standard11 pagesEnglish languagesale 15% off
- Standard11 pagesFrench languagesale 15% off
- Standard11 pagesFrench languagesale 15% off
Defines implementation of ISO 7-bit and 8-bit coded character sets on punched cards as well as the representation of 7-bit and 8-bit combinations on 12-row punched cards. This representation is derived from, and compatible with, the Hollorith Code. Ensures widely compatibility with existing punched card files. Intended for general interchange of information among data processing systems.
- Standard5 pagesEnglish languagesale 15% off
- Standard5 pagesFrench languagesale 15% off
- Standard5 pagesFrench languagesale 15% off
Provides representation of 7-bit coded character set on punched tape 25,4 mm (1 in) in width. Applicable in conjunction with ISO 646, 1154, 1729.
- Standard2 pagesEnglish languagesale 15% off
- Standard2 pagesFrench languagesale 15% off
- Standard2 pagesFrench languagesale 15% off
Includes 2 modes of graphical representations of the control characters of columns 0 and 1 of the ISO 7-bit coded character set for information interchange (ISO 646): a) a set of specific symbols requiring a single character for graphical representation of each of these normally non-printing characters; b) a set of alphanumeric representations comprising two characters derived from the abreviations used in the ISO 7-bit table. Also provides graphical representations for the normally non-printing characters SPACE and DELETE.
- Standard6 pagesEnglish languagesale 15% off
- Standard6 pagesFrench languagesale 15% off
- Standard6 pagesFrench languagesale 15% off
Establishes dimensions and location of feed holes in punched paper tape for data interchange. Applies to punched paper tape with a nominal width of 25,4 mm with fully punched round holes.
- Standard2 pagesEnglish languagesale 15% off
- Standard2 pagesFrench languagesale 15% off
- Standard2 pagesFrench languagesale 15% off
Defines the implementation of the 7-bit coded character set and of its 7-bit and 8-bit extentions for the interchange of data on 3.81 mm magnetic tape cassette. References: ISO 646; 2022; 3407.
- Standard1 pageEnglish languagesale 15% off
- Standard1 pageFrench languagesale 15% off
- Standard1 pageFrench languagesale 15% off
Specifies the representation of the 7-bit-code and its 7-bit and 8-bit extensions on an 9-track magnetic tape with a width of 12,7 mm (0,5 in).
- Standard3 pagesEnglish languagesale 15% off
- Standard3 pagesFrench languagesale 15% off
- Standard3 pagesFrench languagesale 15% off
Defines the characteristics of general-purpose unpunched paper cards to be used information interchange between different punched card devices. References: ISO/R534; ISO/R 536; ISO/R 1974; ISO/R 2144; ISO 2493; ISO 2494; ISO 2758.
- Standard10 pagesEnglish languagesale 15% off
- Standard10 pagesFrench languagesale 15% off
- Standard10 pagesFrench languagesale 15% off
- Standard106 pagesEnglish languagesale 15% off
This document defines the following. — A reference comparison method. This method is applicable to two character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire, such as those of the different ISO/IEC 8-bit standard character sets, or any other character set, standardized or not, to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Table defined in this document or from one of its tailorings. This method provides a reference format. The format is described using the Backus-Naur Form (BNF). This format is used to describe the Common Template Table. The format is used normatively within this document. — A Common Template Table. A given tailoring of the Common Template Table is used by the reference comparison method. The Common Template Table describes an order for all characters encoded in the Unicode 10.0 standard[27], included in ISO/IEC 10646:2017. It allows for a specification of a fully deterministic ordering. This table enables the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). NOTE 1 This Common Template Table is to be modified to suit the needs of a local environment. The main worldwide benefit is that, for other scripts, often no modification is required and the order will remain as consistent as possible and predictable from an international point of view. NOTE 2 The character repertoire used in this document is equivalent to that of the Unicode Standard version 10.0[27]. — A reference name. The reference name refers to this particular version of the Common Template Table, for use as a reference when tailoring. In particular, this name implies that the table is linked to a particular stage of development of the ISO/IEC 10646 Universal coded character set. — Requirements for a declaration of the differences (delta) between the collation table and the Common Template Table. This document does not mandate the following. — A specific comparison method; any equivalent method giving the same results is acceptable. — A specific format for describing or tailoring tables in a given implementation. — Specific symbols to be used by implementations, except for the name of the Common Template Table. — Any specific user interface for choosing options. — Any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either. — A context-dependent ordering. — Any particular preparation of character strings prior to comparison. NOTE 1 It is normally necessary to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this International Standard with their anticipated solutions.
- Standard53 pagesEnglish languagesale 15% off
- Standard54 pagesFrench languagesale 15% off
ISO/IEC 10646:2017 specifies the Universal Coded Character Set (UCS). It is applicable to the represen-tation, transmission, interchange, processing, storage, input, and presentation of the written form of the lan-guages of the world as well as of additional symbols. - specifies the architecture of this International Standard, - defines terms used in this International Standard, - describes the general structure of the UCS codespace, - specifies the Basic Multilingual Plane (BMP) of the UCS, - specifies supplementary planes of the UCS: the Supplementary Multilingual Plane (SMP), the Supplemen-tary Ideographic Plane (SIP), the Tertiary Ideographic Plane (TIP), and the Supplementary Special-purpose Plane (SSP), - defines a set of graphic characters used in scripts and the written form of languages on a world-wide scale, - specifies the names for the graphic characters and format characters of the BMP, SMP, SIP, TIP, SSP and their coded representations within the UCS codespace, - specifies the coded representations for control characters and private use characters, - specifies three encoding forms of the UCS: UTF-8, UTF-16, and UTF-32, - specifies seven encoding schemes of the UCS: UTF-8, UTF-16, UTF-16BE, UTF-16LE, UTF-32, UTF-32BE, and UTF-32LE, - specifies the management of future additions to this coded character set. The UCS is an encoding system different from that specified in ISO/IEC 2022. The method to designate UCS from ISO/IEC 2022 is specified in 12.2. A graphic character will be assigned only one code point in the standard, located either in the BMP or in one of the supplementary planes.
- Standard2693 pagesEnglish languagesale 15% off