Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters

Technologies de l'information — Jeu universel de caractères codés (JUC) — Amendement 2: Bhaiksuki, Marchen, Tangut et autres caractères

General Information

Status: Withdrawn
Publication Date: 20-Apr-2016
Withdrawal Date: 20-Apr-2016

ICS: 35.040 - Information coding
: 35.040.10 - Coding of character sets

Technical Committee: ISO/IEC JTC 1/SC 2 - Coded character sets
Drafting Committee: ISO/IEC JTC 1/SC 2/WG 2 - Universal coded character set

Current Stage: 9599 - Withdrawal of International Standard
Start Date: 20-Dec-2017
Completion Date: 12-Feb-2026

Relations

Amends: ISO/IEC 10646:2014 - Information technology — Universal Coded Character Set (UCS)
Effective Date: 09-May-2020

Revised: ISO/IEC 10646:2017 - Information technology — Universal Coded Character Set (UCS)
Effective Date: 30-Dec-2017

ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters - Page 1 preview

ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters - Page 2 preview

ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters - Page 3 preview

Standard

ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters

English language

165 pages

sale 15% off

Preview

sale 15% off

Preview

Get Certified

Connect with accredited certification bodies for this standard

BSI Group

BSI (British Standards Institution) is the business standards company that helps organizations make excellence a habit.

UKAS United Kingdom Verified

Visit Website

NYCE

Mexican standards and certification body.

EMA Mexico Verified

Visit Website

Frequently Asked Questions

What is ISO/IEC 10646:2014/Amd 2:2016?

ISO/IEC 10646:2014/Amd 2:2016 is a standard published by the International Organization for Standardization (ISO). Its full title is "Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters". This standard covers: Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters

What is the scope of ISO/IEC 10646:2014/Amd 2:2016?

Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters

What ICS categories does ISO/IEC 10646:2014/Amd 2:2016 belong to?

ISO/IEC 10646:2014/Amd 2:2016 is classified under the following ICS (International Classification for Standards) categories: 35.040 - Information coding; 35.040.10 - Coding of character sets. The ICS classification helps identify the subject area and facilitates finding related standards.

What standards are related to ISO/IEC 10646:2014/Amd 2:2016?

ISO/IEC 10646:2014/Amd 2:2016 has the following relationships with other standards: It is inter standard links to ISO/IEC 10646:2014, ISO/IEC 10646:2017. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.

How can I access ISO/IEC 10646:2014/Amd 2:2016?

ISO/IEC 10646:2014/Amd 2:2016 is available in PDF format for immediate download after purchase. The document can be added to your cart and obtained through the secure checkout process. Digital delivery ensures instant access to the complete standard document.

Standards Content (Sample)

ISO/IEC 10646:2014/Amd 2:2016 ...

INTERNATIONAL ISO/IEC
STANDARD 10646
Fourth edition
2014-09-01
AMENDMENT 2
2016-05-01
Information technology — Universal
Coded Character Set (UCS)
AMENDMENT 2: Bhaiksuki, Marchen,
Tangut and other characters
Technologies de l’information — Jeu universel de caractères codés (JUC)
AMENDEMENT 2: Bhaiksuki, Marchen, Tangut et autres caractères
Reference number
ISO/IEC 10646:2014/Amd.2:2016(E)
©
ISO/IEC 2016
ISO/IEC 10646:2014/Amd.2:2016(E)

© ISO/IEC 2016, Published in Switzerland
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form
or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior
written permission. Permission can be requested from either ISO at the address below or ISO’s member body in the country of
the requester.
ISO copyright office
Ch. de Blandonnet 8 • CP 401
CH-1214 Vernier, Geneva, Switzerland
Tel. +41 22 749 01 11
Fax +41 22 749 09 47
copyright@iso.org
www.iso.org
ii © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees estab-lished by the respective organization to deal with particular fields of technical activity.
ISO and IEC technical committees collaborate in fields of mutual interest. Other international
organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the
work. In the field of information technology, ISO and IEC have established a joint technical committee,
ISO/IEC JTC 1.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of document should be noted. This document was drafted in accordance with the editorial
rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Details of any patent rights identified during the development of the document will be in the Introduction
and/or on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does
not constitute an endorsement.
For an explanation on the meaning of ISO specific terms and expressions related to conformity
assessment, as well as information about ISO's adherence to the WTO principles in the Technical
Barriers to Trade (TBT) see the following URL: Foreword - Supplementary information.
Amendment 2 to ISO/IEC 10646:2014 was prepared by Joint Technical Committee ISO/IEC JTC1,
Information technology, Subcommittee SC 2, Coded character sets.
© ISO/IEC 2016 – All rights reserved iii

ISO/IEC 10646:2014/Amd.2:2016 (E)
iv © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

Information technology — Universal Coded
Character Set (UCS)
AMENDMENT 2: Bhaiksuki, Marchen, Tangut and
other characters
Page 5 Sub-clause 4.25 extended collection
In the Note 3, insert ‘typically’ between ‘are’ and ‘referenced’.
Page 12 Sub-clause 6.4 Naming of character
In the list, insert the following new items after c) and adjust the next list item numbers accordingly:
d) follows the rule given in 25.7 for Tangut ideographs, or
Page 29 Sub-clause 23.1 List of source references
In the list of Hanzi G sources, insert the following new sources:
GFC Modern Chinese Standard Dictionary (现代汉语规范词典第二版。主编：李行健。北京：外语
教学与研究出版社) 2010, ISBN：978-7-5600-9518-9
GGFZ Tongyong Guifan Hanzi Zidian (通用规范汉字字典)
In the list of Kanji J sources, replace ‘2000’ with ‘2004’ for the J3 and J4 sources. Modify the J3A entry to read:
J3A JIS X 0213:2004 level-3 addendum from JIS X 0213:2000 level-3
Add the following entries:
J13 JIS X 0213:2004 level-3 characters replacing J1 characters
J13A JIS X 0213:2004 level-3 addendum from JIS X 0213:2000 level-3 replacing J1 characters
JA3 JIS X 0213:2004 level-3 characters replacing JA characters
J14 JIS X 0213:2004 level-4 characters replacing J1 characters
JA4 JIS X 0213:2004 level-4 characters replacing JA characters
Replace all J1 and JA sources that are also included in JIS X 0213:2004 by the values defined in JIS X
0213:2004 using the prefixes described above (J13, J13A, JA3, J14, and JA4). This represents 85 original JA
sources and 2743 original J1 sources.
After the list of J sources, add the following notes:
NOTE 3 – When a source such as JIS X 0213:2004 includes encoded characters already
included in older referenced sources, the more recent source references are typically used
to reflect contemporary usage. This also reflects the fact that some of these more recent
references were accompanied by minor adjustment for the graphic representation of the
related characters.
© ISO/IEC 2016 – All rights reserved 1

ISO/IEC 10646:2014/Amd.2:2016 (E)
NOTE 4 – J Sources that have been partially superseded (J1 and JA) are described in An-
nex A (see A.4.3 and A4.4). They can also be found as provisional sources in the Unihan
Data base in http://www.unicode.org/reports/tr38/ fully described as kJis1 for J1: JIS X
0212-1990 and kJA for the JA source.
In Table 5: Format details of the tags used in the source reference file for CJK Ideographs,
in the line for kIRG_GSource, insert the additional third field format values:
GFC-ddd, and GGFZ-dddddd.
in the line for kIRG_JSource, insert the additional third field format values:
J13-hhhh, J13A-hhhh, JA3-hhhh, J14-hhhh, and JA4-hhhh.
In the same Table 5, in the last row, remove the last sentence starting by “Only the first Radical-Stroke” from
the fourth paragraph.
Insert new source references for the new 9 characters part of CJK Unified Ideographs.
Perform the additional updates to the source references:
 Move U+3D1D H source to U+2A3ED,
 Remove U+4CA4 H source (already moved to U+9FD0),
 Replace U+2AD12 U source with T source: TE-353F,
 Remove U+2B08F U source,
 Replace U+2F949 T source with U source: UCI-01199.
Provide a new reference file with updated references.
(The following text is identical to ISO/IEC 10646:2014 but is linked to the new file)
Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“CJKSrc.txt”.
Page 32 Sub-clause 23.2 Source references file for CJK Ideographs
In the third paragraph starting with ‘The Format definition’, replace ‘LINE TABULATION control character (000B)’
by ‘CHARACTER TABULATION control character (0009)’.
Page 37 Before Clause 24 Character names and annotations
Insert new clause, table, and figure as follows, renumbering the following clauses, tables, and figures:
24 Source references for Tangut Ideographs
24.1 List of source references
A Tangut ideograph is always referenced by one source reference. These source references are provided
in a machine-readable format that is accessible as a link from this document. The content pointed by
these links is also normative.
The source reference information establishes the character identity for Tangut ideographs. A source ref-
erence is established by associating a Tangut ideograph code point with one value in the source stand-
ards listed below. The Tangut sources are:
H2004-A Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H1
sub-division
2 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

H2004-B Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H2
sub-division
L1986 Lǐ Fànwén 1986 (Study of the Homophones)
st
L1997 Lǐ Fànwén 1997 (Tangut-Chinese Dictionary, 1 edition)
L2006 Lǐ Fànwén 2006 (Comparative Study of Wuyin Qieyun and Wenhai Baoyun)
nd
L2008 Lǐ Fànwén 2008 (Tangut-Chinese Dictionary, 2 edition)
N1966 Nishida 1966 (Little Dictionary of Tangut)
S1968 Sofronov 1968 (Grammar of the Tangut Language)

24.2 Source reference file for Tangut Ideographs
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark. The file specifies the sources references data for all Tangut Ideographs.
The file also contains information concerning Radical-Stroke index. Each line in the text file contains the
following information organized in three fields:
st
 1 field: UCS code point in the format (U+hhhhh)
nd
 2 field: Tag indicating the type of information in the third field (kTGT_MergedSrc, kRSTUnicode)
rd
 3 field: Information corresponding to the tag value specified by the second field. The Table 6 pro-
vides the format details.
The format definition for the first field uses ‘h’ as a hexadecimal unit. The three fields are delimited by a
CHARACTER TABULATION control character (0009). Comment lines, starting with a NUMBER SIGN (‘#’)
are informational only. Comment lines and blank lines in the text file should be ignored by any automatic
process which parses the data file to extract the source reference information.
Table 6: Format details of the tags used in the source reference file for Tangut ideographs
Tag Value Tag Third field format
description
kTGT_MergedSrc Tangut (H2004-A-dddd), (H2004-B-dddd), (L1986-dddd), (L1997-dddd),
source (l2006-dddd), (L2008-dddd), (L2008-dddd-dddd), (N1966-ddd-ddx),
(N1966-ddd-ddxx), or (S1968-dddd)
kRSTUnicode (d{1,3}.d{1,2})
Radical
Stroke index
The format definition uses ‘d’ as a decimal unit, and ‘x’ as an alphanumerical unit (0 to 9 and A to Z). Uppercase characters, digits
and all other symbols between parentheses appear as shown.
The Radical-Stroke index (d{1,3}.d{1,2}) is an informative field contains one construct: a radical index (one to three digits),
followed by a full stop, and ending by one or two digits for the stroke count. The stroke count includes all strokes for the character,
including the radical. The radical index is an entry on the Tangut Components block at 18800-18AFF, with the index ‘1’ represent-
ing the code point 18800.
NOTE – Concerning Lǐ Fànwén 2008 entries, the notation (L2008-dddd) denotes a character mapped to a single entry
while the notation (L2008-dddd-dddd) denotes a character mapped to two entries.
The following examples shows file entries for the Tangut ideographs 17000, 17001, 17A7A, and 17E91.
EXAMPLES
U+17000 kTGT_MergedSrc L2008-0008
U+17000 kRSTUnicode 1.6
U+17001 kTGT_MergedSrc L2008-0030
U+17001 kRSTUnicode 1.7
U+17A7A kTGT_MergedSrc N1966-204-10GG
U+17A7A kRSTUnicode 141.14
© ISO/IEC 2016 – All rights reserved 3

ISO/IEC 10646:2014/Amd.2:2016 (E)

U+17E91 kTGT_MergedSrc L2008-2027-2350
U+17E91 kRSTUnicode 259.10
Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“TangutSrc.txt”.
24.3 Source reference presentation for Tangut Ideographs
The code charts for Tangut ideographs show one graphic symbol per character, along with additional in-
formation related to their identity such as radical, stroke count, and numeric value of the various source
references.
The graphic representation for the radical is shown immediately below the code point, along with the radi-
cal number and the stroke count. That stroke count includes the radical itself.
NOTE – The radicals shown in this representation are part of the Tangut Components block at 18800-18AFF.
The figure 7 shows an example for characters 177D0-177D2, 177E4-177E6, 177F8-177FA, 1780C-1780E,
and 17820-17822
Figure 7 – Chart presentation for TANGUT IDEOGRAPHS
Page 37 Clause 24 Character names and annotations
The clause is now renumbered 25. Insert one new sub-clause after 25.6 Character names for CJK Ideographs
and renumber the following sub-clause accordingly.
25.7 Character names for Tangut Ideographs
For Tangut ideographs the names are algorithmically constructed by appending their coded representa-
tion in their five hexadecimal digit notation to “TANGUT IDEOGRAPH-”. For example, the first Tangut
Ideograph has the name “TANGUT IDEOGRAPH-17000”.
Page 47 Sub-Clause 31.2 Code chart
The sub-clause is now renumbered 32.2. Replace the two last sentences starting with “Code charts” with:
Code charts for CJK ideographs and Tangut ideographs have different formats. See Clause 23 and 24.
Page 47-2380 Clause 31, Code charts and list of character names
The clause is now renumbered 32.
1. Modifications to existing blocks
Insert the additional character glyphs and names at the indicated positions in the blocks given below.
4 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

Plane 00
Arabic Extended-A
Kannada
Malayalam
Currency Symbols
Miscellaneous Technical
Supplemental Punctuation
CJK Unified Ideographs
Latin Extended-D
Saurashtra
Plane 01
Ancient Greek Numbers
Khojki
Miscellaneous Symbols and Pictographs
Emoticons
Transport and Map Symbols
These blocks contain new characters and names at the following code positions:
08B6-08BD 08D4-08E2 0C80 0D4F 0D54-0D56 0D58-0D5E 0D76-0D78 20BE 23FB-23FE 2E43-
2E44 9FCD-9FD5 A7AE A8C5 1018D-1018E 1123E 1F32D-1F32F 1F37E-1F37F 1F3CF-1F3D3
1F3F8-1F3FF 1F4FF 1F54B-1F54E 1F643-1F644 1F6D0
updated graphic representation at the following code positions:
19B0-19C0, 19C8-19C9, 301C, 3127, 3D1D, 4CA4, 7921, 2AD12, and 2B08F.
2. New blocks
Insert the following additional blocks.
Plane 00
Cyrillic Extended-C
Plane 01
Osage
Mongolian Supplement
Bhaiksuki
Marchen
Ideographic Symbols and Punctuation
Tangut
Tangut Components
Glagolitic Supplement
Supplemental Symbols and Pictographs
These blocks add new characters and names at the following code positions:
1C80-1C88 104B0-104D3 104D8-104FB 11660-1166C 11C00-11C08 11C0A-11C36 11C38-
11C45 11C50-11C6C 11C70-11C8F 11C92-11CA7 11CA9-11CB6 16FE0 17000-187EC 18800-
18AF2 1E000-1E006 1E008-1E018 1E01B-1E021 1E023-1E024 1E026-1E02A 1F910-1F918
1F980-1F984 1F9C0
© ISO/IEC 2016 – All rights reserved 5

ISO/IEC 10646:2014/Amd.2:2016 (E)

Page 2381, Annex A.1
Add a ‘*’ (for fixed collection) for the following collection:
1046 EMOTICONS
In the list of collection numbers and names, after
166 CHEROKEE SUPPLEMENT
insert a new entry as follows:
167 CYRILLIC EXTENDED-C 1C80-1C8F
In the list of collection numbers and names, after
1092 SUTTON SIGNWRITING
insert new entries as follows:
1093 OSAGE 104B0-104FF
1094 MONGOLIAN SUPPLEMENT 11660-1167F
1095 BHAIKSUKI 11C00-11C6F
1096 MARCHEN 11C70-11CBF
1097 IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
1098 TANGUT 17000-187EF
1199 TANGUT COMPONENTS 18800-18AFF
1100 GLAGOLITIC SUPPLEMENT 1E000-1E02F
1101 SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
In the list of collection numbers and names, after
314 UNICODE 7.0
insert a new entry as follows:
315 UNICODE 8.0 see A.6.14 *
In the list of collection numbers and names, after
372 JAPANESE IDEOGRAPHICS SUPPLEMENT
insert new entries as follows:
373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 see A.4.4 *
374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS STANDARDS see A.4.5 *
Page 2385, annex A.1
In the alphabetical list of keywords in Note 3, add the value 167 to the Cyrillic entry, the value 1094 to the Mon-
golian, the value 1097 to the Tamil entry, the value 1098 to the Punctuation entry and the Symbols entry re-
spectively, the value 1101 to the Glagolitic entry, the value 1102 to the Symbols entry, the values 2003, 2004,
and 2005 to the CJK entry; and insert the following entries:
Bhaiksuki 1095
Marchen 1096
Osage 1093
Tangut 1098 1099
6 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

Page 2387, Annex A.2.1
In the list of blocks in the BMP, insert the following new entry:
CYRILLIC EXTENDED-C 1C80-1C8F
Page 2388, Annex A.2.2
In the list of blocks in the SMP, insert the following new entries:
OSAGE 104B0-104FF
MONGOLIAN SUPPLEMENT 11660-1667F
BHAIKSUKI 11C00-11C6F
MARCHEN 11C70-11CBF
IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
TANGUT 17000-187EF
TANGUT COMPONENTS 18800-18AFF
GLAGOLITIC SUPPLEMENT 1E000-1E02F
SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
Page 2393, Annex A.4
Replace Annex A.4.2 and A.4.3 and insert new A.4.4 and A.4.6 sub-clauses as follows:
A.4.2 371 JIS2004 IDEOGRAPHICS EXTENSION
The fixed collection 371 JIS2004 IDEOGRAPHICS EXTENSION consists of all level 3 and level 4 CJK
characters defined in JIS X 0213:2004. It contains 3 695 characters.
The code points of this collection are identified by J3 Kanji J sources (J3, J3A, J13, J13A, and JA3) and
J4 Kanji J sources (J4, J14, and JA4) in the Source Reference file for CJK Unified Ideographs
(CJKSrc.txt). See 23.1 and 23.2 for details.
A.4.3 372 JAPANESE IDEOGRAPHICS SUPPLEMENT
The fixed collection 372 JAPANESE IDEOGRAPHICS SUPPLEMENT consists of all CJK characters de-
fined in JIS X 0212:1990. It contains 5 801 characters.
NOTE – 2 743 characters are common between the collections 371 and 372.
The code points of this collection are identified by the Kanji J sources: J1, J13, J13A, and J14 in the
Source Reference file for CJK Unified Ideographs (CJKSrc.txt). See 23.1 for further details.
A.4.4 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993
The fixed collection 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 consists of all
CJK characters defined in the Unified Japanese IT Vendors Contemporary Ideographs, 1993 set. It con-
tains 660 characters.
The code points of this collection are identified by the Kanji J sources: JA, JA3, and JA4 in the Source
Reference file for CJK Unified Ideographs (CJKSrc.txt). See 23.1 for further details.
A.4.5 374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS STANDARDS
The fixed collection 374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS
STANDARDS The set consists of 2828 characters, of which 2743 were part of JIS X 0212-1990 (referred
as J1 sources in the previous version of this International Standard) and 85 were part of the ‘Unified Jap-
anese IT Vendors Contemporary Ideographs, 1993’ (referred as JA in the previous version of this Interna-
tional Standard). Among these 2828 characters, 205 had their glyph representation slightly modified.
© ISO/IEC 2016 – All rights reserved 7

ISO/IEC 10646:2014/Amd.2:2016 (E)

These 2828 characters use the ‘J13, J13A, J14, JA3, and JA4’ notations in their prefix four source identifi-
cation as described in sub-clause 23.1.
NOTE 1 – Given its large size and the large number of sparse ranges, the collection is not specified by code point ranges in
this document but instead by a linked content.
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark, which specifies, after a 7-lines header, as many lines as characters in the
collection; each containing the following information in fixed length field:
 1st field: BMP UCS code point (hhhh), normative.
nd
 2 field: J Source reference in previous editions of this standard.
rd
 3 field: J Source reference in this standard
The fields are delimited by a SPACE character; the format definition for the code point uses ‘h’ as a hexa-
decimal unit. The J Source references use the format specified in 23.2.
Click on this highlighted text to access the reference file.
NOTE 2 – The content is also available as a separate viewable file in the same file directory as this document. The file is
named: “JIS-X-0213-FromPrevious”.
Page 2409, Annex A.6
Add a new Annex A.6.14 as follows:
A.6.14 315 UNICODE 8.0
The fixed collection 315 UNICODE 8.0 is arranged by planes as follows.
Plane 00
11 00-FF
Row Values within row
12 00-48 4A-4D 50-56 58 5A-5D 60-88 8A-8D
00 20-7E A0-FF
90-B0 B2-B5 B8-BE C0 C2-C5 C8-D6 D8-FF
01-02 0100-02FF
13 00-10 12-15 18-5A 5D-7C 80-99 A0-F5 F8-FD
03 00-77 7A-7F 84-8A 8C 8E-A1 A3-FF
14-15 1400-15FF
04 00-FF
16 00-9C A0-F8
05 00-2F 31-56 59-5F 61-87 89-8A 8D-8F 91-C7
17 00-0C 0E-14 20-36 40-53 60-6C 6E-70 72-73
D0-EA F0-F4
80-DD E0-E9 F0-F9
06 00-1C 1E-FF
18 00-0E 10-19 20-77 80-AA B0-F5
07 00-0D 0F-4A 4D-B1 C0-FA
19 00-1E 20-2B 30-3B 40 44-6D 70-74 80-AB B0-
08 00-2D 30-3E 40-5B 5E A0-B4 E3-FF
C9 D0-DA DE-FF
09 00-83 85-8C 8F-90 93-A8 AA-B0 B2 B6-B9 BC-
1A 00-1B 1E-5E 60-7C 7F-89 90-99 A0-AD B0-BE
C4 C7-C8 CB-CE D7 DC-DD DF-E3 E6-FB
1B 00-4B 50-7C 80-F3 FC-FF
0A 01-03 05-0A 0F-10 13-28 2A-30 32-33 35-36
1C 00-37 3B-49 4D-7F C0-C7 D0-F6 F8-F9
38-39 3C 3E-42 47-48 4B-4D 51 59-5C 5E 66-
1D 00-F5 FC-FF
75 81-83 85-8D 8F-91 93-A8 AA-B0 B2-B3 B5-
1E 00-FF
B9 BC-C5 C7-C9 CB-CD D0 E0-E3 E6-F1 F9
1F 00-15 18-1D 20-45 48-4D 50-57 59 5B 5D 5F-
0B 01-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
7D 80-B4 B6-C4 C6-D3 D6-DB DD-EF F2-F4
3C-44 47-48 4B-4D 56-57 5C-5D 5F-63 66-77
F6-FE
82-83 85-8A 8E-90 92-95 99-9A 9C 9E-9F A3-
20 00-64 66-71 74-8E 90-9C A0-BE D0-F0
A4 A8-AA AE-B9 BE-C2 C6-C8 CA-CD D0 D7
21 00-8B 90-FF
E6-FA
22 00-FF
0C 00-03 05-0C 0E-10 12-28 2A-39 3D-44 46-48
23 00-FA
4A-4D 55-56 58-5A 60-63 66-6F 78-7F 81-83
24 00-26 40-4A 60-FF
85-8C 8E-90 92-A8 AA-B3 B5-B9 BC-C4 C6-C8
25 00-FF
CA-CD D5-D6 DE E0-E3 E6-EF F1-F2
26 00-FF
0D 01-03 05-0C 0E-10 12-3A 3D-44 46-48 4A-4E
27 00-FF
57 5F-63 66-75 79-7F 82-83 85-96 9A-B1 B3-
28-2A 2800-2AFF
BB BD C0-C6 CA CF-D4 D6 D8-DF E6-EF F2-F4
2B 00-73 76-95 98-B9 BD-C8 CA-D1 EC-EF
0E 01-3A 3F-5B 81-82 84 87-88 8A 8D 94-97 99-
2C 00-2E 30-5E 60-F3 F9-FF
9F A1-A3 A5 A7 AA-A8 AD-B9 BB-BD C0-C4 C6
2D 00-25 27 2D 30-67 6F-70 7F-96 A0-A6 A8-AE
C8-CD D0-D9 DC-DF
B0-B6 B8-BE C0-C6 C8-CE D0-D6 D8-DE E0-FF
0F 00-47 49-6C 71-97 99-BC BE-CC CE-DA
2E 00-42 80-99 9B-F3
10 00-C5 C7 CD D0-FF
8 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

2F 00-D5 F0-FB AA 00-36 40-4D 50-59 5C-C2 DB-F6
30 00-3F 41-96 99-FF AB 01-06 09-0E 11-16 20-26 28-2E 30-65 70-ED
31 05-2D 31-8E 90-BA C0-E3 F0-FF F0-F9
32 00-1E 20-FE AC-D6 AC00-D6FF
33 00-FF D7 00-A3 B0-C6 CB-FB
34-4C 3400-4CFF E0-F8 E000-F8FF
4D 00-B5 C0-FF F9 00-FF
4E-9F 4E00-9FD5 FA 00-6D 70-D9
A0-A3 A000-A3FF FB 00-06 13-17 1D-36 38-3C 3E 40-41 43-44 46-
A4 00-8C 90-C6 D0-FF C1 D3-FF
A5 00-FF FC 00-FF
A6 00-2B 40-F7 FD 00-3F 50-8F 92-C7 F0-FD
A7 00-AD B0-B7 F7-FF FE 00-19 20-52 54-66 68-6B 70-74 76-FC FF
A8 00-2B 30-39 40-77 80-C4 CE-D9 E0-FD FF 01-BE C2-C7 CA-CF D2-D7 DA-DC E0-E6 E8-
A9 00-53 5F-7C 80-CD CF-D9 DE-FE EE F9-FD

Plane 01
44-46 4400-4646
Row Values within row
68-69 6800-69FF
00 00-0B 0D-26 28-3A 3C-3D 3F-4D 50-5D 80-FA
6A 00-38 40-5E 60-69 6E-6F D0-ED F0-F5
01 00-02 07-33 37-8C 90-9B A0 D0-FD
6B 00-45 50-59 5B-61 63-77 7D-8F
02 80-9C A0-D0 E0-FB
6F 00-44 50-7E 8F-9F
03 00-23 30-4A 50-7A 80-9D 9F-C3 C8-D5
B0 00-01
04 00-9D A0-A9
BC 00-6A 70-7C 80-88 90-99 9C-A3
05 00-27 30-63 6F
D0 00-F5
06 00-FF
D1 00-26 29-E8
07 00-36 40-55 60-67
D2 00-45
08 00-05 08 0A-35 37-38 3C 3F-55 57-9E A7-AF
D3 00-56 60-71
E0-F2 F4-F5 FB-FF
D4 00-54 56-9C 9E-9F A2 A5-A6 A9-AC AE-B9 BB
09 00-1B 1F-39 3F 80-B7 BC-CF D2-FF
BD-C3 C5-FF
0A 00-03 05-06 0C-13 15-17 19-33 38-3A 3F-47
D5 00-05 07-0A 0D-14 16-1C 1E-39 3B-3E 40-44
50-58 60-9F C0-E6 EB-F6
46 4A-50 52-FF
0B 00-35 39-55 58-72 78-91 99-9C A9-AF
D6 00-A5 A8-FF
0C 00-48 80-B2 C0-F2 FA-FF
D7 00-CB CE-FF
0E 60-7E
D8-D9 D800-D9FF
10 00-4D 52-6F 7F-C1 D0-E8 F0-F9
DA 00-8B 9B-AF
11 00-34 36-43 50-76 80-CD D0-DF E1-F4
E8 00-C4 C7-D6
12 00-11 13-3D 80-86 88 8A-8D 8F-9D 9F-A9 B0-
EE 00-03 05-1F 21-22 24 27 29-32 34-37 39 3B
EA F0-F9
42 47 49 4B 4D-4F 51-52 54 57 59 5B 5D 5F
13 00-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
61-62 64 67-6A 6C-72 74-77 79-7C 7E 80-89
3C-44 47-48 4B-4D 50 57 5D-63 66-6C 70-74
8B-9B A1-A3 A5-A9 AB-BB F0-F1
14 80-C7 D0-D9
F0 00-2B 30-93 A0-AE B1-BF C1-CF D1-F5
15 80-B5 B8-DD
F1 00-0C 10-2E 30-6B 70-9A E6-FF
16 00-44 50-59 80-B7 C0-C9
F2 00-02 10-3A 40-48 50-51
17 00-19 1D-2B 30-3F
F3 00-FF
18 A0-F2 FF
F4 00-FF
1A C0-F8
F5 00-79 7B-A3 A5-FF
20-22 2000-22FF
F6 00-D0 E0-EC F0-F3
23 00-99
F7 00-73 80-D4
24 00-6E 70-74 80-FF
F8 00-0B 10-47 50-59 60-87 90-AD
25 00-43
F9 10-18 80-84 C0
30-34 3000-342E
Plane 02
Row Values within row
00-A6 0000-A6D6
A7-B6 A700-B6FF
B7 00-34 40-FF
B8 00-1D 20
B9-CD B900-CDFF
CE 00-A1
F8-FA F800-FA1D
Plane 0E
Row Values within row
00 01 20-7F
01 00-EF
© ISO/IEC 2016 – All rights reserved 9

ISO/IEC 10646:2014/Amd.2:2016 (E)
Plane 0F
Row Values within row
00-FF 0000-FFFD
Plane 10
Row Values within row
00-FF 0000-FFFD
NOTE – The collection 315 UNICODE 8.0 can also be determined by using another fixed collection from Error! Reference
source not found. and several ranges of code points.
Plane 00-10
Collection number and name
314 UNICODE 7.0
Plane 00
Row Values within row
2B EC-EF
08 B3-B4 E3
9F CD-D5
0A F9 A6 9E
0C 5A A7 8F B2-B7
0D 5F
A8 FC-FD
13 F5 F8-FD
AB 60-63 70-BF
20 BE FE 2E-2F
21 8A-8B
Plane 01
Row Values within row
25 00-43
08 E0-F2 F4-F5 FB-FF 44-46 4400-4646
09 BC-BD C0-CF D2-FF D1 DE-E8
0C 80-B2 C0-F2 FA-FF
D8-D9 D800-D9FF
11 C9-CC DB-DF
DA 00-8B 9B-AF
12 80-86 88 8A-8D 8F-9D 9F-A9 F3 2D-2F 7E-7F CF-D3 F8-FF
13 00 50 F4 FF
15 CA-DD
F5 4B-4F
17 00-19 1D-2B 30-3F F6 D0
23 99 F9 10-18 80-84 C0
24 80-FF
Plane 02
Row Values within row
B8 20-FF
B9-CD B900-CDFF
CE 00-A1
Page 2421, Annex G
In first paragraph, replace ‘and CJK ideographs’ with ‘, CJK ideographs, and Tangut ideographs’.
In the list of blocks, insert the following new block:
TANGUT
Insert each of the new character name entries at the appropriate position, ordered alphabetically by the charac-
ter name, in the list of character names in Annex G. These new names are provided in a machine-readable
format that is accessible as a link to this document.
Click on this highlighted text to access the file containing the new names.
NOTE – The content is also available as a separate viewable file in the same file directory as this docu-
ment. The file is named: “Am2names.txt”.
10 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)

Page 2423, Annex I.2
Replace the text from the sub-clause, starting from the first paragraph until the first Note with the following text:
An IDS consist of an IDC followed by a fixed number of Description Component (DC) organized in
subgroups corresponding to script category, such as CJK ideographs or Tangut ideographs. An IDS
should only use items belonging to a single subgroup, to clarify the script that it belongs to. The sub-
groups and their contents are as follows:
 CJK IDS, including DCs which may be of any one of the following:
• a coded CJK ideographs, which consists of any coded character from the CJK
UNIFIED IDEOGRAPHS block or the CJK COMPATIBILITY IDEOGRAPHS
blocks,
• a coded CJK radical, which consists of any coded character from the CJK
RADICALS blocks or the KANGXI RADICALS block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• the character FF1F FULL WIDTH QUESTION MARK to represent an otherwise
un-described DC,
• a private use character (as long as the interchanging parties have agreed that
the particular private use character represents a particular CJK ideograph or
component of a CJK ideograph),
• another CJK IDS.
 Tangut IDS, including DCs which may be of any one of the following:
• a Tangut Ideograph, which consists of any character from the TANGUT block,
• a Tangut component, which consists of any coded character from the TANGUT
COMPONENT block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• another Tangut IDS.
NOTE 1 – The above description implies that any IDS may be nested within another IDS in same subgroup.
Page 2453, Annex P
Insert new text after Table P.1:
The Japanese Industrial Standard JIS X 0213:2004 includes a set of characters which were part of
previous JIS standards. The set consists of 2828 characters, of which 2743 were part of JIS X 0212-
1990 (referred as J1 sources in the previous version of this International Standard) and 85 were part
of the ‘Unified Japanese IT Vendors Contemporary Ideographs, 1993’ (referred as JA in the previous
version of this International Standard). Among these 2828 characters, 205 had their glyph represen-
tation slightly modified. These 2828 characters use the ‘J13, J13A, J14, JA3, and JA4’ notations in
their prefix four source identification as described in sub-clause 23.1. Their glyph representation also
reflects the one shown in JIS X 0213:2004. The set of these 2828 characters is further defined as the
fixed collection 374 (see A.4.5).
Page 2459, Annex S.1.6
Insert a new note before the first note, adjusting the consequent note numbers accordingly:
NOTE 1 – The characters from the J source: JIS X 0212-1990 encoded in this International Standard are
listed in the collection 372 JAPANESE IDEOGRAPHICS SUPPLEMENT.
© ISO/IEC 2016 – All rights reserved 11

ISO/IEC 10646:2014/Amd.2 (E)
Page 2460, Annex S.3
Insert a new note after the first note:
NOTE 2 – Seven pairs described below have J sources which were originally part of the JIS X 212-1990 lev-
el-3, therefore covered by the Source separation rule, which are also part of the JIS X 213:2004 with source
identified as J13. The pairs (J13 character code points emphasized) are: U+5861-U+586B, U+6483-U+64CA,
U+75E9-U+7626, U+83D1-U+8458, U+848B-U+8523, U+91A4-U+91AC, and U+985A-U+985B.
12 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)
08A0 Arabic Extended-A 08FF
08A 08B 08C 08D 08E 08F
0  ࢰ  
08A0 08B0 08E0 08F0
1 ࢡ ࢱ  
08A1 08B1 08E1 08F1
2  ࢲ  
08A2 08B2 08E2 08F2
3    
08A3 08B3 08E3 08F3
4     
08A4 08B4 08D4 08E4 08F4
5    
08A5 08D5 08E5 08F5
6     
08A6 08B6 08D6 08E6 08F6
7     
08A7 08B7 08D7 08E7 08F7
8     
08A8 08B8 08D8 08E8 08F8
9     
08A9 08B9 08D9 08E9 08F9
A     
08AA 08BA 08DA 08EA 08FA
B     
08AB 08BB 08DB 08EB 08FB
C     
08AC 08BC 08DC 08EC 08FC
D ࢭ    
08AD 08BD 08DD 08ED 08FD
E ࢮ   
08AE 08DE 08EE 08FE
F ࢯ   $ࣿ
08AF 08DF 08EF 08FF
© ISO/IEC 2016 – All rights reserved 13

ISO/IEC 10646:2014/Amd.2:2016 (E)
08A0 Arabic Extended-A 08FD
Arabic letters for African languages Arabic letters for West Africa
08A0  ARABIC LETTER BEH WITH SMALL V BELOW 08BB  ARABIC LETTER AFRICAN FEH
08A1 ࢡ ARABIC LETTER BEH WITH HAMZA ABOVE 08BC  ARABIC LETTER AFRICAN QAF
• Adamawa Fulfulde (Cameroon) 08BD  ARABIC LETTER AFRICAN NOON
• used for the implosive bilabial stop
Pakistani Quranic marks
→ 0253 ɓ latin small letter b with hook
08D4  ARABIC SMALL HIGH WORD AR-RUB
08A2  ARABIC LETTER JEEM WITH TWO DOTS ABOVE
08D5  ARABIC SMALL HIGH SAD
08A3  ARABIC LETTER TAH WITH TWO DOTS ABOVE
08D6  ARABIC SMALL HIGH AIN
08A4  ARABIC LETTER FEH WITH DOT BELOW AND
08D7  ARABIC SMALL HIGH QAF
THREE DOTS ABOVE
08D8  ARABIC SMALL HIGH NOON WITH KASRA
08A5  ARABIC LETTER QAF WITH DOT BELOW
08D9  ARABIC SMALL LOW NOON WITH KASRA
08A6  ARABIC LETTER LAM WITH DOUBLE BAR
08DA  ARABIC SMALL HIGH WORD ATH-THALATHA
08A7  ARABIC LETTER MEEM WITH THREE DOTS
08DB  ARABIC SMALL HIGH WORD AS-SAJDA
ABOVE
08DC  ARABIC SMALL HIGH WORD AN-NISF
08A8  ARABIC LETTER YEH WITH TWO DOTS BELOW
AND HAMZA ABOVE 08DD  ARABIC SMALL HIGH WORD SAKTA
• Adamawa Fulfulde 08DE  ARABIC SMALL HIGH WORD QIF
• used for the implosive palatal approximant, 08DF  ARABIC SMALL HIGH WORD WAQFA
realized as pharyngealization of the
08E0  ARABIC SMALL HIGH FOOTNOTE MARKER
approximant
08E1  ARABIC SMALL HIGH SIGN SAFHA
→ 01B4 ƴ latin small letter y with hook
08E2  ARABIC DISPUTED END OF AYAH
08A9  ARABIC LETTER YEH WITH TWO DOTS BELOW
Extended vowel sign for Arwi
AND DOT ABOVE
• Adamawa Fulfulde 08E3  ARABIC TURNED DAMMA BELOW
• used for the voiced palatal nasal
Extended vowel signs for Rohingya
→ 0272 ɲ latin small letter n with left hook
08E4  ARABIC CURLY FATHA
Dependent consonants for Rohingya
08E5  ARABIC CURLY DAMMA
08AA  ARABIC LETTER REH WITH LOOP
08E6  ARABIC CURLY KASRA
= bottya-reh
08E7  ARABIC CURLY FATHATAN
08AB  ARABIC LETTER WAW WITH DOT WITHIN
08E8  ARABIC CURLY DAMMATAN
= nota-wa
08E9  ARABIC CURLY KASRATAN
08AC  ARABIC LETTER ROHINGYA YEH
Tone marks for Rohingya
= bottya-yeh
08EA  ARABIC TONE ONE DOT ABOVE
Arabic letters for European and Central Asian
08EB  ARABIC TONE TWO DOTS ABOVE
languages
08EC  ARABIC TONE LOOP ABOVE
08AD ࢭ ARABIC LETTER LOW ALEF
08ED  ARABIC TONE ONE DOT BELOW
• Bashkir, Tatar
08EE  ARABIC TONE TWO DOTS BELOW
08AE ࢮ ARABIC LETTER DAL WITH THREE DOTS BELOW
08EF  ARABIC TONE LOOP BELOW
• Belarusian
Quranic annotation signs
08AF ࢯ ARABIC LETTER SAD WITH THREE DOTS BELOW
08F0  ARABIC OPEN FATHATAN
• Belarusian
= successive fathatan
08B0 ࢰ ARABIC LETTER GAF WITH INVERTED STROKE
08F1  ARABIC OPEN DAMMATAN
• Crimean Tatar, Chechen, Lak
= successive dammatan
08B1 ࢱ ARABIC LETTER STRAIGHT WAW
08F2  ARABIC OPEN KASRATAN
• Tatar
= successive kasratan
Arabic letter for Berber
08F3  ARABIC SMALL HIGH WAW
08B2 ࢲ ARABIC LETTER ZAIN WITH INVERTED V ABOVE
Extended vowel signs for African languages
Arabic letters for Arwi
08F4  ARABIC FATHA WITH RING
08B3  ARABIC LETTER AIN WITH THREE DOTS BELOW
08F5  ARABIC FATHA WITH DOT ABOVE
08B4  ARABIC LETTER KAF WITH DOT BELOW
08F6  ARABIC KASRA WITH DOT BELOW
• also used in Philippine languages
Arabic letters for Bravanese
08F7  ARABIC LEFT ARROWHEAD ABOVE
08B6  ARABIC LETTER BEH WITH SMALL MEEM
08F8  ARABIC RIGHT ARROWHEAD ABOVE
ABOVE
08F9  ARABIC LEFT ARROWHEAD BELOW
08B7  ARABIC LETTER PEH WITH SMALL MEEM
08FA  ARABIC RIGHT ARROWHEAD BELOW
ABOVE
08B8  ARABIC LETTER TEH WITH SMALL TEH ABOVE 08FB  ARABIC DOUBLE RIGHT ARROWHEAD ABOVE
08FC  ARABIC DOUBLE RIGHT ARROWHEAD ABOVE
08B9  ARABIC LETTER REH WITH SMALL NOON
WITH DOT
ABOVE
08FD  ARABIC RIGHT ARROWHEAD ABOVE WITH DOT
08BA  ARABIC LETTER YEH WITH TWO DOTS BELOW
AND SMALL NOON ABOVE
14 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)
08FE Arabic Extended-A 08FF
Extended vowel signs
08FE  ARABIC DAMMA WITH DOT
• Philippine languages
08FF $ࣿARABIC MARK SIDEWAYS NOON GHUNNA
© ISO/IEC 2016 – All rights reserved 15

ISO/IEC 10646:2014/Amd.2:2016 (E)
0C80 Kannada 0CFF
0C8 0C9 0CA 0CB 0CC 0CD 0CE 0CF
ಠ ರ
0 ಀ ಐ $ೀ ೠ
0C80 0C90 0CA0 0CB0 0CC0 0CE0
1 $ಁಡ ಱ $ು ೡ X
0C81 0CA1 0CB1 0CC1 0CE1 0CF1
2 $ಂ ಒ ಢ ಲ $ೂ $ Y
ೢ
0C82 0C92 0CA2 0CB2 0CC2 0CE2 0CF2
3 $ಃ ಓ ಣ ಳ $ೃ $
ೣ
0C83 0C93 0CA3 0CB3 0CC3 0CE3
4 ಔ ತ $ೄ
0C94 0CA4 0CC4
ಥ ವ
5 ಅ ಕ $ೕ
0C85 0C95 0CA5 0CB5 0CD5
6 ಆ ಖ ದ ಶ $ೆ$ೖ ೦
0C86 0C96 0CA6 0CB6 0CC6 0CD6 0CE6
7 ಇ ಗ ಧ ಷ $ೇ ೧
0C87 0C97 0CA7 0CB7 0CC7 0CE7
8 ಈ ಘ ನ ಸ $ೈ ೨
0C88 0C98 0CA8 0CB8 0CC8 0CE8
9 ಉ ಙ ಹ ೩
0C89 0C99 0CB9 0CE9
ಚ ಪ
Aಊ $ೊ ೪
0C8A 0C9A 0CAA 0CCA 0CEA
B ಋ ಛ ಫ $ೋ ೫
0C8B 0C9B 0CAB 0CCB 0CEB
C ಌ ಜ ಬ $಼$ೌ ೬
0C8C 0C9C 0CAC 0CBC 0CCC 0CEC
D ಝ ಭ ಽ $್ ೭
0C9D 0CAD 0CBD 0CCD 0CED
E ಎ ಞ ಮ $ಾ ೞ ೮
0C8E 0C9E 0CAE 0CBE 0CDE 0CEE
ಟ ಯ
F ಏ $ಿ ೯
0C8F 0C9F 0CAF 0CBF 0CEF
16 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)
0C80 Kannada 0CEF
Various signs Various signs
0C80 ಀ KANNADA SIGN SPACING CANDRABINDU 0CBC $಼KANNADA SIGN NUKTA
0C81 $ಁKANNADA SIGN CANDRABINDU 0CBD ಽ KANNADA SIGN AVAGRAHA
0C82 $ಂ KANNADA SIGN ANUSVARA
Dependent vowel signs
0C83 $ಃ KANNADA SIGN VISARGA
0CBE $ಾ KANNADA VOWEL SIGN AA
Independent vowels
0CBF $ಿ KANNADA VOWEL SIGN I
0C85 ಅ KANNADA LETTER A 0CC0 $ೀ KANNADA VOWEL SIGN II
0C86 ಆ KANNADA LETTER AA ≡ 0CBF $ಿ 0CD5 $ೕ
0C87 ಇ KANNADA LETTER I 0CC1 $ು KANNADA VOWEL SIGN U
0C88 ಈ KANNADA LETTER II 0CC2 $ೂ KANNADA VOWEL SIGN UU
0CC3 $ೃ KANNADA VOWEL SIGN VOCALIC R
0C89 ಉ KANNADA LETTER U
0C8A ಊ KANNADA LETTER UU 0CC4 $ೄ KANNADA VOWEL SIGN VOCALIC RR
0C8B ಋ KANNADA LETTER VOCALIC R 0CC5 "
0C8C ಌ KANNADA LETTER VOCALIC L 0CC6 $ೆKANNADA VOWEL SIGN E
0C8D " 0CC7 $ೇ KANNADA VOWEL SIGN EE
≡ 0CC6 $ೆ 0CD5 $ೕ
0C8E ಎ KANNADA LETTER E
0C8F ಏ KANNADA LETTER EE 0CC8 $ೈ KANNADA VOWEL SIGN AI
≡ 0CC6 $ೆ 0CD6 $ೖ
0C90 ಐ KANNADA LETTER AI
0C91 " 0CC9 "
0CCA $ೊ KANNADA VOWEL SIGN O
0C92 ಒ KANNADA LETTER O
≡ 0CC6 $ೆ 0CC2 $ೂ
0C93 ಓ KANNADA LETTER OO
0CCB$ೋ KANNADA VOWEL SIGN OO
0C94 ಔ KANNADA LETTER AU
≡ 0CCA $ೊ 0CD5 $ೕ
Consonants
0CCC $ೌ KANNADA VOWEL SIGN AU
0C95 ಕ KANNADA LETTER KA
Virama
0C96 ಖ KANNADA LETTER KHA
0CCD $್ KANNADA SIGN VIRAMA
0C97 ಗ KANNADA LETTER GA
• preferred name is halant
0C98 ಘ KANNADA LETTER GHA
0C99 ಙ KANNADA LETTER NGA
Various signs
0C9A ಚ KANNADA LETTER CA
0CD5 $ೕ KANNADA LENGTH MARK
0C9B ಛ KANNADA LETTER CHA
0CD6 $ೖ KANNADA AI LENGTH MARK
0C9C ಜ KANNADA LETTER JA
Additional consonants
0C9D ಝ KANNADA LETTER JHA
0CDE ೞ KANNADA LETTER FA
0C9E ಞ KANNADA LETTER NYA
※ KANNADA LETTER LLLA
0C9F ಟ KANNADA LETTER TTA
• obsolete historic letter
0CA0 ಠ KANNADA LETTER TTHA
• name is a mistake for LLLA
0CA1 ಡ KANNADA LETTER DDA
0CA2 ಢ KANNADA LETTER DDHA Additional vowels for Sanskrit
0CA3 ಣ KANNADA LETTER NNA
0CE0 ೠ KANNADA LETTER VOCALIC RR
0CA4 ತ KANNADA LETTER TA
0CE1 ೡ KANNADA LETTER VOCALIC LL
0CA5 ಥ KANNADA LETTER THA
Dependent vowels
0CA6 ದ KANNADA LETTER DA
0CE2 $ KANNADA VOWEL SIGN VOCALIC L
ೢ
0CA7 ಧ KANNADA LETTER DHA
0CE3 $ KANNADA VOWEL SIGN VOCALIC LL
ೣ
0CA8 ನ KANNADA LETTER NA
0CA9 " Reserved
0CAA ಪ KANNADA LETTER PA
For viram punctuation, use the generic Indic 0964 and 0965.
0CAB ಫ KANNADA LETTER PHA
0CE4 "
0CAC ಬ KANNADA LETTER BA
→ 0964 । devanagari danda
0CAD ಭ KANNADA LETTER BHA
0CE5 "
0CAE ಮ KANNADA LETTER MA
→ 0965 ॥ devanagari double danda
0CAF ಯ KANNADA LETTER YA
Digits
0CB0 ರ KANNADA LETTER RA
0CE6 ೦ KANNADA DIGIT ZERO
0CB1 ಱ KANNADA LETTER RRA
0CE7 ೧ KANNADA DIGIT ONE
0CB2 ಲ KANNADA LETTER LA
0CE8 ೨ KANNADA DIGIT TWO
0CB3 ಳ KANNADA LETTER LLA
0CE9 ೩ KANNADA DIGIT THREE
0CB4 "
0CEA ೪ KANNADA DIGIT FOUR
0CB5 ವ KANNADA LETTER VA
0CEB ೫ KANNADA DIGIT FIVE
0CB6 ಶ KANNADA LETTER SHA
0CEC ೬ KANNADA DIGIT SIX
0CB7 ಷ KANNADA LETTER SSA
0CED ೭ KANNADA DIGIT SEVEN
0CB8 ಸ KANNADA LETTER SA
0CEE ೮ KANNADA DIGIT EIGHT
0CB9 ಹ KANNADA LETTER HA
0CEF ೯ KANNADA DIGIT NINE
© ISO/IEC 2016 – All rights reserved 17

ISO/IEC 10646:2014/Amd.2:2016 (E)
0CF1 Kannada 0CF2
Signs used in Sanskrit
0CF1 X KANNADA SIGN JIHVAMULIYA
→ 1CF5 ᳵ vedic sign jihvamuliya
0CF2 Y KANNADA SIGN UPADHMANIYA
→ 1CF6 ᳶ vedic sign upadhmaniya
18 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)
0D00 Malayalam 0D7F
0D0 0D1 0D2 0D3 0D4 0D5 0D6 0D7
0 ഐ ഠ ര $ീ ൠ ൰
0D10 0D20 0D30 0D40 0D60 0D70
1 $ഁ ഡ റ $ു ൡ ൱
0D01 0D21 0D31 0D41 0D61 0D71
2 $ം ഒ ഢ ല $ൂ $ൢ൲
0D02 0D12 0D22 0D32 0D42 0D62 0D72
$ൣ
3 $ഃ ഓ ണ ള $ൃ ൳
0D03 0D13 0D23 0D33 0D43 0D63 0D73
4 ഔ ത ഴ $ൄ ൔ ൴
0D14 0D24 0D34 0D44 0D54 0D74
5 അ ക ഥ വ ൕ ൵
0D05 0D15 0D25 0D35 0D55 0D75
6 ആ ഖ ദ ശ $െൖ ൦ ൶
0D06 0D16 0D26 0D36 0D46 0D56 0D66 0D76
7 ഇ ഗ ധ ഷ $േ$ൗ ൧ ൷
0D07 0D17 0D27 0D37 0D47 0D57 0D67 0D77
8ഈഘ ന സ $ൈ൘ ൨൸
0D08 0D18 0D28 0D38 0D48 0D58 0D68 0D78
9 ഉ ങ ഩ ഹ ൙ ൩ ൹
0D09 0D19 0D29 0D39 0D59 0D69 0D79
A ഊ ച പ ഺ $ൊ ൚ ൪ ൺ
0D0A 0D1A 0D2A 0D3A 0D4A 0D5A 0D6A 0D7A
B ഋ ഛ ഫ $ോ ൛ ൫ ൻ
0D0B 0D1B 0D2B 0D4B 0D5B 0D6B 0D7B
C ഌ ജ ബ $ൌ൜ ൬ ർ
0D0C 0D1C 0D2C 0D4C 0D5C 0D6C 0D7C
D ഝ ഭ ഽ $് ൝ ൭ ൽ
0D1D 0D2D 0D3D 0D4D 0D5D 0D6D 0D7D
E എ ഞ മ $ാ ൎ ൞ ൮ ൾ
0D0E 0D1E 0D2E 0D3E 0D4E 0D5E 0D6E 0D7E
F ഏ ട യ $ി ൥ ൟ ൯ ൿ
0D0F 0D1F 0D2F 0D3F 0D4F 0D5F 0D6F 0D7F
© ISO/IEC 2016 – All rights reserved 19

ISO/IEC 10646:2014/Amd.2:2016 (E)
0D01 Malayalam 0D4E
Various signs 0D2D ഭ MALAYALAM LETTER BHA
0D2E മ MALAYALAM LETTER MA
0D01 $ഁ MALAYALAM SIGN CANDRABINDU
• also used to denote the fraction one eightieth
0D02 $ം MALAYALAM SIGN ANUSVARA
(kaani)
0D03 $ഃ MALAYALAM SIGN VISARGA
0D2F യ MALAYALAM LETTER YA
Independent vowels
0D30 ര MALAYALAM LETTER RA
0D05 അ MALAYALAM LETTER A
0D31 റ MALAYALAM LETTER RRA
0D06 ആ MALAYALAM LETTER AA
0D32 ല MALAYALAM LETTER LA
0D07 ഇ MALAYALAM LETTER I
0D33 ള MALAYALAM LETTER LLA
0D08 ഈ MALAYALAM LETTER II
0D34 ഴ MALAYALAM LETTER LLLA
0D09 ഉ MALAYALAM LETTER U
= zha
0D0A ഊ MALAYALAM LETTER UU 0D35 വ MALAYALAM LETTER VA
0D0B ഋ MALAYALAM LETTER VOCALIC R
0D36 ശ MALAYALAM LETTER SHA
= soft sha
0D0C ഌ MALAYALAM LETTER VOCALIC L
0D0D " 0D37 ഷ MALAYALAM LETTER SSA
= sha
0D0E എ MALAYALAM LETTER E
0D38 സ MALAYALAM LETTER SA
0D0F ഏ MALAYALAM LETTER EE
0D39 ഹ MALAYALAM LETTER HA
0D10 ഐ MALAYALAM LETTER AI
0D3A ഺ MALAYALAM LETTER TTTA
0D11 "
• historic use only
0D12 ഒ MALAYALAM LETTER O
0D13 ഓ MALAYALAM LETTER OO
Addition for Sanskrit
0D14 ഔ MALAYALAM LETTER AU
0D3D ഽ MALAYALAM SIGN AVAGRAHA
= praslesham
Consonants
Dependent vowel signs
Alternate romanizations are shown as aliases for some letters
to clarify their identity.
0D3E $ാ MALAYALAM VOWEL SIGN AA
0D15 ക MALAYALAM LETTER KA 0D3F $ി MALAYALAM VOWEL SIGN I
0D16 ഖ MALAYALAM LETTER KHA 0D40 $ീ MALAYALAM VOWEL SIGN II
0D41 $ു MALAYALAM VOWEL SIGN U
0D17 ഗ MALAYALAM LETTER GA
0D18 ഘ MALAYALAM LETTER GHA 0D42 $ൂ MALAYALAM VOWEL SIGN UU
0D43 $ൃ MALAYALAM VOWEL SIGN VOCALIC R
0D19 ങ MALAYALAM LETTER NGA
0D1A ച MALAYALAM LETTER CA 0D44 $ൄ MALAYALAM VOWEL SIGN VOCALIC RR
= cha
0D45 "
0D1B ഛ MALAYALAM LETTER CHA
0D46 $െMALAYALAM VOWEL SIGN E
= chha
• stands to the left of the consonant
0D1C ജ MALAYALAM LETTER JA
0D47 $േMALAYALAM VOWEL SIGN EE
0D1D ഝ MALAYALAM LETTER JHA
• stands to the left of the consonant
0D1E ഞ MALAYALAM LETTER NYA
0D48 $ൈMALAYALAM VOWEL SIGN AI
= nha
• stands to the left of the consonant
0D1F ട MALAYALAM LETTER TTA
Two-part dependent vowel signs
= ta
These vowel signs have glyph pieces which stand on both sides
0D20 ഠ MALAYALAM LETTER TTHA
= tta of the consonant; they follow the consonant in logical order,
0D21 ഡ MALAYALAM LETTER DDA and should be handled as a unit for most processing.
= hard da
0D4A $ൊ MALAYALAM VOWEL SIGN O
0D22 ഢ MALAYALAM LETTER DDHA
≡ 0D46 $െ 0D3E $ാ
= hard dda
0D4B $ോ MALAYALAM VOWEL SIGN OO
0D23 ണ MALAYALAM LETTER NNA
≡ 0D47 $േ 0D3E $ാ
= hard na
0D4C $ൌ MALAYALAM VOWEL SIGN AU
0D24 ത MALAYALAM LETTER TA
• archaic form of the /au/ dependent vowel
= tha
→ 0D57 $ൗ malayalam au length mark
0D25 ഥ MALAYALAM LETTER THA
≡ 0D46 $െ 0D57 $ൗ
= ttha
Virama
0D26 ദ MALAYALAM LETTER DA
= soft da
0D4D $് MALAYALAM SIGN VIRAMA
= candrakkala (the preferred name)
0D27 ധ MALAYALAM LETTER DHA
= soft dda = vowel half-u
0D28 ന MALAYALAM LETTER NA
Dot reph
0D29 ഩ MALAYALAM LETTER NNNA
0D4E ൎ MALAYALAM LETTER DOT REPH
• historic use only
• not used in reformed modern Malayalam
0D2A പ MALAYALAM LETTER PA
orthography
0D2B ഫ MALAYALAM LETTER PHA
0D2C ബ MALAYALAM LETTER BA
20 © ISO/IEC 2016 – All rights reserved

ISO/IEC 10646:2014/Amd.2:2016 (E)
0D4F Malayalam 0D7F
Measurement symbol 0D71 ൱ MALAYALAM NUMBER ONE HUNDRED
0D72 ൲ MALAYALAM NUMBER ONE THOUSAND
0D4F ൥ MALAYALAM SIGN PARA
• used historically to measure rice
Fractions
0D73 ൳ MALAYALAM FRACTION ONE QUARTER
Chillu letters
= kaal
0D54 ൔ MALAYALAM LETTER CHILLU M
0D74 ൴ MALAYALAM FRACTION ONE HALF
0D55 ൕ MALAYALAM LETTER CHILLU Y
= a
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.

Loading comments...

Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters

Technologies de l'information — Jeu universel de caractères codés (JUC) — Amendement 2: Bhaiksuki, Marchen, Tangut et autres caractères

General Information

Relations

ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters

Get Certified

BSI Group

NYCE

Frequently Asked Questions

Standards Content (Sample)

Questions, Comments and Discussion

This May Also Interest You