Information technology — Universal Coded Character Set (UCS) — Amendment 2: Bhaiksuki, Marchen, Tangut and other characters

Technologies de l'information — Jeu universel de caractères codés (JUC) — Amendement 2: Bhaiksuki, Marchen, Tangut et autres caractères

General Information

Status
Withdrawn
Publication Date
20-Apr-2016
Withdrawal Date
20-Apr-2016
Current Stage
9599 - Withdrawal of International Standard
Completion Date
20-Dec-2017
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters
English language
165 pages
sale 15% off
Preview
sale 15% off
Preview
Standard
ISO/IEC 10646:2014/Amd 2:2016 - Bhaiksuki, Marchen, Tangut and other characters
English language
165 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 10646
Fourth edition
2014-09-01
AMENDMENT 2
2016-05-01
Information technology — Universal
Coded Character Set (UCS)
AMENDMENT 2: Bhaiksuki, Marchen,
Tangut and other characters
Technologies de l’information — Jeu universel de caractères codés (JUC)
AMENDEMENT 2: Bhaiksuki, Marchen, Tangut et autres caractères
Reference number
ISO/IEC 10646:2014/Amd.2:2016(E)
©
ISO/IEC 2016

---------------------- Page: 1 ----------------------
ISO/IEC 10646:2014/Amd.2:2016(E)

COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2016, Published in Switzerland
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form
or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior
written permission. Permission can be requested from either ISO at the address below or ISO’s member body in the country of
the requester.
ISO copyright office
Ch. de Blandonnet 8 • CP 401
CH-1214 Vernier, Geneva, Switzerland
Tel. +41 22 749 01 11
Fax +41 22 749 09 47
copyright@iso.org
www.iso.org
ii © ISO/IEC 2016 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees estab-lished by the respective organization to deal with particular fields of technical activity.
ISO and IEC technical committees collaborate in fields of mutual interest. Other international
organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the
work. In the field of information technology, ISO and IEC have established a joint technical committee,
ISO/IEC JTC 1.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of document should be noted. This document was drafted in accordance with the editorial
rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Details of any patent rights identified during the development of the document will be in the Introduction
and/or on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does
not constitute an endorsement.
For an explanation on the meaning of ISO specific terms and expressions related to conformity
assessment, as well as information about ISO's adherence to the WTO principles in the Technical
Barriers to Trade (TBT) see the following URL: Foreword - Supplementary information.
Amendment 2 to ISO/IEC 10646:2014 was prepared by Joint Technical Committee ISO/IEC JTC1,
Information technology, Subcommittee SC 2, Coded character sets.
© ISO/IEC 2016 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)
iv © ISO/IEC 2016 – All rights reserved

---------------------- Page: 4 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)

Information technology — Universal Coded
Character Set (UCS)
AMENDMENT 2: Bhaiksuki, Marchen, Tangut and
other characters
Page 5 Sub-clause 4.25 extended collection
In the Note 3, insert ‘typically’ between ‘are’ and ‘referenced’.
Page 12 Sub-clause 6.4 Naming of character
In the list, insert the following new items after c) and adjust the next list item numbers accordingly:
d) follows the rule given in 25.7 for Tangut ideographs, or
Page 29 Sub-clause 23.1 List of source references
In the list of Hanzi G sources, insert the following new sources:
GFC Modern Chinese Standard Dictionary (现代汉语规范词典第二版。主编:李行健。北京:外语
教学与研究出版社) 2010, ISBN:978-7-5600-9518-9
GGFZ Tongyong Guifan Hanzi Zidian (通用规范汉字字典)
In the list of Kanji J sources, replace ‘2000’ with ‘2004’ for the J3 and J4 sources. Modify the J3A entry to read:
J3A JIS X 0213:2004 level-3 addendum from JIS X 0213:2000 level-3
Add the following entries:
J13 JIS X 0213:2004 level-3 characters replacing J1 characters
J13A JIS X 0213:2004 level-3 addendum from JIS X 0213:2000 level-3 replacing J1 characters
JA3 JIS X 0213:2004 level-3 characters replacing JA characters
J14 JIS X 0213:2004 level-4 characters replacing J1 characters
JA4 JIS X 0213:2004 level-4 characters replacing JA characters
Replace all J1 and JA sources that are also included in JIS X 0213:2004 by the values defined in JIS X
0213:2004 using the prefixes described above (J13, J13A, JA3, J14, and JA4). This represents 85 original JA
sources and 2743 original J1 sources.
After the list of J sources, add the following notes:
NOTE 3 – When a source such as JIS X 0213:2004 includes encoded characters already
included in older referenced sources, the more recent source references are typically used
to reflect contemporary usage. This also reflects the fact that some of these more recent
references were accompanied by minor adjustment for the graphic representation of the
related characters.
© ISO/IEC 2016 – All rights reserved 1

---------------------- Page: 5 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)
NOTE 4 – J Sources that have been partially superseded (J1 and JA) are described in An-
nex A (see A.4.3 and A4.4). They can also be found as provisional sources in the Unihan
Data base in http://www.unicode.org/reports/tr38/ fully described as kJis1 for J1: JIS X
0212-1990 and kJA for the JA source.
In Table 5: Format details of the tags used in the source reference file for CJK Ideographs,
in the line for kIRG_GSource, insert the additional third field format values:
GFC-ddd, and GGFZ-dddddd.
in the line for kIRG_JSource, insert the additional third field format values:
J13-hhhh, J13A-hhhh, JA3-hhhh, J14-hhhh, and JA4-hhhh.
In the same Table 5, in the last row, remove the last sentence starting by “Only the first Radical-Stroke” from
the fourth paragraph.
Insert new source references for the new 9 characters part of CJK Unified Ideographs.
Perform the additional updates to the source references:
 Move U+3D1D H source to U+2A3ED,
 Remove U+4CA4 H source (already moved to U+9FD0),
 Replace U+2AD12 U source with T source: TE-353F,
 Remove U+2B08F U source,
 Replace U+2F949 T source with U source: UCI-01199.
Provide a new reference file with updated references.
(The following text is identical to ISO/IEC 10646:2014 but is linked to the new file)
Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“CJKSrc.txt”.
Page 32 Sub-clause 23.2 Source references file for CJK Ideographs
In the third paragraph starting with ‘The Format definition’, replace ‘LINE TABULATION control character (000B)’
by ‘CHARACTER TABULATION control character (0009)’.
Page 37 Before Clause 24 Character names and annotations
Insert new clause, table, and figure as follows, renumbering the following clauses, tables, and figures:
24 Source references for Tangut Ideographs
24.1 List of source references
A Tangut ideograph is always referenced by one source reference. These source references are provided
in a machine-readable format that is accessible as a link from this document. The content pointed by
these links is also normative.
The source reference information establishes the character identity for Tangut ideographs. A source ref-
erence is established by associating a Tangut ideograph code point with one value in the source stand-
ards listed below. The Tangut sources are:
H2004-A Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H1
sub-division
2 © ISO/IEC 2016 – All rights reserved

---------------------- Page: 6 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


H2004-B Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H2
sub-division
L1986 Lǐ Fànwén 1986 (Study of the Homophones)
st
L1997 Lǐ Fànwén 1997 (Tangut-Chinese Dictionary, 1 edition)
L2006 Lǐ Fànwén 2006 (Comparative Study of Wuyin Qieyun and Wenhai Baoyun)
nd
L2008 Lǐ Fànwén 2008 (Tangut-Chinese Dictionary, 2 edition)
N1966 Nishida 1966 (Little Dictionary of Tangut)
S1968 Sofronov 1968 (Grammar of the Tangut Language)

24.2 Source reference file for Tangut Ideographs
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark. The file specifies the sources references data for all Tangut Ideographs.
The file also contains information concerning Radical-Stroke index. Each line in the text file contains the
following information organized in three fields:
st
 1 field: UCS code point in the format (U+hhhhh)
nd
 2 field: Tag indicating the type of information in the third field (kTGT_MergedSrc, kRSTUnicode)
rd
 3 field: Information corresponding to the tag value specified by the second field. The Table 6 pro-
vides the format details.
The format definition for the first field uses ‘h’ as a hexadecimal unit. The three fields are delimited by a
CHARACTER TABULATION control character (0009). Comment lines, starting with a NUMBER SIGN (‘#’)
are informational only. Comment lines and blank lines in the text file should be ignored by any automatic
process which parses the data file to extract the source reference information.
Table 6: Format details of the tags used in the source reference file for Tangut ideographs
Tag Value Tag Third field format
description
kTGT_MergedSrc Tangut (H2004-A-dddd), (H2004-B-dddd), (L1986-dddd), (L1997-dddd),
source (l2006-dddd), (L2008-dddd), (L2008-dddd-dddd), (N1966-ddd-ddx),
(N1966-ddd-ddxx), or (S1968-dddd)
kRSTUnicode (d{1,3}.d{1,2})
Radical
Stroke index
The format definition uses ‘d’ as a decimal unit, and ‘x’ as an alphanumerical unit (0 to 9 and A to Z). Uppercase characters, digits
and all other symbols between parentheses appear as shown.
The Radical-Stroke index (d{1,3}.d{1,2}) is an informative field contains one construct: a radical index (one to three digits),
followed by a full stop, and ending by one or two digits for the stroke count. The stroke count includes all strokes for the character,
including the radical. The radical index is an entry on the Tangut Components block at 18800-18AFF, with the index ‘1’ represent-
ing the code point 18800.
NOTE – Concerning Lǐ Fànwén 2008 entries, the notation (L2008-dddd) denotes a character mapped to a single entry
while the notation (L2008-dddd-dddd) denotes a character mapped to two entries.
The following examples shows file entries for the Tangut ideographs 17000, 17001, 17A7A, and 17E91.
EXAMPLES
U+17000 kTGT_MergedSrc L2008-0008
U+17000 kRSTUnicode 1.6
U+17001 kTGT_MergedSrc L2008-0030
U+17001 kRSTUnicode 1.7

U+17A7A kTGT_MergedSrc N1966-204-10GG
U+17A7A kRSTUnicode 141.14
© ISO/IEC 2016 – All rights reserved 3

---------------------- Page: 7 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


U+17E91 kTGT_MergedSrc L2008-2027-2350
U+17E91 kRSTUnicode 259.10

Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“TangutSrc.txt”.
24.3 Source reference presentation for Tangut Ideographs
The code charts for Tangut ideographs show one graphic symbol per character, along with additional in-
formation related to their identity such as radical, stroke count, and numeric value of the various source
references.
The graphic representation for the radical is shown immediately below the code point, along with the radi-
cal number and the stroke count. That stroke count includes the radical itself.
NOTE – The radicals shown in this representation are part of the Tangut Components block at 18800-18AFF.
The figure 7 shows an example for characters 177D0-177D2, 177E4-177E6, 177F8-177FA, 1780C-1780E,
and 17820-17822

Figure 7 – Chart presentation for TANGUT IDEOGRAPHS
Page 37 Clause 24 Character names and annotations
The clause is now renumbered 25. Insert one new sub-clause after 25.6 Character names for CJK Ideographs
and renumber the following sub-clause accordingly.
25.7  Character names for Tangut Ideographs
For Tangut ideographs the names are algorithmically constructed by appending their coded representa-
tion in their five hexadecimal digit notation to “TANGUT IDEOGRAPH-”. For example, the first Tangut
Ideograph has the name “TANGUT IDEOGRAPH-17000”.
Page 47 Sub-Clause 31.2 Code chart
The sub-clause is now renumbered 32.2. Replace the two last sentences starting with “Code charts” with:
Code charts for CJK ideographs and Tangut ideographs have different formats. See Clause 23 and 24.
Page 47-2380 Clause 31, Code charts and list of character names
The clause is now renumbered 32.
1. Modifications to existing blocks
Insert the additional character glyphs and names at the indicated positions in the blocks given below.
4  © ISO/IEC 2016 – All rights reserved

---------------------- Page: 8 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


Plane 00
Arabic Extended-A
Kannada
Malayalam
Currency Symbols
Miscellaneous Technical
Supplemental Punctuation
CJK Unified Ideographs
Latin Extended-D
Saurashtra
Plane 01
Ancient Greek Numbers
Khojki
Miscellaneous Symbols and Pictographs
Emoticons
Transport and Map Symbols
These blocks contain new characters and names at the following code positions:
08B6-08BD 08D4-08E2 0C80 0D4F 0D54-0D56 0D58-0D5E 0D76-0D78 20BE 23FB-23FE 2E43-
2E44 9FCD-9FD5 A7AE A8C5 1018D-1018E 1123E 1F32D-1F32F 1F37E-1F37F 1F3CF-1F3D3
1F3F8-1F3FF 1F4FF 1F54B-1F54E 1F643-1F644 1F6D0
updated graphic representation at the following code positions:
19B0-19C0, 19C8-19C9, 301C, 3127, 3D1D, 4CA4, 7921, 2AD12, and 2B08F.
2. New blocks
Insert the following additional blocks.
Plane 00
Cyrillic Extended-C
Plane 01
Osage
Mongolian Supplement
Bhaiksuki
Marchen
Ideographic Symbols and Punctuation
Tangut
Tangut Components
Glagolitic Supplement
Supplemental Symbols and Pictographs
These blocks add new characters and names at the following code positions:
1C80-1C88 104B0-104D3 104D8-104FB 11660-1166C 11C00-11C08 11C0A-11C36 11C38-
11C45 11C50-11C6C 11C70-11C8F 11C92-11CA7 11CA9-11CB6 16FE0 17000-187EC 18800-
18AF2 1E000-1E006 1E008-1E018 1E01B-1E021 1E023-1E024 1E026-1E02A 1F910-1F918
1F980-1F984 1F9C0
© ISO/IEC 2016 – All rights reserved 5

---------------------- Page: 9 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)

Page 2381, Annex A.1
Add a ‘*’ (for fixed collection) for the following collection:
1046 EMOTICONS
In the list of collection numbers and names, after
166 CHEROKEE SUPPLEMENT
insert a new entry as follows:
167 CYRILLIC EXTENDED-C 1C80-1C8F
In the list of collection numbers and names, after
1092 SUTTON SIGNWRITING
insert new entries as follows:
1093 OSAGE 104B0-104FF
1094 MONGOLIAN SUPPLEMENT 11660-1167F
1095 BHAIKSUKI 11C00-11C6F
1096 MARCHEN 11C70-11CBF
1097 IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
1098 TANGUT 17000-187EF
1199 TANGUT COMPONENTS 18800-18AFF
1100 GLAGOLITIC SUPPLEMENT 1E000-1E02F
1101 SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
In the list of collection numbers and names, after
314 UNICODE 7.0
insert a new entry as follows:
315 UNICODE 8.0 see A.6.14 *
In the list of collection numbers and names, after
372 JAPANESE IDEOGRAPHICS SUPPLEMENT
insert new entries as follows:
373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 see A.4.4 *
374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS STANDARDS see A.4.5 *
Page 2385, annex A.1
In the alphabetical list of keywords in Note 3, add the value 167 to the Cyrillic entry, the value 1094 to the Mon-
golian, the value 1097 to the Tamil entry, the value 1098 to the Punctuation entry and the Symbols entry re-
spectively, the value 1101 to the Glagolitic entry, the value 1102 to the Symbols entry, the values 2003, 2004,
and 2005 to the CJK entry; and insert the following entries:
Bhaiksuki 1095
Marchen 1096
Osage 1093
Tangut 1098 1099
6  © ISO/IEC 2016 – All rights reserved

---------------------- Page: 10 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


Page 2387, Annex A.2.1
In the list of blocks in the BMP, insert the following new entry:
CYRILLIC EXTENDED-C 1C80-1C8F
Page 2388, Annex A.2.2
In the list of blocks in the SMP, insert the following new entries:
OSAGE 104B0-104FF
MONGOLIAN SUPPLEMENT 11660-1667F
BHAIKSUKI 11C00-11C6F
MARCHEN 11C70-11CBF
IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
TANGUT 17000-187EF
TANGUT COMPONENTS 18800-18AFF
GLAGOLITIC SUPPLEMENT 1E000-1E02F
SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
Page 2393, Annex A.4
Replace Annex A.4.2 and A.4.3 and insert new A.4.4 and A.4.6 sub-clauses as follows:
A.4.2 371 JIS2004 IDEOGRAPHICS EXTENSION
The fixed collection 371 JIS2004 IDEOGRAPHICS EXTENSION consists of all level 3 and level 4 CJK
characters defined in JIS X 0213:2004. It contains 3 695 characters.
The code points of this collection are identified by J3 Kanji J sources (J3, J3A, J13, J13A, and JA3) and
J4 Kanji J sources (J4, J14, and JA4) in the Source Reference file for CJK Unified Ideographs
(CJKSrc.txt). See 23.1 and 23.2 for details.
A.4.3 372 JAPANESE IDEOGRAPHICS SUPPLEMENT
The fixed collection 372 JAPANESE IDEOGRAPHICS SUPPLEMENT consists of all CJK characters de-
fined in JIS X 0212:1990. It contains 5 801 characters.
NOTE – 2 743 characters are common between the collections 371 and 372.
The code points of this collection are identified by the Kanji J sources: J1, J13, J13A, and J14 in the
Source Reference file for CJK Unified Ideographs (CJKSrc.txt). See 23.1 for further details.
A.4.4 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993
The fixed collection 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 consists of all
CJK characters defined in the Unified Japanese IT Vendors Contemporary Ideographs, 1993 set. It con-
tains 660 characters.
The code points of this collection are identified by the Kanji J sources: JA, JA3, and JA4 in the Source
Reference file for CJK Unified Ideographs (CJKSrc.txt). See 23.1 for further details.
A.4.5 374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS STANDARDS
The fixed collection 374 JAPANESE JIS X 0213:2004 IDEOGRAPHS FROM PREVIOUS JIS
STANDARDS The set consists of 2828 characters, of which 2743 were part of JIS X 0212-1990 (referred
as J1 sources in the previous version of this International Standard) and 85 were part of the ‘Unified Jap-
anese IT Vendors Contemporary Ideographs, 1993’ (referred as JA in the previous version of this Interna-
tional Standard). Among these 2828 characters, 205 had their glyph representation slightly modified.
© ISO/IEC 2016 – All rights reserved 7

---------------------- Page: 11 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)

These 2828 characters use the ‘J13, J13A, J14, JA3, and JA4’ notations in their prefix four source identifi-
cation as described in sub-clause 23.1.
NOTE 1 – Given its large size and the large number of sparse ranges, the collection is not specified by code point ranges in
this document but instead by a linked content.
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark, which specifies, after a 7-lines header, as many lines as characters in the
collection; each containing the following information in fixed length field:
 1st field: BMP UCS code point (hhhh), normative.
nd
 2 field: J Source reference in previous editions of this standard.
rd
 3 field: J Source reference in this standard
The fields are delimited by a SPACE character; the format definition for the code point uses ‘h’ as a hexa-
decimal unit. The J Source references use the format specified in 23.2.
Click on this highlighted text to access the reference file.
NOTE 2 – The content is also available as a separate viewable file in the same file directory as this document. The file is
named: “JIS-X-0213-FromPrevious”.
Page 2409, Annex A.6
Add a new Annex A.6.14 as follows:
A.6.14 315 UNICODE 8.0
The fixed collection 315 UNICODE 8.0 is arranged by planes as follows.
Plane 00
11 00-FF
Row Values within row
12 00-48 4A-4D 50-56 58 5A-5D 60-88 8A-8D
00 20-7E A0-FF
90-B0 B2-B5 B8-BE C0 C2-C5 C8-D6 D8-FF
01-02 0100-02FF
13 00-10 12-15 18-5A 5D-7C 80-99 A0-F5 F8-FD
03 00-77 7A-7F 84-8A 8C 8E-A1 A3-FF
14-15 1400-15FF
04 00-FF
16 00-9C A0-F8
05 00-2F 31-56 59-5F 61-87 89-8A 8D-8F 91-C7
17 00-0C 0E-14 20-36 40-53 60-6C 6E-70 72-73
D0-EA F0-F4
80-DD E0-E9 F0-F9
06 00-1C 1E-FF
18 00-0E 10-19 20-77 80-AA B0-F5
07 00-0D 0F-4A 4D-B1 C0-FA
19 00-1E 20-2B 30-3B 40 44-6D 70-74 80-AB B0-
08 00-2D 30-3E 40-5B 5E A0-B4 E3-FF
C9 D0-DA DE-FF
09 00-83 85-8C 8F-90 93-A8 AA-B0 B2 B6-B9 BC-
1A 00-1B 1E-5E 60-7C 7F-89 90-99 A0-AD B0-BE
C4 C7-C8 CB-CE D7 DC-DD DF-E3 E6-FB
1B 00-4B 50-7C 80-F3 FC-FF
0A 01-03 05-0A 0F-10 13-28 2A-30 32-33 35-36
1C 00-37 3B-49 4D-7F C0-C7 D0-F6 F8-F9
38-39 3C 3E-42 47-48 4B-4D 51 59-5C 5E 66-
1D 00-F5 FC-FF
75 81-83 85-8D 8F-91 93-A8 AA-B0 B2-B3 B5-
1E 00-FF
B9 BC-C5 C7-C9 CB-CD D0 E0-E3 E6-F1 F9
1F 00-15 18-1D 20-45 48-4D 50-57 59 5B 5D 5F-
0B 01-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
7D 80-B4 B6-C4 C6-D3 D6-DB DD-EF F2-F4
3C-44 47-48 4B-4D 56-57 5C-5D 5F-63 66-77
F6-FE
82-83 85-8A 8E-90 92-95 99-9A 9C 9E-9F A3-
20 00-64 66-71 74-8E 90-9C A0-BE D0-F0
A4 A8-AA AE-B9 BE-C2 C6-C8 CA-CD D0 D7
21 00-8B 90-FF
E6-FA
22 00-FF
0C 00-03 05-0C 0E-10 12-28 2A-39 3D-44 46-48
23 00-FA
4A-4D 55-56 58-5A 60-63 66-6F 78-7F 81-83
24 00-26 40-4A 60-FF
85-8C 8E-90 92-A8 AA-B3 B5-B9 BC-C4 C6-C8
25 00-FF
CA-CD D5-D6 DE E0-E3 E6-EF F1-F2
26 00-FF
0D 01-03 05-0C 0E-10 12-3A 3D-44 46-48 4A-4E
27 00-FF
57 5F-63 66-75 79-7F 82-83 85-96 9A-B1 B3-
28-2A 2800-2AFF
BB BD C0-C6 CA CF-D4 D6 D8-DF E6-EF F2-F4
2B 00-73 76-95 98-B9 BD-C8 CA-D1 EC-EF
0E 01-3A 3F-5B 81-82 84 87-88 8A 8D 94-97 99-
2C 00-2E 30-5E 60-F3 F9-FF
9F A1-A3 A5 A7 AA-A8 AD-B9 BB-BD C0-C4 C6
2D 00-25 27 2D 30-67 6F-70 7F-96 A0-A6 A8-AE
C8-CD D0-D9 DC-DF
B0-B6 B8-BE C0-C6 C8-CE D0-D6 D8-DE E0-FF
0F 00-47 49-6C 71-97 99-BC BE-CC CE-DA
2E 00-42 80-99 9B-F3
10 00-C5 C7 CD D0-FF
8  © ISO/IEC 2016 – All rights reserved

---------------------- Page: 12 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


2F 00-D5 F0-FB AA 00-36 40-4D 50-59 5C-C2 DB-F6
30 00-3F 41-96 99-FF AB 01-06 09-0E 11-16 20-26 28-2E 30-65 70-ED
31 05-2D 31-8E 90-BA C0-E3 F0-FF F0-F9
32 00-1E 20-FE AC-D6 AC00-D6FF
33 00-FF D7 00-A3 B0-C6 CB-FB
34-4C 3400-4CFF E0-F8 E000-F8FF
4D 00-B5 C0-FF F9 00-FF
4E-9F 4E00-9FD5 FA 00-6D 70-D9
A0-A3 A000-A3FF FB 00-06 13-17 1D-36 38-3C 3E 40-41 43-44 46-
A4 00-8C 90-C6 D0-FF C1 D3-FF
A5 00-FF FC 00-FF
A6 00-2B 40-F7 FD 00-3F 50-8F 92-C7 F0-FD
A7 00-AD B0-B7 F7-FF FE 00-19 20-52 54-66 68-6B 70-74 76-FC FF
A8 00-2B 30-39 40-77 80-C4 CE-D9 E0-FD FF 01-BE C2-C7 CA-CF D2-D7 DA-DC E0-E6 E8-
A9 00-53 5F-7C 80-CD CF-D9 DE-FE EE F9-FD

Plane 01
44-46 4400-4646
Row Values within row
68-69 6800-69FF
00 00-0B 0D-26 28-3A 3C-3D 3F-4D 50-5D 80-FA
6A 00-38 40-5E 60-69 6E-6F D0-ED F0-F5
01 00-02 07-33 37-8C 90-9B A0 D0-FD
6B 00-45 50-59 5B-61 63-77 7D-8F
02 80-9C A0-D0 E0-FB
6F 00-44 50-7E 8F-9F
03 00-23 30-4A 50-7A 80-9D 9F-C3 C8-D5
B0 00-01
04 00-9D A0-A9
BC 00-6A 70-7C 80-88 90-99 9C-A3
05 00-27 30-63 6F
D0 00-F5
06 00-FF
D1 00-26 29-E8
07 00-36 40-55 60-67
D2 00-45
08 00-05 08 0A-35 37-38 3C 3F-55 57-9E A7-AF
D3 00-56 60-71
E0-F2 F4-F5 FB-FF
D4 00-54 56-9C 9E-9F A2 A5-A6 A9-AC AE-B9 BB
09 00-1B 1F-39 3F 80-B7 BC-CF D2-FF
BD-C3 C5-FF
0A 00-03 05-06 0C-13 15-17 19-33 38-3A 3F-47
D5 00-05 07-0A 0D-14 16-1C 1E-39 3B-3E 40-44
50-58 60-9F C0-E6 EB-F6
46 4A-50 52-FF
0B 00-35 39-55 58-72 78-91 99-9C A9-AF
D6 00-A5 A8-FF
0C 00-48 80-B2 C0-F2 FA-FF
D7 00-CB CE-FF
0E 60-7E
D8-D9 D800-D9FF
10 00-4D 52-6F 7F-C1 D0-E8 F0-F9
DA 00-8B 9B-AF
11 00-34 36-43 50-76 80-CD D0-DF E1-F4
E8 00-C4 C7-D6
12 00-11 13-3D 80-86 88 8A-8D 8F-9D 9F-A9 B0-
EE 00-03 05-1F 21-22 24 27 29-32 34-37 39 3B
EA F0-F9
42 47 49 4B 4D-4F 51-52 54 57 59 5B 5D 5F
13 00-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
61-62 64 67-6A 6C-72 74-77 79-7C 7E 80-89
3C-44 47-48 4B-4D 50 57 5D-63 66-6C 70-74
8B-9B A1-A3 A5-A9 AB-BB F0-F1
14 80-C7 D0-D9
F0 00-2B 30-93 A0-AE B1-BF C1-CF D1-F5
15 80-B5 B8-DD
F1 00-0C 10-2E 30-6B 70-9A E6-FF
16 00-44 50-59 80-B7 C0-C9
F2 00-02 10-3A 40-48 50-51
17 00-19 1D-2B 30-3F
F3 00-FF
18 A0-F2 FF
F4 00-FF
1A C0-F8
F5 00-79 7B-A3 A5-FF
20-22 2000-22FF
F6 00-D0 E0-EC F0-F3
23 00-99
F7 00-73 80-D4
24 00-6E 70-74 80-FF
F8 00-0B 10-47 50-59 60-87 90-AD
25 00-43
F9 10-18 80-84 C0
30-34 3000-342E

Plane 02
Row Values within row
00-A6 0000-A6D6
A7-B6 A700-B6FF
B7 00-34 40-FF
B8 00-1D 20
B9-CD B900-CDFF
CE 00-A1
F8-FA F800-FA1D

Plane 0E
Row Values within row
00 01 20-7F
01 00-EF
© ISO/IEC 2016 – All rights reserved 9

---------------------- Page: 13 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)
Plane 0F
Row Values within row
00-FF 0000-FFFD
Plane 10
Row Values within row
00-FF 0000-FFFD
NOTE – The collection 315 UNICODE 8.0 can also be determined by using another fixed collection from Error! Reference
source not found. and several ranges of code points.
Plane 00-10
Collection number and name
314 UNICODE 7.0
Plane 00
Row Values within row
2B EC-EF
08 B3-B4 E3
9F CD-D5
0A F9 A6 9E
0C 5A A7 8F B2-B7
0D 5F
A8 FC-FD
13 F5 F8-FD
AB 60-63 70-BF
20 BE FE 2E-2F
21 8A-8B
Plane 01
Row Values within row
25 00-43
08 E0-F2 F4-F5 FB-FF 44-46 4400-4646
09 BC-BD C0-CF D2-FF D1 DE-E8
0C 80-B2 C0-F2 FA-FF
D8-D9 D800-D9FF
11 C9-CC DB-DF
DA 00-8B 9B-AF
12 80-86 88 8A-8D 8F-9D 9F-A9 F3 2D-2F 7E-7F CF-D3 F8-FF
13 00 50 F4 FF
15 CA-DD
F5 4B-4F
17 00-19 1D-2B 30-3F F6 D0
23 99 F9 10-18 80-84 C0
24 80-FF
Plane 02
Row Values within row
B8 20-FF
B9-CD B900-CDFF
CE 00-A1
Page 2421, Annex G
In first paragraph, replace ‘and CJK ideographs’ with ‘, CJK ideographs, and Tangut ideographs’.
In the list of blocks, insert the following new block:
TANGUT
Insert each of the new character name entries at the appropriate position, ordered alphabetically by the charac-
ter name, in the list of character names in Annex G. These new names are provided in a machine-readable
format that is accessible as a link to this document.
Click on this highlighted text to access the file containing the new names.
NOTE – The content is also available as a separate viewable file in the same file directory as this docu-
ment. The file is named: “Am2names.txt”.
10 © ISO/IEC 2016 – All rights reserved

---------------------- Page: 14 ----------------------
ISO/IEC 10646:2014/Amd.2:2016 (E)


Page 2423, Annex I.2
Replace the text from the sub-clause, starting from the first paragraph until the first Note with the following text:
An IDS consist of an IDC followed by a fixed number of Description Component (DC) organized in
subgroups corresponding to script category, such as CJK ideographs or Tangut ideographs. An IDS
should only use items belonging to a single subgroup, to clarify the script that it belongs to. The sub-
groups and their contents are as follows:
 CJK IDS, including DCs which may be of any one of the following:
• a coded CJK ideographs, which consists of any coded character from the CJK
UNIFIED IDEOGRAPHS block or the CJK COMPATIBILITY IDEOGRAPHS
blocks,
• a coded CJK radical, which consists of any coded character from the CJK
RADICALS blocks or the KANGXI RADICALS block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• the character FF1F FULL WIDTH QUESTION MARK to represent an otherwise
un-described DC,
• a private use character (as long as the interchanging parties have agreed that
the particular private use character represents a particular CJK ideograph or
component of a CJK ideograph),
• another CJK IDS.
 Tangut IDS, including DCs which may be of any one of the following:
• a Tangut Ideograph, which consists of any character from the TANGUT block,
• a Tangut component, which consists of any coded character from the TANGUT
COMPONENT block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• another Tangut IDS.
NOTE 1 – The above description implies that any IDS may be nested within another IDS in same subgroup.
Page 2453, Annex P
Insert new text after Table P.1:
The Japanese Industrial Standard JIS X 0213:2004 includes a set of characters which were part of
previous JIS standards. The set consists of 2828 characters, of which 2743 were part of JIS X 0212-
1990 (referred as J1 sources in the previous version of this International Standard) and 85 were part
of the ‘Unified Japanese IT Vendors Contemporary Ideographs, 1993’ (referred as JA in the previous
version of this International Standard). Among these 2828 characters, 205 had their glyph represen-
tation slightly modified. These 2828 characters use the ‘J13, J13A, J14, JA3, and JA4’ notations in
their prefix four source identification as described in sub-clause 23.1. Their glyp
...

DRAFT AMENDMENT
ISO/IEC 10646: DAM 2
ISO/IEC JTC 1/SC 2 Secretariat: JISC
Voting begins on: Voting terminates on:
2015-07-06 2015-10-06
Information technology — Universal Coded Character Set
(UCS)
AMENDMENT 2: Bhaiksuki, Marchen, Tangut, and other
characters
Technologies de l’information — Jeu universel de caractères codés (JUC)
AMENDEMENT 2
ICS: 35.040
THIS DOCUMENT IS A DRAFT CIRCULATED
FOR COMMENT AND APPROVAL. IT IS
THEREFORE SUBJECT TO CHANGE AND MAY
NOT BE REFERRED TO AS AN INTERNATIONAL
STANDARD UNTIL PUBLISHED AS SUCH.
IN ADDITION TO THEIR EVALUATION AS
BEING ACCEPTABLE FOR INDUSTRIAL,
TECHNOLOGICAL, COMMERCIAL AND
USER PURPOSES, DRAFT INTERNATIONAL
STANDARDS MAY ON OCCASION HAVE TO
BE CONSIDERED IN THE LIGHT OF THEIR
POTENTIAL TO BECOME STANDARDS TO
WHICH REFERENCE MAY BE MADE IN
Reference number
NATIONAL REGULATIONS.
ISO/IEC 10646:2014/DAM 2:2015(E)
RECIPIENTS OF THIS DRAFT ARE INVITED
TO SUBMIT, WITH THEIR COMMENTS,
NOTIFICATION OF ANY RELEVANT PATENT
RIGHTS OF WHICH THEY ARE AWARE AND TO
©
PROVIDE SUPPORTING DOCUMENTATION. ISO/IEC 2015

---------------------- Page: 1 ----------------------
ISO/IEC 10646:2014/DAM 2:2015(E)

COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2015, Published in Switzerland
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form
or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior
written permission. Permission can be requested from either ISO at the address below or ISO’s member body in the country of
the requester.
ISO copyright office
Ch. de Blandonnet 8 • CP 401
CH-1214 Vernier, Geneva, Switzerland
Tel. +41 22 749 01 11
Fax +41 22 749 09 47
copyright@iso.org
www.iso.org
ii © ISO/IEC 2015 – All rights reserved

---------------------- Page: 2 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Com-
mission) form the specialized system for worldwide standardization. National bodies that are members of
ISO or IEC participate in the development of International Standards through technical committees estab-
lished by the respective organization to deal with particular fields of technical activity. ISO and IEC tech-
nical committees collaborate in fields of mutual interest. Other international organizations, governmental
and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information
technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of the joint technical committee is to prepare International Standards. Draft International
Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication
as an International Standard requires approval by at least 75% of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of ISO/IEC 10646 may be the subject of
patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
This International Standard was prepared by Joint Technical Committee ISO/IEC JTC1, Information tech-
nology, Subcommittee SC 2, Coded Character sets.
Amendment 2 to ISO/IEC 10646:2014 was prepared by Joint Technical Committee ISO/IEC JTC1, Infor-
mation technology, Subcommittee SC 2, Coded character sets.

© ISO/IEC 2015 – All rights reserved iii

---------------------- Page: 3 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.1:2015 (E)


iv  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 4 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


Information technology — Universal
Coded Character Set (UCS) —


AMENDMENT 2: Bhaiksuki, Marchen, Tangut, and other
characters

Page 5 Sub-clause 4.25 extended collection
In the Note 3, insert ‘typically’ between ‘are’ and ‘referenced’.
Page 12 Sub-clause 6.4 Naming of character
In the list, insert the following new items after c) and adjust the next list item numbers accordingly:
d) follows the rule given in 25.7 for Tangut ideographs, or
Page 29 Sub-clause 23.1 List of source references
In the list of Hanzi G sources, insert the following new sources:
GFC Modern Chinese Standard Dictionary (现代汉语规范词典第二版。主编:李行健。北京:外语
教学与研究出版社) 2010, ISBN:978-7-5600-9518-9
GGFZ Tongyong Guifan Hanzi Zidian (通用规范汉字字典)
In the list of Kanji J sources, replace ‘2000’ with ‘2004’ for the J3 and J4 sources. Modify the J3A entry to read:
J3A JIS X 0213:2004 level-3 addendum from JIS X 0213:2000 level-3
Add the following entries:
J3B JIS X 0213:2004 level-3 characters replacing J1 characters
J3C JIS X 0213:2004 level-3 characters replacing JA characters
J4B JIS X 0213:2004 level-4 characters replacing J1 characters
J4C JIS X 0213:2004 level-4 characters replacing JA characters
Replace all J1 and JA sources that are also included in JIS X 0213:2004 by the values defined in JIS X
0213:2004 using the prefixes described above (J3B, J3C, J4B, and J4C). This represents 85 original JA
sources and 2743 original J1 sources.
After the list of J sources, add the following notes:
NOTE 3 – When a source such as JIS X 0213:2004 includes encoded characters already
included in older referenced sources, the more recent source references are typically used
to reflect contemporary usage. This also reflects the fact that some of these more recent
references were accompanied by minor adjustment for the graphic representation of the
related characters.

NOTE 4 – J Sources that have been partially superseded (J1 and JA) are described in An-
nex A (see A.4.3 and A4.4). They can also be found as provisional sources in the Unihan
© ISO/IEC 2015 – All rights reserved 1

---------------------- Page: 5 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.2:2015 (E)

Data base in http://www.unicode.org/reports/tr38/ fully described as kJis1 for J1: JIS X
0212-1990 and kJA for the JA source.
In Table 5: Format details of the tags used in the source reference file for CJK Ideographs,
in the line for kIRG_GSource, insert the additional third field format values:
GFC-ddd, and GGFZ-dddddd.
in the line for kIRG_JSource, insert the additional third field format values:
J3B-hhhh, J3C-hhhh, J4B-hhhh, and J4C-hhhh.
In the same Table 5, in the last row, remove the last sentence starting by “Only the first Radical-Stroke” from
the fourth paragraph.
Insert new source references for the new 9 characters part of CJK Unified Ideographs.
Perform the additional updates to the source references:
• Move U+3D1D H source to U+2A3ED,
• Remove U+4CA4 H source (already moved to U+9FD0),
• Replace U+2AD12 U source with T source: TE-353F,
• Remove U+2B08F U source,
• Replace U+2F949 T source with U source: UCI-01199.
Provide a new reference file with updated references.
(The following text is identical to ISO/IEC 10646:2014 but is linked to the new file)
Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“CJKSrc.txt”.
Page 32 Sub-clause 23.2 Source references file for CJK Ideographs
In the third paragraph starting with ‘The Format definition’, replace ‘LINE TABULATION control character (000B)’
by ‘CHARACTER TABULATION control character (0009)’.
Page 37 Before Clause 24 Character names and annotations
Insert new clause, table, and figure as follows, renumbering the following clauses, tables, and figures:
24 Source references for Tangut Ideographs
24.1 List of source references
A Tangut ideograph is always referenced by one source reference. These source references are provided
in a machine-readable format that is accessible as a link from this document. The content pointed by
these links is also normative.
The source reference information establishes the character identity for Tangut ideographs. A source ref-
erence is established by associating a Tangut ideograph code point with one value in the source stand-
ards listed below. The Tangut sources are:
H2004-A Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H1
sub-division
H2004-B Hán Xiǎománg 2004 (PhD dissertation on the correct form of Tangut ideographs), H2
sub-division
2  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 6 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


L1986 Lǐ Fànwén 1986 (Study of the Homophones)
st
L1997 Lǐ Fànwén 1997 (Tangut-Chinese Dictionary, 1 edition)
L2006 Lǐ Fànwén 2006 (Comparative Study of Wuyin Qieyun and Wenhai Baoyun)
nd
L2008 Lǐ Fànwén 2008 (Tangut-Chinese Dictionary, 2 edition)
N1966 Nishida 1966 (Little Dictionary of Tangut)
S1968 Sofronov 1968 (Grammar of the Tangut Language)

24.2 Source reference file for Tangut Ideographs
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark. The file specifies the sources references data for all Tangut Ideographs.
The file also contains information concerning Radical-Stroke index. Each line in the text file contains the
following information organized in three fields:
st
• 1 field: UCS code point in the format (U+hhhhh)
nd
• 2 field: Tag indicating the type of information in the third field (kTGT_MergedSrc, kRSTUnicode)
rd
• 3 field: Information corresponding to the tag value specified by the second field. The Table 6 pro-
vides the format details.
The format definition for the first field uses ‘h’ as a hexadecimal unit. The three fields are delimited by a
CHARACTER TABULATION control character (0009). Comment lines, starting with a NUMBER SIGN (‘#’)
are informational only. Comment lines and blank lines in the text file should be ignored by any automatic
process which parses the data file to extract the source reference information.
Table 6: Format details of the tags used in the source reference file for Tangut ideographs
Tag Value Tag Third field format
description
kTGT_MergedSrc Tangut (H2004-A-dddd), (H2004-B-dddd), (L1986-dddd), (L1997-dddd),
source (l2006-dddd), (L2008-dddd), (L2008-dddd-dddd), (N1966-ddd-ddx),
(N1966-ddd-ddxx), or (S1968-dddd)
kRSTUnicode (d{1,3}.d{1,2})
Radical
Stroke index
The format definition uses ‘d’ as a decimal unit, and ‘x’ as an alphanumerical unit (0 to 9 and A to Z). Uppercase characters, digits
and all other symbols between parentheses appear as shown.
The Radical-Stroke index (d{1,3}.d{1,2}) is an informative field contains one construct: a radical index (one to three digits),
followed by a full stop, and ending by one or two digits for the stroke count. The stroke count includes all strokes for the character,
including the radical. The radical index is an entry on the Tangut Components block at 18800-18AFF, with the index ‘1’ represent-
ing the code point 18800.
NOTE – Concerning Lǐ Fànwén 2008 entries, the notation (L2008-dddd) denotes a character mapped to a single entry
while the notation (L2008-dddd-dddd) denotes a character mapped to two entries.
The following examples shows file entries for the Tangut ideographs 17000, 17001, 17A7A, and 17E91.
EXAMPLES
U+17000 kTGT_MergedSrc L2008-0008
U+17000 kRSTUnicode 1.6
U+17001 kTGT_MergedSrc L2008-0030
U+17001 kRSTUnicode 1.7

U+17A7A kTGT_MergedSrc N1966-204-10GG
U+17A7A kRSTUnicode 139.14

U+17E91 kTGT_MergedSrc L2008-2027-2350
U+17E91 kRSTUnicode 257.10
© ISO/IEC 2015 – All rights reserved 3

---------------------- Page: 7 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.2:2015 (E)


Click on this highlighted text to access the reference file.
NOTE – The content is also available as a separate viewable file in the same file directory as this document. The file is named:
“TangutSrc.txt”.
24.3 Source reference presentation for Tangut Ideographs
The code charts for Tangut ideographs show one graphic symbol per character, along with additional in-
formation related to their identity such as radical, stroke count, and numeric value of the various source
references.
The graphic representation for the radical is shown immediately below the code point, along with the radi-
cal number and the stroke count. That stroke count includes the radical itself.
NOTE – The radicals shown in this representation are part of the Tangut Components block at 18800-18AFF.
The figure 7 shows an example for characters 177D0-177D2, 177E4-177E6, 177F8-177FA, 1780C-1780E,
and 17820-17822

Figure 7 – Chart presentation for TANGUT IDEOGRAPHS
Page 37 Clause 24 Character names and annotations
The clause is now renumbered 25. Insert one new sub-clause after 25.6 Character names for CJK Ideographs
and renumber the following sub-clause accordingly.
25.7  Character names for Tangut Ideographs
For Tangut ideographs the names are algorithmically constructed by appending their coded representa-
tion in their five hexadecimal digit notation to “TANGUT IDEOGRAPH-”. For example, the first Tangut
Ideograph has the name “TANGUT IDEOGRAPH-17000”.
Page 47 Sub-Clause 31.2 Code chart
The sub-clause is now renumbered 32.2. Replace the two last sentences starting with “Code charts” with:
Code charts for CJK ideographs and Tangut ideographs have different formats. See Clause 23 and 24.
Page 47-2380 Clause 31, Code charts and list of character names
The clause is now renumbered 32.
1. Modifications to existing blocks
Insert the additional character glyphs and names at the indicated positions in the blocks given below.
Plane 00
Arabic Extended-A
4  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 8 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


Kannada
Malayalam
Currency Symbols
Miscellaneous Technical
Supplemental Punctuation
CJK Unified Ideographs
Latin Extended-D
Saurashtra
Plane 01
Ancient Greek Numbers
Khojki
Miscellaneous Symbols and Pictographs
Emoticons
Transport and Map Symbols
These blocks contain new characters and names at the following code positions:
08B6-08BD 08D4-08E2 0C80 0D4F 0D54-0D56 0D58-0D5E 0D76-0D78 20BE 23FB-23FE 2E43-
2E44 9FCD-9FD5 A7AE A8C5 1018D-1018E 1123E 1F32D-1F32F 1F37E-1F37F 1F3CF-1F3D3
1F3F8-1F3FF 1F4FF 1F54B-1F54E 1F643-1F644 1F6D0
updated graphic representation at the following code positions:
19B0-19C0, 19C8-19C9, 1301C, 3127, 3D1D, 4CA4, 7921, 2AD12, and 2B08F.
2. New blocks
Insert the following additional blocks.
Plane 00
Cyrillic Extended-C
Plane 01
Osage
Mongolian Supplement
Bhaiksuki
Marchen
Tamil Supplement
Ideographic Symbols and Punctuation
Tangut
Tangut Components
Glagolitic Supplement
Supplemental Symbols and Pictographs
These blocks add new characters and names at the following code positions:
1C80-1C88 104B0-104D3 104D8-104FB 11660-1166C 11C00-11C08 11C0A-11C36 11C38-
11C45 11C50-11C6C 11C70-11C8F 11C92-11CA7 11CA9-11CB6 11FC0-11FF5 11FFF 16FE0
17000-187ED 18800-18AF0 1E000-1E006 1E008-1E018 1E01B-1E021 1E023-1E024 1E026-
1E02A 1F910-1F918 1F980-1F984 1F9C0
© ISO/IEC 2015 – All rights reserved 5

---------------------- Page: 9 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.2:2015 (E)

Page 2381, Annex A.1
Add a ‘*’ (for fixed collection) for the following collection:
1046 EMOTICONS
In the list of collection numbers and names, after
165 LATIN EXTENDED-E
insert a new entry as follows:
166 CYRILLIC EXTENDED-C 1C80-1C8F
In the list of collection numbers and names, after
1092 SUTTON SIGNWRITING
insert new entries as follows:
1093 OSAGE 104B0-104FF
1094 MONGOLIAN SUPPLEMENT 11660-1167F
1095 BHAIKSUKI 11C00-11C6F
1096 MARCHEN 11C70-11CBF
1097 TAMIL SUPPLEMENT 11FC0-11FFF
1098 IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
1099 TANGUT 17000-187EF
1100 TANGUT COMPONENTS 18800-18AFF
1101 GLAGOLITIC SUPPLEMENT 1E000-1E02F
1102 SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
In the list of collection numbers and names, after
314 UNICODE 7.0
insert a new entry as follows:
315 UNICODE 8.0 see A.6.14 *
In the list of collection numbers and names, after
372 JAPANESE IDEOGRAPHICS SUPPLEMENT
insert a new entry as follows:
373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 see A.4.4 *
Page 2385, annex A.1
In the alphabetical list of keywords in Note 3, add the value 166 to the Cyrillic entry, the value 1094 to the Mon-
golian, the value 1097 to the Tamil entry, the value 1098 to the Punctuation entry and the Symbols entry re-
spectively, the value 1101 to the Glagolitic entry, the value 1102 to the Symbols entry, the values 2003, 2004,
and 2005 to the CJK entry; and insert the following entries:
Bhaiksuki 1095
Marchen 1096
Osage 1093
Tangut 1099 1100
6  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 10 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


Page 2387, Annex A.2.1
In the list of blocks in the BMP, insert the following new entry:
CYRILLIC EXTENDED-C 1C80-1C8F
Page 2388, Annex A.2.2
In the list of blocks in the SMP, insert the following new entries:
OSAGE 104B0-104FF
MONGOLIAN SUPPLEMENT 11660-1667F
BHAIKSUKI 11C00-11C6F
MARCHEN 11C70-11CBF
TAMIL SUPPLEMENT 11FC0-11FFF
IDEOGRAPHIC SYMBOLS AND PUNCTUATION
16FE0-16FFF
TANGUT 17000-187EF
TANGUT COMPONENTS 18800-18AFF
GLAGOLITIC SUPPLEMENT 1E000-1E02F
SUPPLEMENTAL SYMBOLS AND PICTOGRAPHS
1F900-1F9FF
Page 2393, Annex A.4
Replace Annex A.4.2 and A.4.3 as follows:
A.4.2 371 JIS2004 IDEOGRAPHICS EXTENSION
The fixed collection 371 JIS2004 IDEOGRAPHICS EXTENSION consists of all level 3 and level 4 CJK
characters defined in JIS X 0213:2004. It contains 3 695 characters.
The code points of this collection are identified by 1 249 J3 Kanji J sources, 10 J3A Kanji J sources, and
2 436 J4 Kanji J sources in the Source Reference file for CJK Unified Ideographs (CJKSrc.txt). See 23.1
and 23.2 for details.
A.4.3 372 JAPANESE IDEOGRAPHICS SUPPLEMENT
The fixed collection 372 JAPANESE IDEOGRAPHICS SUPPLEMENT consists of all CJK characters de-
fined in JIS X 0212:1990. It contains 5 801 characters
NOTE 1 – Given its large size and the large number of sparse ranges, the collection is not specified by code point ranges in
this document but instead by a linked content. 2 742 characters are common between the collections 371 and 372.
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark, which specifies, after a 3-lines header, as many lines as characters in the
collection; each containing the following information in fixed length field:
• BMP code point (hhhh), normative.
The format definition uses ‘h’ as a hexadecimal unit. Digits between parentheses appear as shown.
Click on this highlighted text to access the reference file.
NOTE 2 – The content is also available as a separate viewable file in the same file directory as this document. The file is
named: “JIS-X-0212-1990.txt”.
A.4.4 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993
The fixed collection 373 JAPANESE IT VENDORS CONTEMPORARY IDEOGRAPHS-1993 consists of all
CJK characters defined in the Unified Japanese IT Vendors Contemporary Ideographs, 1993 set. It con-
tains 660 characters
© ISO/IEC 2015 – All rights reserved 7

---------------------- Page: 11 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.2:2015 (E)

NOTE 1 – Given its large size and the large number of sparse ranges, the collection is not specified by code point ranges in
this document but instead by a linked content.
The content linked to is a plain text file, using ISO/IEC 646-IRV characters with CARRIAGE RETURN/
LINE FEED as end of line mark, which specifies, after a 3-lines header, as many lines as characters in the
collection; each containing the following information in fixed length field:
• BMP code point (hhhh), normative.
The format definition uses ‘h’ as a hexadecimal unit. Digits between parentheses appear as shown.
Click on this highlighted text to access the reference file.
NOTE 2 – The content is also available as a separate viewable file in the same file directory as this document. The file is
named: “JA-IT-Vendors-1993.txt”.
Page 2409, Annex A.6
Add a new Annex A.6.14 as follows:
A.6.14 315 UNICODE 8.0
The fixed collection 315 UNICODE 8.0 is arranged by planes as follows.
Plane 00
18 00-0E 10-19 20-77 80-AA B0-F5
Row Values within row
19 00-1E 20-2B 30-3B 40 44-6D 70-74 80-AB B0-
00 20-7E A0-FF
C9 D0-DA DE-FF
01-02 0100-02FF
1A 00-1B 1E-5E 60-7C 7F-89 90-99 A0-AD B0-BE
03 00-77 7A-7F 84-8A 8C 8E-A1 A3-FF
1B 00-4B 50-7C 80-F3 FC-FF
04 00-FF
1C 00-37 3B-49 4D-7F C0-C7 D0-F6 F8-F9
05 00-2F 31-56 59-5F 61-87 89-8A 8D-8F 91-C7
1D 00-F5 FC-FF
D0-EA F0-F4
1E 00-FF
06 00-1C 1E-FF
1F 00-15 18-1D 20-45 48-4D 50-57 59 5B 5D 5F-
07 00-0D 0F-4A 4D-B1 C0-FA
7D 80-B4 B6-C4 C6-D3 D6-DB DD-EF F2-F4
08 00-2D 30-3E 40-5B 5E A0-B4 E3-FF
F6-FE
09 00-83 85-8C 8F-90 93-A8 AA-B0 B2 B6-B9 BC-
20 00-64 66-71 74-8E 90-9C A0-BE D0-F0
C4 C7-C8 CB-CE D7 DC-DD DF-E3 E6-FB
21 00-8B 90-FF
0A 01-03 05-0A 0F-10 13-28 2A-30 32-33 35-36
22 00-FF
38-39 3C 3E-42 47-48 4B-4D 51 59-5C 5E 66-
23 00-FA
75 81-83 85-8D 8F-91 93-A8 AA-B0 B2-B3 B5-
24 00-26 40-4A 60-FF
B9 BC-C5 C7-C9 CB-CD D0 E0-E3 E6-F1 F9
25 00-FF
0B 01-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
26 00-FF
3C-44 47-48 4B-4D 56-57 5C-5D 5F-63 66-77
27 00-FF
82-83 85-8A 8E-90 92-95 99-9A 9C 9E-9F A3-
28-2A 2800-2AFF
A4 A8-AA AE-B9 BE-C2 C6-C8 CA-CD D0 D7
2B 00-73 76-95 98-B9 BD-C8 CA-D1 EC-EF
E6-FA
2C 00-2E 30-5E 60-F3 F9-FF
0C 00-03 05-0C 0E-10 12-28 2A-39 3D-44 46-48
2D 00-25 27 2D 30-67 6F-70 7F-96 A0-A6 A8-AE
4A-4D 55-56 58-5A 60-63 66-6F 78-7F 81-83
B0-B6 B8-BE C0-C6 C8-CE D0-D6 D8-DE E0-FF
85-8C 8E-90 92-A8 AA-B3 B5-B9 BC-C4 C6-C8
2E 00-42 80-99 9B-F3
CA-CD D5-D6 DE E0-E3 E6-EF F1-F2
2F 00-D5 F0-FB
0D 01-03 05-0C 0E-10 12-3A 3D-44 46-48 4A-4E
30 00-3F 41-96 99-FF
57 5F-63 66-75 79-7F 82-83 85-96 9A-B1 B3-
31 05-2D 31-8E 90-BA C0-E3 F0-FF
BB BD C0-C6 CA CF-D4 D6 D8-DF E6-EF F2-F4
32 00-1E 20-FE
0E 01-3A 3F-5B 81-82 84 87-88 8A 8D 94-97 99-
33 00-FF
9F A1-A3 A5 A7 AA-A8 AD-B9 BB-BD C0-C4 C6
34-4C 3400-4CFF
C8-CD D0-D9 DC-DF
4D 00-B5 C0-FF
0F 00-47 49-6C 71-97 99-BC BE-CC CE-DA
4E-9F 4E00-9FD5
10 00-C5 C7 CD D0-FF
A0-A3 A000-A3FF
11 00-FF
A4 00-8C 90-C6 D0-FF
12 00-48 4A-4D 50-56 58 5A-5D 60-88 8A-8D
A5 00-FF
90-B0 B2-B5 B8-BE C0 C2-C5 C8-D6 D8-FF
A6 00-2B 40-F7
13 00-10 12-15 18-5A 5D-7C 80-99 A0-F5 F8-FD
A7 00-AD B0-B7 F7-FF
14-15 1400-15FF
A8 00-2B 30-39 40-77 80-C4 CE-D9 E0-FD
16 00-9C A0-F8
A9 00-53 5F-7C 80-CD CF-D9 DE-FE
17 00-0C 0E-14 20-36 40-53 60-6C 6E-70 72-73
AA 00-36 40-4D 50-59 5C-C2 DB-F6
80-DD E0-E9 F0-F9
8  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 12 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


AB 01-06 09-0E 11-16 20-26 28-2E 30-65 70-ED FB 00-06 13-17 1D-36 38-3C 3E 40-41 43-44 46-
F0-F9 C1 D3-FF
AC-D6 AC00-D6FF FC 00-FF
D7 00-A3 B0-C6 CB-FB FD 00-3F 50-8F 92-C7 F0-FD
E0-F8 E000-F8FF FE 00-19 20-52 54-66 68-6B 70-74 76-FC FF
F9 00-FF FF 01-BE C2-C7 CA-CF D2-D7 DA-DC E0-E6 E8-
FA 00-6D 70-D9 EE F9-FD

Plane 01
44-46 4400-4646
Row Values within row
68-69 6800-69FF
00 00-0B 0D-26 28-3A 3C-3D 3F-4D 50-5D 80-FA
6A 00-38 40-5E 60-69 6E-6F D0-ED F0-F5
01 00-02 07-33 37-8C 90-9B A0 D0-FD
6B 00-45 50-59 5B-61 63-77 7D-8F
02 80-9C A0-D0 E0-FB
6F 00-44 50-7E 8F-9F
03 00-23 30-4A 50-7A 80-9D 9F-C3 C8-D5
B0 00-01
04 00-9D A0-A9
BC 00-6A 70-7C 80-88 90-99 9C-A3
05 00-27 30-63 6F
D0 00-F5
06 00-FF
D1 00-26 29-E8
07 00-36 40-55 60-67
D2 00-45
08 00-05 08 0A-35 37-38 3C 3F-55 57-9E A7-AF
D3 00-56 60-71
E0-F2 F4-F5 FB-FF
D4 00-54 56-9C 9E-9F A2 A5-A6 A9-AC AE-B9 BB
09 00-1B 1F-39 3F 80-B7 BC-CF D2-FF
BD-C3 C5-FF
0A 00-03 05-06 0C-13 15-17 19-33 38-3A 3F-47
D5 00-05 07-0A 0D-14 16-1C 1E-39 3B-3E 40-44
50-58 60-9F C0-E6 EB-F6
46 4A-50 52-FF
0B 00-35 39-55 58-72 78-91 99-9C A9-AF
D6 00-A5 A8-FF
0C 00-48 80-B2 C0-F2 FA-FF
D7 00-CB CE-FF
0E 60-7E
D8-D9 D800-D9FF
10 00-4D 52-6F 7F-C1 D0-E8 F0-F9
DA 00-8B 9B-AF
11 00-34 36-43 50-76 80-CD D0-DF E1-F4
E8 00-C4 C7-D6
12 00-11 13-3D 80-86 88 8A-8D 8F-9D 9F-A9 B0-
EE 00-03 05-1F 21-22 24 27 29-32 34-37 39 3B
EA F0-F9
42 47 49 4B 4D-4F 51-52 54 57 59 5B 5D 5F
13 00-03 05-0C 0F-10 13-28 2A-30 32-33 35-39
61-62 64 67-6A 6C-72 74-77 79-7C 7E 80-89
3C-44 47-48 4B-4D 50 57 5D-63 66-6C 70-74
8B-9B A1-A3 A5-A9 AB-BB F0-F1
14 80-C7 D0-D9
F0 00-2B 30-93 A0-AE B1-BF C1-CF D1-F5
15 80-B5 B8-DD
F1 00-0C 10-2E 30-6B 70-9A E6-FF
16 00-44 50-59 80-B7 C0-C9
F2 00-02 10-3A 40-48 50-51
17 00-19 1D-2B 30-3F
F3 00-FF
18 A0-F2 FF
F4 00-FF
1A C0-F8
F5 00-79 7B-A3 A5-FF
20-22 2000-22FF
F6 00-D0 E0-EC F0-F3
23 00-99
F7 00-73 80-D4
24 00-6E 70-74 80-FF
F8 00-0B 10-47 50-59 60-87 90-AD
25 00-43
F9 10-18 80-84 C0
30-34 3000-342E

Plane 02
Row Values within row
00-A6 0000-A6D6
A7-B6 A700-B6FF
B7 00-34 40-FF
B8 00-1D 20
B9-CD B900-CDFF
CE 00-A1
F8-FA F800-FA1D

Plane 0E
Row Values within row
00 01 20-7F
01 00-EF

Plane 0F
Row Values within row
00-FF 0000-FFFD

Plane 10
© ISO/IEC 2015 – All rights reserved 9

---------------------- Page: 13 ----------------------
Draft Amendment (DAM) 2
ISO/IEC 10646:2014/Amd.2:2015 (E)

Row Values within row
00-FF 0000-FFFD

NOTE – The collection 315 UNICODE 8.0 can also be determined by using another fixed collection from A.1 and several
ranges of code points.
Plane 00-10
Collection number and name
314 UNICODE 7.0

Plane 00
Row Values within row
2B EC-EF
08 B3-B4 E3
9F CD-D5
0A F9
A6 9E
0C 5A A7 8F B2-B7
0D 5F A8 FC-FD
13 F5 F8-FD
AB 60-63 70-BF
20 BE FE 2E-2F
21 8A-8B

Plane 01
Row Values within row
25 00-43
08 E0-F2 F4-F5 FB-FF
44-46 4400-4646
09 BC-BD C0-CF D2-FF D1 DE-E8
0C 80-B2 C0-F2 FA-FF D8-D9 D800-D9FF
11 C9-CC DB-DF
DA 00-8B 9B-AF
12 80-86 88 8A-8D 8F-9D 9F-A9
F3 2D-2F 7E-7F CF-D3 F8-FF
13 00 50 F4 FF
15 CA-DD
F5 4B-4F
17 00-19 1D-2B 30-3F
F6 D0
23 99 F9 10-18 80-84 C0
24 80-FF

Plane 02
Row Values within row
B8 20-FF
B9-CD B900-CDFF
CE 00-A1

Page 2421, Annex G
In first paragraph, replace ‘and CJK ideographs’ with ‘, CJK ideographs, and Tangut ideographs’.
In the list of blocks, insert the following new block:
TANGUT
Insert each of the new character name entries at the appropriate position, ordered alphabetically by the charac-
ter name, in the list of character names in Annex G. These new names are provided in a machine-readable
format that is accessible as a link to this document.
Click on this highlighted text to access the file containing the new names.
NOTE – The content is also available as a separate viewable file in the same file directory as this docu-
ment. The file is named: “Am2names.txt”.
Page 2423, Annex I.2
Replace the text from the sub-clause, starting from the first paragraph until the first Note with the following text:
An IDS consist of an IDC followed by a fixed number of Description Component (DC) organized in
subgroups corresponding to script category, such as CJK ideographs or Tangut ideographs. An IDS
10  © ISO/IEC 2015 – All rights reserved

---------------------- Page: 14 ----------------------
Draft Amendment (DAM) 2 ISO/IEC 10646:2014/Amd.2:2015 (E)


should only use items belonging to a single subgroup, to clarify the script that it belongs to. The sub-
groups and their contents is as follows:
• CJK IDS, including DCs which may be of any one of the following:
• a coded CJK ideographs, which consists of any coded character from the CJK
UNIFIED IDEOGRAPHS block or the CJK COMPATIBILITY IDEOGRAPHS
blocks,
• a coded CJK radical, which consists of any coded character from the CJK
RADICALS blocks or the KANGXI RADICALS block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• the character FF1F FULL WIDTH QUESTION MARK to represent an otherwise
un-described DC,
• a private use character (as long as the interchanging parties have agreed that
the particular private use character represents a particular CJK ideograph or
component of a CJK ideograph),
• another CJK IDS.
• Tangut IDS, including DCs which may be of any one of the following:
• a Tangut Ideograph, which consists of any character from the TANGUT block,
• a Tangut component, which consists of any coded character from the TANGUT
COMPONENT block,
• a coded stroke, which consists of any coded character from the CJK
STROKES block,
• another Tangut IDS.
NOTE 1 – The above description implies that any IDS may be nested within another IDS in same subgroup.
Page 2453, Annex P
Insert new text after Table P.1:
The Japanese Industrial Standard JIS X 0213:2004 contains characters in its levels 3 and 4 which
were part of previous JIS standards. The set consists of 2828 characters, of which 2723 were part of
JIS X 0212-1990 (referred as J1 sources in this International Standard) and 85 were part of the ‘Uni-
fied Japanese IT Vendors Contemporary Ideographs, 1993’ (referred as JA in this International
Standard). Among these 2828 characters, 205 had their glyph representation slightly modified. These
2828 characters use the ‘J3B, J3C, J4B, and J4C’ notations in their prefix four source identification
as described in sub-clause 23.1.
Page 2459, Annex S.1.6
Insert a new note before the first note, adjusting the consequent not
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.