僧伽罗文 (Unicode区段)

僧伽罗文是一个位于基本多文种平面的Unicode区块,收录了用于书写僧伽罗语巴利语梵语僧伽罗文字符[3]。僧伽罗文区块的编码并不完全基于ISCII,导致其与基于ISCII编码的Unicode区块(如孟加拉文淡米尔文)不能完全对应。

僧伽罗文
Sinhala
范围U+0D80..U+0DFF
(128个码位)
平面基本多文种平面BMP
文字僧伽罗文
应用僧伽罗语
巴利语
梵语
已分配91个码位
未分配37个保留码位
统一码版本历史
3.080 (+80)
7.090 (+10)
13.091 (+1)
相关区块
僧伽罗文古数字
码表
点击此处
注释[1][2]

区块

僧伽罗文
Sinhala[1][2]
Unicode Consortium 官方码表 (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0D8x
U+0D9x
U+0DAx
U+0DBx
U+0DCx
U+0DDx
U+0DEx
U+0DFx
注释
1.^ 依据 Unicode 14.0
2.^ 灰色区域表示未被分配的码位

历史

下列与Unicode相关的文档记录了在僧伽罗文区块中定义特定字符的目的和过程:

版本 最终码位[a] 码位数 L2英语International Committee for Information Technology Standards ID WG2英语ISO/IEC JTC 1/SC 2 ID 文档
3.0 U+0D82..0D83, 0D85..0D96, 0D9A..0DB1, 0DB3..0DBB, 0DBD, 0DC0..0DC6, 0DCA, 0DCF..0DD4, 0DD6, 0DD8..0DDF, 0DF2..0DF4 80 L2/97-019 N1480 Ginige, S. L., Request to add Sinhalese to 10646 based on recent Sinhalese Standard (SLS 1134-1996), 1996-11-15 
L2/97-019.1 Sri Lanka standard Sinhala character code for information interchange 
L2/97-018 N1473R页面存档备份,存于互联网档案馆 Everson, Michael, Proposal for encoding the Sinhala script in ISO/IEC 10646 (revision 1), 1997-03-01 
N1532 Ross, Hugh McGregor, Comment on Sri Lanka Proposal for Sinhala Script, 1997-03-07 
L2/97-030 N1503 (pdf, doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S.; Ksar, Mike, 8.6, Unconfirmed Minutes of WG 2 Meeting #32, Singapore; 1997-01-20--24, 1997-04-01 
L2/97-145 N1589 Everson, Michael, Mapping of Sinhala between ISO/IEC 10646 and SLS 1134, 1997-06-13 
N1585 Disanayaka, J. B., Towards Standard Sinhala Character Code, 1997-06-25 
N1584 Adams, Glenn, Feedback on Sinhala Script Proposals (N1480, N 1473R, etc.), 1997-06-30 
L2/97-158 N1613 Report of ad-hoc group on Sinhala encoding, 1997-07-02 
L2/97-288 N1603页面存档备份,存于互联网档案馆 Umamaheswaran, V. S., 8.7, Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997, 1997-10-24 
L2/98-319 N1896页面存档备份,存于互联网档案馆 Revised text of 10646-1/FPDAM 21, AMENDMENT 21: Sinhala, 1998-10-22 
L2/99-010 N1903 (pdf, html页面存档备份,存于互联网档案馆), doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S., Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25, 1998-12-30 
L2/10-164 Dias, Gihan, Sinhala Named Sequences, 2010-05-05 
L2/10-108 Moore, Lisa, Consensus 123-C32, UTC #123 / L2 #220 Minutes, 2010-05-19, Accept three Sinhala named sequences as provisional in Unicode 6.0... 
N3903 (pdf, doc页面存档备份,存于互联网档案馆)) M57.09 (Named USIs for Sinhala), Unconfirmed minutes of WG2 meeting 57, 2011-03-31 
7.0 U+0DE6..0DEF 10 L2/07-268 N3253 (pdf, doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S., M50.28, Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27, 2007-07-26 
L2/08-007 Inclusion of archaic Sinhala numerals in the Sinhala character code range, 2008-01-07 
L2/08-068 Dias, Gihan, Archaic Sinhala Numerals, 2008-01-28 
L2/08-105 Observations on the Encoding of Archaic Sinhala Numerals in Unicode/UCS, 2008-02-05 
L2/08-003 Moore, Lisa, Archaic Sinhala Numerals, UTC #114 Minutes, 2008-02-14 
L2/10-165 Dias, Gihan, Preliminary Proposal to Encode Sinhala Digits and Numerals, 2010-05-03 
L2/10-312 Dias, Gihan, Proposal to Encode Sinhala Archaic Numerals and Numbers, 2010-08-10 
L2/10-337 N3888页面存档备份,存于互联网档案馆 Proposal to include Sinhala Numerals to the BMP and SMP of the UCS, 2010-08-19 
N3888-A Senaweera, L. N., Sri Lanka's proposal on Sinhala Numerals for inclusion in Information Technology - Universal Multiple Octet Coded Character Set, ISO/IEC 10646 : 2003, 2010-09-10 
N3888-B Unicode Character Properties of Sinhala Lith Illakkam (Sinhala Astrological Digits) and Sinhala Illakkam or Sinhala Archaic Numbers 
L2/10-433 Wijayawardhana, Harsha; et al, RE: Background information on the use of Sinhala Numerals (L2/10-337), 2010-10-23 
L2/10-416R Moore, Lisa, Sinhala Numerals, UTC #125 / L2 #222 Minutes, 2010-11-09 
N3903 (pdf, doc页面存档备份,存于互联网档案馆)) M57.14, Unconfirmed minutes of WG2 meeting 57, 2011-03-31 
13.0 U+0D81 1 L2/18-060 N4964页面存档备份,存于互联网档案馆 A, Srinidhi; A, Sridatta, Proposal to encode the CANDRABINDU for Sinhala, 2018-02-05 
L2/18-079 Anderson, Deborah, Feedback on Sinhala candrabindu (L2/18-060), 2018-03-21 
L2/18-168 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Chapman, Chris; Cook, Richard, 10. Sinhala, Recommendations to UTC #155 April-May 2018 on Script Proposals, 2018-04-28 
L2/18-115 Moore, Lisa, D.4.1, UTC #155 Minutes, 2018-05-09 
L2/18-183 Moore, Lisa, B.1.3.1.1.1, UTC #156 Minutes, 2018-11-20 
N5020 (pdf, doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S., 10.3.14, Unconfirmed minutes of WG 2 meeting 67, 2019-01-11 
L2/20-052 Pournader, Roozbeh, Changes to Identifier_Type of some Unicode 13.0 characters, 2020-01-15 
L2/20-015R Moore, Lisa, B.13.4 Changes to Identifier_Type of some Unicode 13.0 characters, Draft Minutes of UTC Meeting 162, 2020-05-14 
  1. ^ 提案中建议的码位和字符名称可能与最终结果不同。

参考资料

  1. ^ Unicode character database. The Unicode Standard. [2016-07-09]. (原始内容存档于2021-02-18). 
  2. ^ Enumerated Versions of The Unicode Standard. The Unicode Standard. [2016-07-09]. (原始内容存档于2020-12-19). 
  3. ^ Chapter 13: South and Central Asia-II - Other Modern Scripts (PDF). The Unicode Standard, Version 15.0. Mountain View, CA: Unicode, Inc. 2022-09 [2023-05-30]. ISBN 978-1-936213-32-0. (原始内容存档 (PDF)于2023-05-16).