17
1 JTC1/SC2/WG2 N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Chakma script in the UCS Source: Michael Everson Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2009-02-12 1. Introduction. The Chakma people were in origin Tibeto-Burman, related to the Burmese. The language which they now speak is Indo-European, part of the Southeastern Bengali branch of Eastern Indo-Aryan. Its better-known closest relatives are Bengali, Assamese, Chittagonian, Bishnupriya, and Sylheti. It is spoken by 312,000 people in southeast Bangladesh near Chittagong City, and another 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh. Literacy in Chakma script is low. The script itself is also called Ajhā pāṭh, sometimes romanized Ojhopath. The Tanchangya lanugage is closely related to Chakma. An effort to develop the orthography is currently underway, and it appears that there may be additional letters, vowel signs, and tone marks added to cover this script. This is a subject for future standardization, as the orthography for Tanchangya is still under development and testing. There is a certain amount of glyph variation between the script as used in India and Bangladesh. Some fonts are rounder, similar to the style used in Myanmar; compare a similar variation in the Tai Tham script as used in Khün Tai. The glyphs used in this proposal are based on the Chadigang font, with some alterations toward more “generic” shapes for some characters. 2. Structure. Chakma is of the Brahmic type: the consonant letters contain an inherent vowel. Consonant clusters are written with conjunct characters, and a visible vowel killer shows the deletion of the inherent vowel when there is no conjunct. The inherent vowel in Chakma is , and the VOWEL SIGN A is used to shorten it. The other vowel signs function as expected in Brahmic scripts: = ka = kā + ˇ -a ki = kā + ˇ i ¯ = kā + ˇ ku = kā + ˇ -u ku ¯ = kā + ˇ ke = kā + ˇ -e ko = kā + ˇ -o kāi = kā + ˇ -āi kau = kā + ˇ -au koi = kā + ˇ -oi

JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Embed Size (px)

Citation preview

Page 1: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

1

JTC1SC2WG2 N3xxxL209-xxx2009-02-12

Universal Multiple-Octet Coded Character SetInternational Organization for StandardizationOrganisation Internationale de Normalisation

Международная организация по стандартизации

Doc Type Working Group DocumentTitle Proposal for encoding the Chakma script in the UCSSource Michael EversonStatus Individual ContributionAction For consideration by JTC1SC2WG2 and UTCDate 2009-02-12

1 Introduction The Chakma people were in origin Tibeto-Burman related to the Burmese Thelanguage which they now speak is Indo-European part of the Southeastern Bengali branch of EasternIndo-Aryan Its better-known closest relatives are Bengali Assamese Chittagonian Bishnupriya andSylheti It is spoken by 312000 people in southeast Bangladesh near Chittagong City and another176000 in India in Mizoram Assam Tripura and Arunachal Pradesh Literacy in Chakma script is lowThe script itself is also called Ajhā pāṭh sometimes romanized Ojhopath The Tanchangyalanugage is closely related to Chakma An effort to develop the orthography is currently underway and itappears that there may be additional letters vowel signs and tone marks added to cover this script Thisis a subject for future standardization as the orthography for Tanchangya is still under development andtesting There is a certain amount of glyph variation between the script as used in India and BangladeshSome fonts are rounder similar to the style used in Myanmar compare a similar variation in the TaiTham script as used in Khuumln Tai The glyphs used in this proposal are based on the Chadigang font withsome alterations toward more ldquogenericrdquo shapes for some characters

2 Structure Chakma is of the Brahmic type the consonant letters contain an inherent vowel Consonantclusters are written with conjunct characters and a visible vowel killer shows the deletion of the inherentvowel when there is no conjunct The inherent vowel in Chakma is -ā and the VOWEL SIGN A is used toshorten it The other vowel signs function as expected in Brahmic scripts

kā = kā

ka = kā + -a ki = kā + i kı = kā + ˇ -ī ku = kā + -u ku = kā + -ū ke = kā + ˇ -e

ko = kā + -o kāi = kā + -āi

kau = kā + -au

koi = kā + -oi

kaṁ = kā + ˇ kaṃ = kā + kaḥ = kā + k = kā + ˇ MAAYAA

One of the interesting features of Chakma writing is that CANDRABINDU (cānaphupudā) can be usedtogether with ANUSVARA (ekaphudā) and VISARGA (dviphudā)

aḥṁ = ā + ḥ + ˇ ṁ aṃṁ = ā + ṃ + ˇ ṁ uṃṁ = u + ṃ + ˇ ṁ muṁ = mā + u + ˇ ṁ

3 Consonants with killed vowels and conjunct consonants Like other Brahmic scripts Chakmamakes use of the MAAYAA (killer) to invoke conjoined consonants In the past practice was much morecommon than it is today Like the Myanmar script Chakma is encoded with two vowel-killing charactersin order to conform to modern user expectations As shown above most letters have their vowels killedwith the use of the explicit MAAYAA character

k = kā + ˇ MAAYAA

In 2001 an orthographic reform was recommended in the book Cāṅmā pattham pāt which would limit thestandard repertoire of conjuncts to those composed with the five letters yā rā lā wā and nāThe four here are the most widely-accepted repertoire of conjuncts

ya X + VIRAMA + yā

- - - -

ra X + VIRAMA + rā

- - - -

la X + VIRAMA + lā

- - - -

wa X + VIRAMA + wā

- - - -

2

No separate conjunct forms of subjoined full-form -yā -rā -lā or -wā appear to exist

na X + VIRAMA + nā

- - - -

An additional conjunct the -na conjunct is exemplary of the orthographic shift which has taken place inChakma While some writers would indeed write kakna as most now would probably expect it tobe written as As with Myanmar and Meetei Mayek encoding a visible killer for modern usersalongside an explicit conjoin-former permits the user to make specific choices about spelling more easilyBoth the Myanmar encoding model and the Devanagari encoding model have been explained to the usercommunity and feedback is that they Myanmar model fits the script better (This is little surpriseconsidering the close relationship between the Myanmar and Chakma scripts)

The 2004 book Phadagaṅ shows examples of the five conjuncts above together alongside conjunctsformed with bā mā and hā These are all formed by simple subjoining

ba X + VIRAMA + bā

- - - -

ma X + VIRAMA + mā

- - - -

ha X + VIRAMA + hā

- - - -

In the 1982 book Cāṅmār āg pudhi a much wider range of conjunct pairs is shown some of them withfairly complicated glyphs

kkā = kā + VIRAMA + kā

kṭā = kā + VIRAMA + ṭā ktā = kā + VIRAMA + tā kmā = kā + VIRAMA + mā

kcā = kā + VIRAMA + cā (conjunct shows old-style glyph)

ṅkā = ṅā + VIRAMA + kā

ṅgā = ṅā + VIRAMA + gā

3

ccā = cā + VIRAMA + cā (conjunct shows old-style glyph)

cchā = cā + VIRAMA + chā (conjunct shows old-style glyph)

ntildecā = ntildeā + VIRAMA + cā (conjunct shows old-style glyph)

ntildejā = ntildeā + VIRAMA + jā ntildejhā = ntildeā + VIRAMA + jhā

ṭṭā = ṭā + VIRAMA + ṭā ttā = tā + VIRAMA + tā tmā = tā + VIRAMA + mā

tthā = tā + VIRAMA + thā

ddā = dā + VIRAMA + dā

ddhā = dā + VIRAMA + dhā

ntā = nā + VIRAMA + tā nthā = nā + VIRAMA + thā

nmā = nā + VIRAMA + mā

ppā = pā + VIRAMA + pā

bbā = bā + VIRAMA + bā

mmā = mā + VIRAMA + mā

jjā = jā + VIRAMA + jā lkā = lā + VIRAMA + kā

lgā = lā + VIRAMA + gā

llā = lā + VIRAMA + lā lṭā = lā + VIRAMA + ṭā lpā = lā + VIRAMA + pā

schā = sā + VIRAMA + chā (conjunct shows old-style glyph)

sṭā = sā + VIRAMA + ṭā skā = sā + VIRAMA + kā

spā = sā + VIRAMA + pā

smā = sā + VIRAMA + mā

hmā = hā + VIRAMA + mā

4

The implication of this variety for implementors is simply one of how much conjunct support they wishto build into their fonts It would seem prudent to support the eight conjuncts seen in publications since2001 The 1982 style font would be considered archaic note how it differs in writing kmā tmā nmā bbā mmā llā smā and hmā which would be written kmā tmā nmā bbā mmā llā smā and hmā according to more recent sources This distinction isstylistic and not orthographic In Chakma the encoding model supports conjunct behaviour and a fontwithout any of these conjuncts would render them with the visible VIRAMA kmā tmā nmā bbā mmā llā smā and hmā To reiterate one can include as manyconjuncts as one wishes in a font but those included in the lists above were listed because they appearedin printed sources which were available

4 Independent vowels Four independent vowels exist a i u and e Other vowels in initialposition are formed by adding the vowel sign to a as in ī ū ai oi Some modern writers aregeneralizing this spelling in i u and e

5 Dependent vowels Independent vowel signs have been encoded according to their phonetic value notaccording to their glyph fragments Thus ṁ ANUSVARA and ḥ VISARGA are distinguished u and ūare distinguished ai and oi are distinguished and o and au are distinguished None of thesewould be equivalent to strings of characters (so ū is not u + u etc)

6 Collating order As an Indo-European language the standard Brahmic sorting order applies toChakma

7 Character names Consonant letter names use the typical Brahmic transliteration used in the UCSChakma letters have a descriptive name followed by a traditional Brahmic consonant Both are used herein the character names

8 Punctuation and digits Alongside a single and double danda punctuation Chakma has a uniquequestion mark and a section sign PHULACIHR There is some variation in the glyphs for the PHULICIHRsome looking like flowers or leaves A set of digits exists and is encoded although Bengali digits are alsoused The Tanchangya use Myanmar digits

9 Linebreaking Letters and digits behave as in Bengali

10 Unicode Character Properties

11100CHAKMA SIGN CANDRABINDUMn0NSMN11101CHAKMA SIGN ANUSVARAMn0NSMN11102CHAKMA SIGN VISARGAMc0LN11103CHAKMA LETTER AALo0LN11104CHAKMA LETTER ILo0LN11105CHAKMA LETTER ULo0LN11106CHAKMA LETTER ELo0LN11107CHAKMA LETTER KAALo0LN11108CHAKMA LETTER KHAALo0LN11109CHAKMA LETTER GAALo0LN1110ACHAKMA LETTER GHAALo0LN1110BCHAKMA LETTER NGAALo0LN1110CCHAKMA LETTER CAALo0LN1110DCHAKMA LETTER CHAALo0LN1110ECHAKMA LETTER JAALo0LN1110FCHAKMA LETTER JHAALo0LN11110CHAKMA LETTER NYAALo0LN11111CHAKMA LETTER TTAALo0LN11112CHAKMA LETTER TTHAALo0LN11113CHAKMA LETTER DDAALo0LN11114CHAKMA LETTER DDHAALo0LN11115CHAKMA LETTER NNAALo0LN11116CHAKMA LETTER TAALo0LN11117CHAKMA LETTER THAALo0LN

5

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 2: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

kaṁ = kā + ˇ kaṃ = kā + kaḥ = kā + k = kā + ˇ MAAYAA

One of the interesting features of Chakma writing is that CANDRABINDU (cānaphupudā) can be usedtogether with ANUSVARA (ekaphudā) and VISARGA (dviphudā)

aḥṁ = ā + ḥ + ˇ ṁ aṃṁ = ā + ṃ + ˇ ṁ uṃṁ = u + ṃ + ˇ ṁ muṁ = mā + u + ˇ ṁ

3 Consonants with killed vowels and conjunct consonants Like other Brahmic scripts Chakmamakes use of the MAAYAA (killer) to invoke conjoined consonants In the past practice was much morecommon than it is today Like the Myanmar script Chakma is encoded with two vowel-killing charactersin order to conform to modern user expectations As shown above most letters have their vowels killedwith the use of the explicit MAAYAA character

k = kā + ˇ MAAYAA

In 2001 an orthographic reform was recommended in the book Cāṅmā pattham pāt which would limit thestandard repertoire of conjuncts to those composed with the five letters yā rā lā wā and nāThe four here are the most widely-accepted repertoire of conjuncts

ya X + VIRAMA + yā

- - - -

ra X + VIRAMA + rā

- - - -

la X + VIRAMA + lā

- - - -

wa X + VIRAMA + wā

- - - -

2

No separate conjunct forms of subjoined full-form -yā -rā -lā or -wā appear to exist

na X + VIRAMA + nā

- - - -

An additional conjunct the -na conjunct is exemplary of the orthographic shift which has taken place inChakma While some writers would indeed write kakna as most now would probably expect it tobe written as As with Myanmar and Meetei Mayek encoding a visible killer for modern usersalongside an explicit conjoin-former permits the user to make specific choices about spelling more easilyBoth the Myanmar encoding model and the Devanagari encoding model have been explained to the usercommunity and feedback is that they Myanmar model fits the script better (This is little surpriseconsidering the close relationship between the Myanmar and Chakma scripts)

The 2004 book Phadagaṅ shows examples of the five conjuncts above together alongside conjunctsformed with bā mā and hā These are all formed by simple subjoining

ba X + VIRAMA + bā

- - - -

ma X + VIRAMA + mā

- - - -

ha X + VIRAMA + hā

- - - -

In the 1982 book Cāṅmār āg pudhi a much wider range of conjunct pairs is shown some of them withfairly complicated glyphs

kkā = kā + VIRAMA + kā

kṭā = kā + VIRAMA + ṭā ktā = kā + VIRAMA + tā kmā = kā + VIRAMA + mā

kcā = kā + VIRAMA + cā (conjunct shows old-style glyph)

ṅkā = ṅā + VIRAMA + kā

ṅgā = ṅā + VIRAMA + gā

3

ccā = cā + VIRAMA + cā (conjunct shows old-style glyph)

cchā = cā + VIRAMA + chā (conjunct shows old-style glyph)

ntildecā = ntildeā + VIRAMA + cā (conjunct shows old-style glyph)

ntildejā = ntildeā + VIRAMA + jā ntildejhā = ntildeā + VIRAMA + jhā

ṭṭā = ṭā + VIRAMA + ṭā ttā = tā + VIRAMA + tā tmā = tā + VIRAMA + mā

tthā = tā + VIRAMA + thā

ddā = dā + VIRAMA + dā

ddhā = dā + VIRAMA + dhā

ntā = nā + VIRAMA + tā nthā = nā + VIRAMA + thā

nmā = nā + VIRAMA + mā

ppā = pā + VIRAMA + pā

bbā = bā + VIRAMA + bā

mmā = mā + VIRAMA + mā

jjā = jā + VIRAMA + jā lkā = lā + VIRAMA + kā

lgā = lā + VIRAMA + gā

llā = lā + VIRAMA + lā lṭā = lā + VIRAMA + ṭā lpā = lā + VIRAMA + pā

schā = sā + VIRAMA + chā (conjunct shows old-style glyph)

sṭā = sā + VIRAMA + ṭā skā = sā + VIRAMA + kā

spā = sā + VIRAMA + pā

smā = sā + VIRAMA + mā

hmā = hā + VIRAMA + mā

4

The implication of this variety for implementors is simply one of how much conjunct support they wishto build into their fonts It would seem prudent to support the eight conjuncts seen in publications since2001 The 1982 style font would be considered archaic note how it differs in writing kmā tmā nmā bbā mmā llā smā and hmā which would be written kmā tmā nmā bbā mmā llā smā and hmā according to more recent sources This distinction isstylistic and not orthographic In Chakma the encoding model supports conjunct behaviour and a fontwithout any of these conjuncts would render them with the visible VIRAMA kmā tmā nmā bbā mmā llā smā and hmā To reiterate one can include as manyconjuncts as one wishes in a font but those included in the lists above were listed because they appearedin printed sources which were available

4 Independent vowels Four independent vowels exist a i u and e Other vowels in initialposition are formed by adding the vowel sign to a as in ī ū ai oi Some modern writers aregeneralizing this spelling in i u and e

5 Dependent vowels Independent vowel signs have been encoded according to their phonetic value notaccording to their glyph fragments Thus ṁ ANUSVARA and ḥ VISARGA are distinguished u and ūare distinguished ai and oi are distinguished and o and au are distinguished None of thesewould be equivalent to strings of characters (so ū is not u + u etc)

6 Collating order As an Indo-European language the standard Brahmic sorting order applies toChakma

7 Character names Consonant letter names use the typical Brahmic transliteration used in the UCSChakma letters have a descriptive name followed by a traditional Brahmic consonant Both are used herein the character names

8 Punctuation and digits Alongside a single and double danda punctuation Chakma has a uniquequestion mark and a section sign PHULACIHR There is some variation in the glyphs for the PHULICIHRsome looking like flowers or leaves A set of digits exists and is encoded although Bengali digits are alsoused The Tanchangya use Myanmar digits

9 Linebreaking Letters and digits behave as in Bengali

10 Unicode Character Properties

11100CHAKMA SIGN CANDRABINDUMn0NSMN11101CHAKMA SIGN ANUSVARAMn0NSMN11102CHAKMA SIGN VISARGAMc0LN11103CHAKMA LETTER AALo0LN11104CHAKMA LETTER ILo0LN11105CHAKMA LETTER ULo0LN11106CHAKMA LETTER ELo0LN11107CHAKMA LETTER KAALo0LN11108CHAKMA LETTER KHAALo0LN11109CHAKMA LETTER GAALo0LN1110ACHAKMA LETTER GHAALo0LN1110BCHAKMA LETTER NGAALo0LN1110CCHAKMA LETTER CAALo0LN1110DCHAKMA LETTER CHAALo0LN1110ECHAKMA LETTER JAALo0LN1110FCHAKMA LETTER JHAALo0LN11110CHAKMA LETTER NYAALo0LN11111CHAKMA LETTER TTAALo0LN11112CHAKMA LETTER TTHAALo0LN11113CHAKMA LETTER DDAALo0LN11114CHAKMA LETTER DDHAALo0LN11115CHAKMA LETTER NNAALo0LN11116CHAKMA LETTER TAALo0LN11117CHAKMA LETTER THAALo0LN

5

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 3: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

No separate conjunct forms of subjoined full-form -yā -rā -lā or -wā appear to exist

na X + VIRAMA + nā

- - - -

An additional conjunct the -na conjunct is exemplary of the orthographic shift which has taken place inChakma While some writers would indeed write kakna as most now would probably expect it tobe written as As with Myanmar and Meetei Mayek encoding a visible killer for modern usersalongside an explicit conjoin-former permits the user to make specific choices about spelling more easilyBoth the Myanmar encoding model and the Devanagari encoding model have been explained to the usercommunity and feedback is that they Myanmar model fits the script better (This is little surpriseconsidering the close relationship between the Myanmar and Chakma scripts)

The 2004 book Phadagaṅ shows examples of the five conjuncts above together alongside conjunctsformed with bā mā and hā These are all formed by simple subjoining

ba X + VIRAMA + bā

- - - -

ma X + VIRAMA + mā

- - - -

ha X + VIRAMA + hā

- - - -

In the 1982 book Cāṅmār āg pudhi a much wider range of conjunct pairs is shown some of them withfairly complicated glyphs

kkā = kā + VIRAMA + kā

kṭā = kā + VIRAMA + ṭā ktā = kā + VIRAMA + tā kmā = kā + VIRAMA + mā

kcā = kā + VIRAMA + cā (conjunct shows old-style glyph)

ṅkā = ṅā + VIRAMA + kā

ṅgā = ṅā + VIRAMA + gā

3

ccā = cā + VIRAMA + cā (conjunct shows old-style glyph)

cchā = cā + VIRAMA + chā (conjunct shows old-style glyph)

ntildecā = ntildeā + VIRAMA + cā (conjunct shows old-style glyph)

ntildejā = ntildeā + VIRAMA + jā ntildejhā = ntildeā + VIRAMA + jhā

ṭṭā = ṭā + VIRAMA + ṭā ttā = tā + VIRAMA + tā tmā = tā + VIRAMA + mā

tthā = tā + VIRAMA + thā

ddā = dā + VIRAMA + dā

ddhā = dā + VIRAMA + dhā

ntā = nā + VIRAMA + tā nthā = nā + VIRAMA + thā

nmā = nā + VIRAMA + mā

ppā = pā + VIRAMA + pā

bbā = bā + VIRAMA + bā

mmā = mā + VIRAMA + mā

jjā = jā + VIRAMA + jā lkā = lā + VIRAMA + kā

lgā = lā + VIRAMA + gā

llā = lā + VIRAMA + lā lṭā = lā + VIRAMA + ṭā lpā = lā + VIRAMA + pā

schā = sā + VIRAMA + chā (conjunct shows old-style glyph)

sṭā = sā + VIRAMA + ṭā skā = sā + VIRAMA + kā

spā = sā + VIRAMA + pā

smā = sā + VIRAMA + mā

hmā = hā + VIRAMA + mā

4

The implication of this variety for implementors is simply one of how much conjunct support they wishto build into their fonts It would seem prudent to support the eight conjuncts seen in publications since2001 The 1982 style font would be considered archaic note how it differs in writing kmā tmā nmā bbā mmā llā smā and hmā which would be written kmā tmā nmā bbā mmā llā smā and hmā according to more recent sources This distinction isstylistic and not orthographic In Chakma the encoding model supports conjunct behaviour and a fontwithout any of these conjuncts would render them with the visible VIRAMA kmā tmā nmā bbā mmā llā smā and hmā To reiterate one can include as manyconjuncts as one wishes in a font but those included in the lists above were listed because they appearedin printed sources which were available

4 Independent vowels Four independent vowels exist a i u and e Other vowels in initialposition are formed by adding the vowel sign to a as in ī ū ai oi Some modern writers aregeneralizing this spelling in i u and e

5 Dependent vowels Independent vowel signs have been encoded according to their phonetic value notaccording to their glyph fragments Thus ṁ ANUSVARA and ḥ VISARGA are distinguished u and ūare distinguished ai and oi are distinguished and o and au are distinguished None of thesewould be equivalent to strings of characters (so ū is not u + u etc)

6 Collating order As an Indo-European language the standard Brahmic sorting order applies toChakma

7 Character names Consonant letter names use the typical Brahmic transliteration used in the UCSChakma letters have a descriptive name followed by a traditional Brahmic consonant Both are used herein the character names

8 Punctuation and digits Alongside a single and double danda punctuation Chakma has a uniquequestion mark and a section sign PHULACIHR There is some variation in the glyphs for the PHULICIHRsome looking like flowers or leaves A set of digits exists and is encoded although Bengali digits are alsoused The Tanchangya use Myanmar digits

9 Linebreaking Letters and digits behave as in Bengali

10 Unicode Character Properties

11100CHAKMA SIGN CANDRABINDUMn0NSMN11101CHAKMA SIGN ANUSVARAMn0NSMN11102CHAKMA SIGN VISARGAMc0LN11103CHAKMA LETTER AALo0LN11104CHAKMA LETTER ILo0LN11105CHAKMA LETTER ULo0LN11106CHAKMA LETTER ELo0LN11107CHAKMA LETTER KAALo0LN11108CHAKMA LETTER KHAALo0LN11109CHAKMA LETTER GAALo0LN1110ACHAKMA LETTER GHAALo0LN1110BCHAKMA LETTER NGAALo0LN1110CCHAKMA LETTER CAALo0LN1110DCHAKMA LETTER CHAALo0LN1110ECHAKMA LETTER JAALo0LN1110FCHAKMA LETTER JHAALo0LN11110CHAKMA LETTER NYAALo0LN11111CHAKMA LETTER TTAALo0LN11112CHAKMA LETTER TTHAALo0LN11113CHAKMA LETTER DDAALo0LN11114CHAKMA LETTER DDHAALo0LN11115CHAKMA LETTER NNAALo0LN11116CHAKMA LETTER TAALo0LN11117CHAKMA LETTER THAALo0LN

5

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 4: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

ccā = cā + VIRAMA + cā (conjunct shows old-style glyph)

cchā = cā + VIRAMA + chā (conjunct shows old-style glyph)

ntildecā = ntildeā + VIRAMA + cā (conjunct shows old-style glyph)

ntildejā = ntildeā + VIRAMA + jā ntildejhā = ntildeā + VIRAMA + jhā

ṭṭā = ṭā + VIRAMA + ṭā ttā = tā + VIRAMA + tā tmā = tā + VIRAMA + mā

tthā = tā + VIRAMA + thā

ddā = dā + VIRAMA + dā

ddhā = dā + VIRAMA + dhā

ntā = nā + VIRAMA + tā nthā = nā + VIRAMA + thā

nmā = nā + VIRAMA + mā

ppā = pā + VIRAMA + pā

bbā = bā + VIRAMA + bā

mmā = mā + VIRAMA + mā

jjā = jā + VIRAMA + jā lkā = lā + VIRAMA + kā

lgā = lā + VIRAMA + gā

llā = lā + VIRAMA + lā lṭā = lā + VIRAMA + ṭā lpā = lā + VIRAMA + pā

schā = sā + VIRAMA + chā (conjunct shows old-style glyph)

sṭā = sā + VIRAMA + ṭā skā = sā + VIRAMA + kā

spā = sā + VIRAMA + pā

smā = sā + VIRAMA + mā

hmā = hā + VIRAMA + mā

4

The implication of this variety for implementors is simply one of how much conjunct support they wishto build into their fonts It would seem prudent to support the eight conjuncts seen in publications since2001 The 1982 style font would be considered archaic note how it differs in writing kmā tmā nmā bbā mmā llā smā and hmā which would be written kmā tmā nmā bbā mmā llā smā and hmā according to more recent sources This distinction isstylistic and not orthographic In Chakma the encoding model supports conjunct behaviour and a fontwithout any of these conjuncts would render them with the visible VIRAMA kmā tmā nmā bbā mmā llā smā and hmā To reiterate one can include as manyconjuncts as one wishes in a font but those included in the lists above were listed because they appearedin printed sources which were available

4 Independent vowels Four independent vowels exist a i u and e Other vowels in initialposition are formed by adding the vowel sign to a as in ī ū ai oi Some modern writers aregeneralizing this spelling in i u and e

5 Dependent vowels Independent vowel signs have been encoded according to their phonetic value notaccording to their glyph fragments Thus ṁ ANUSVARA and ḥ VISARGA are distinguished u and ūare distinguished ai and oi are distinguished and o and au are distinguished None of thesewould be equivalent to strings of characters (so ū is not u + u etc)

6 Collating order As an Indo-European language the standard Brahmic sorting order applies toChakma

7 Character names Consonant letter names use the typical Brahmic transliteration used in the UCSChakma letters have a descriptive name followed by a traditional Brahmic consonant Both are used herein the character names

8 Punctuation and digits Alongside a single and double danda punctuation Chakma has a uniquequestion mark and a section sign PHULACIHR There is some variation in the glyphs for the PHULICIHRsome looking like flowers or leaves A set of digits exists and is encoded although Bengali digits are alsoused The Tanchangya use Myanmar digits

9 Linebreaking Letters and digits behave as in Bengali

10 Unicode Character Properties

11100CHAKMA SIGN CANDRABINDUMn0NSMN11101CHAKMA SIGN ANUSVARAMn0NSMN11102CHAKMA SIGN VISARGAMc0LN11103CHAKMA LETTER AALo0LN11104CHAKMA LETTER ILo0LN11105CHAKMA LETTER ULo0LN11106CHAKMA LETTER ELo0LN11107CHAKMA LETTER KAALo0LN11108CHAKMA LETTER KHAALo0LN11109CHAKMA LETTER GAALo0LN1110ACHAKMA LETTER GHAALo0LN1110BCHAKMA LETTER NGAALo0LN1110CCHAKMA LETTER CAALo0LN1110DCHAKMA LETTER CHAALo0LN1110ECHAKMA LETTER JAALo0LN1110FCHAKMA LETTER JHAALo0LN11110CHAKMA LETTER NYAALo0LN11111CHAKMA LETTER TTAALo0LN11112CHAKMA LETTER TTHAALo0LN11113CHAKMA LETTER DDAALo0LN11114CHAKMA LETTER DDHAALo0LN11115CHAKMA LETTER NNAALo0LN11116CHAKMA LETTER TAALo0LN11117CHAKMA LETTER THAALo0LN

5

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 5: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

The implication of this variety for implementors is simply one of how much conjunct support they wishto build into their fonts It would seem prudent to support the eight conjuncts seen in publications since2001 The 1982 style font would be considered archaic note how it differs in writing kmā tmā nmā bbā mmā llā smā and hmā which would be written kmā tmā nmā bbā mmā llā smā and hmā according to more recent sources This distinction isstylistic and not orthographic In Chakma the encoding model supports conjunct behaviour and a fontwithout any of these conjuncts would render them with the visible VIRAMA kmā tmā nmā bbā mmā llā smā and hmā To reiterate one can include as manyconjuncts as one wishes in a font but those included in the lists above were listed because they appearedin printed sources which were available

4 Independent vowels Four independent vowels exist a i u and e Other vowels in initialposition are formed by adding the vowel sign to a as in ī ū ai oi Some modern writers aregeneralizing this spelling in i u and e

5 Dependent vowels Independent vowel signs have been encoded according to their phonetic value notaccording to their glyph fragments Thus ṁ ANUSVARA and ḥ VISARGA are distinguished u and ūare distinguished ai and oi are distinguished and o and au are distinguished None of thesewould be equivalent to strings of characters (so ū is not u + u etc)

6 Collating order As an Indo-European language the standard Brahmic sorting order applies toChakma

7 Character names Consonant letter names use the typical Brahmic transliteration used in the UCSChakma letters have a descriptive name followed by a traditional Brahmic consonant Both are used herein the character names

8 Punctuation and digits Alongside a single and double danda punctuation Chakma has a uniquequestion mark and a section sign PHULACIHR There is some variation in the glyphs for the PHULICIHRsome looking like flowers or leaves A set of digits exists and is encoded although Bengali digits are alsoused The Tanchangya use Myanmar digits

9 Linebreaking Letters and digits behave as in Bengali

10 Unicode Character Properties

11100CHAKMA SIGN CANDRABINDUMn0NSMN11101CHAKMA SIGN ANUSVARAMn0NSMN11102CHAKMA SIGN VISARGAMc0LN11103CHAKMA LETTER AALo0LN11104CHAKMA LETTER ILo0LN11105CHAKMA LETTER ULo0LN11106CHAKMA LETTER ELo0LN11107CHAKMA LETTER KAALo0LN11108CHAKMA LETTER KHAALo0LN11109CHAKMA LETTER GAALo0LN1110ACHAKMA LETTER GHAALo0LN1110BCHAKMA LETTER NGAALo0LN1110CCHAKMA LETTER CAALo0LN1110DCHAKMA LETTER CHAALo0LN1110ECHAKMA LETTER JAALo0LN1110FCHAKMA LETTER JHAALo0LN11110CHAKMA LETTER NYAALo0LN11111CHAKMA LETTER TTAALo0LN11112CHAKMA LETTER TTHAALo0LN11113CHAKMA LETTER DDAALo0LN11114CHAKMA LETTER DDHAALo0LN11115CHAKMA LETTER NNAALo0LN11116CHAKMA LETTER TAALo0LN11117CHAKMA LETTER THAALo0LN

5

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 6: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

11118CHAKMA LETTER DAALo0LN11119CHAKMA LETTER DHAALo0LN1111ACHAKMA LETTER NAALo0LN1111BCHAKMA LETTER PAALo0LN1111CCHAKMA LETTER PHAALo0LN1111DCHAKMA LETTER BAALo0LN1111ECHAKMA LETTER BHAALo0LN1111FCHAKMA LETTER MAALo0LN11120CHAKMA LETTER YYAALo0LN11121CHAKMA LETTER YAALo0LN11122CHAKMA LETTER RAALo0LN11123CHAKMA LETTER LAALo0LN11124CHAKMA LETTER WAALo0LN11125CHAKMA LETTER SAALo0LN11126CHAKMA LETTER HAALo0LN11127CHAKMA VOWEL SIGN AMn0NSMN11128CHAKMA VOWEL SIGN IMn0NSMN11129CHAKMA VOWEL SIGN IIMn0NSMN1112ACHAKMA VOWEL SIGN UMn0NSMN1112BCHAKMA VOWEL SIGN UUMn0NSMN1112CCHAKMA VOWEL SIGN EMc0LN1112DCHAKMA VOWEL SIGN AIMn0NSMN1112ECHAKMA VOWEL SIGN OMn0NSMN1112FCHAKMA VOWEL SIGN AUMn0NSMN11130CHAKMA VOWEL SIGN OIMn0NSMN11131CHAKMA VIRAMAMn9NSMN11132CHAKMA MAAYYAAMn0NSMN11133CHAKMA DANDAPo0LN11134CHAKMA DOUBLE DANDAPo0LN11135CHAKMA QUESTION MARKPo0LN11136CHAKMA DIGIT ZERONd0L000N11137CHAKMA DIGIT ONENd0L111N11138CHAKMA DIGIT TWONd0L222N11139CHAKMA DIGIT THREENd0L333N1113ACHAKMA DIGIT FOURNd0L444N1113BCHAKMA DIGIT FIVENd0L555N1113CCHAKMA DIGIT SIXNd0L666N1113DCHAKMA DIGIT SEVENNd0L777N1113ECHAKMA DIGIT EIGHTNd0L888N1113FCHAKMA DIGIT NINENd0L999N11140CHAKMA PHULACIHRPo0LN

11 BibliographyCāṅmā Cirajyoti and Maṅgal Cāṅgmā 1982 Cāṅmār āg pudhi (Chakma primer) Rāṅamāṭi

Cāṅmābhāṣā Prakāśanā Pariṣad Khisa Bhagadatta 2001 Cāṅmā pattham pāt = Chakma primer Rāṅamāṭi Tribal Cultural Institute

(TCI)Singā 2004 Phagadāṅ

12 Acknowledgements This project was made possible in part by a grant from the US NationalEndowment for the Humanities which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Chakma encoding Any views findings conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment of the Humanities

6

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 7: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 7

1114FChakma11100

1110 1111 1112 1113 1114

$69888

$ 69889

$ 69890

69891

69892

69893

69894

69895

69896

69897

69898

69899

69900

69901

69902

69903

69904

69905

69906

69907

69908

69909

69910

69911

69912

69913

69914

69915

69916

69917

69918

69919

69920

69921

69922

69923

69924

69925

69926

$ 69927

$ 69928

$69929

$ 69930

$ 69931

$ 69932

$ 69933

$ 69934

$ 69935

$69936

69937

$69938

69939

69940

69941

69942

69943

69944

69945

69946

69947

69948

69949

69950

69951

6995211100

11101

11102

11103

11104

11105

11106

11107

11108

11109

1110A

1110B

1110C

1110D

1110E

1110F

11110

11111

11112

11113

11114

11115

11116

11117

11118

11119

1111A

1111B

1111C

1111D

1111E

1111F

11120

11121

11122

11123

11124

11125

11126

11127

11128

11129

1112A

1112B

1112C

1112D

1112E

1112F

11130

11131

11132

11133

11134

11135

11136

11137

11138

11139

1113A

1113B

1113C

1113D

1113E

1113F

11140

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 8: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-128

1113AChakma11100

1111D 69917 CHAKMA LETTER BAA= ubaramuyaa baa

1111E 69918 CHAKMA LETTER BHAA= ciraddaalyaa bhaa

1111F 69919 CHAKMA LETTER MAA= bugatpadalaa maa

11120 69920 CHAKMA LETTER YYAA= cimayyaa yyaa

11121 69921 CHAKMA LETTER YAA= jilyaa yaa

11122 69922 CHAKMA LETTER RAA= dvidaayyaa raa

11123 69923 CHAKMA LETTER LAA= talamuyaa laa

11124 69924 CHAKMA LETTER WAA= bajhonyaa waa

11125 69925 CHAKMA LETTER SAA= bhudibukyaa saa

11126 69926 CHAKMA LETTER HAA= ubaramuyaa haa

Dependent vowel signs11127 $69927 CHAKMA VOWEL SIGN A

= ubaratulyaa a

11128 $69928 CHAKMA VOWEL SIGN I= bahryaa i

11129 $69929 CHAKMA VOWEL SIGN II= baaniiphadaa ii

1112A $69930 CHAKMA VOWEL SIGN U= ekattaana u

1112B $69931 CHAKMA VOWEL SIGN UU= dvittaana uu

1112C $69932 CHAKMA VOWEL SIGN E= ekaara e

1112D $69933 CHAKMA VOWEL SIGN AI= delabhaanga ai

1112E $69934 CHAKMA VOWEL SIGN O= okaara o

1112F $69935 CHAKMA VOWEL SIGN AU= aukaara au

11130 $69936 CHAKMA VOWEL SIGN OI= oikaara oi

Various signs11131 69937 CHAKMA VIRAMA

bull used to form conjuncts

rarr 1039 myanmar sign virama

11132 $69938 CHAKMA MAAYYAA

bull killer

rarr 103A $ myanmar sign asat

11133 69939 CHAKMA DANDA= ekacilyaa

11134 69940 CHAKMA DOUBLE DANDA= dvicilyaa

11135 69941 CHAKMA QUESTION MARK= pujhaar

Digits11136 69942 CHAKMA DIGIT ZERO

11137 69943 CHAKMA DIGIT ONE

11138 69944 CHAKMA DIGIT TWO

11139 69945 CHAKMA DIGIT THREE

1113A 69946 CHAKMA DIGIT FOUR

Various signs11100 $69888 CHAKMA SIGN CANDRABINDU

= caanaphupudaa

11101 $69889 CHAKMA SIGN ANUSVARA= ekaphudaa

11102 $69890 CHAKMA SIGN VISARGA= dviphudaa

Independent vowels11103 69891 CHAKMA LETTER AA

= pichapujhaa aa

11104 69892 CHAKMA LETTER I= delabhaangagaa i

11105 69893 CHAKMA LETTER U= bacacu u

11106 69894 CHAKMA LETTER E= lejaubaa e

Consonants11107 69895 CHAKMA LETTER KAA

= cucyaangyaa kaa

11108 69896 CHAKMA LETTER KHAA= grajaangyaa khaa

11109 69897 CHAKMA LETTER GAA= caandyaa gaa

1110A 69898 CHAKMA LETTER GHAA= tinaddaalyaa ghaa

1110B 69899 CHAKMA LETTER NGAA= cilaama ngaa

1110C 69900 CHAKMA LETTER CAA= dvibhalyaa caa

1110D 69901 CHAKMA LETTER CHAA= majaraa chaa

1110E 69902 CHAKMA LETTER JAA= dvipadalaa haa

1110F 69903 CHAKMA LETTER JHAA= uraauraa jhaa

11110 69904 CHAKMA LETTER NYAA= silaacyaa nyaa

11111 69905 CHAKMA LETTER TTAA= dviyaadaat ttaa

11112 69906 CHAKMA LETTER TTHAA= phudaadviyaat tthaa

11113 69907 CHAKMA LETTER DDAA= aadudaangaat ddaa

11114 69908 CHAKMA LETTER DDHAA= lejabharaat ddhaa

11115 69909 CHAKMA LETTER NNAA= pettttuyaa nnaa

11116 69910 CHAKMA LETTER TAA= ghangadaat taa

11117 69911 CHAKMA LETTER THAA= jagadaat thaa

11118 69912 CHAKMA LETTER DAA= dolaniit daa

11119 69913 CHAKMA LETTER DHAA= talamuyaat dhaa

1111A 69914 CHAKMA LETTER NAA= phaarabaanyaa naa

1111B 69915 CHAKMA LETTER PAA= paalyaa paa

1111C 69916 CHAKMA LETTER PHAA= ubaraphudaa phaa

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 9: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Printed using UniBooktrade

(httpwwwunicodeorgunibook)

Date 2009-02-12 9

11140Chakma1113B

1113B 69947 CHAKMA DIGIT FIVE

1113C 69948 CHAKMA DIGIT SIX

1113D 69949 CHAKMA DIGIT SEVEN

1113E 69950 CHAKMA DIGIT EIGHT

1113F 69951 CHAKMA DIGIT NINE

Punctuation11140 69952 CHAKMA PHULACIHR

= section sign

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 10: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 1 Chakma chart from Griersonrsquos Linguistic Survey of India 1903

10

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 11: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 2 Charts taken from a paper written by Mr Sugata Chakma of the Tribal Cultural Institute onldquothe Primary classification of languagesrdquo

Figure 3 Example of poetry from the book Phagadāṅ 2004

11

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 12: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 4 Alphabet chart from Khisa 2001

12

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 13: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 5 Chart of vowel signs and conjuncts from Khisa 2001

13

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 14: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 6 Chart with old-style conjuncts from Cāṅmā 1982

14

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 15: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

Figure 7 Chart with punctuation and digits from Cāṅmā 1982

15

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 16: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

A Administrative1 TitlePro po s al fo r enco di ng the Chakma s cri pt i n the UCS2 Requesterrsquos nameMi chael Ev ers o n3 Requester type (Member bodyLiaisonIndividual contribution)Indi v i dual co ntri buti o n4 Submission date2 0 0 8 -0 8 -2 85 Requesterrsquos reference (if applicable)6 Choose one of the following6a This is a complete proposalNo 6b More information will be provided laterYes

B Technical ndash General1 Choose one of the following1a This proposal is for a new script (set of characters)Yes 1b Proposed name of scriptChakma1c The proposal is for addition of character(s) to an existing blockNo 1d Name of the existing block2 Number of characters in proposal6 3 3 Proposed category (A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols)Categ o ry A4a Is a repertoire including character names providedYes 4b If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP documentYes 4c Are the character shapes attached in a legible form suitable for reviewYes 5a Who will provide the appropriate computerized font (ordered preference True Type or PostScript format) for publishing thestandardMi chael Ev ers o n5b If available now identify source(s) for the font (include address e-mail ftp-site etc) and indicate the tools usedMi chael Ev ers o n Fo nto g rapher6a Are references (to other character sets dictionaries descriptive texts etc) providedYes 6b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attachedYes 7 Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searchingindexing transliteration etc (if yes please enclose information)Yes 8 Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assistin correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties areCasing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contextsCompatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorgfor such informat ion on o ther scrip ts Also see Unicode Character Database h t tp www unicode org Publ icUNIDATAUnicodeCharacterDatabasehtml and associated Unicode Technical Reports for information needed for consideration by the UnicodeTechnical Committee for inclusion in the Unicode StandardSee abo v e

C Technical ndash Justification1 Has this proposal for addition of character(s) been submitted before If YES explainNo 2a Has contact been made to members of the user community (for example National Body user groups of the script or charactersother experts etc)Yes 2b If YES with whom2c If YES available relevant documents3 Information on the user community for the proposed characters (for example size demographics information technology use orpublishing use) is includedPeo pl e l i v i ng i n Bang l ades h and i n Indi a

16

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17

Page 17: JTC1/SC2/WG2 L2/09-xxx - Evertype N3xxx L2/09-xxx 2009-02-12 Universal Multiple-Octet Coded Character Set ... 176,000 in India in Mizoram, Assam, Tripura, and Arunachal Pradesh

4a The context of use for the proposed characters (type of use common or rare)Co mmo n4b Reference5a Are the proposed characters in current use by the user communityYes 5b If YES whereIn Bang l ades h and i n Indi a6a After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMPYes 6b If YES is a rationale providedYes 6c If YES referenceCo ntempo rary us e and acco rdance wi th the Ro admap7 Should the proposed characters be kept together in a contiguous range (rather than being scattered)Yes 8a Can any of the proposed characters be considered a presentation form of an existing character or character sequenceNo 8b If YES is a rationale for its inclusion provided8c If YES reference9a Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharactersNo 9b If YES is a rationale for its inclusion provided9c If YES reference10a Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing characterNo 10b If YES is a rationale for its inclusion provided10c If YES reference11a Does the proposal include use of combining characters andor use of composite sequences (see clauses 412 and 414 in ISOIEC10646-1 2000)No 11b If YES is a rationale for such use provided11c If YES reference11d Is a list of composite sequences and their corresponding glyph images (graphic symbols) providedNo 11e If YES reference12a Does the proposal contain characters with any special properties such as control function or similar semanticsNo 12b If YES describe in detail (include attachment if necessary)13a Does the proposal contain any Ideographic compatibility character(s)No 13b If YES is the equivalent corresponding unified ideographic character(s) identified

17