Jump to content

Indian Script Code for Information Interchange

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by UTF-8 (talk | contribs) at 16:41, 29 June 2014 (Remove a dead link. It contained a few lines of uninformative text.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Template:IndicText Indian Standard Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Assamese, Bengali (Bengla), Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. ISCII does not encode the writing systems of India based on Arabic, but its writing system switching codes nonetheless provide for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic. The Arabic-based writing systems were subsequently encoded in the PASCII encoding.

The Brahmi-derived writing systems are mostly rather similar in structure, but have different letter shapes. So ISCII encodes letters with the same phonetic value at the same codepoint, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki]. This will be rendered as कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the ATR code described below.

One motivation for the use of a single encoding is the idea that it will allow easy transliteration from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea. See About ISCII.

ISCII is an 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific. In addition to the codepoints representing characters, ISCII makes use of a codepoint with mnemonic ATR that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes such as bold and italic. ISCII does not provide a means of indicating the default writing system.

ISCII has not been widely used outside of certain government institutions and has now been rendered largely obsolete by Unicode. Unicode uses a separate block for each Indic writing system, and largely preserves the ISCII layout within each block.

Codepage layout

The following table shows the character set for Devanagari. The code sets for Assamese, Bengali, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu are similar, with each Devanagari form replaced by the equivalent form in each writing system. Each character is shown with its decimal code and its Unicode equivalent.

ISCII Devanagari
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
0_ Template:Chset-color-ctrl|NUL
0000
0
Template:Chset-color-ctrl|SOH
0001
1
Template:Chset-color-ctrl|STX
0002
2
Template:Chset-color-ctrl|ETX
0003
3
Template:Chset-color-ctrl|EOT
0004
4
Template:Chset-color-ctrl|ENQ
0005
5
Template:Chset-color-ctrl|ACK
0006
6
Template:Chset-color-ctrl|BEL
0007
7
Template:Chset-color-ctrl|BS
0008
8
Template:Chset-color-ctrl|HT
0009
9
Template:Chset-color-ctrl|LF
000A
10
Template:Chset-color-ctrl|VT
000B
11
Template:Chset-color-ctrl|FF
000C
12
Template:Chset-color-ctrl|CR
000D
13
Template:Chset-color-ctrl|SO
000E
14
Template:Chset-color-ctrl|SI
000F
15
1_ Template:Chset-color-ctrl|DLE
0010
16
Template:Chset-color-ctrl|DC1
0011
17
Template:Chset-color-ctrl|DC2
0012
18
Template:Chset-color-ctrl|DC3
0013
19
Template:Chset-color-ctrl|DC4
0014
20
Template:Chset-color-ctrl|NAK
0015
21
Template:Chset-color-ctrl|SYN
0016
22
Template:Chset-color-ctrl|ETB
0017
23
Template:Chset-color-ctrl|CAN
0018
24
Template:Chset-color-ctrl|EM
0019
25
Template:Chset-color-ctrl|SUB
001A
26
Template:Chset-color-ctrl|ESC
001B
27
Template:Chset-color-ctrl|FS
001C
28
Template:Chset-color-ctrl|GS
001D
29
Template:Chset-color-ctrl|RS
001E
30
Template:Chset-color-ctrl|US
001F
31
2_ Template:Chset-color-punct|SP
0020
32
Template:Chset-color-punct|!
0021
33
Template:Chset-color-punct|"
0022
34
Template:Chset-color-punct|#
0023
35
Template:Chset-color-punct|$
0024
36
Template:Chset-color-punct|%
0025
37
Template:Chset-color-punct|&
0026
38
Template:Chset-color-punct|'
0027
39
Template:Chset-color-punct|(
0028
40
Template:Chset-color-punct|)
0029
41
Template:Chset-color-punct|*
002A
42
Template:Chset-color-punct|+
002B
43
Template:Chset-color-punct|,
002C
44
Template:Chset-color-punct|-
002D
45
Template:Chset-color-punct|.
002E
46
Template:Chset-color-punct|/
002F
47
3_ Template:Chset-color-digit|0
0030
48
Template:Chset-color-digit|1
0031
49
Template:Chset-color-digit|2
0032
50
Template:Chset-color-digit|3
0033
51
Template:Chset-color-digit|4
0034
52
Template:Chset-color-digit|5
0035
53
Template:Chset-color-digit|6
0036
54
Template:Chset-color-digit|7
0037
55
Template:Chset-color-digit|8
0038
56
Template:Chset-color-digit|9
0039
57
Template:Chset-color-punct|:
003A
58
Template:Chset-color-punct|;
003B
59
Template:Chset-color-punct|<
003C
60
Template:Chset-color-punct|=
003D
61
Template:Chset-color-punct|>
003E
62
Template:Chset-color-punct|?
003F
63
4_ Template:Chset-color-punct|@
0040
64
Template:Chset-color-alpha|A
0041
65
Template:Chset-color-alpha|B
0042
66
Template:Chset-color-alpha|C
0043
67
Template:Chset-color-alpha|D
0044
68
Template:Chset-color-alpha|E
0045
69
Template:Chset-color-alpha|F
0046
70
Template:Chset-color-alpha|G
0047
71
Template:Chset-color-alpha|H
0048
72
Template:Chset-color-alpha|I
0049
73
Template:Chset-color-alpha|J
004A
74
Template:Chset-color-alpha|K
004B
75
Template:Chset-color-alpha|L
004C
76
Template:Chset-color-alpha|M
004D
77
Template:Chset-color-alpha|N
004E
78
Template:Chset-color-alpha|O
004F
79
5_ Template:Chset-color-alpha|P
0050
80
Template:Chset-color-alpha|Q
0051
81
Template:Chset-color-alpha|R
0052
82
Template:Chset-color-alpha|S
0053
83
Template:Chset-color-alpha|T
0054
84
Template:Chset-color-alpha|U
0055
85
Template:Chset-color-alpha|V
0056
86
Template:Chset-color-alpha|W
0057
87
Template:Chset-color-alpha|X
0058
88
Template:Chset-color-alpha|Y
0059
89
Template:Chset-color-alpha|Z
005A
90
Template:Chset-color-punct|[
005B
91
Template:Chset-color-punct|\
005C
92
Template:Chset-color-punct|]
005D
93
Template:Chset-color-punct|^
005E
94
Template:Chset-color-punct|_
005F
95
6_ Template:Chset-color-punct|`
0060
96
Template:Chset-color-alpha|a
0061
97
Template:Chset-color-alpha|b
0062
98
Template:Chset-color-alpha|c
0063
99
Template:Chset-color-alpha|d
0064
100
Template:Chset-color-alpha|e
0065
101
Template:Chset-color-alpha|f
0066
102
Template:Chset-color-alpha|g
0067
103
Template:Chset-color-alpha|h
0068
104
Template:Chset-color-alpha|i
0069
105
Template:Chset-color-alpha|j
006A
106
Template:Chset-color-alpha|k
006B
107
Template:Chset-color-alpha|l
006C
108
Template:Chset-color-alpha|m
006D
109
Template:Chset-color-alpha|n
006E
110
Template:Chset-color-alpha|o
006F
111
7_ Template:Chset-color-alpha|p
0070
112
Template:Chset-color-alpha|q
0071
113
Template:Chset-color-alpha|r
0072
114
Template:Chset-color-alpha|s
0073
115
Template:Chset-color-alpha|t
0074
116
Template:Chset-color-alpha|u
0075
117
Template:Chset-color-alpha|v
0076
118
Template:Chset-color-alpha|w
0077
119
Template:Chset-color-alpha|x
0078
120
Template:Chset-color-alpha|y
0079
121
Template:Chset-color-alpha|z
007A
122
Template:Chset-color-punct|{
007B
123
Template:Chset-color-punct||
007C
124
Template:Chset-color-punct|}
007D
125
Template:Chset-color-punct|~
007E
126
Template:Chset-color-ctrl |DEL
007F
127
8_ Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|
9_ Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|
A_ Template:Chset-color-undef| Template:Chset-color-intl |
0901
161
Template:Chset-color-intl |
0902
162
Template:Chset-color-intl |
0903
163
Template:Chset-color-intl |
0905
164
Template:Chset-color-intl |
0906
165
Template:Chset-color-intl |
0907
166
Template:Chset-color-intl |
0908
167
Template:Chset-color-intl |
0909
168
Template:Chset-color-intl |
090A
169
Template:Chset-color-intl |
090B
170
Template:Chset-color-intl |
090E
171
Template:Chset-color-intl |
090F
172
Template:Chset-color-intl |
0910
173
Template:Chset-color-intl |
090D
174
Template:Chset-color-intl |
0912
175
B_ Template:Chset-color-intl |
0913
176
Template:Chset-color-intl |
0914
177
Template:Chset-color-intl |
0911
178
Template:Chset-color-intl |
0915
179
Template:Chset-color-intl |
0916
180
Template:Chset-color-intl |
0917
181
Template:Chset-color-intl |
0918
182
Template:Chset-color-intl |
0919
183
Template:Chset-color-intl |
091A
184
Template:Chset-color-intl |
091B
185
Template:Chset-color-intl |
091C
186
Template:Chset-color-intl |
091D
187
Template:Chset-color-intl |
091E
188
Template:Chset-color-intl |
091F
189
Template:Chset-color-intl |
0920
190
Template:Chset-color-intl |
0921
191
C_ Template:Chset-color-intl |
0922
192
Template:Chset-color-intl |
0923
193
Template:Chset-color-intl |
0924
194
Template:Chset-color-intl |
0925
195
Template:Chset-color-intl |
0926
196
Template:Chset-color-intl |
0927
197
Template:Chset-color-intl |
0928
198
Template:Chset-color-intl |
0929
199
Template:Chset-color-intl |
092A
200
Template:Chset-color-intl |
092B
201
Template:Chset-color-intl |
092C
202
Template:Chset-color-intl |
092D
203
Template:Chset-color-intl |
092E
204
Template:Chset-color-intl |
092F
207
Template:Chset-color-intl |
095F
206
Template:Chset-color-intl |
0930
205
D_ Template:Chset-color-intl |
0931
208
Template:Chset-color-intl |
0932
209
Template:Chset-color-intl |
0933
210
Template:Chset-color-intl |
0934
211
Template:Chset-color-intl |
0935
212
Template:Chset-color-intl |
0936
213
Template:Chset-color-intl |
0937
214
Template:Chset-color-intl |
0938
215
Template:Chset-color-intl |
0939
216
Template:Chset-color-ctrl |INV
 
217
Template:Chset-color-intl |
093E
218
Template:Chset-color-intl |ि
093F
219
Template:Chset-color-intl |
0940
220
Template:Chset-color-intl |
0941
221
Template:Chset-color-intl |
0942
222
Template:Chset-color-intl |
0943
223
E_ Template:Chset-color-intl |
0946
224
Template:Chset-color-intl |
0947
225
Template:Chset-color-intl |
0948
226
Template:Chset-color-intl |
0945
227
Template:Chset-color-intl |
094A
228
Template:Chset-color-intl |
094B
229
Template:Chset-color-intl |
094C
230
Template:Chset-color-intl |
0949
231
Template:Chset-color-intl |
094D
232
Template:Chset-color-intl |
093C
233
Template:Chset-color-intl |
0964
234
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-ctrl |ATR
 
239
F_ Template:Chset-color-ctrl |EXT
 
240
Template:Chset-color-intl |
0966
241
Template:Chset-color-intl |
0967
242
Template:Chset-color-intl |
0968
243
Template:Chset-color-intl |
0969
244
Template:Chset-color-intl |
096A
245
Template:Chset-color-intl |
096B
246
Template:Chset-color-intl |
096C
247
Template:Chset-color-intl |
096D
248
Template:Chset-color-intl |
096E
249
Template:Chset-color-intl |
096F
250
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|

Special code points

ii

INV character—code point D9 (217)
The INV character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क्‍ (half ka). The Unicode equivalent is no break space 00A0 or dotted circle ◌ 25CC.
ATR character—code point EF (239)
The ATR character followed by a byte code is used to switch to a different font attribute (such as bold) or language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points.
EXT character—code point F0 (240)
The EXT character followed by a byte code indicates a Vedic accent. This has no direct Unicode equivalent, as Vedic accents are assigned to distinct code points.
Halant character ़—code point E8 (232)
The halant character removes the implicit vowel from a consonant and is used between consonants to represent conjunct consonants. For example, क (ka) + ् (halant) + त (ta) = क्त (kta). The sequence ् (halant) + ् (halant) displays a conjunct with an explicit halant, for example क (ka) + ् (halant) + ् (halant) + त (ta) = क्‌त. The sequence ् (halant) + ़ (nukta) displays a conjunct with half consonants, if available, for example क (ka) + ् (halant) + ़ (nukta) + त (ta) = क्‍त.
ISCII Unicode
single halant E8 halant 094D
halant + halant E8 E8 halant + ZWNJ 094D 200C
halant + nukta E8 E9 halant + ZWJ 094D 200D
Nukta character ़—code point E9 (233)
The nukta character after another ISCII character is used for a number of rarer characters which don't exist in the main ISCII set. For example क (ka) + ़ (nukta) = क़ (qa). These characters have precomposed forms in Unicode, as shown in the following table.
ISCII
code point
Original
character
Character
with nukta
Unicode
code point
A1 (161) 0950
A6 (166) 090C
A7 (167) 0961
AA (176) 0960
B3 (179) क़ 0958
B4 (180) ख़ 0959
B5 (181) ग़ 095A
BA (186) ज़ 095B
BF (191) ड़ 095C
C0 (192) ढ़ 095D
C9 (201) फ़ 095E
DB (219) ि 0962
DC (220) 0963
DF (223) 0944
EA (234) 093D

Code pages for ISCII conversion

To convert from Unicode (UTF-8) to an ISCII / ANSI coding, the following code pages may be used:

 * 57002 Devanagari (Hindi, Marathi, Sanskrit, Konkani)
 * 57003 Bengali
 * 57004 Tamil
 * 57005 Telugu
 * 57006 Assamese (same as Bengali)
 * 57007 Oriya
 * 57008 Kannada
 * 57009 Malayalam
 * 57010 Gujarati
 * 57011 Punjabi (Gurmukhi)

Code points for all languages

Each alphabet is listed in the order of its ISCII code point. Code points with asterisks (*) indicate the code point followed by nukta, e.g. क (ka) + ़ = क़ (qa); इ (i) + ़ = ऌ (ḷ). Each character is listed along with its Unicode code point.