Unicode is a widely used international standard for encoding characters of all languages and and symbols of all types. It is governed by a non-profit organization, the Unicode Consortium. See the Wikipedia article on Unicode for more information.
Unicode is now the standard way to store Devanagari, and virtually every other alphabet, on computers. Devanagari is the traditional alphabet for writing Sanskrit and other Indian languages. Before Unicode became popular, roughly around 2000, there were many other schemes for storing Devanagari on computers. These schemes and the software that used them are now largely obsolete.
Unicode is based on characters, not sounds. Devanagari Unicode has a code for each Devanagari letter, according to Panini’s conception of letter. Since the relationship between characters and sounds is more complex in Devanagari than in English, Unicode fonts that include Devanagari must have correct characters and information about combining simple characters into compound characters. In addition, the software you use has to be able to handle this information correctly. ↖
Most recent software usually handles Unicode properly, but older software often does not. In Windows, for instance, there is only a small amount of Unicode support in Windows 95 and Office 97, but a very high level of Unicode support in Windows XP and Office 2003.
|
पवित्र |
The text at left is the Sanskrit word pavitra in Devanagari Unicode. If this displays as question marks or small boxes containing tiny letters or numbers, then your operating system has not yet been enabled to display or type Devanagari Unicode characters. If it displays as Devanagari, but the short i vowel mark is to the right of the va instead of the left, or the last syllable is written ta-halant-ra instead of conjunct tra, then your operating system can display Unicode but does not have complex script support and thus cannot render Devanagari correctly. |
The Wikipedia article on Indic language support describes how to enable almost all operating systems to correctly display and type of Devanagari Unicode.
In Windows, if you need to type in a small amount of Unicode text, some software supports the Alt-X method for entering a Unicode character: Type the hexadecimal code of the desired Unicode character, and then type Alt-X. For instance, to enter अ, type 0905 and then Alt-X. In OS X, a similar technique can be used by activating and selecting the Unicode Hex Input keyboard. See below for a chart of Devanagari Unicode characters and their hexadecimal codes. ↖
AksharaPad • AksharaPad is a simple word processing program for typing Unicode Devanāgarī and IAST transliteration with a Western keyboard.
Akshara Bridge • Akshara Bridge converts between several kinds of text that represent Devanāgarī and transliterations of Devanāgarī, including Unicode and non-Unicode.
Itranslator 2003 • omkarananda-ashram.org/Sanskrit/itranslator2003.htm • Free software for writing Devanagari with an English keyboard. Includes a high quality Unicode Devanagari font.
Open Office (.org) • openoffice.org • A free, open source alternative to Microsoft Office. Open Office can read and modify any file created with Microsoft Office, and create its own. It can convert any Office file to PDF, including Unicode documents.
Praja font • A Unicode Devanagari font developed by Dr. Peter Freund. ↖
VedaPad and VedaType • VedaPad is a system for typing non-Unicode Devanāgarī on Windows computers. VedaType is a companion product for Macintosh computers.
Itranslator 99 • omkarananda-ashram.org/Sanskrit/itranslator99.htm • Free software for writing Devanagari with an English keyboard. Includes a high quality non-Unicode Devanagari font.
VedaTeX • VedaTeX is a system for using the fonts and keystroke transliteration of VedaPad with TeX. VedaTeX has special corrections for the VedaPad fonts that improve the way these fonts display compared to other word processing software.
Devanagari for TeX • sarovar.org/projects/devnag • This package allows you to typeset high-quality Devanāgarī text with TeX and LaTeX systems. The package supports Sanskrit, Hindi, Marathi, Nepali, and other languages typically printed in the Devanāgarī script. ↖
The following chart shows samples of Devanagari Unicode in some common Unicode fonts. Each sample shows the Devanagari characters, the keystrokes that represent the characters in VedaPad, and numeric codes that represent the characters. These codes are given in hexadecimal notation, as indicated by the “x” before the digits.
Default font
अग्निं
ईळे
agniM ILe
x0905, x0917, x094D, x0928, x093F, x0902 —
x0908, x0933, x0947
Mangal
अग्निं
ईळे
agniM ILe
x0905, x0917, x094D, x0928, x093F, x0902 —
x0908, x0933, x0947
unifont
अग्निं
ईळे
agniM ILe
x0905, x0917, x094D, x0928, x093F, x0902 —
x0908, x0933, x0947
Sanskrit 2003
अग्निं
ईळे
agniM ILe
x0905, x0917, x094D, x0928, x093F, x0902 —
x0908, x0933, x0947
Code2000
अग्निं
ईळे
agniM ILe
x0905, x0917, x094D, x0928, x093F, x0902 —
x0908, x0933, x0947
↖
This chart shows the characters in the Devanagari Unicode standard, and their numeric Unicode character codes in hexadecimal. Devanagari characters are in three different Unicode character blocks:
- Devanagari, numeric codes x0900–x097F
- Vedic Extensions, numeric codes x1CD0–x1CFF
- Common Indic Number Forms, numeric codes xA830–xA83F
- Devanagari Extended, numeric codes xA8E0–xA8FF
The Devanagari Extended and Vedic Extensions blocks consist entirely of Vedic accent characters. The main Devanagari block contains a few Vedic accent characters also.
| _ | x090_ | x091_ | x092_ | x093_ | x094_ | x095_ | x096_ | x097_ | x1CD_ | x1CE_ | x1CF_ | xA83_ | xA8E_ | xA8F_ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | ऀ x0900 |
ऐ x0910 |
ठ x0920 |
र x0930 |
ी x0940 |
ॐ x0950 |
ॠ x0960 |
॰ x0970 |
᳐ x1CD0 |
᳠ x1CE0 |
ᳰ x1CF0 |
꠰ xA830 |
꣠ xA8E0 |
꣰ xA8F0 |
| 1 | ँ x0901 |
ऑ x0911 |
ड x0921 |
ऱ x0931 |
ु x0941 |
॑ x0951 |
ॡ x0961 |
ॱ x0971 |
᳑ x1CD1 |
᳡ x1CE1 |
ᳱ x1CF1 |
꠱ xA831 |
꣡ xA8E1 |
꣱ xA8F1 |
| 2 | ं x0902 |
ऒ x0912 |
ढ x0922 |
ल x0932 |
ू x0942 |
॒ x0952 |
ॢ x0962 |
ॲ x0972 |
᳒ x1CD2 |
᳢ x1CE2 |
ᳲ x1CF2 |
꠲ xA832 |
꣢ xA8E2 |
ꣲ xA8F2 |
| 3 | ः x0903 |
ओ x0913 |
ण x0923 |
ळ x0933 |
ृ x0943 |
॓ x0953 |
ॣ x0963 |
᳓ x1CD3 |
᳣ x1CE3 |
ᳳ x1CF3 |
꠳ xA833 |
꣣ xA8E3 |
ꣳ xA8F3 |
|
| 4 | ऄ x0904 |
औ x0914 |
त x0924 |
ऴ x0934 |
ॄ x0944 |
॔ x0954 |
। x0964 |
᳔ x1CD4 |
᳤ x1CE4 |
᳴ x1CF4 |
꠴ xA834 |
꣤ xA8E4 |
ꣴ xA8F4 |
|
| 5 | अ x0905 |
क x0915 |
थ x0925 |
व x0935 |
ॅ x0945 |
ॕ x0955 |
॥ x0965 |
᳕ x1CD5 |
᳥ x1CE5 |
ᳵ x1CF5 |
꠵ xA835 |
꣥ xA8E5 |
ꣵ xA8F5 |
|
| 6 | आ x0906 |
ख x0916 |
द x0926 |
श x0936 |
ॆ x0946 |
० x0966 |
᳖ x1CD6 |
᳦ x1CE6 |
ᳶ x1CF6 |
꠶ xA836 |
꣦ xA8E6 |
ꣶ xA8F6 |
||
| 7 | इ x0907 |
ग x0917 |
ध x0927 |
ष x0937 |
े x0947 |
१ x0967 |
᳗ x1CD7 |
᳧ x1CE7 |
꠷ xA837 |
꣧ xA8E7 |
ꣷ xA8F7 |
|||
| 8 | ई x0908 |
घ x0918 |
न x0928 |
स x0938 |
ै x0948 |
क़ x0958 |
२ x0968 |
᳘ x1CD8 |
᳨ x1CE8 |
꠸ xA838 |
꣨ xA8E8 |
꣸ xA8F8 |
||
| 9 | उ x0909 |
ङ x0919 |
ऩ x0929 |
ह x0939 |
ॉ x0949 |
ख़ x0959 |
३ x0969 |
ॹ x0979 |
᳙ x1CD9 |
ᳩ x1CE9 |
꠹ xA839 |
꣩ xA8E9 |
꣹ xA8F9 |
|
| A | ऊ x090A |
च x091A |
प x092A |
ॊ x094A |
ग़ x095A |
४ x096A |
ॺ x097A |
᳚ x1CDA |
ᳪ x1CEA |
꣪ xA8EA |
꣺ xA8FA |
|||
| B | ऋ x090B |
छ x091B |
फ x092B |
ो x094B |
ज़ x095B |
५ x096B |
ॻ x097B |
᳛ x1CDB |
ᳫ x1CEB |
꣫ xA8EB |
ꣻ xA8FB |
|||
| C | ऌ x090C |
ज x091C |
ब x092C |
़ x093C |
ौ x094C |
ड़ x095C |
६ x096C |
ॼ x097C |
᳜ x1CDC |
ᳬ x1CEC |
꣬ xA8EC |
|||
| D | ऍ x090D |
झ x091D |
भ x092D |
ऽ x093D |
् x094D |
ढ़ x095D |
७ x096D |
ॽ x097D |
᳝ x1CDD |
᳭ x1CED |
꣭ xA8ED |
|||
| E | ऎ x090E |
ञ x091E |
म x092E |
ा x093E |
ॎ x094E |
फ़ x095E |
८ x096E |
᳞ x1CDE |
ᳮ x1CEE |
꣮ xA8EE |
||||
| F | ए x090F |
ट x091F |
य x092F |
ि x093F |
य़ x095F |
९ x096F |
᳟ x1CDF |
ᳯ x1CEF |
꣯ xA8EF |
| x200C |
x200D |
◌ x25CC |
࿕ x0FD5 |
࿖ x0FD6 |
࿗ x0FD7 |
࿘ x0FD8 |
卐 x5350 |
卍 x534D |
This chart shows Unicode equivalents for each VedaPad keystroke in Devanagari mode. Each block shows the Unicode character on top, the VedaPad keystroke in the middle, and the numeric Unicode character code in hexadecimal on the bottom.
| Character Keystroke Code |
Character Keystroke Code |
Character Keystroke Code |
Character Keystroke Code |
Character Keystroke Code |
|---|---|---|---|---|
| अ a x0905 |
आ ा A x0906 x093E |
|||
| इ ि i x0907 x093F |
ई ी I x0908 x0940 |
|||
| उ ु u x0909 x0941 |
ऊ ू U x090A x0942 |
|||
| ऋ ृ q x090B x0943 |
ॠ ॄ Q x0960 x0944 |
ऌ ॢ lq x090C x0962 |
||
| ए े e x090F x0947 |
ऐ ै ai x0910 x0948 |
|||
| ओ ो o x0913 x094B |
औ ौ au x0914 x094C |
|||
| ं M x0902 |
ँ MM x0901 |
ः H x0903 |
ऽ ' x093D |
|
| क ka x0915 |
ख kha x0916 |
ग ga x0917 |
घ gha x0918 |
ङ za x0919 |
| च ca x091A |
छ cha x091B |
ज ja x091C |
झ jha x091D |
ञ xa x091E |
| ट Ta x091F |
ठ Tha x0920 |
ड Da x0921 |
ढ Dha x0922 |
ण Na x0923 |
| त ta x0924 |
थ tha x0925 |
द da x0926 |
ध dha x0927 |
न na x0928 |
| प pa x092A |
फ pha x092B |
ब ba x092C |
भ bha x092D |
म ma x092E |
| य ya x092F |
र ra x0930 |
ल la x0932 |
व va x0935 |
|
| श fa x0936 |
ष Sa x0937 |
स sa x0938 |
ह ha x0939 |
|
| ॐ O x0950 |
ळ La x0933 |
् x094D |
||
| । , x0964 |
॥ . x0965 |
॒ = x0952 |
॑ / x0951 |
|
| १ 1 x0967 |
२ 2 x0968 |
३ 3 x0969 |
४ 4 x096A |
५ 5 x096B |
| ६ 6 x096C |
७ 7 x096D |
८ 8 x096E |
९ 9 x096F |
० 0 x0966 |
This chart shows Unicode equivalents for each VedaPad keystroke in Transliteration mode. These characters are in three different Unicode character blocks:
- Latin-1 Supplement, numeric codes x0080–x00FF
- Latin Extended-A, numeric codes x0100–x017F
- Latin Extended Additional, numeric codes x1E00–x1EFF
| Uppercase Translit. Character |
Lowercase Translit. Character |
Uppercase VedaPad Keystrokes |
Lowercase VedaPad Keystrokes |
Uppercase Translit. Code |
Lowercase Translit. Code |
|---|---|---|---|---|---|
| Ā | ā | +A | A | x0100 | x0101 |
| Ī | ī | +I | I | x012A | x012B |
| Ū | ū | +U | U | x016A | x016B |
| Ṛ | ṛ | +q | q | x1E5A | x1E5B |
| Ṝ | ṝ | +Q | Q | x1E5C | x1E5D |
| Ḷ | ḷ | +lq | lq | x1E36 | x1E37 |
| Ṃ | ṃ | +M | M | x1E42 | x1E43 |
| Ṁ | ṁ | +MM | MM | x1E40 | x1E41 |
| Ḥ | ḥ | +H | H | x1E24 | x1E25 |
| Ṅ | ṅ | +z | z | x1E44 | x1E45 |
| Ñ | ñ | +x | x | x00D1 | x00F1 |
| Ṭ | ṭ | +T | T | x1E6C | x1E6D |
| Ḍ | ḍ | +D | D | x1E0C | x1E0D |
| Ṇ | ṇ | +N | N | x1E46 | x1E47 |
| Ś | ś | +f | f | x015A | x015B |
| Ṣ | ṣ | +S | S | x1E62 | x1E63 |
| Ḻ | ḻ | +L | L | x1E3A | x1E3B |
In Devanagari Unicode, each vowel except a has two character codes, one for the initial letter and one for the vowel symbol that modifies a consonant. Each consonant includes an implicit a. To modify the vowel of a consonant, add the non-initial vowel code after the consonant code or codes, including short i.
Use the virama (x094D) between consonant codes to form compound consonants. Without an intervening virama, consecutive consonant codes indicate separate syllables. If there is a modifying vowel for a compound consonant, use it after all the consonants, including short i. If there is a vowel modifier, such as an anusvara, in addition to a modifying vowel, then use the vowel first and then the vowel modifier. Vedic accents should be placed at the end of a syllable.
The zero-width joiner character (x200D) after a virama that is between consonants indicates a that the compound consonant should be formed using a half letter, if available, instead of a ligature. In the same situation, the zero-width non-joiner character (x200C) indicates that a virama should be used, rather than a half-letter or ligature.
Examples:
| क | कु | कुं | कि | किं | किं॒ | किं॑ | रक | र्क | र्कि | र्किं | कष | क्ष | क्ष | क्ष |
| x0915 |
x0915 x0941 |
x0915 x0941 x0902 |
x0915 x093F |
x0915 x093F x0902 |
x0915 x093F x0902 x0952 |
x0915 x093F x0902 x0951 |
x0930 x0915 |
x0930 x094D x0915 |
x0930 x094D x0915 x093F |
x0930 x094D x0915 x093F x0902 |
x0915 x0937 |
x0915 x094D x0937 |
x0915 x094D x200D x0937 |
x0915 x094D x200C x0937 |
These files are published by the Unicode Consortium and are the official reference for encoding Devanagari characters in Unicode.
- The Unicode standard for Devanagari • unicode.org/charts/PDF/U0900.pdf
- The Unicode standard for Vedic Extensions • http://www.unicode.org/charts/PDF/U1CD0.pdf
- The Unicode standard for Common Indic Number Forms • http://www.unicode.org/charts/PDF/UA830.pdf
- The Unicode standard for Devanagari Extended • http://www.unicode.org/charts/PDF/UA8E0.pdf ↖