indolipi _ manual

35
ᵫꭐꭐ π ꭐꭐ ꭐꭐ ꭐꭐ ᵫꭐ ꭐꭐ ᵫꭐꭐ ན་ད་୩་པ་ [Ȥin ʝd ʝolipi] INDOLIPI © Elmar Kniprath 2005-2010 Manual Last Update: 2009-12-31 [Ȥin ʝd ʝolipi] ན་ད་୩་པ་ ꭐꭐ ᵫꭐ ꭐꭐ ᵫꭐꭐ ᵫꭐꭐ π ꭐꭐ ꭐꭐ

description

a transliteration software freeware to install

Transcript of indolipi _ manual

Page 1: indolipi _ manual

��������� ����π �������� �� ������ ������

�������� ������� �������� ��������

����� ����� ��ན་ད�་��་པ�་

[�Ȥinʝdʝo�lipi]

INDOLIPI © Elmar Kniprath 2005-2010

Manual Last Update: 2009-12-31

[�Ȥinʝdʝo�lipi]

����� ����� ��ན་ད�་��་པ�་

�������� ������� �������� ��������

��������� ����π �������� �� ������ ������

Page 2: indolipi _ manual

INDOLIPI: Manual

- 2 -

Copyright

INDOLIPI has been developed by Elmar Kniprath. He holds the copyright for the whole package (including the fonts). INDOLIPI is distributed as freeware. It may be used free of charge for scientific and private purposes. No commercial use or distribution of the program as a whole or of any of its components (including the fonts) is allowed, except with written consent of the author.

Disclaimer

The author can not be made responsible for any possible faults of the package

and its components nor is he liable for any damages which might be caused by

them.

Page 3: indolipi _ manual

INDOLIPI: Manual

- 3 -

INDOLIPI is a multipurpose tool box for indologists and linguists. It contains

• Open Type fonts for most Indian scripts,

• a Latin font for "instant" transliteration of Indic scripts,

• a Unicode based Latin font for writing of scientific texts in a western language containing all transliteration signs used by indologists as well as all presently valid IPA signs contained in Unicode (this font is also Open Type regarding exact positioning of IPA diacritics),

• western keyboard layouts for Indic scripts for users who find it difficult to use Indian keyboard layouts,

• an MS Word template with helpful macros for easy use of OT fonts and with a converter both for transliteration of Indic scripts into Latin and for converting between different transliteration systems (ISO 15919, Library of Congress, IAST, Harvard-Kyoto, Velthuis and ITRANS).

To allow for partial downloads of the components the INDOLIPI software has been split up into smaller packages and single fonts. The fonts and the keyboard layouts may each be installed and used separately. So you may for example

• install just one of INDOLIPI's Tamil fonts and use it with an Indian keyboard layout as provided by Microsoft or others,

• or install one of INDOLIPI's western keyboard layouts and use it with an MS font (e.g. Mangal) or any Open Type font made by somebody else,

• or use the font "e-Latin Indic" for instant transliteration of Indic texts,

• or write scientific texts in a western language with the font "e-PhonTranslit UNI", allowing you to type IPA signs or signs with diacritics needed for transliteration of Indic words, all signs being Unicode based, i.e. they are compatible with any other Unicode font in the world,

• or convert a text written with one of the non-ISO transliteration systems Library of Congress, IAST, Harvard-Kyoto, Velthuis and ITRANS into ISO or Indic scripts.

Installing all components will enable you to use all features and scripts of INDOLIPI and thus give you a maximum flexibility for your work.

Page 4: indolipi _ manual

INDOLIPI: Manual

- 4 -

Contents

Introduction

1. Technical Prerequisites 1.1 Software 1.2 Keyboard 1.3 Windows 7

2. Components of INDOLIPI 2.1 Font files 2.2 Keyboard layout files 2.2.1 German layout 2.2.2 English layout

2.2.3 INSCRIPT layout (for Grantha) 2.3 Additional files 2.4 Packages

3. Installation of INDOLIPI 3.1 Download 3.2 Fonts 3.3 Enabling complex scripts features in Windows (most important!) 3.4 Keyboards 3.5 Template "INDOLIPI.dot"

4. Using the Features of INDOLIPI

4.1 No Transliteration 4.1.1 Devanagari Fonts 4.1.2 Asamiya and Bengali 4.1.3 Gujarati 4.1.4 Malayalam Fonts 4.1.5 Panjabi 4.1.6 Tamil Fonts 4.1.7 Telugu

4.2 Transliteration 4.2.0 Transliteration Systems

4.2.0.1 Systems using diacritics 4.2.0.2 Systems without diacritics 4.2.0.3 Role of the Inherent Vowel

4.2.1 "Instant" Transliteration 4.2.2 Unicode based INDOLIPI Transliteration

4.3 Working with the INDOLIPI template

Page 5: indolipi _ manual

INDOLIPI: Manual

- 5 -

4.3.1 The Menu Bar and its Functions 4.3.1.1 Asamiya + Bangla 4.3.1.2 Grantha (see 5.1) 4.3.1.3 Gujarati 4.3.1.4 Kannada 4.3.1.5 Malayalam 4.3.1.6 Nagari 4.3.1.7 Oriya 4.3.1.8 Panjabi 4.3.1.9 Sinhala 4.3.1.10 Tamil 4.3.1.11 Telugu 4.3.1.12 Instant Transliteration 4.3.1.13 Transliteration Converter 4.3.1.14 Style 4.3.1.15 Digits 4.3.1.16 Old Files

4.3.2 Shortcuts 4.3.3 From Latin Transliteration to Indic scripts

5. Instructions for other languages and scripts

5.1 Grantha 5.2 Tamil transliteration of Sanskrit

5.3 Vedic Sanskrit 5.3.1 Devanagari 5.3.2 Grantha

5.4 Oriya 5.5 Sinhalese

5.6 Tibetan 5.7 IPA

6. Additional remarks and hints

7. Examples

8. Improvements and Updates

Page 6: indolipi _ manual

INDOLIPI: Manual

- 6 -

Introduction

Microsoft applications from 2000 and later exhibit a new feature called Open Type Fonts (OT fonts). It was developed by Adobe and Microsoft and is especially suitable for complex scripts such as the descendants of Brahmi in India and SE Asia.

OT fonts have an inherent program which e.g. immediately puts all entered matra vowels to their due positions, forms ligatures automatically, governs the formation and positioning of reph ... in short: The sequence of signs on entering is the same as the sequence of sounds in speaking, and after each typed sign the user sees the final version of his text on his desktop.

A further advantage is that Open Type is Unicode based. That means, by adhering to this world wide standard, texts typed with an OT font are compatible with all other OT fonts of the same script. This greatly facilitates data exchange, e.g. writing e-mails or exchanging text documents or Excel files in Indic scripts.

To keep up with this new development the fonts of the previous writing program Elmar's Indic ("e-Fonts") have been revised to fit into Open Type. The resulting new software package has been called INDOLIPI. The names of the fonts contained in INDOLIPI were given the suffix "OT" in order to distinguish them from the fonts of Elmar's Indic. Now "e-Fonts OT" for the following languages are available:

Asamiya, Bengali, Gujarati, Hindi, Kannada, Malayalam (modern and classical spelling), Marathi, Nepali, Oriya, Panjabi, Rajasthani, Sanskrit (Devanagari and Grantha scripts), Sinhalese (modern spelling), Tamil (modern and classical spelling, also classical spelling mixed up with Grantha for Maṇipravāḷā), Telugu.

Additionaly two Latin fonts are being provided: The "instant transliteration" font e-Latin Indic, which allows immediate transliteration of "e-Fonts OT", and e-PhonTranslit UNI, a Unicode based multipurpose font suitable for writing scientific texts on indological and phonetic subjects in a language based on Latin script.

Beyond that, it is possible to transliterate Indic scripts into e-PhonTranslit UNI and vice versa using different transliteration systems and even to type an Indic text in Latin script and have it transliterated to an Indic script automatically.

1. Technical Prerequisites

1.1 Software

It is recommended to use Windows XP (+ SP2) and MS Word 2003 or later versions. Earlier versions of this software may cause problems.

For some scripts it seems to be advisable to make sure, that the version of Microsoft's Unicode Script Processor "Uniscribe" is not too old. If one of the OT fonts doesn't run properly, please look for the version number of the file USP10.dll. It should be:

Page 7: indolipi _ manual

INDOLIPI: Manual

- 7 -

1. under windows\system32: 1.420.2600.2180 or higher 2. under program files\common files\microsoft shared\office11: 1.471.4063.0 or higher

German users may find USP10.dll under:

1. Windows\system32 2. Programme\Gemeinsame Dateien\Microsoft Shared\OFFICE11

It is to be expected, that Windows 7 will be able to handle all OT fonts properly.

� Important: You must enable complex scripts features in Windows (see below under 3.3) and the respective languages in MSOffice. A language specific keyboard must be installed (see below under 3.4).

(It seems, that at least some OT fonts may also be used with Linux or others.)

1.2 Keyboard

Each Indic script needs its own Unicode based keyboard layout (not only for "e-Fonts OT", but for all Unicode based Indic fonts in the world). That means for writing Tamil you need a Tamil layout, for Sanskrit you need a Devanagari layout etc. Indian users may use their respective Indic keyboards. These are contained in MS Windows (e.g. "Devanagari - INSCRIPT").

For users of western keyboards sets of German and English (US) based keyboard layouts are provided with this software package. For Grantha an extra keyboard layout based on Bengali-INSCRIPT is contained in the Grantha Package of INDOLIPI.

Additionally keyboard layouts for IPA and Tibetan Unicode fonts are provided1.

1.3 Windows 7

Microsoft's Windows 7 comes along with considerable improvements regarding complex scripts. Now there are regional and language options available for Assamese, Oriya and Sinhalese. It's to be aspected that the OT fonts contained in INDOLIPI will be running well. Unfortunately the keyboard layouts provided with INDOLIPI can not be installed under Windows 7.

2. Components of INDOLIPI

2.1 Font files

• e-Bengali OT.ttf (for Asamiya and Bengali)

• e-Grantamil.ttf (combined font for Tamil and Grantha, to be used for Sanskrit, Tamil and Manipravala texts)

• e-Gujarati OT.ttf

1 All keyboard layouts were designed using Microsoft's "Keyboard Layout Creator".

Page 8: indolipi _ manual

INDOLIPI: Manual

- 8 -

• e-Kannada OT.ttf

• e-Malayalam OT.ttf (for modern Malayalam)

• e-Malayalam OTC.ttf (for Malayalam with classical orthography)

• e-Nagari OT.ttf (for Sanskrit and Nepali)

• e-Nagari OTH.ttf (for Hindi)

• e-Nagari OTM.ttf (for Marathi)

• e-Nagari OTR.ttf (for Rajasthani)

• e-Oriya OT.ttf

• e-Panjabi OT.ttf (for Gurmukhi script)

• e-Sinhala OT (for Sinhalese)

• e-Tamil OT.ttf (for modern Tamil)

• e-Tamil OTC.ttf (for Tamil with classical orthography)

• e-Telugu OT.ttf

• e-Latin Indic.ttf (for "instant" Latin transliteration of Indic Unicode texts)

• e-PhonTranslit UNI (for writing indological texts in a language based on Latin script, also containig all valid IPA signs contained in Unicode and a lot of arrows, mathematical and logical signs)

2.2 Keyboard layout files

2.2.1 German layout

• KBDAssDE for keyboard e-Asamiya DE

• KBDBenDE for keyboard e-Bangla DE

• KBDGraDE for keyboard e-Grantha DE

• KBDGujDE for keyboard e-Gujarati DE

• KBDipaDE for keyboard e-IPA DE (for IPA transcription)

• KBDKanDE for keyboard e-Kannada DE

• KBDMalDE for keyboard e-Malayalam DE

• KBDOriDE for keyboard e-Oriya DE

• KBDPanDE for keyboard e-Panjabi DE

• KBDSinDE for keyboard e-Sinhala DE

• KBDSktDE for keyboard e-Sanskrit DE (to be used for all Devanagari fonts)

• KBDTamDE for keyboard e-Tamil DE

• KBDTelDE for keyboard e-Telugu DE

• KBDTibDE for keyboard e-Tibetan DE

2.2.2 English layout (US)

• KBDAssEN for keyboard e-Asamiya EN

• KBDBenEN for keyboard e-Bangla EN

• KBDGraEN for keyboard e-Grantha EN

Page 9: indolipi _ manual

INDOLIPI: Manual

- 9 -

• KBDGujEN for keyboard e-Gujarati EN

• KBDipaEN for keyboard e-IPA EN (for IPA transcription)

• KBDKanEN for keyboard e-Kannada EN

• KBDMalEN for keyboard e-Malayalam EN

• KBDOriEN for keyboard e-Oriya EN

• KBDPanEN for keyboard e-Panjabi EN

• KBDSinEN for keyboard e-Sinhala EN

• KBDSktEN for keyboard e-Sanskrit EN (to be used for all Devanagari fonts)

• KBDTamEN for keyboard e-Tamil EN

• KBDTelEN for keyboard e-Telugu EN

2.2.3 INSCRIPT layout

• KBDGraIN for keyboard e-Grantha IN (based on Bengali encoding)

2.3 Additional files

• INDOLIPI.dot (a template for MS Word with helpfull macros for easy use of OT fonts and transliteration into Latin script)

• INDOLIPI Manual.pdf (this file)

• Grantha Manual.pdf

• IPA Manual.pdf

• Keyboard Layout Tables English, German and INSCRIPT

• Font Scheme.pdf

• INDOLIPI Working Scheme.pdf

• Concordance of Transliteration Systems.pdf

2.4 Packages

To allow for partial downloads of the components the INDOLIPI software has been split up into smaller packages and single fonts:

• Indic fonts and keyboard layouts are packed together separately for each Indic script.

• Additional packs contain the transliteration tools, the manuals and separate packages for Grantha and IPA.

• A detailed description of all INDOLIPI features with download and installation advice is given in the INDOLIPI Manual (this file) contained in the Manuals package.

• Because of the popularity of the Manipravala font "e-Grantamil.ttf" a separate download of this font has been provided for.

Page 10: indolipi _ manual

INDOLIPI: Manual

- 10 -

3. Installation of INDOLIPI

3.1 Download

Download the components you would like to use and unzip them.

3.2 Fonts

Install the fonts under Windows. They are immediately ready for use.

3.3 Enabling complex scripts features in Windows (most important!)

For Indic Open Type fonts to work correctly "complex scripts" must be enabled in Windows. This is to be made sure by the following steps:

1. Go to Control Panel 2. Click on Date, Time, Language, and Regional Options 3. Click on Regional and Language Options 4. Click on the Languages tab

Under the heading of "supplemental language support" there should be a check mark for "Install files for complex script and right-to-left languages". If not, make a check mark and click "OK" or "Apply". You will be prompted to insert the Windows CD-ROM. After the files are installed, you must restart your computer.

For users with German version of Windows XP:

1. Start > Systemsteuerung 2. Datums-, Zeit-, Sprach- und Regionaleinstellungen 3. Regions- und Sprachoptionen 4. Sprachen

Unter "Zusätzliche Sprachunterstützung" muss ein Häkchen bei "Dateien für Sprachen mit komplexen Schriftzeichen ... installieren" sein. Wenn nicht, muss eines eingefügt werden. Anschließend auf "OK" clicken. Man wird dann aufgefordert die Windows CD-ROM einzulegen. Nach Installieren der Dateien muss der Computer neu gestartet werden.

3.4 Keyboards

� Make sure that "complex scripts" are enabled in Windows! (see 3.3 above!)

� Note: There are special instructions for installation of the keyboards e-Grantha DE/EN/IN, e-IPA DE/EN, e-Oriya DE/EN, e-Sinhala DE/EN and e-Tibetan DE/EN. Please refer to the respective paragraphs of this Manual under 5.!

� Note: For the Grantha font only e-Grantha DE, e-Grantha EN or e-Grantha IN can be used!

Page 11: indolipi _ manual

INDOLIPI: Manual

- 11 -

If you have a Unicode based Indic keyboard installed and would like to work with it you may do so without caring about the keyboard files provided with this package (except Grantha). People used to western keyboards may prefer the German or English based layouts of this package.

If you don't like to install a German or English based keyboard layout and are not yet familiar with Indic layouts, you may use an On-Screen Keyboard. This is to be found under:

Start/All Programs/Accessories/Accessibility/On-Screen Keyboard

(for German users: Start/Alle Programme/Zubehör/Eingabehilfen/Bildschirmtastatur)

The unpacked keyboard files each contain a Windows Installer Package. Find it for the keyboard you want to install and start installation by doubleclick on the Setup file in the respective keyboard folder (e.g. "KBDSktDE.msi" in the folder "KBDSktDE", or "KBDTamEN.msi" in the folder "KBDTamEN"). You will be informed, when the installation has been successfull.

Next you must add this keyboard in Windows by the following steps:

1. Go to Control Panel 2. Click on Date, Time, Language, and Regional Options 3. Click on Regional and Language Options 4. Click on the Languages tab 5. Under "Text services and input languages," click on "Details..." 6. Under Installed Services, click "Add..." 7. In the Add Input Language dialog box, click the input language (e.g. "Sanskrit", "Hindi" or

"Marathi" for all Devanagari fonts provided with this software package, or "Tamil" for all Tamil fonts) and the keyboard layout you want to add (e.g. "e-Sanskrit DE/EN" for all Devanagari fonts and languages written with Devanagari, or "e-Tamil DE/EN" for Tamil). Click OK to exit. On the Text Services and Input Languages page, click OK again to close Regional Options.

8. You should now see a language indicator (blue square with two capital letters, e.g. EN for English) in your System Tray (located at bottom right hand corner of the desktop by default).

For users with German version of Windows XP:

1. Start > Systemsteuerung 2. Datums-, Zeit-, Sprach- und Regionaleinstellungen 3. Regions- und Sprachoptionen 4. Sprachen 5. Details ... 6. Einstellungen 7. Hinzufügen ... 8. Unter "Eingabegebietsschema hinzufügen" Eingabegebietsschema (z.B. "Sanskrit", "Hindi"

oder "Marathi" für alle Devanagari-Fonts, "Tamil" für alle Tamil-Fonts) und Tastaturlayout

Page 12: indolipi _ manual

INDOLIPI: Manual

- 12 -

(z.B. e-Sanskrit DE für alle Devanagari-Fonts oder e-Tamil DE für Tamil) wählen. Mit "OK" bestätigen

9. Auch "Textdienste und Eingabesprachen" und "Regions- und Sprachoptionen" mit "OK" bestätigen.

10. In der Taskleiste sollte jetzt unten rechts die Sprachanzeige zu sehen sein (z.B. auf blauem Grund "DE" für Deutsch)

The installed keyboards and fonts are now ready for use. They will work e.g. under MS Word, WordPad, Excel, Outlook Express, Access and others, but also for writing e-mails or searching Indic Wikipedias.

The Keyboard Layout Tables contained in each of the script specific packs inform you about the keystrokes to be applied for each sign.

3.5 Template "INDOLIPI.dot"

Copy this file into the Template folder of Microsoft Office and/or put a link to INDOLIPI.dot on your desktop.

4. Using the Features of INDOLIPI

� Note: There are additional special instructions for Grantha, Oriya, Sinhalese, Tibetan and IPA. Please refer to the respective paragraphs of this Manual under 5. and to the separate Manuals contained in the Grantha and IPS packs!

INDOLIPI gives you the following opportunities:

• to type just Indic scripts without the option of Latin transliteration (see section 4.1 below), orororor

• to type just texts in a western language with the possibility to insert Latin letters with indological diacritics and/or IPA symbols,

orororor

• to mix up Indic and western scripts in one and the same text as in the present file, orororor

• to take advantage of the multiple features provided by the template INDOLIPI.dot and have the possibility of transliterating your Indic texts into Latin script according to ISO 15919 or other popular transliteration systems as shown in the INDOLIPI Working Scheme and described in the section starting with 4.2 below,

orororor

• to type Indic texts in Latin script using any of the transliteration systems supported by INDOLIPI and transliterate them into an Indic script.

Page 13: indolipi _ manual

INDOLIPI: Manual

- 13 -

4.1 No Transliteration

To start writing choose the font you want to use (e.g. e-Nagari OT or e-Telugu OT), then click on the blue language indicator in the System Tray (located at bottom right hand corner of the desktop by default) and choose the language you want to use. If there is a keyboard symbol beside the language indicator, click on that symbol to choose the keyboard layout you want to work with, e.g. if you have chosen "Sanskrit" language and both "Devanagari - INSCRIPT", "e-Sanskrit EN" and "e-Sanskrit DE" are installed on your computer, you may choose one of them, or if you have chosen "Bengali" language and your computer has "Bengali - INSCRIPT", "e-Asamiya EN", "e-Bengali EN", "e-Asamiya DE" and "e-Bengali DE", you may choose "e-Asamiya DE" or "e-Asamiya EN" to type Asamiya, or "e-Bengali DE" or "e-Bengali EN" to type Bengali, whereas "Bengali - INSCRIPT" can be used only for Bengali.

Generally the principle of writing is the same for all Indic OT fonts: You type the letters "phonetically", i.e. in the sequence of speaking. That means for typing Devanagari "ki" you first type "k" and then "i", or for Tamil "ke" first "k" and then "e". The vowel is automatically put in the correct position, e.g. Devanagari �� or Tamil �� or ���� And that is independent of your keyboard layout, provided you are using an OT font.

A ligature is written by typing its components followed by Halant (Virama, Pulli). So �� is got by typing � + ◌ + �. Other examples are:

+ ◌ + � + ◌ + � => �

� + ◌ + + ◌ + � => ��

� + � + � =>

� + ◌ + � => �

Vowel ligatures like Devanagari � or Tamil or Telugu just need typing the consonant followed by the vowel.

With these informations you are already able to type all kinds of texts in all Indic OT fonts, not only in MS Word but also in Wordpad, Outlook Express or others.

Here are some further remarks regarding the different Fonts of INDOLIPI:

4.1.1 Devanagari Fonts

The package contains special Devanagari fonts for different languages. The reason for this is, that some of the languages written with Devanagari have slightly different orthographies or different ligatures. So Hindi usually uses less ligatures than Sanskrit, e.g. where Sanskrit written with e-Nagari OT has �� [paṅkti], Hindi written with e-Nagari OTH usually has ��� �. By using the respective font you automatically get the appropriate ligatures. To get a special Sanskrit ligature in a Hindi text there are two possibilities: Either you type that word in e-Nagari OTH and change the font of that word into e-Nagari OT, or you insert an invisible Zero Width Joiner (ZWJ) by

Page 14: indolipi _ manual

INDOLIPI: Manual

- 14 -

typing in e-Nagari OTH "ShiftCtrl 1", in the above example after �. By this latter procedure you may explicitely write homorganic nasals instead of Anusvara:

� + �◌ + � + ◌ + � + ◌� => �����

� + �◌ + � + ◌ + ZWJ + � + ◌� => ��� ��

Analogously the Marathi "eye lash ra" is typed as � + ◌ + ZWJ => � . When omitting ZWJ you will get Reph.2 The Devanagari fonts now contain also � (see Keyboard Tables!).

In Hindi Anusvara is automatically transformed to Chandrabindu in certain positions, whereas Marathi and Rajasthani do not use Chandrabindu.

For Rajasthani the initial vowels exhibit the special forms � [e] and � [ai]. The Devanagari fonts now contain also � (see Keyboard Tables!).

For Nepali the font e-Nagari OT should be used.

For the Sindhi implosives � � please refer to the Keyboard Tables!

If you don't care about transliteration you may just use e-Nagari OT for all of these languages typing e.g.

� + �◌ + ◌� + � + ◌� => �����

� + �◌ + � + ◌� + � + ◌� => �����

4.1.2 Asamiya and Bengali

To render (Khanda Ta) there are 2 possibilities: The direct method is to type it as "AltGr + t". With the indirect method you must type � + ◌� => � . At the end of a word the latter is kept as ; if followed by a consonant a ligature may be formed, e.g. � + ◌� + � => �. To enforce formation of Khanda Ta inside a word by the indirect method you have to type � + ◌� + ZWNJ => � . (ZWNJ = Zero Width Non Joiner; to be typed by "ShiftCtrl 2".). The indirect method is to be used with INDOLIPI Transliteration.

By using the keyboard layouts "e-Asamiya DE" or "e-Asamiya EN" you will get the Assamese signs � and � when typing "r" and "v", while with "e-Bengali DE" or "e-Bengali EN" you will get �

and �.

4.1.3 Gujarati

The Nukta forms have to be typed by "sign + nukta", e.g.: + ◌� => �

2 In other OT fonts (e.g. Mangal) "eye lash ra" is obtained by � + ◌ => � . As � is normally used for Devanagari transliteration of alveolar "r" from Dravidian scripts (transliterated as as `ṟb), only e-Nagari OTM uses this feature in addition to the method described above. But in this case the Latin transliteration will not be correct (see 4.2 below).

Page 15: indolipi _ manual

INDOLIPI: Manual

- 15 -

4.1.4 Malayalam Fonts

The fonts e-Malayalam OT and e-Malayalam OTC differ considerably in orthography. e-Malayalam OTC has the classical forms, e.g.:

e-Malayalam OT e-Malayalam OTC

� � � � �� �� � � ����� �� � �

Final "m + Halant" will be automatically transformed to Anusvara which will be transliterated as `mb both in Instant Transliteration and in INDOLIPI Transliteration.

To get the "unrounded u" in word final position, you have to type "u + Halant". In Instant Transliteration you will get `ub. INDOLIPI Transliteration will yield `ŭb (in accordance with ISO).

Positioning of matra-e/ē/ai/o/ō/au:

e-Malayalam OT: + ◌� + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > ��/��/��/���/���/���

+ ◌� + ZWJZWJZWJZWJ + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > � �� etc.

+ ◌� + ZWNJZWNJZWNJZWNJ + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > � �� etc.

e-Malayalam OTC: � + ◌� + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > �� etc.

� + ◌� + ZWJZWJZWJZWJ + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > � �� etc.

� + ◌� + ZWNJZWNJZWNJZWNJ + � + �◌/�◌/�◌/�◌�/�◌�/�◌� > � �� etc.

When using ZWNJ the cursor may be positioned between � and �� etc.; with ZWJ this will not be possible.

Chillaksharams (= virama ligatures) may be typed directly (see Keyboard Tables!) or by typing the respective consonant followed by virama:

+ ◌� > �

� + ◌� >

+ ◌� > �

! + ◌� > "

� + ◌� > #

$ + ◌� > %

To avoid the automatic formation of Chillaksharams ZWJ must be used:

/�/ /!/�/$ + ZWJZWJZWJZWJ + ◌� > �/� �/ �/! �/� �/$ �

Page 16: indolipi _ manual

INDOLIPI: Manual

- 16 -

4.1.5 Panjabi

Addhak is automatically formed by typing "Consonant + Halant + Consonant". To write double aspirated consonants you may either type "asp. Consonant + Halant + asp. Consonant" or "unasp. Consonant + Halant + asp. Consonant". The difference will show up in transliteration.

If you don't care about transliteration you may just use the respective signs located on the keyboard (as described above for e-Nagari OT) typing e.g.

� + ◌� + � => ���

4.1.6 Tamil Fonts

The fonts e-Tamil OT and e-Tamil OTC differ in orthography. e-Tamil OTC has the classical forms for "ai" and others:

e-Tamil OT e-Tamil OTC

�� � ����

���� ���

Tamil with classical spelling may also be typed with the Manipravala font "e-Grantamil". For Tamil transliteration of Sanskrit see 5.1.5 below!

4.1.7 Telugu

There is a special way of spelling certain foreign words in Telugu as the following examples show:

������ `olimpiksb "Olympics", ����� `spōrṭsb "sports", ������ `kānpharensb "conference", ���� `kanekṭb "connect", �� ��� `neṭvarkb "network"

To get such consonant groops like "rts" in "sports" correctly one has to type:

� + ◌ + + ◌ + ! + ◌ to yield ��� The old form of vowelless `nb " as e.g. in #$" `cētanb or %&" `koṟakunb is obtained by using ZWJ:

� + ZWJZWJZWJZWJ + ◌ > "

4.2 Transliteration

For scientific or didactic purposes it may be necessary to convert Indic texts into Latin transliteration. This needs of course a 1:1 correspondence between Indic and Latin letters. INDOLIPI's transliteration program allows for automatic conversion of Indic scripts into Latin in accordance with the standard ISO 15919 and several other systems.

Page 17: indolipi _ manual

INDOLIPI: Manual

- 17 -

4.2.0 Transliteration Systems

� Please refer to the table given in "Concordance of Transliteration Systems.pdf"!

Basically there are two types of transliteration systems

• Systems using diacritics, e.g. ISO 15919, American Library Association - Library of Congress (LC), Indian Alphabet of Sanskrit Transliteration (IAST) and also INDOLIPI's "Instant Transliteration",

• Systems without diacritics, e.g. Harvard-Kyoto (HK), Velthuis (VH) and ITRANS.

With the exception of "Instant Transliteration", all of these systems are Unicode based. So they can be used for data exchange. 4.2.0.1 Systems using Diacritics

Most important is the international standard ISO 15919 covering almost all Indian scripts. It does not (yet) contain [ĕ] and [ŏ] for Devanagari � and � as well as signs for the Sindhi implosives �

� . It allows for an exact transliteration of most Indian scripts from Indic to Latin and back. For Anusvara it has two equivalent transliteration forms [ṁ] and [ṃ]. For INDOLIPI it was decided to use only [ṁ], because Latin signs with a dot below generally stand for retroflexes. But INDOLIPI's Transliteration Converter contains a optional feature for the reversible conversion of [ṁ] to [ṃ].

The Library of Congress system mostly agrees with ISO. It contains [ĕ] and [ŏ] and has special signs for the Sindhi implosives, though the latter are not represented by one glyph, but by a combination of a consonant + a combining diacritical mark. In one instance it does not agree with Devanagari Unicode: Instead of rendering the retroflex implosive � [k ] by a suitable equivalent it has [dʗ] representing a dental implosive [ǧ ]. There seems to be a discrepancy in literature regarding the nature of this sound: Whereas E. Trumpp3 states that it is clearly retroflex, C. Masica in his book on Indoarian Languages4 describes it as retroflex on p. 104, but as dental in fig. 5.1 on p. 107. - Furthermore the LC system lacks equivalents for the prenasalized consonants of Sinhala and it has a few diacritics differing from ISO usage. - Generally LC (like ISO) allows for an exact and reversible transliteration for most Indian scripts.

IAST is confined to transliteration of Sanskrit. It can not be used for New Indoarian and Dravidian languages, because it transliterates the syllabic consonants � � � � by [ṛ ṝ ḷ ḹ] instead of [ ]. In NIA [ṛ] and [ṛh] are needed for the retroflex flaps � [ȍ] and � [ȍȺ], [ḷ] stands for retroflex [ǽ ] both in Dravidian and in some NIA languages (and also for � in Vedic).

� INDOLIPI's "Instant Transliteration" follows ISO, but the signs have no Latin Unicode encoding!

3 Ernst Trumpp: Grammar of the Sindhi Language, 1872 (reprint New Delhi 1986) 4 Colin P. Masica: The Indoarian Languages, Cambridge 1991

Page 18: indolipi _ manual

INDOLIPI: Manual

- 18 -

4.2.0.2 Systems without Diacritics

These systems make use of characters to be found on every Latin keyboard.

Harvard-Kyoto in its basic version uses only vowel and consonant signs. It seems to have no equivalents for NIA � and �. So it may only be used with Sanskrit. The university of Köln (Cologne) version uses a few additional marks like [~] and [^] to render Dravidian scripts.

Velthuis and ITRANS both use some common punctuation signs serving as a kind of diacritics, but they do not use combining diacritical marks as do the systems described above under 4.2.0.1. Velthuis may be used for Indoarian languages, ITRANS for both Indoarian and Dravidian.

As may be seen from the concordance table there are some transliteration variants inside HK-Köln, Velthuis and ITRANS. 4.2.0.3 Role of the Inherent Vowel

Contrary to European scripts, one of the prominent features of the Brahmi script and all of its descendents (i.e. the south and south-east Asian scripts) is that the vowel "short a" [ǩ, Ǥ etc.], when following a consonant, has no sign of its own5. It is considered as being "inherent" in the consonant sign. So Devanagari , Tamil �, Gurmukhi �, Sinhala � or Kannada � all denote `tab. All other vowels are marked in these scripts. Therefore a compensation is needed to mark the absence of the inherent vowel. This mark is called Halant, Virama, Pulli etc. In the above examples `tb without the inherent vowel is to be written Devanagari ! , Tamil �, Sinhala , Kannada . When followed by an other consonant a ligature may be formed 6.

Latin transliteration must render by [ta] and ! by [t], i.e. the unmarked Indic form by a marked Latin form and the marked Indic form by an unmarked Latin form. Automatic insertion of [a] in a transliteration program is not a problem with Sanskrit, Oriya, Sinhala and the Dravidian languages. But in many of the modern Indoarian languages the inherent vowel has disappeared in some instances, without marking that in the script ("silent a").

Examples from Hindi are:

���! pronounced [kǩrna], transliterated [karnā], not [*karanā] �"#�! pronounced [sǩmǩǯȺna], transliterated [samajhnā], not [*samajhanā] �"#!�! pronounced [sǩmǯȺana], transliterated [samjhānā], not [*samajhānā]

Thus inconsequent modern Indic spelling has lead to an inconsequent transliteration.

An automated transliteration of Indic scripts has to take into account these facts. With Sanskrit, the Dravidian languages, Oriya and Sinhala (group A) transliteration is straightforward and fully reversible, while for other languages (group B) certain precautions have to be taken regarding the

5 i.e. it has a "zero-graph". 6 An extreme is the Gurmukhi script which normally has no Halant and makes use of very few ligatures. Therefore reading Panjabi presupposes knowledge of the language.

Page 19: indolipi _ manual

INDOLIPI: Manual

- 19 -

inherent vowel. It is a unique feature of INDOLIPI's fonts that they contain the inherent vowel "a" which can be typed as an invisible letter in the Indic text.

Group Group Group Group A languagesA languagesA languagesA languages: Typing of inherent "a" is optional7. INDOLIPI Transliteration inserts it automatically.

Group B languagesGroup B languagesGroup B languagesGroup B languages: Typing of inherent "a" is obligatory. INDOLIPI Transliteration automatically marks "silent a" by [~].

PanjabiPanjabiPanjabiPanjabi: Typing of inherent "a" is obligatory. "Silent a" is not marked.

ISO 15919 has no sign for Halant. Therefore a colon [:] has to be inserted to distinguish e.g. " [bh] from #� [b:h] or $ [ai] from %� [a:i].

If these precautions are taken an automatic and reversible transliteration to ISO is possible for all languages except Panjabiexcept Panjabiexcept Panjabiexcept Panjabi. The following table gives a rough summing-up of the above. It does not include "Instant Transliteration".

Language group

Keyboard INDOLIPI Transliteration to ISO

ISO Options Transliteration Converter

A:A:A:A: Sanskrit Kannada Malayalam Oriya Sinhala Tamil Telugu

optional: inherent "a"

inserts inherent [a] inserts [:]

B:B:B:B: Asamiya Bangla Gujarati Hindi Marathi Rajasthani

obligatory:obligatory:obligatory:obligatory: inherent "a"

inserts "silent" [~] inserts [:]

Panjabi obligatoryobligatoryobligatoryobligatory:::: inherent "a"

irreversible

reversible: [ṁ] > [ ̃] [ṁ] > [ṃ] `.b > `|b `..b > `||b irreversible: remove [~] remove [:]

reversible: a) all languages: Library of Congress ITRANS b) confined to certain languages: IAST Harvard-Kyoto Velthuis

7 If Instant Transliteration is used, typing of inherent "a" is obligatory!

Page 20: indolipi _ manual

INDOLIPI: Manual

- 20 -

4.2.1 "Instant" Transliteration

� With Instant Transliteration typing of inherent "a" is obligatory for all languages and scripts.

For this feature you have to install the transliteration font e-Latin Indic containing all Latin equivalents in the Unicode positions of the respective Indic characters, i.e. with regard to the Latin characters the text is not in accordance with Unicode. To convert Indic into Latin you may

• either select your text or a portion of it, chose the font e-Latin Indic and you will at once get the transliteration

• or run the font specific Instant Transliteration program which will convert all Indic portions in your file

%&'()*+,+&(-� ���. �/0+ -'1- ��1-�+(� �-1+ �+2-�3,-4+◌5 अस्त्युत्तऺरऺस्यां दिशि देवऺतात्मा हिमालऺयो नामऺ नऺगाधिराजऺः

�� � � ���� ���� � �

அகர முதல எழுத்தெல்லாம ஆதி

You may switch anytime between the Indic font and e-Latin Indic.

If you did type the homorganic nasals as "Nasal + Halant" (e.g. in Hindi using e-Nagari-OTH) they will be correctly transliterated; e.g. both ����� typed as "h + i + n + Halant + d + ī" and ��� ��

typed as "h + i + n + Halant + ZWJ + d + ī" will appear as `hindīb, whereas typing of "h + i + Anusvara + d + ī" will yield `hiṁdīb, which would not be correct.

The nasalized vowels typed as Anusvara (e.g. in Hindi) will not be transliterated as "vowel + tilde" (e.g. `ũb), but as "vowel + `ṁb", which could be acceptable. If you prefer the tilde you have to use the Unicode based INDOLIPI Transliteration as described below under 4.2.2.

If you try to insert hyphens or the like into a word transliterated by the "instant" method you will observe, that this may be impossible, as the latin letters are knit together as they are in Indic "writing syllables". In this case too INDOLIPI Transliteration has to be used (see 4.2.2).

4.2.2 Unicode based INDOLIPI Transliteration 8

� Make sure that the blue language indicator in the System Tray (located at bottom right

hand corner of the desktop by default) shows "DE" (German) or "EN" (English)! If you do not intend to use the inputinputinputinput----macrosmacrosmacrosmacros of INDOLIPI, you may also use any other western keyboard.

The Latin font e-PhonTranslit UNI can be used like any other Latin font (e.g. Times New Roman or Arial) for writing texts in any language written with Latin script. In addition to this it contains

• all signs for transliteration of Indic languages according to ISO 15919,

8 In former versions of INDOLIPI this process was called "PhonTranslit".

Page 21: indolipi _ manual

INDOLIPI: Manual

- 21 -

• all signs needed for the transliteration systems Library of Congress, IAST, Harvard-Kyoto, Velthuis and ITRANS,

• a lot of arrows, mathematical and logical signs,

• all presently valid IPA symbols with automatic correct positioning of diacritics according to the width of the base letters.

All these signs are at their due Unicode positions, i.e. this font is compatible with all other Unicode based fonts (provided that they contain the respective signs).

Working with this font is made easier, if you have installed the template "INDOLIPI.dot". By using this template you will also be enabled to automatically transliterate Indic script texts into "e-PhonTranslit UNI" and back to Indic script ... provided that you observe the precautions given in 4.2.0.3 above and 4.3 below.

A linguistic text in Hindi typed with "e-PhonTranslit UNI" could e.g. contain the following interlinear presentation of a Devanagari text with ISO transliteration, IPA transcription and English translation:

"$�! "$�! "$�! "$�! %!& %!& %!& %!& '(� '(� '(� '(� )(*!)(*!)(*!)(*!++++ mer=ā �āī �ar gay=ā. mer=a bȺai gȺǩr gǩj=a my=SING.MASC brother house go=PF.SING.MASC "My brother went home."

Of these lines only the first one is typed with e-Nagari OTH. The rest is e-PhonTranslit UNI. Line 2 is obtained from line 1 by running INDOLIPI's ISO transliteration feature to yield

merā bhāī ghar gayā))))

Line 3 is obtained from ISO transliteration by using the IPA feature:

mera bȺai gȺǩr gǩja

The rest as well as the separators [=] were typed directly. All five lines are Unicode based!

Automatic conversion to other transliterating systems using INDOLIPI's Transliteration Converter yields:

Library of Congress: merā bhāī ghar gayā)))) Harvard-Kyoto: merA bhAI ghar gayA)))) Velthuis: meraa bhaaii ghar gayaa)))) ITRANS: meraa bhaaii ghar gayaa))))

� So INDOLIPI's ISO-transliteration is the basic step from complex Indic scripts into Latin. For conversion to other transliteration systems ISO serves as a platform for switching between different transliteration methods (see INDOLIPI Working Scheme).

A more detailed description of INDOLIPI Transliteration and the Transliteration Converter is given under 4.3 below.

Page 22: indolipi _ manual

INDOLIPI: Manual

- 22 -

4.3 Working with the INDOLIPI template

� Important:::: If you intend to use the template INDOLIPI.dot, please pay attention to the fact, that the actual versions of MS Word contain a protection against Macro Virus attacks. But there is no distinction between "good" and "malicious" macros. Of course the macros of INDOLIPI belong to the good ones. With the highest grade of security chosen the macros will be deactivated and the INDOLIPI.dot will not run. So choose a lower grade of security!

� Advice: If you want to transliterate a portion of Indic text contained in western text, copy that Indic text into a new file, run the respective conversion commands in that new file and copy the result back to your main text. A similar procedure may be used for the reverse operation.

� Speed: The complex nature of transliteration from Indic Open Type fonts to Latin may be more or less time-consuming, depending on the speed of your computer. Therefore it is recommended to begin trying smaller portions of text.

Please refer to the INDOLIPI Working Scheme to see how the MS Word template INDOLIPI is connected with the other items of this software package.

4.3.1 The Menu Bar and its Functions

� The open eye symbol shown with some of the commands tells you that there might be restrictions for certain scripts or languages. A click on such a command will be followed by a warning indicating possible risks so that you may decide if you want to continue or not.

When the INDOLIPI template with its helpfull macros for easy use of OT fonts and transliteration is loaded, you will see a menu bar with mostly names of Indic languages and scripts. The general use of these items is explained for the Gujarati language:

� Please mind typing the inherent "a" in e-Gujarati OT!

If you click on "Gujarati" a menu will open with the following commands:

Choose Font e-Gujarati OT Transliteration Change Digits Export Shruti

These have the following functions:

Page 23: indolipi _ manual

INDOLIPI: Manual

- 23 -

• Choose Font e-Gujarati OT: Chooses this font either starting from the present coursor position or for a selected portion of text. (Don't forget to choose the blue language indicator "GU" for Gujarati in the System Tray!)

• Transliteration with the following subcommands:

o Gujarati to ISO: This feature transliterates all Gujarati portions in your file from Gujarati Unicode to e-PhonTranslit UNI.

o ISO to Gujarati: This feature transliterates from e-PhonTranslit UNI to Gujarati Unicode. The text must contain only Gujarati!

o Old Files to ISO: To be used with files based on older versions of INDOLIPI (see 4.3.1.15 below).

o Instant Transliteration: The two subcommands either change all e-Gujarati OT signs in your file to e-Latin Indic, or all e-Latin Indic portions back to e-Gujarati OT. They work only with e-Gujarati OT, not with other Gujarati fonts! For other Fonts please use the separate Instant Transliteration function on the menu bar (see 4.3.1.12 below).

• Change Digits allows you to convert Indic digits to western and vice versa.

• Export: Removes the inherent short "a" to make your text compatible with other Gujarati fonts which do not contain "a". - Warning! This operation is not undoable for Gujarati!

• Shruti: Lets you choose the Shruti font provided by Microsoft and switch between Shruti and e-Gujarati OT.

For the other scripts and languages please refer to the following informations.

4.3.1.1 Asamiya + Bangla

� Please mind typing the inherent "a" in e-Bengali OT for both languages!

The menu items let you choose the font e-Bengali OT for Assamese and Bengali, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO for both languages separately to ensure distinction between the two forms of "r" and "v".

Automatic insertion of inherent vowel is not possible!

"Export" removes the inherent vowel "a" to make your text compatible with other Bengali fonts which do not contain "a". - Warning! This operation is not undoable for Assamese and Bengali!

"Vrinda" lets you choose the Vrinda font provided by Microsoft and switch between Vrinda and e-Bengali OT. 4.3.1.2 Grantha

For Grantha please refer to the separate Grantha Manual!

Page 24: indolipi _ manual

INDOLIPI: Manual

- 24 -

4.3.1.3 Gujarati (see above 4.3.1)

4.3.1.4 Kannada

The menu items let you choose the font e-Kannada OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Kannada fonts which do not contain "a". - This operation is reversible for Kannada!

"Tunga" lets you choose the Tunga font provided by Microsoft and switch between Tunga and e-Kannada OT.

4.3.1.5 Malayalam

For Malayalam you may choose between modern (e-Malayalam OT) and classical (e-Malayalam OTC) orthography.

The menu items let you choose the font e-Malayalam OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Malayalam fonts which do not contain "a". - This operation is reversible for Malayalam!

"Kartika" lets you choose the Kartika font provided by Microsoft and switch between Kartika and e-Malayalam OT.

The features of "Classical Orthography" let you choose the font e-Malayalam OTC, change it to e-Malayalam OT or Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

4.3.1.6 Nagari

The menu items Sanskrit, Hindi, Marathi and Rajasthani let you choose the respective fonts, change them to Instant Transliteration and back, and run INDOLIPI's ransliteration to ISO.

� Please mind typing the inherent "a" in all languages except Sanskrit!

� For Sanskrit you may also automatically insert the inherent vowel "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Devanagari fonts which do not contain "a". - Warning! This operation is not undoable for Hindi, Marathi and Rajasthani! But it is reversible for Sanskrit....

"Mangal" lets you choose the Mangal font provided by Microsoft and switch between Mangal and the e-Nagari fonts.

For transliteration of Sanskrit from Devanagari to Grantha please refer to the separate Grantha Manual!

Page 25: indolipi _ manual

INDOLIPI: Manual

- 25 -

4.3.1.7 Oriya (see also 5.3 below)

The menu items let you choose the font e-Oriya OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Oriya fonts which do not contain "a". - This operation is reversible for Oriya!

4.3.1.8 Panjabi

� Please mind typing the inherent "a" in e-Panjabi OT!

The menu items let you choose the Gurmukhi font e-Panjabi OT, change it to Instant Transliteration and back and run INDOLIPI's transliteration to ISO. But mind:

� ISO transliteration for Gurmukhi is not reversible!

Automatic insertion of inherent vowel is not possible!

"Export" removes the inherent vowel "a" to make your text compatible with other Gurmukhi fonts which do not contain "a". - Warning! This operation is not undoable for Panjabi!

"Raavi" lets you choose the Raavi font provided by Microsoft and switch between Raavi and e-Panjabi OT.

4.3.1.9 Sinhala (see also 5.4 below)

The menu items let you choose the font e-Sinhala OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Sinhala fonts which do not contain "a". - This operation is reversible for Sinhala!

4.3.1.10 Tamil

For Tamil you may choose between modern (e-Tamil OT) and classical (e-Tamil OTC) orthography.

The menu items let you choose the font e-Tamil OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Tamil fonts which do not contain "a". - This operation is reversible for Tamil!

"Latha" lets you choose the Latha font provided by Microsoft and switch between Latha and both e-Tamil OT and e-Tamil OTC.

Page 26: indolipi _ manual

INDOLIPI: Manual

- 26 -

The features of "Classical Orthography" let you choose the font e-Tamil OTC, change it to e-Tamil OT or "instant transliteration" and back, and run INDOLIPI's transliteration to ISO.

For Tamil transliteration of Sanskrit texts see 5.1.5 below! 4.3.1.11 Telugu

The menu items let you choose the font e-Telugu OT, change it to Instant Transliteration and back, and run INDOLIPI's transliteration to ISO.

"Insert Inherent Vowel" inserts "a".

"Export" removes the inherent vowel "a" to make your text compatible with other Telugu fonts which do not contain "a". - This operation is reversible for Telugu!

"Gautami" lets you choose the Gautami font provided by Microsoft and switch between Gautami and e-Telugu OT. 4.3.1.12 Instant Transliteration

"Choose Font e-Latin Indic" chooses the font e-Latin Indic either starting from the present coursor position or for a selected portion of text. 4.3.1.13 Transliteration Converter

This menu item allows for operations with the font e-PhonTranslit UNI. It lets you choose this font either starting from the present coursor position or for a selected portion of "western" text and it allows for conversion between different transliteration systems.

If you click on "Transliteration Converter" a menu will open with the following commands:

Choose Font e-PhonTranslit UNI ISO 15919 IAST Harvard-Kyoto Library of Congress Velthuis ITRANS IPA

These have the following functions:

• Choose font e-PhonTranslit UNI: Chooses this font either starting from the present coursor position or for a selected portion of text.

• ISO 15919 with the following subcommands:

o ISO to Indic scripts: Allows for direct transliteration from ISO back to Indic for all scripts and languages supported by INDOLIPI.

o ISO Anusvara: Changes [ṁ] to [ ̃] and back or [ṁ] to [ṃ] and back.

Page 27: indolipi _ manual

INDOLIPI: Manual

- 27 -

o ISO Vertical Line: By default danda and double danda are transliterated by INDOLIPI as `.b and `..b. You may change them to vertical lines `|b and `||b by using this feature.

o remove silent a: Removes the [ª] symbol from NIA texts. Please refer to 4.2.0.3 above. Warning! This operation is not undoable!

o remove colon: Removes the [ : ] symbol, but not the punctuation colon. Please refer to 4.2.0.3 above. Warning! This operation is not undoable!

• IAST with the following subcommands:

� IAST should be used only with Sanskrit!

o ISO to IAST

o IAST to ISO

o IAST to Sanskrit: Allows for direct transliteration from IAST back to Indic for Sanskrit (Devanagari and Grantha).

• Harvard-Kyoto: Should be used only with Sanskrit and Tamil (and perhaps other Dravidian languages). Subcommands:

o Sanskrit: Converts ISO to Harvard-Kyoto and Harvard-Kyoto to ISO or directly to Indic for Sanskrit (Devanagari, Grantha and Grantamil).

o Tamil: Converts ISO to Harvard-Kyoto for Tamil and Harvard-Kyoto to ISO or directly to Indic for Tamil (modern and classical spelling).

• Library of Congress with the following subcommands:

o ISO to LC

o LC to ISO

• Velthuis with the following subcommands:

o ISO to Velthuis

o Velthuis to ISO

• ITRANS: Tamil (and perhaps other Dravidian languages) needs a treatment different from the other languages:

o Tamil (ISO to ITRANS and back)

o others (ISO to ITRANS and back)

� Note! - As may be seen from the concordance table there are some transliteration variants inside HK-Köln, Velthuis and ITRANS. With Velthuis and ITRANS the INDOLIPI Converter transliterates Indic scripts to the first alternates in the table, while considering all Latin alternates when transliterating back into Indic. With HK-Köln (for Tamil) it considers only the second (i.e. Tamil specific) alternates.

Page 28: indolipi _ manual

INDOLIPI: Manual

- 28 -

• IPA: "IPA" transforms the ISO version of your text to the International Phonetic Alphabet (IPA). The result is a kind of transcription which is approximately phonetic for Sanskrit. For the other Indic languages (may be also for Sanskrit) it may need some additional manual substitutions using INDOLIPI's keyboard layouts e-IPA DE or e-IPA EN. This IPA feature is meant to spare you transcription work.

o ISO to IPA

o IPA to ISO

o Halant: In the IPA version Halant is shown as `ˆb to ensure reversibility. For a printout you may want to make `ˆb invisible. For this purpose the following subcommands a meant:

o Make Halant invisible: Replaces `ˆb by an invisible sign.

o Make Halant visible: Changes invisible Halant to visible `ˆb.

o Remove Halant: Removes Halant irreversibly in order to ensure compatibility with other fonts.

4.3.1.14 Style

This command assigns language and script specific styles (German: Formatvorlage) to the whole document. This can be done in different ways depending on your purpose of writing:

• Documents containing only one language and script (e.g. Telugu or Bengali): You may choose a style before beginning to type a new document or assign a style to an existing document.

• Documents containing more than one language and script: It is recommended to choose the style "INDOLIPI" before beginning to type a new document and change only the font when changing the language.

• The default style of INDOLIPI is "INDOLIPI".

• The font size of all styles is 14 pt. So when changing the style of a document all font sizes are put to 14 pt!

• After choosing or changing the style of a document you may change the font size and other formatting parameters as you like

4.3.1.15 Digits

You will have observed that in the menu bar each one of the language items contains a feature named "digits". It lets you change all digits in your file from Indic to western and back. With INDOLIPI's keyboard layouts the default forms are Indic for Assamiya, Bengali, Grantha, Gujarati, Gurmukhi, Devanagari and Oriya, but Western with Kannada, Malayalam, Sinhala, Tamil and Telugu.

Page 29: indolipi _ manual

INDOLIPI: Manual

- 29 -

4.3.1.16 Old Files

� Using the new transliteration commands within an old file is not possible!

If you would like to transliterate a text typed with a former version of INDOLIPI you can do that in two ways:

• Copy your text and paste it into a new file based on the new INDOLIPI update and then run the transliteration program.

or

• Transliterate your text directly using the "Old Files" feature to be found in the menu bar under each language or script.

4.3.2 Shortcuts

INDOLIPI contains shortcuts for easy typing of signs with indological diacritics according to ISO.

� For the positions of signs on an English (USA) keyboard please refer to the table "Keyboard layout Transliteration.pdf".

4.3.3 From Latin transliteration to Indic scripts

It is possible to type Indic texts in e-PhonTranslit UNI using any of the transliteration systems and to transform them afterwards to an Indic script ... if you keep in mind certain facts and adhere to certain rules:

• The text must contain words for only one Indic script.

• No western words in western orthography are allowed, because these would also be transliterated to Indic. The result would be nonsense. - But western words in a transliteration of Indic orthography (like `reḍiyob for "radio") will be acceptable.

• Capital letters are allowed only for those systems which use them to represent certain Indic letters, as do Harvard-Kyoto, Velthuis and ITRANS.

• The inherent vowel `ab must always be typed!

• With NIA languages like Bengali, Hindi etc. the "silent a" must be typed as [~].

• To convert your text to Indic, use the functions of the Transliteration Converter.

• This feature can not be used with Panjabi!

� Advice: If you want to transliterate a portion of Indic text contained in western

text, copy that Indic text into a new file, run the respective conversion commands in that new file and copy the result back to your main text. A similar procedure may be used for the reverse operation.

Page 30: indolipi _ manual

INDOLIPI: Manual

- 30 -

5. Instructions for other languages and scripts

For various reasons the following languages and scripts need a special treatment.

5.1 Grantha

For Grantha please refer to the separate Grantha Manual!

5.2 Tamil transliteration of Sanskrit

Exact transliteration of Sanskrit to Tamil script may be done in two ways:

• Using diacritic digits for rendering Sanskrit sounds not found in Tamil script. These digits are contained in the Tamil fonts e-Tamil OT, e-Tamil OTC and e-Grantamil and in INDOLIPI's Tamil keyboard layout.

• Importing the missing signs from Grantha script (see Grantha Manual under "Manipravala").

Examples:Examples:Examples:Examples:

Devanagari 6 7 2 8 9

Tamil with digits � �² �³ �⁴ �

Grantha signs � � � � �

Latin Transliteration k kh g gh ṅ

Sanskrit (Devanagari) "20+:� - bhagavadgītā

Tamil ������� pakavatkītai

Tamil with digits �⁴�³��³��³ pa⁴ka³vat³kī³tā

Grantha ������� bhagavadgītā

5.3 Vedic Sanskrit

5.3.1 Devanagari

For Vedic texts other Unicode based fonts as e.g. "Chandas.ttf" should be used. They can be typed with INDOLIPI's keyboard layouts.

5.3.2 Grantha

See separate Grantha Manual!

Page 31: indolipi _ manual

INDOLIPI: Manual

- 31 -

5.4 Oriya MS Windows XP does not yet allow for Oriya as input language. Therefore one of the keyboard layouts e-Oriya DE or e-Oriya EN has to be installed under a western language, e.g. German or English. This will be changed in Microsoft's new Windows 7.

The font e-Oriya OT is Unicode based.

5.5 Sinhala MS Windows XP does not yet allow for Sinhalese as input language. Therefore one of the keyboard layouts e-Sinhala DE or e-Sinhala EN has to be installed under a western language, e.g. German or English. This will be changed in Microsoft's new Windows 7.

The font e-Sinhala OT is Unicode based. Due to the restrictions of MS Windows, presently only the simplified modern spelling is possible.

5.6 Tibetan

INDOLIPI contains keyboard layouts for Unicode based Tibetan fonts under KBDTibDE and KBDTibEN. As MS Windows does not yet allow for Tibetan as input language, the keyboard layouts "e-Tibetan DE" and "e-Tibetan EN" have to be installed under a western language, e.g. German or English.

A well elaborated Unicode based Tibetan OT font is "Tibetan Machine Uni" which can be downloaded and used free of charge from www.thdl.org, the website of the Tibetan and Himalayan Digital Library of the University of Virginia. But INDOLIPI's Tibetan keyboard layout will also work with any other Unicode based OT font.

Automatic transliteration of Tibetan with INDOLIPI is not yet possible!

5.7 IPA

The font e-PhonTranslit UNI contains all presently valid IPA symbols contained in Unicode.

IPA signs may be entered by "Insert - Symbol". IPA diacritics will be automatically put to their correct position depending on the width of the base letter. This may be impossible with some other Unicode based fonts containing IPA signs (e.g. Arial Unicode MS).

A much easier way of typing IPA is to use the keyboards e-IPA DE or e-IPA EN, which should be installed according to 3.4 above, e.g. under "German", "English" or another western language.

"Keyboard Tables English.pdf" and "Keyboard Tables German.pdf" of INDOLIPI each contain an extra table for IPA.

When used together e-PhonTranslit UNI and the keyboards "e-IPA DE" and "e-IPA EN" will greatly facilitate typing of texts containig IPA signs: You don't have to change the font, but may just change the keyboard by mouseclick.

Page 32: indolipi _ manual

INDOLIPI: Manual

- 32 -

6. Additional remarks and hints

• e-mail: The fonts and keyboard layouts provided with INDOLIPI allow for typing of e-mails. But remember: Your text should not contain the inherent vowel "a", except if the addressee has installed the respective e-font on his PC.

• WordPad: Type whatever you like. Though transliteration is not possible.

• MS Excel: When you enter a word you will first get a "raw" version which will be transformed by pressing ENTER. Transliteration is not possible.

• Other Indic fonts: If they are Unicode based, they are compatible with e-Fonts. But they may not contain the inherent vowel "a".

• Built-in MS Indic fonts: When choosing the language in the System Tray MS Word will offer you the respective built in font by default. Of course you may use these fonts with INDOLIPI. But remember: They don't have the inherenet vowel and may contain less or other ligatures. For the fonts to be used with INDOLIPI please refer to the "Font Scheme" file. To avoid appearing of the default fonts, you may first choose the font (e.g. by choosing the respective style) and then the language.

• Sorting of Indic scripts is possible.

• Writing scientific texts in a western language mixed up with Indic fonts is possible, but you must observe the advices given under 4.3.3 above. By the way, the font hights of all e-Fonts are the same. So when you use e-PhonTranslit UNI for the western text together with Indic e-Fonts, the line distance will not differ. Other Latin fonts (e.g. Times New Roman) may show varying distances.

• INDOLIPI and the Internet: INDOLIPI's keyboard layouts are useful for typing Indic words in the Internet, e.g. when searching Indic Wikipedias. You just have to choose the language and start typing. The scripts used will normally not be e-Fonts but the built-in MS fonts.

• This present file is a good demonstration of the possibilities of INDOLIPI. The next

chapter gives some examples for the automatic conversion into different transliteration

systems,

Page 33: indolipi _ manual

INDOLIPI: Manual

- 33 -

7. Examples

7.1 Example 1

Suppose you downloaded the following Sanskrit text from Muktabodha's digital library in Velthuis transliteration: mano hi dvividha.m prokta.m "suddha.m caa"suddhameva ca |

a"suddha.m kaamasa.mkalpa.m "suddha.m kaamavivarjitam || 1 ||

Change the font to e-PhonTranslit UNI: (Select the text and run Transliteration Converter > Choose font e-PhonTranslit UNI)

or (Copy the text, paste it into a new file based on INDOLIPI and choose Style > INDOLIPI)

mano hi dvividha.m prokta.m "suddha.m caa"suddhameva ca | a"suddha.m kaamasa.mkalpa.m "suddha.m kaamavivarjitam || 1 ||

Convert to ISO: (Transliteration Converter > Velthuis > Velthuis to ISO)

mano hi dvividhaṁ proktaṁ śuddhaṁ cāśuddhameva ca . aśuddhaṁ kāmasaṁkalpaṁ śuddhaṁ kāmavivarjitam .. 1 ..

If you prefer the vertical lines instead of the periods, run:

(Transliteration Converter > ISO 15919 > ISO Vertical Line > Period to Vertical Line)

mano hi dvividhaṁ proktaṁ śuddhaṁ cāśuddhameva ca | aśuddhaṁ kāmasaṁkalpaṁ śuddhaṁ kāmavivarjitam || 1 ||

Convert to Devanagari: (Transliteration Converter > ISO 15919 > ISO to Indic scripts > ISO to Sanskrit (Devanagari)

1+�� �� �;�03+◌� �<�=+◌� .)>+◌� ?-.)>+1/0+ ?+ @

%.)>+◌� 6-1+A+◌�6+B�+◌� .)>+◌� 6-1+�00+�4C +D E F E

Or to Grantha: (Transliteration Converter > ISO 15919 > ISO to Indic scripts > ISO to Sanskrit (Grantamil)

�� � ! "#$% &'(�)% *+,% -�*+,�.�� -� / 0*+,% ����1%��23% *+,% ����#��45��6 7 8 7

Page 34: indolipi _ manual

INDOLIPI: Manual

- 34 -

7.2 Example 2

Suppose you typed the following Sanskrit text in Harvard-Kyoto:

astyuttarasyAM dizi devatAtmA himAlayo nAma nagAdhirAjaH

Convert to ISO: (Transliteration Converter > Harvard-Kyoto > Sanskrit > HK (Sanskrit) to ISO)

astyuttarasyāṁ diśi devatātmā himālayo nāma nagādhirājaḥ

Convert to Devanagari: (Transliteration Converter > ISO 15919 > ISO to Indic scripts > ISO to Sanskrit (Devanagari)

%&'()*+,+&(-� ���. �/0+ -'1- ��1-�+(� �-1+ �+2-�3,-4+◌5

Or to Grantha: (Transliteration Converter > ISO 15919 > ISO to Indic scripts > ISO to Sanskrit (Grantamil)

019:�9�;�1<�% => ����@� !��A�B� C��� C���D;�EF

Or to Malayalam script (classical spelling): (Transliteration Converter > ISO 15919 > ISO to Indic scripts > ISO to Malayalam (classical)

��������� ���� �� !�"� #���$�%� ��� �&�'���(◌)

7.3 Example 3

Suppose you want to send the following Tamil text to somebody having neither a Tamil font nor a Latin font with indological diacritics installed on his PC:

��� � � ��� �� ��!" ��#$� %&%' ()���*�(# �$+, ��� !�������+�

Run INDOLIPI's transliteration to ISO: (Tamil > Transliteration > Tamil to ISO)

paṭukkaiyai viṭṭu eḻuntatum pāccā nērē kuḻāyaṭikkuc ceṉṟu pal tulakkiṉāṉ.

Convert to Harvard-Kyoto: (Transliteration Converter > Harvard-Kyoto > Tamil > ISO to HK (Tamil))

paTukkaiyai viTTu ezuntatum pAccA nErE kuzAyaTikkuc cen2Ru pal tulakkin2An2.

Page 35: indolipi _ manual

INDOLIPI: Manual

- 35 -

7.4 Example 4

Suppose you want to generate an IPA transcription of the text of Sample 3:

Run INDOLIPI's transliteration to ISO: (Tamil > Transliteration > Tamil to ISO)

paṭukkaiyai viṭṭu eḻuntatum pāccā nērē kuḻāyaṭikkuc ceṉṟu pal tulakkiṉāṉ.

Convert to IPA: (Transliteration Converter > IPA > ISO to IPA)

pǩȘȚkˆkaǺ ʢjaǺ ʢ vǺȘˆȘȚ eȋȚnʝˆt ʝǩt ʝȚmˆ paȷˆȷa nʝere kȚȋajǩȘǺkˆkȚȷˆ ȷeṉˆṟȚ pǩlˆ t ʝȚlǩkˆkǺṉaṉˆ.

As this text still contains INDOLIPI's Halant signs [ˆ] you can make them invisible. This step is reversible, because Halant is not removed, but replaced by an invisible nonspacing sign: (Transliteration Converter > IPA > Halant > Make Halant Invisible)

pǩȘȚk�kaǺ ʢjaǺ ʢ vǺȘ�ȘȚ eȋȚnʝ�t ʝǩt ʝȚm� paȷ�ȷa nʝere kȚȋajǩȘǺk�kȚȷ� ȷeṉ�ṟȚ pǩl� t ʝȚlǩk�kǺṉaṉ�.

You may also remove both visible and invisible Halant irreversibly (you will get a warning ): (Transliteration Converter > ISO > Halant > Remove Halant)

pǩȘȚkkaǺ ʢjaǺ ʢ vǺȘȘȚ eȋȚnʝt ʝǩt ʝȚm paȷȷa nʝere kȚȋajǩȘǺkkȚȷ ȷeṉṟȚ pǩl t ʝȚlǩkkǺṉaṉ.

The result is rather a draft transcription, which you might easily improve using one of INDOLIPI's keyboard layouts e-IPA DE or e-IPA EN.

8. Improvements and Updates

All critical remarks and suggestions will be welcome. Users are especially requested to inform the author about any problems occurring with this software.

Work on improvements and enlargement of the program is in progress. Updates will be published on this website without prior notice.

If you wish to be informed automatically by e-mail about new updates, please mail your address to kniprath(-at-)online(-dot-)de. Elmar Kniprath