| Home |
Latest update: The and buttons now convert -- to — and vice versa |
|
Utility: Automatically Add Diacritics to 94,000 Different Sanskrit Words!
Tested using IE6, IE7, FireFox 3, Safari 4 and Opera 9 (FireFox 3 works best)
To see two examples of how this web page works:
1. Click the button.
2. Click the button to see the apostrophe and the quotes enhanced.
3. Click the button to see the diacritics added.
4. Click .
5. Type some Sanskrit words minus the diacritics such as "Lord Sri Krsna" (including the quotes) into the text box above.
6. Click .
7. Click .
NOTE: Clicking causes each word to be checked for the possibility of changing it to it’s Sanskrit diacritic equivalent by consulting a list of more than 94,000 Sanskrit words! This list was obtained by extracting those Sanskrit words which contain diacritics from Śrila Prabhupāda’s unchanged books, his tape transcriptions, and the Vaiṣṇava Songbook in the form of 4671 .htm files on http://causelessmercy.com/. The words were extracted using a completely automated process. No manual editing was done. Many words can have the diacritics added in more than one way. For example, Krsna could be Kṛṣṇa or Kṛṣṇā, but since Kṛṣṇa is more common, that’s the one that’s automatically used. Sometimes a particular word may appear, on occasion, in the books / tapes / songbook with diacritics, but more commonly without diacritics. Therefore, no diacritics are added in these particular cases.
The text box above is a “rich formatting” text box. The following are some of Internet Explorer’s built-in keyboard “shortcuts” which may be used within the text box:
1. “Enter” - New paragraph.
2. “<Shift>Enter” - New line.
3. “<Ctrl>B” - Toggle selected text between bold and nonbold.
4. “<Ctrl>C” - Copy selected text to the clipboard.
5. “<Ctrl>I” - Toggle selected text between italic and nonitalic.
6. “<Ctrl>K” - Make selected text into a link.
7. “<Ctrl>U” - Toggle selected text between underlined and not underlined.
8. “<Ctrl>V” - Replace selected text with the clipboard contents.
9. “<Ctrl>X” - Cut selected text after copying it to the clipboard.
NOTE: Before clicking , Unicode characters may be entered in 2 different ways. For example, the Greek letter Delta (Δ) may be entered as “[Delta],” or as “[#916].” Certain fractions may be entered in 3 different ways. For example, ¾ may be entered as “3/4,” as “[frac34],” or as “[#190].” (See “Table of HTML Special Character Designations” - http://llbest.com/HTMLSpecialCharacterTable.htm)
Examples of a few of the many other special codes are as follows: ... (3 periods) = …, "..." = “…”, '...' = ‘…’, [copy] = ©, [reg] = ®, [trade] = ™, [sup1] = ¹, [sup2] = ², [sup3] = ³, [dagger] = †, and [Dagger] = ‡.
NOTE: This web page was created using Notepad. In order to speed up the function, it was written in Perl. Unfortunately, this means that it will not work when saved to your local hard drive.
Questions, comments and suggestions are welcome.
Pratyatoṣa Dāsa ()
| Home | THIS WEB PAGE URL: http://pratyatosa.com/AddDiacritics.cgi |