| Home |
(Tested using IE7, IE8, IE11 and Firefox 25)
1. Select the SampleSanskritDiacriticsTextRT1 function and then click the corresponding (Apply selected function to selected text) button.
2. Check “Text only,” select the ToUnicode function and then click to see the apostrophe and the quotes enhanced.
3. Select the AddSanskritDiacritics function and then click to see Sanskrit diacritics added to 14 letters.
4. Select the <b>…</b> (bold) template and then click the corresponding (Apply selected template to selected text) button.
5. Select the <i>…</i> (italic) template and then click .
6. Click the (middle) button to see the italics removed.
7. Click again to see the bold removed.
8. Click again to see the Sanskrit diacritics removed.
9. Click again to see the special Unicode characters changed back to ANSI.
10. Click the (middle) button to undo the last .
Each word of the selected text is compared to either one or two word lists. If the particular word appears on a list of 1100 common English words/HTML reserved words, then no further action is taken for that particular word. If it doesn’t, then it is compared to a 2nd list of words: A list of 94,000 Sanskrit words! If it is found on the 2nd list, then diacritics are added.
The list of Sanskrit words was obtained by extracting those Sanskrit words which contain diacritics from 4671 HTML files on http://causelessmercy.com/. Those HTML files included Srila Prabhupada’s unchanged books, his tape transcriptions, and the Vaisnava Songbook. Although some additional words were added manually later on, almost all of the words were extracted using a completely automated process.
Many words can have the diacritics added in more than one way. For example, Krsna could be Krsna or Krsna, but since Krsna is more common, that’s the version which was selected. Sometimes a particular Sanskrit word appears with diacritics, but it appears more often without diacritics. In such a case, the word was not selected for inclusion in the list.
Examples of a few of the many other special codes are as follows:
... → …
"…" → “…”
'…' → ‘…’
[copy] → ©
[reg] → ®
[trade] → ™
[sup1] → ¹
[sup2] → ²
[sup3] → ³
[dagger] → †
[Dagger] → ‡
| Home | THIS WEB PAGE URL: http://Pratyatosa.com/?P=5i |