Mylai (Included through MTX files for inter-conversion)
Auto-detect encoding
Just select the text and do a "Check Encoding" and Murasu Anjal will tell you what encoding the text is in. Anjal, TSCII, TAB and Unicode are automatically recognised.
[181]>>177 の *.mtx は対応している符号化の変換表。
TSCII 1.6,
TAB (TamilNet99 Bilingual Encoding),
Mylai,
Murasu Tamil encoding,
Anjal Tamil encoding,
Anjal Roman encoding,
Tamil Unicode Character Set
Auto-detection is a feature in the converter that detects the format of the text automatically. If the text is is any of the commonly used encoding formats, namely TSCII, TAB or Unicode, the converter will automatically set the appropriate encoding parameters for the user. This feature will be very useful if the user does not know which particular encoding the document is in.
If the encoding used is not one of those mentioned above, the converter will alert the user that the document is in an unknown encoding. In this situation, the user can set the encoding manually. Most documents on the Internet today use one of the common encoding formats listed above.
The converter already supports the following encoding formats : TSCII 1.6, TSCII 1.7, TAB, TAM, Anjal, Murasu-6, Murasu-7, Murasu-8, Kanian, Vikatan, Unicode, Romanised Tamil and Mylai. To add a new encoding, all you need is an MTX file for that encoding. (see next question).
An MTX file is a file that contains information about the encoding. To create and MTX file, you may use the MTX Editor that's bundled free with Murasu Anjal-2000 as a plug-in.
This site is set up with Dynamic Fonts for Tamil and Sanskrit texts. The dynamic fonts are supported by Netscape browser 4.06 (and later releases) and Microsoft Internet Explorer 4.*. Netscape users: please make sure the option, Use document specified fonts, including Dynamic Fonts, under Edit->Preferences->Appearance->Fonts is set.
i) Invoke "x-user-defined" case for the encoding in the META header
ii) Invoke font face tags
Based on several successful trials on several of the commonly used web-browsers in different computer OS, IWC recommends that the Tamil webpages carry the tamil text as raw 8-bit text and with Meta-Headers that specify "x-user-defined" as the charset.
+# Tamil fonts (TSCII encoding : see http://www.tscii.net)
+# See also http://bugzilla.mozilla.org/show_bug.cgi?id=204039.
+# These fonts have pseudo-Unicode cmap with TSCII interpreted as Windows-1252.
+encoding.tsc_paranarpdf.ttf = x-tamilttf-0.wide
+encoding.tsc_paranbold.ttf = x-tamilttf-0.wide
+encoding.tsc_paranarho.ttf = x-tamilttf-0.wide
+encoding.tsc_kannadaasan.ttf = x-tamilttf-0.wide
+encoding.tscu_comic.ttf = x-tamilttf-0.wide
+encoding.tscu_times.ttf = x-tamilttf-0.wide
+encoding.tscu_paranar.ttf = x-tamilttf-0.wide
+encoding.tscu_paranarbold.ttf = x-tamilttf-0.wide
+
+
+# These two fonts don't have Unicode cmap but have pseudo-Apple Roman cmap
+# with TSCII assignment.
+encoding.tsc_aandaal.ttf = x-tscii-0
+encoding.tsc_aparanarpdf.ttf = x-tscii-0
It has been decided during late 1998 that Tamil Script Code for Information Interchange (TSCII) encoding developed by the Cyber Tamil community will be used for all Etexts. Hence all the Etxts (and web versions) of Tamil works released in 1999 will be in TSCII format. Etexts released in the maiden year 1998 are still available in Inaimathi/Anjal and Mylai font formats.
TSCCONVERTER is a Windows based utility that allows you to convert text files that were created in the following fonts to TSCII encoding: Amutham, Baamini, Divya, Elango, Inaimathi, Kalki, Mylai, TBoomi, Shree802, TMNews, and Marx. Several other fonts that are based on Tamil typewriter keyboard may also work.
For Tamil, apart from providing 'PhoneticTransliteration' and UserDefinedPhonetics for typing in Unicode, Azhagi+ supports typing in various other Non-Unicode font encodings and keyboards too. The full list of supported font encodings is as follows:
Unicode (யூனிகோட், ஒருங்குறி)
SaiIndira (சாய்இந்திரா)
TSCII (திஸ்கி)
Bamini (பாமினி), TamilBible (தமிழ் பைபிள்)
TAM (தாம், டேம்)
TAB (தாப், டேப்)
Baamini (பாமினி 2) (not the same as Bamini)
Vanavil (வானவில்)
STMZH (செந்தமிழ்) [same as RGB Tamil fonts]
Shreelipi (ஸ்ரீலிபி)
LT-TM (எல்.டி-டி.எம்) [same as IndoWord Tamil fonts]
Among other exquisite features (super-fast plain text conversion), Azhagi's converter can do conversion of formatted text too - directly inside 'MS Word' documents itself - thereby retaining all formatting - Bold/Italics/Underline, Color, Alignment, Tables, etc. etc. etc. - of all your Tamil text. The converter is extendable by the user himself/herself - to convert from ANY Tamil font encoding to ANY other Tamil font encoding. The 45 Tamil font encodings supported (as on March 2022) are: Unicode, SaiIndira, TSCII, TAB, TAM, Bamini & TamilBible, Vanavil, Shreelipi, STMZH, LT-TM [same as IndoWord], Gee_Tamil, DCI+Tml+Ismail, SunTommy, ELCOT-ANSI, ELCOT-Bilingual, Diamond, Amudham, Ka, Shree, Mylai Plain, TACE, Elango, Periyar, Priya, Chenet Platinum, KrutiTamil, TM-TTValluvar, Roja, MCL Kannamai, Baamini [not the same as Bamini], Needhimathi, Pandian, TBoomiS, APT-Sangam, Dev, TA-Arul, Sashi, Ganesha, Lakshmi, Tamil-Aiswarya, Adhawin, TmlCheran, Avaice Jasmine, KavipPriya, Vikatan.
If you own a font whose name (e.g. Kalaham) does not match with any of the names in the existing 'Font Encodings' list# of Azhagi, it does not mean you cannot effect to-and-fro conversion using that font. It is very much possible that your font is of the same encoding as one of the font encodings already supported by Azhagi. For instance, 'Kalaham' font is of the same encoding as 'Bamini' font.
So, if you wish to effect conversion from 'Kalaham' font to any other font encoding, then just select 'Bamini' in the 'from this font encoding' list of Azhagi's Font Converter, before effecting conversion.
Similarly, if you wish to effect conversion from any other font encoding to 'Kalaham' font, then just select 'Bamini' in the 'to this font encoding' list of Azhagi's Font Converter before effecting conversion.
Please note that eventhough Unicode has specified only 8 signs/symbols (for day, month, year, etc.), I have given provision for 23 such symbols so that even if Unicode brings in some more signs/symbols in future, they can be accomodated. Not only that. If the font you are using has some special symbols/signs, you can specify them here after the first 8 characters. Not only that. You can specify your own short forms too here. For instance, for the 9th character, if you specify "இப்படிக்கு", then when you press 'Mi', you can get 'இப்படிக்கு'. If you specify 'அடியேன்' for 10th character, then when you press 'Mh', you can get 'அடியேன்' and so on. :)
Please note that 'SaiIndira' font is actually of "Tscii" font encoding and hence, as such, including "Tscii" under 'Font Encoding' list is itself enough. But, I have still included 'SaiIndira' separately for the convenience of people who do not know that SaiIndira is of 'Tscii' encoding.
Similarly, TamilBible font has its Tamil characters in the same slots as Bamini has and hence, as such, including "Bamini" under 'Font Encoding' list is itself enough. But, I have still included 'TamilBible' separately for the convenience of people who do not know that both TamilBible and Bamini fonts hold the Tamil characters in the same slots.
In case you are still using Azhagi+ 10.45 downloaded prior to 9-October-2020, then please download Azhagi+ 10.45 afresh and install. It is necessary that you do the same since Azhagi+ 10.45 got updated on 9-October-2020 (allowing you to effect typing and conversion in 10 more font encodings - Chenet Platinum, Elango, Tace, MylaiPlain, KrutiTamil, MCLKannamai, Periyar, Priya, Roja, TmTtValluvar) and again in May-2021 (allowing you to effect typing and conversion in 14 more font encodings - Avaice Jasmine, Adhawin-Tamil, Baamini [not the same as Bamini], Needhimathi, TmlCheran, Pandian, TBoomiS, APT-Sangam, Dev, TA-Arul, Tamil-Aiswarya, Sashi, Ganesha, Lakshmi). And, the "txts-all.zip" file provided below for download will allow you to effect typing and conversion in 2 more font encodings - Vikatan and KavipPriya (and also many more fonts of this same encoding - Cauvery, Chitram, Ellachelvi, EzhilArasi, Kalaiarasi, Kannaki, Kayalvizhi, Menaka, Meenakshi, Nanthini, Nattiya, Ponni, Sakunthalai, Seethai, Sivakami, Thamarai, Thenmozhi, Ulagamai, etc.).
Software to Convert text in most known Tamil character encoding schemes from one to another.
Supports Unicode, TACE, TSCII, TAB, TAM, Bamini, Shreelipi, Diacritic, Vanavil, Softview.
Definitions in an easy XML Structure that makes NHM Converter extendable to any language, any encoding easily