Lot of times we have felt the problem with open-tamil : it has many utilities, but none of them are usually available as functions or commands out of the box. It has very much been a developer tool, and not a user or informed-layperson tool.
A quick fix is to add the example Python scripts to the default install paths along with open-tamil installation [which is still simple as ‘$ pip install –upgrade open-tamil‘].
1. tamilphonetic - convert EN input to Tamil text 2. tamilwordfilter - filter Tamil input only from all input text data 3. tamilurlfilter - filter Tamil text from the input website data 4. tamiltscii2utf8 - convert encoding from TSCII to UTF-8 for input file 5. tamilwordgrid - generate a crossword from Tamil input text and write to output.html file 6. tamilwordcount - like UNIX wc program but for Tamil
All these functions will be made available in version 0.7 of open-tamil to be released soon. Currently these functions have landed in the development branch through the commit 02810461bef216df56b10ebf09818b94dfc75574
The next step should be to really bundle these tools into a binary executable for various platforms. Also to note, the function tamilwordcount was contributed by a new member to the Open-Tamil group, Mr. Surendhar. Thanks much, and welcome!
-Muthu
Chennai, India
kudos. these are much-needed efforts that have been long due. while many Tamil software pioneers have worked on these problems, their works are not easily discoverable, remain proprietary or in their local hard drives. making these open source and listed in package libraries will make software engineering for Tamil as easy and accessible as English. wonderful work, keep it up.
A shameless plug: I wrote a function in 2004 to count Tamil letters in a string and shared it during INFITT’s TI 2004: https://venkatarangan.com/blog/2004/12/counting-letters-in-an-unicode-string/