Over the course of this year, since translating Ruby Kin, and preparing a summary of 3 years work on spell-checker for Tamil Internet Conference – 2019, I’ve been thinking of next level of interesting projects.
The following have come to mind, expressed in Twitter @ezhillang in various forms. Here they are in simply chronological order,
Translating “Data Structures and Algorithms” book in Tamil
Problem Solving with Algorithms and Data Structures Using Python, Creative Commons licensed by its authors Brad Miller and David Ranum; same book (from above) adapted at Bradfield CS https://bradfieldcs.com/algos/
Translating/Writing a “Debugging Techniques” book in Tamil: ‘கணினி செயல்முறை நிரகளில் வழுநீக்கம்‘ – பயிற்சி, நூல்
Debugging techniques are important learning milestone for any professional software/hardware developer which are usually learnt on the job and essentially skipped in academia (perhaps for practical purposes).
(Research/Proof-of-concept) Viterbi algorithm based spelling correction algorithm for Tamil
(Research/Proof-of-concept) Concordance based context ambiguity resolution for Tamil spelling correction.
Contingent on our levels and degrees of success we can share our work in forums like Tamil Internet Conference, ACL or ACM, etc.
As always collaborators are welcome: email: ezhillang -AT- gmail -DOT- com
I have been keen to understand what kind of articles are read, and written, in Tamil Wikipedia, and Wiktionary. So it was time to use data analysis and some programming – so thats what I did! Last weekend I tried to take Open-Tamil Python library for a spin with the dumps of Wikipedia data for Tamil wiki and here are results.