now the contents of the webpage <BODY> tag are stored in the variable tatext.
Parsing Tamil text
This is where open-tamil library really shines; you can pull out the letters from a tamil string encoded in UTF-8 with a multi-byte encoding, in right order – i.e. you can write programs at the Tamil-letters level instead of worrying about the byte ordering, and uyirmei grouping etc.
Get the tamil letters from the text using the ‘get_letters‘ API,
Tamil being a classical language and all with kids not able to read or write it, and language stuck by standards and vocabulary written by old men in the pre – computer and digital stone age era it’s all time for change. We need a fun modern way to learn the Tamil language.
Educational games in English language include puzzles like scrabble, jumbled words, matching, word building games to spend leisurely time as well as focused toward particular learning objectives. There are many Tamil applications for android and iPhone with quizzes, hangman like apps.
At UrbanTamil project we have believe there are modern approaches to learning the Tamil language and vocabulary. Keeping classical Tamil alive is very important. But so is keeping popular language alive too. Capturing the usage and educating users is possible via Internet at a low total cost.
Word of the day – via Twitter @urbantamil1 we publish a word and its meaning everyday, so that you may learn the language vocabulary easily
Today I’m setting up a new project and writing this blog to announce urbantamil.com which is live now. UrbanTamil, is like urbandictionary.com for Tamil, to provide a user centered, social dictionary building experience, and free reuse.
You can lookup words in dictionary
Use onscreen keyboard to lookup word
Results from lookup for word;
You have options for downloading or defining a word
You can download definition as text file; you can also define the word
Users can add words as they are used in regional dialect (Chennai, Kovai, Madurai, Thanjai etc.); user can tag them with period usage, parts-of-speech (peyar sol, vinai sol etc);
All content will be contributed back all the entries under CC SA 3.0 to Tamil Wiktionary.
You can define the word, and add custom information.
Success screen follows upon correct upload to website
Successful upload to website
Moreover users can correct (edit) existing words for spelling mistakes, usage, and define new words.
Goals of the project
Language / Vernacular use
Censorship free/Contemporary natural Tamil – from ஜிகிர்தண்டா (Jigirthanda) – கலாசல் (Kalasal)
Language from Ceylon, Singapore, Canada – Tamil + Punjabi, Tamil + English etc variants
Build a open vocabulary while having fun.
This project will be fun, and I hope you a user community can engage and develop out of the website urbantamil.com; I always welcome feedback, and let me know how you think of it – via email or comments on the blog.