Subasa online Spell-checker for Sinhala
Subasa online Spell-checker for Sinhala is a free online application which can be used to spell-check Sinhala texts written in Sinhala Unicode fonts. This does not support for Sinhala proprietary fonts. You can fix your Sinhala spelling problems by simply copying your text into the provided space.
Font Encoding Converters for Sinhala & Tamil
Real-time font encoding converting tool that runs in your web browser. It facilitates the encoding conversion from proprietary fonts to Unicode and vice versa for both Sinhala and Tamil. This tool currently supports DL-Manel, FM Abhaya, Thibus, kaputa and Amalee Sinhala fonts and Bamini Tamil font.
UCSC-ICTA Tamil Corpus
Subasa Tamil Optical Character Recognition (OCR)
Tamil OCR is an Optical Character Recognition system for Tamil script. This enables conversion of images in Tamil text into machine-readable text. This OCR system supports for .jpg file type and output text will be in Tamil Unicode without formatting.
Si-Ta – Machine assisted translation system for official government documents written in Sinhala/Tamil
Si-Ta is a collaborative effort between the National Languages Processing Centre, the Department of Official Languages (DOL) and Ministry of National Co-existence, Dialogue & Official Languages. Si-Ta currently supports the translation of short official documents between Sinhala-to-Tamil and Tamil-to-Sinhala.
A Comprehensive Parts of Speech (POS) tag set and a POS tagger for Sinhala
A new Sinhala POS tag set that overcomes the limitations of the previous Sinhala POS tag set. A corpus of 400,000 words have been manually annotated, which in turn was used to train different supervised classifiers.
Sinhala Morphological Synthesizer
A morphological synthesizer for Sinhala nouns, which works with a reasonable accuracy.
Submit Your Project