Female voice corpus
Speech corpus with 3000 Sinhala utterances spoken by a single female speaker. This corpus was initially designed to built an Automatic Speech Recognition System (ASR) for Sinhala. Spoken utterances were selected considering the most frequently used words in Sinhala.
Male voice corpus
Speech corpus with 625 Sinhala utterances spoken by a single male speaker. This corpus was initially designed to built a Text to Speech Syatem (TTS) for Sinhala.
2000 voice corpus
Speech corpus with 74,000 Sinhala utterances spoken by various speakers representing both male and female in different age groups. This corpus was initially designed to built a song request application for mobile phones.
Sinhala NEWS Corpus
A speech corpus with 8000 utterances of recorded Sinhala NEWS from both male and female announcers. This is still an ongoing project.
To download