1 AIT Asian Institute of Technology

Extraction of tones of speech : an application to the Thai language

AuthorRamalingam, H.
Call NumberAIT Thesis no. TC-95-5
Subject(s)Automatic speech recognition
NoteA thesis submitted in partial fulfilment of the requirement for the degree of Master of Engineering.
PublisherAsian Institute of Technology
Series StatementThesis ; no. TC-95-5
AbstractThe pitch vanat10n that convey lexical infonnation about the meaning of a word is commonly refeITed to as a tone. Automatic speech recognition of tonal languages requires the recognition of tones associated with the syllables in addition to the phonemes for proper identification of the syllable. Here, in this thesis work, a general model for the automatic recognition of tones of syllables of tonal languages is developed. The pitch contour of the syllable is initially estimated using Subharmonic Summation as the Pitch Determination Algorithm and given as input to vector quantizer for correct recognition of tones associated with the syllable. The input vector is time aligned and pitch n01malised to remove inter and intraspeaker variations before being applied to the vector quantizer. A codebook is implemented containing reference vectors corresponding to each of the tones of the tonal language. A distortion measure is computed between the test vector and each of the reference vectors. The reference vector c01Tesponding to the least distortion is identified as the tone. The perfo1mance evaluation is done in MATLAB environment. Thai language is chosen as the tonal language for the pe1formance evaluation. Four isolated syllables, uttered by four speakers for all the five tones, are used for simulation. For noise-free speech the system gave 100% correct recognition of tones for the four speakers. Recognition rates of 98, 97, 97, 98, 93 and 93 % were obtained for signal-to-noise-ratios of 40 dB, 30 dB, 20 dB, 10 dB, 5 dB and 2 dB respectively for the worst case speaker.
Year1995
Corresponding Series Added EntryAsian Institute of Technology. Thesis ; no. TC-95-5
TypeThesis
SchoolSchool of Engineering and Technology (SET)
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSTelecommunications (TC)
Chairperson(s)MakeHiinen, Kimmo
Examination Committee(s)Ahmed, K. M.;Chindakorn Tuchinda
Scholarship Donor(s)NORAD (Norwegian Agency for International Development)
DegreeThesis (M.Eng.) - Asian Institute of Technology, 1995


Usage Metrics
View Detail0
Read PDF0
Download PDF0