1 AIT Asian Institute of Technology

A Thai text retrieval system using digital search trees and SQL

AuthorNontarat Thongpumpurksar
Call NumberAIT Thesis no.CS-93-26
Subject(s)Information retrieval

NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Science
PublisherAsian Institute of Technology
AbstractA THAI and ENGLISH Information Retrieval (TEIR) system is developed by using an extended double-array as the retrieval table and a relational database management system (RDBMS) to implement the posting file. Extended double-arrays for digital search tree are introduced and a new updating algorithm presented. The new algorithm has many advantages over the original one: faster insertion time, support of up to 2 GB of nodes, and retrieval of keywords in sequence. The advantages of using RDBMS to implement posting files are that the file system can easily be ported to other computer systems and one has better file management. A major limitation of IR system when dealing with THAI text is that it cannot retrieve text using THAI phrases. Current THAI IR systems can retrieve text only by employing some specific THAI keywords. The TEIR presents 2 methods to correct the problem of retrieving text using THAI phrases.
Year1993
TypeThesis
SchoolSchool of Engineering and Technology (SET)
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSComputer Science (CS)
Chairperson(s)Vilas Wuwongse;
Examination Committee(s)Yulu, Qi;Sadananda, Ramakoti;
Scholarship Donor(s)DAAD;
DegreeThesis (M.Sc.) - Asian Institute of Technology, 1993


Usage Metrics
View Detail0
Read PDF0
Download PDF0