1 AIT Asian Institute of Technology

Applying document clustering to hypertext documents on the Internet

AuthorTran Ngoc Thanh
Call NumberAIT Thesis no. IM-99-01
Subject(s)Hypertext systems

NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Science, School of Advanced Technologies
PublisherAsian Institute of Technology
Series StatementHypertext systems
AbstractStarting from the beginning of this decade, the amount of electronic information available from the Internet has been increasing dramatically. Designing a tool to search relevant information on the Internet is not easy, because of the lack of proper organizing principles in the Internet. Dewey Decimal Classification (DDC) has been used for information organization for many years. First published in 1876, it has been continuously revised to meet evolving information access needs, both in the traditional library and in the electronic environments. Clustering techniques have been successfully applied in the field of information retrieval (IR). Applying clustering to hypertext documents and organizing them according to DDC classification scheme is the main purpose of this thesis. Topic representative, (or query in IR) is generated. Manually organized seed hyper-documents and automatically assigned topics of new documents to DDC are the main results of the thesis project. The full system was built and tested by the research to prove the effectiveness of this approach in information organization field.
Year1999
Corresponding Series Added EntryAsian Institute of Technology. Thesis ; no. IM-99-01
TypeThesis
SchoolSchool of Advanced Technologies (SAT)
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSInformation Management (IM)
Chairperson(s)Devadason, Francis J. ;
Examination Committee(s)Phan Minh Dung ;Yulu, Qi;
Scholarship Donor(s)Asian Institute of Technology ;
DegreeThesis (M.Sc.) - Asian Institute of Technology, 1999


Usage Metrics
View Detail0
Read PDF0
Download PDF0