1
Applying document clustering to hypertext documents on the Internet | |
Author | Tran Ngoc Thanh |
Call Number | AIT Thesis no. IM-99-01 |
Subject(s) | Hypertext systems |
Note | A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science, School of Advanced Technologies |
Publisher | Asian Institute of Technology |
Series Statement | Hypertext systems |
Abstract | Starting from the beginning of this decade, the amount of electronic information available from the Internet has been increasing dramatically. Designing a tool to search relevant information on the Internet is not easy, because of the lack of proper organizing principles in the Internet. Dewey Decimal Classification (DDC) has been used for information organization for many years. First published in 1876, it has been continuously revised to meet evolving information access needs, both in the traditional library and in the electronic environments. Clustering techniques have been successfully applied in the field of information retrieval (IR). Applying clustering to hypertext documents and organizing them according to DDC classification scheme is the main purpose of this thesis. Topic representative, (or query in IR) is generated. Manually organized seed hyper-documents and automatically assigned topics of new documents to DDC are the main results of the thesis project. The full system was built and tested by the research to prove the effectiveness of this approach in information organization field. |
Year | 1999 |
Corresponding Series Added Entry | Asian Institute of Technology. Thesis ; no. IM-99-01 |
Type | Thesis |
School | School of Advanced Technologies (SAT) |
Department | Department of Information and Communications Technologies (DICT) |
Academic Program/FoS | Information Management (IM) |
Chairperson(s) | Devadason, Francis J. ; |
Examination Committee(s) | Phan Minh Dung ;Yulu, Qi; |
Scholarship Donor(s) | Asian Institute of Technology ; |
Degree | Thesis (M.Sc.) - Asian Institute of Technology, 1999 |