1
Learning document type definition from XML documents | |
Author | Phanom Slisatkorn |
Call Number | AIT Thesis no. CS-00-19 |
Subject(s) | XML (Document markup language) |
Note | A thesis submitted in partial fulfilment of the requirements for the degree of Master of Engineering, School of Advanced Technologies |
Publisher | Asian Institute of Technology |
Series Statement | Thesis ; no. CS-00-19 |
Abstract | Extensible Markup Language (XML) is a new Web standard especially designed for delivering structured data and documents over the Web and currently plays an increasingly significant role in Web applications and data interchanges. XML documents can optionally include rules to restrict the structure of elements and attributes in terms of a Document Type Definition (DTD) or an XML schema. A DTD provides a means to validate the structure of documents. However, since DTDs are not compulsory, there exist XML documents without any DTD. Identifying a DTD from such documents is useful but not readily achievable. Therefore, this research aims at developing a learning mechanism to obtain quality DTDs from given sets of XML documents. We present an innovative concept which introduces the star height of variables into our process for precisely inferring ?, +, * meta characters and enables regular expression pattern detection between input sequences. Together with the factoring, reduction and generalization steps, the concept forms a learning mechanism which can infer concise meaningful DTDs. Experiments are carried out to demonstrate the effectiveness of the mechanism and compare its efficiency with that of the other existing approaches. |
Year | 2000 |
Corresponding Series Added Entry | Asian Institute of Technology. Thesis ; no. CS-00-19 |
Type | Thesis |
School | School of Advanced Technologies (SAT) |
Department | Department of Information and Communications Technologies (DICT) |
Academic Program/FoS | Computer Science (CS) |
Chairperson(s) | Vilas Wuwongse; |
Examination Committee(s) | Aagesen, Finn Arve;Phan Minh Dung; |
Scholarship Donor(s) | H.M. the King of Thailand; |
Degree | Thesis (M.Eng.) - Asian Institute of Technology, 2000 |