1 AIT Asian Institute of Technology

Thaigovai : a framework to transform into informal text to government writing style

AuthorArnajak Tungchoksongchai
Call NumberAIT Thesis no.DSAI-23-07
Subject(s)Natural language processing (Computer science)
Artificial intelligence
Government report writing--Thailand--Data processing
NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Engineering in Data Science and Artificial Intelligence
PublisherAsian Institute of Technology
AbstractThe background of this research is rooted in the observation that most Thai people, includ ing government officers, face challenges when writing government-style documents due to the unique style, rule-following, and use of non-regular words. Developing an application to address this issue could significantly save time and improve efficiency for individu als. However, the problem arises from the absence of a dataset for fine-tuning language models, leading to risks of hallucination in sample generation and requiring significant human effort for manual dataset creation. Additionally, implementing zero-shot learning with large language models incurs substantial costs, and existing evaluation metrics are not fully suitable for the task. To tackle these challenges, the research proposes a framework that distills knowledge from language models, filters data generation, and fine-tunes the model, all while incorporating human evaluation. The key findings reveal that datasets generated from GPT3.5 are of acceptable quality, but filtering them before fine-tuning can enhance model performance. The framework’s outcomes are comparable to those of non-specialists. And to the best of our knowledge, it could represent the first application of paraphrasing casual text into government style, which makes it a beneficial initiator for this innovation.
Year2023
TypeThesis
SchoolSchool of Engineering and Technology
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSData Science and Artificial Intelligence (DSAI)
Chairperson(s)Chaklam Silpasuwanchai;
Examination Committee(s)Dailey, Matthew N.;Mongkol Ekpanyapong;
Scholarship Donor(s)Royal Thai Government Fellowship;
DegreeThesis (M. Eng.) - Asian Institute of Technology, 2023


Usage Metrics
View Detail0
Read PDF0
Download PDF0