1
Thaigovai : a framework to transform into informal text to government writing style | |
Author | Arnajak Tungchoksongchai |
Call Number | AIT Thesis no.DSAI-23-07 |
Subject(s) | Natural language processing (Computer science) Artificial intelligence Government report writing--Thailand--Data processing |
Note | A thesis submitted in partial fulfillment of the requirements for the degree of Master of Engineering in Data Science and Artificial Intelligence |
Publisher | Asian Institute of Technology |
Abstract | The background of this research is rooted in the observation that most Thai people, includ ing government officers, face challenges when writing government-style documents due to the unique style, rule-following, and use of non-regular words. Developing an application to address this issue could significantly save time and improve efficiency for individu als. However, the problem arises from the absence of a dataset for fine-tuning language models, leading to risks of hallucination in sample generation and requiring significant human effort for manual dataset creation. Additionally, implementing zero-shot learning with large language models incurs substantial costs, and existing evaluation metrics are not fully suitable for the task. To tackle these challenges, the research proposes a framework that distills knowledge from language models, filters data generation, and fine-tunes the model, all while incorporating human evaluation. The key findings reveal that datasets generated from GPT3.5 are of acceptable quality, but filtering them before fine-tuning can enhance model performance. The frameworkâs outcomes are comparable to those of non-specialists. And to the best of our knowledge, it could represent the first application of paraphrasing casual text into government style, which makes it a beneficial initiator for this innovation. |
Year | 2023 |
Type | Thesis |
School | School of Engineering and Technology |
Department | Department of Information and Communications Technologies (DICT) |
Academic Program/FoS | Data Science and Artificial Intelligence (DSAI) |
Chairperson(s) | Chaklam Silpasuwanchai; |
Examination Committee(s) | Dailey, Matthew N.;Mongkol Ekpanyapong; |
Scholarship Donor(s) | Royal Thai Government Fellowship; |
Degree | Thesis (M. Eng.) - Asian Institute of Technology, 2023 |