1 AIT Asian Institute of Technology

Reil : a framework for reinforced intervention-based imitation learning

AuthorRom Parnichkun
Call NumberAIT Thesis no.DSAI-22-01
Subject(s)Reinforcement learning
Neural networks (Computer science)
NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Engineering in Data Science and Artificial Intelligence
PublisherAsian Institute of Technology
AbstractCompared to traditional imitation learning methods such as DAgger and DART, inter vention based imitation offers a more convenient and sample efficient data collection process to users. In this thesis, we introduce Reinforced Intervention-based Learning (ReIL), a framework consisting of a general intervention-based learning algorithm and a multi-task imitation learning model aimed at enabling non-expert users to train agents in real environments with little supervision or fine tuning. ReIL achieves this with an algorithm that combines the advantages of imitation learning and reinforcement learn ing and a model named MimeticSNAIL, capable of concurrently processing demonstra tions, past experience, and current observations. Experimental results from real world mobile robot navigation challenges indicate that ReIL learns rapidly from sparse su pervisor corrections without suffering deterioration in performance that is character istic of supervised learning-based methods such as HG-Dagger and IWR. The results also demonstrate that in contrast to other intervention-based methods such as IARL and EGPO, ReIL can utilize an arbitrary reward function for training without any additional heuristics.
Year2022
TypeThesis
SchoolSchool of Engineering and Technology
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSData Science and Artificial Intelligence (DSAI)
Chairperson(s)Dailey, Matthew N.
Examination Committee(s)Mongkol Ekpanyapong
Scholarship Donor(s)His Majesty the King’s Scholarships (Thailand)
DegreeThesis (M. Eng.) - Asian Institute of Technology, 2022


Usage Metrics
View Detail0
Read PDF0
Download PDF0