1 AIT Asian Institute of Technology

Efficient posevit : exploring efficient vision transformers and training strategies for fast human pose estimation

AuthorLamsal, Diwas
Call NumberAIT Thesis no.CS-23-05
Subject(s)Image analysis
Human-computer interaction
NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Engineering in Computer Science
PublisherAsian Institute of Technology
AbstractHuman pose estimation (HPE) serves as a critical component in various downstream applications, including activity recognition and human-computer interaction. However, it faces challenges such as occlusion, complex body poses, diverse clothing and environments, and low-quality images. State-of-the-art HPE methods adept at addressing these challenges often prove to be computationally expensive to run on low-cost edge devices. On the other hand, the efficient models designed to run on edge devices are not as accurate, especially when faced with occlusion. In this thesis, I explored the use of computationally efficient vision transformer (ViT) backbones for the task of HPE to produce an efficient model capable of running on edge devices while also addressing common HPE challenges. Specifically, I replaced the ViT encoder in ViTPose with effi cient ViT variants, and performed a comprehensive comparison. The findings indicate that the EdgeNeXt-Base model offers an optimal speed-accuracy trade-off among the evaluated models. Through knowledge distillation and multi-task training strategies, the resulting models exhibit improved performance, particularly in handling occlusion. Additionally, I produced a wheelchair-based human pose estimation and activity recognition dataset aimed at developing a system that effectively monitors the activities of people sitting in wheelchairs.
Year2023
TypeThesis
SchoolSchool of Engineering and Technology
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSComputer Science (CS)
Chairperson(s)Dailey, Matthew N.
Examination Committee(s)Mongkol Ekpanyapong;Chaklam Silpasuwanchai
Scholarship Donor(s)AIT Scholarships
DegreeThesis (M. Eng.) - Asian Institute of Technology, 2023


Usage Metrics
View Detail0
Read PDF0
Download PDF0