AIT Asian Institute of Technology

1 AIT Asian Institute of Technology

> > >

Model-based target tracking from a moving monocular camera
Author	Basit, Abdul
Call Number	AIT Diss. no.CS-14-03
Subject(s)	Vision, Monocular Computer vision
Note	A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Science, School of Engineering and Technology
Publisher	Asian Institute of Technology
Series Statement	Dissertation ; no. CS-14-03
Abstract	Small unmanned ground vehicles (SUGVs) and Unmanned Aerial Vehicles (UAVs) are use- ful for gathering information about environments where access by human beings is either impossible or dangerous. They are portable, lightweight, and inexpensive. Such robots are becoming increasingly common in military applications and disaster areas, where they are generally teleoperated, as access by human beings is either restricted or dangerous. One intriguing application of the small autonomous vehicle is pursuit . Robot pursuit applica- tions include following and monitoring important people and pursuing suspicious people in security or military contexts. One possibility for the main sensor of a pursuit robot is a monocular camera. Although a sin- gle camera simplifies the design and lowers the cost of the robot, it also presents challenges. First, tracking an object during target pursuit requires a tracker that is both sufficiently accu- rate and sufficiently fast to keep track of the target in real time. Second, since depth estimates based on monocular cues will necessarily be extremely noisy, to obtain usable target position estimates, sensor modeling and state filtering will be required. In this thesis, I focus on the use of noisy sensor measurements while tracking a target with the pursuit robot’s monocular camera. Noise makes target tracking difficult. The noisy data may lead the pursuit robot to track a false target or abruptly shut down the visual tracking process. However, in addition to camera data, we also receive noisy odometry data from the robots‘s encoders. I propose a method to obtain smooth tracking and trajectories of robot and target jointly with a moving monocular camera by coupling the robot’s kinematics and the target’s dynamics in a joint state space model. I propose a novel joint localization model to reduce integrated robot and target position esti- mation error caused by noisy monocular depth cues. The method fuses information from the 2D visual tracker and the SUGV’s wheel encoders with knowledge of the robot’s kinematics in an extended Kalman filter to obtain superior state estimation accuracy. The model main- tains an estimate of the state of the target, assuming a simple linear dynamical model, as well as an estimate of the pursuit robot’s state, assuming differential drive robot kinematics. The joint localization model significantly improves estimation accuracy compared to simple sensor-based position estimates as well as compared to filters not incorporating pursuit robot kinematics. I use joint localization or joint state estimation for the proposed method in this thesis. Additionally, I propose a fast visual tracking method using color histogram backprojection and an adaptive histogram similarity threshold. In the first phase, I use a CAMSHIFT tracker for monocular target tracking and suspend the tracking process when the target is occluded or lost in a cluttered environment. The suspension decision uses an adaptive histogram simi- larity threshold. This helps prevent the visual tracker from tracking an incorrect object. Once the target is reported absent from the scene, we need a fast method to correctly reinitialize the CAMSHIFT tracker in order to restart the tracking process. The second part of visual sensor is the redetection phase. The proposed redetection method swiftly searches the entire image in real time for the target, reducing false detection and correctly reinializing the visual tracker. The results show that the proposed visual tracking method is fast and easily recovers the target once it reappears after being occluded in a cluttered environment. Furthermore, the proposed estimation model produces a smoother trajectory and is more tolerant to noise than alternative methods
Year	2014
Corresponding Series Added Entry	Asian Institute of Technology. Dissertation ; no. CS-14-03
Type	Dissertation
School	School of Engineering and Technology (SET)
Department	Department of Information and Communications Technologies (DICT)
Academic Program/FoS	Computer Science (CS)
Chairperson(s)	Daily, Matthew N.;
Examination Committee(s)	Mongkol Ekpanyapong ;Hyun Trung Luong;
Scholarship Donor(s)	University of Balochistan, Quetta, Pakistan;
Degree	Thesis (Ph.D.) - Asian Institute of Technology, 2014