Towards robust visual object tracking : proposal selection and occlusion reasoning

Abstract : In this dissertation we address the problem of visual object tracking, whereinthe goal is to localize an object and determine its trajectory over time. Inparticular, we focus on challenging scenarios where the object undergoessignificant transformations, becomes occluded or leaves the field of view. Tothis end, we propose two robust methods which learn a model for the object ofinterest and update it, to reflect its changes over time.Our first method addresses the tracking problem in the context of objectsundergoing severe geometric transformations, such as rotation, change in scale.We present a novel proposal-selection algorithm, which extends the traditionaldiscriminative tracking-by-detection approach. This method proceeds in twostages -- proposal followed by selection. In the proposal stage, we compute acandidate pool that represents the potential locations of the object byrobustly estimating the geometric transformations. The best proposal is thenselected from this candidate set to localize the object precisely usingmultiple appearance and motion cues.Second, we consider the problem of model update in visual tracking, i.e.,determining when to update the model of the target, which may become occludedor leave the field of view. To address this, we use motion cues to identify thestate of the object in a principled way, and update the model only when theobject is fully visible. In particular, we utilize long-term trajectories incombination with a graph-cut based technique to estimate parts of the objectsthat are visible.We have evaluated both our approaches extensively on several trackingbenchmarks, notably, recent online tracking benchmark and the visual objecttracking challenge datasets. Both our approaches compare favorably to thestate of the art and show significant improvement over several other recenttrackers. Specifically, our submission to the visual object tracking challengeorganized in 2015 was the winner in one of the competitions.
Complete list of metadatas

Cited literature [198 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01394943
Contributor : Abes Star <>
Submitted on : Thursday, November 10, 2016 - 11:00:08 AM
Last modification on : Wednesday, September 26, 2018 - 3:36:44 AM
Long-term archiving on : Wednesday, March 15, 2017 - 3:51:46 AM

File

HUA_2016_diffusion.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01394943, version 1

Collections

Citation

Yang Hua. Towards robust visual object tracking : proposal selection and occlusion reasoning. Computer Vision and Pattern Recognition [cs.CV]. Université Grenoble Alpes, 2016. English. ⟨NNT : 2016GREAM012⟩. ⟨tel-01394943⟩

Share

Metrics

Record views

1732

Files downloads

6847