Training Deformable Models for Localization in Images

Deva Ramanan

Toyota Technical Institute

We present a new method for training deformable models. Assume that we have training images of objects (say, a horse) where parts have been labeled. Typically, one fits a model by maximizing the likelihood of the part labels. Alternatively, one could fit a model such that, when the model is run on the training images, it finds the parts. We show this criteria leads to very different models because the objective is discriminative (rather than generative) and model parameters are jointly learned (rather than independently). We formulate model-learning as parameter estimation in a conditional random field (CRF); this means we can learn globally optimal parameters by simple gradient ascent. We present results on three established vision datasets of motorcycles, people, and horses that achieve or surpass the state-of-the-art. This is joint work with Cristian Sminchesescu.
Deepak Ramachandran
Last modified: Thu Jan 19 14:25:50 CST 2006