We develop and
demonstrate an object recognition system capable of accurately
detecting, localizing, and recovering the kinematic configuration of
textured animals in real images. We build a deformation model of shape
automatically from videos of animals and an appearance model of texture
from a labeled collection of animal images, and combine the two models
automatically. We develop a simple texture descriptor that outperforms
the state of the art. We test our animal models on two datasets; images
taken by professional photographers from the Corel collection, and
assorted images from the web returned by Google. We demonstrate quite
good performance on both datasets. Comparing our results with simple
baselines, we show that for the Google set, we can recognize objects
from a collection demonstrably hard for object recognition. Ramanan, D., Forsyth, D. A., Barnard, K. "Detecting, Localizing, and Recovering Kinematics of Textured Animals." Computer Vision and Pattern Recognition (CVPR), San Diego, CA, June 2005 [pdf] Ramanan, D., Forsyth, D. A., and Barnard, K. "Building Models of Animals from Video." IEEE Pattern Analysis and Machine Intelligence (PAMI), Accepted for publication [pdf] |
Back to home page