We address the problem of 2D-3D pose estimation in difficult viewing conditions, such as low illumination, cluttered background, and large highlights and shadows that appear on the object of interest. In such challenging conditions conventional features used for establishing correspondence are unreliable. We show that under the assumption of a dominant light source, specular highlights produced by a known object can be used to establish correspondence between its image and the 3D model, and to verify the hypothesized pose. These ideas are incorporated in an efficient method for pose estimation from a monocular image of an object using only highlights produced by the object as its input. The proposed method uses no knowledge of lighting direction and no calibration object for estimating the lighting in the scene. The evaluation of the method shows good accuracy on numerous synthetic images and good robustness on real images of complex, shiny objects, with shadows and difficult backgrounds.
The original 3D model files can be found here in .OBJ format.
These are 3D renderings of the original models, that were used for the synthetic tests in this work.
Each image was rendered with a randomly generated pose and light direction.
The entire set can be found here.
Some sample images:
These are pictures of the manufactured models, that were taken outdoors.
The entire set can be found here.
Some sample images:
These are pictures of the manufactured models, that were taken indoors.
The entire set can be found here.
Some sample images: