Búsqueda | BVS CLAP/SMR-OPS/OMS

Vid2CAD: CAD Model Alignment Using Multi-View Constraints From Videos.

Maninis, Kevis-Kokitsi; Popov, Stefan; Niesner, Matthias; Ferrari, Vittorio.

IEEE Trans Pattern Anal Mach Intell ; 45(1): 1320-1327, 2023 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-35077362

RESUMEN

We address the task of aligning CAD models to a video sequence of a complex scene containing multiple objects. Our method can process arbitrary videos and fully automatically recover the 9 DoF pose for each object appearing in it, thus aligning them in a common 3D coordinate frame. The core idea of our method is to integrate neural network predictions from individual frames with a temporally global, multi-view constraint optimization formulation. This integration process resolves the scale and depth ambiguities in the per-frame predictions, and generally improves the estimate of all pose parameters. By leveraging multi-view constraints, our method also resolves occlusions and handles objects that are out of view in individual frames, thus reconstructing all objects into a single globally consistent CAD representation of the scene. In comparison to the state-of-the-art single-frame method Mask2CAD that we build on, we achieve substantial improvements on the Scan2CAD dataset (from 11.6% to 30.7% class average accuracy).

Ver mas detalles

ENVIAR RESULTADO:

Exportar

Imprimir

RSS

XML

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA