IEEE transactions on pattern analysis and machine intelligence
High-Resolution Open-Vocabulary Object 6D Pose Estimation.
Jaime Corsetti, Davide Boscaini, Francesco Giuliari, Changjae Oh, Andrea Cavallaro, Fabio Poiesi
Published: 202610.1109/TPAMI.2025.3624589
Abstract
The generalisation to unseen objects in the 6D pose estimation task is very challenging. While Vision-Language Models (VLMs) enable using natural language descriptions to support 6D pose estimation of unseen objects, these solutions underperform comp…
Preview only. Read the full abstract at the source