OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding

Figure 1. We illustrate the three major blocks of our pipeline and their respective computational times: (a) object onboarding from a CAD model or a set of images takes less than a minute per object,

OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding

Figure 1. We illustrate the three major blocks of our pipeline and their respective computational times: (a) object onboarding from a CAD model or a set of images takes less than a minute per object, (b) object detection with CNOS [54] takes less than 0.5 seconds per image, (c) pose estimation with our methodology takes less than 0.05 seconds per instance.


📜 Original Paper Content

🚀 Synchronizing high-quality layout from 1TB storage...