Movatterモバイル変換

Part of the book series:Lecture Notes in Computer Science ((LNCS,volume 14356))

Included in the following conference series:

International Conference on Image and Graphics

555Accesses

Abstract

6D object pose estimation calculates the rotation and translation matrices from the object coordinate system to the camera coordinate system and plays an important role in tasks such as robotic grasping. The voting-based 6D pose estimation method PVNet votes on a set of hypotheses to determine one as the estimation for real keypoint, and uses Perspective-n-Point (PnP) algorithm to calculate 6D pose based on the estimated keypoints. For improving the accuracy of estimated keypoints, the accuracy of hypotheses should be improved firstly. Since each hypothesis is an intersection computed with extended lines of two predicted unit vectors, three factors should be considered for improving its accuracy. The deviation of angle between predicted vector and real vector should be as small as possible. The angular deviation for predicted vectors of pixels farther away from keypoints should be smaller than that of those nearer. Any two approximately parallel or coincident predicted vectors should be prohibited to compute intersection. In light of the three points, this paper predicts vector-field instead of unit vector-field to take into account the distance from pixel to real keypoint, and proposes a distance-aware vector-field prediction loss which requires that the farther pixels from keypoints, the smaller the angular deviation for predicted vectors, and suggests a strategy for preventing approximately parallel or coincident predicted vectors from computing hypothesis. Experiments on LINEMOD and OCC-LINEMOD datasets show that our method achieves 5.9% and 8.4% improvement for the average accuracy of pose estimation in terms of ADD(-S) respectively compared with PVNet.

This research was supported by The National Key R & D Program of China (No.2021ZD0111902), NSFC(U21B2038, 61876012), Foundation for China university Industry-university Research Innovation (No.2021JQR023).

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 9151; Price includes VAT (Japan)

Softcover Book: JPY 11439; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Pose Estimation of 3D Objects Based on Point Pair Feature and Weighted Voting

6D object pose estimation based on dense convolutional object center voting with improved accuracy and efficiency

Article25 November 2023

NMPose: Leveraging Normal Maps for 6D Pose Estimation

References

Bay, H., Tuytelaars, T., Van, Gool. L.: Surf: Speeded up robust features. In: Proceedings of the European Conference on Computer Vision, pp. 404–417 (2006)
Google Scholar
Lowe, D. G.: Object recognition from local scale-invariant features. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1150–1157 (1999)
Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2564–2571 (2011)
Google Scholar
Rosten, E., Drummond, T.: Fusing points and lines for high performance tracking. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1508–1515 (2005)
Google Scholar
Peng, S., Liu, Y., Huang, Q., Zhou, X., Bao, H.: Pvnet: pixel-wise voting network for 6dof pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4561–4570 (2019)
Google Scholar
Yu, X., Zhuang, Z., Koniusz, P., Li, H.: 6dof object pose estimation via differentiable proxy voting regularizer. In: British Machine Vision Conference, pp. 1–12 (2020)
Google Scholar
Song, C., Song, J., Huang, Q.: HybridPose: 6d object pose estimation under hybrid representations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 431–440 (2020)
Google Scholar
Liu, X., Iwase, S., Kitani, K. M.: KdfNet: Learning keypoint distance field for 6d object pose estimation. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4631–4638 (2021)
Google Scholar
Xiong, F., Liu, C., Chen, Q.: Region pixel voting network (RPVNet) for 6d pose estimation from monocular image. Appl. Sci.11(2), 743–757 (2021)
Article Google Scholar
Rad, M., Lepetit, V.: Bb8: a scalable, accurate, robust to partial occlusion method for predicting the 3d poses of challenging objects without using depth. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3828–3836 (2017)
Google Scholar
Tekin, B., Sinha, S. N., Fua, P.: Real-time seamless single shot 6d object pose prediction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 292–301 (2018)
Google Scholar
Zhao, W., Zhang, S., Guan, Z., Zhao, W., Peng, J., Fan, J.: Learning deep network for detecting 3d object keypoints and 6d poses. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 14134–14142 (2020)
Google Scholar
Pavlakos, G., Zhou, X., Chan, A., Derpanis, K. G., Daniilidis, K.: 6-dof object pose from semantic keypoints. In: 2017 IEEE International Conference on Robotics and Automation, pp. 2011–2018 (2017)
Google Scholar
Hu, Y., Hugonot, J., Fua, P., Salzmann, M.: Segmentation-driven 6d object pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3385–3394 (2019)
Google Scholar
Oberweger, M., Rad, M., Lepetit, V.: Making deep heatmaps robust to partial occlusions for 3d object pose estimation. In: Proceedings of the European Conference on Computer Vision, pp. 119–134 (2018)
Google Scholar
Di, Y., Manhardt, F., Wang, G., Ji, X., Navab, N., Tombari, F.: So-pose: exploiting self-occlusion for direct 6d pose estimation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 12396–12405 (2021)
Google Scholar
Hodan, T., Barath, D., Matas, J.: Epos: Estimating 6d pose of objects with symmetries. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 11703–11712 (2020)
Google Scholar
Li, Z., Wang, G., Ji, X.: Cdpn: Coordinates-based disentangled pose network for real-time RGB-based 6-DOF object pose estimation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 7678–7687 (2019)
Google Scholar
Wang, G., Manhardt, F., Tombari, F., Ji, X.: GDR-net: geometry-guided direct regression network for monocular 6d object pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 16611–16621 (2021)
Google Scholar
Zakharov, S., Shugurov, I., Ilic, S.: Dpod: 6d pose object detector and refiner. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1941–1950 (2019)
Google Scholar
Chen, B., Parra, A., Cao, J., Li, N., Chin, T. J.: End-to-end learnable geometric vision by backpropagating pnp optimization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8100–8109 (2020)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Google Scholar
Xiang, Y., Schmidt, T., Narayanan, V., Fox, D.: PoseCNN: a convolutional neural network for 6d object pose estimation in cluttered scenes. In: Robotics: Science and Systems Conference, pp. 19–32 (2018)
Google Scholar
Shotton, J., Glocker, B., Zach, C., Izadi, S., Criminisi, A., Fitzgibbon, A.: Scene coordinate regression forests for camera relocalization in RGB-D images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2930–2937 (2013)
Google Scholar
Nigam, A., Penate-Sanchez, A., Agapito, L.: Detect globally, label locally: Learning accurate 6-DOF object pose estimation by joint segmentation and coordinate regression. IEEE Robot. Autom. Lett.3(4), 3960–3967 (2018)
Article Google Scholar
Lepetit, V., Moreno-Noguer, F., Fua, P.: EPnP: an accurate O(n) solution to the PnP problem. Int. J. Comput. Vision81(2), 155–166 (2009)
Article Google Scholar
Brachmann, E., Krull, A., Michel, F., Gumhold, S., Shotton, J., Rother, C.: Learning 6d object pose estimation using 3d object coordinates. In: Proceedings of the European Conference on Computer Vision, pp. 536–551 (2014)
Google Scholar
Michel, F., et al.: Global hypothesis generation for 6D object pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 462–471 (2017)
Google Scholar
Brachmann, E., Michel, F., Krull, A., Yang, M. Y., Gumhold, S.: Uncertainty-driven 6d pose estimation of objects and scenes from a single RGB image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3364–3372 (2016)
Google Scholar
Hinterstoisser, S., et al.: Model based training, detection and pose estimation of texture-less 3d objects in heavily cluttered scenes. In: Asian Conference on Computer Vision, pp. 548–562 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
Lichun Wang, Chao Yang, Jianjia Xin & Baocai Yin
Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China
Lichun Wang, Chao Yang, Jianjia Xin & Baocai Yin

Authors

Lichun Wang
View author publications
You can also search for this author inPubMed Google Scholar
Chao Yang
View author publications
You can also search for this author inPubMed Google Scholar
Jianjia Xin
View author publications
You can also search for this author inPubMed Google Scholar
Baocai Yin
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toLichun Wang.

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Huchuan Lu
University of Sydney, Sydney, NSW, Australia
Wanli Ouyang
Shenzhen University, Shenzhen, China
Hui Huang
Tsinghua University, Beijing, China
Jiwen Lu
Dalian University of Technology, Dalian, China
Risheng Liu
Institute of Automation, CAS, Beijing, China
Jing Dong
University of Technology Sydney, Sydney, NSW, Australia
Min Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, L., Yang, C., Xin, J., Yin, B. (2023). Distance-Aware Vector-Field and Vector Screening Strategy for 6D Object Pose Estimation. In: Lu, H.,et al. Image and Graphics. ICIG 2023. Lecture Notes in Computer Science, vol 14356. Springer, Cham. https://doi.org/10.1007/978-3-031-46308-2_31

Download citation

DOI:https://doi.org/10.1007/978-3-031-46308-2_31
Published:30 October 2023
Publisher Name:Springer, Cham
Print ISBN:978-3-031-46307-5
Online ISBN:978-3-031-46308-2
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Movatterモバイル変換

Distance-Aware Vector-Field and Vector Screening Strategy for 6D Object Pose Estimation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Pose Estimation of 3D Objects Based on Point Pair Feature and Weighted Voting

6D object pose estimation based on dense convolutional object center voting with improved accuracy and efficiency

NMPose: Leveraging Normal Maps for 6D Pose Estimation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Access this chapter

Subscribe and save

Buy Now