Movatterモバイル変換


[0]ホーム

URL:


Skip to main content
Springer Nature Link
Log in

Underwater target detection and embedded deployment based on lightweight YOLO_GN

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

In order to solve the problem of missing various targets due to the limited memory and computing power of underwater equipment and also the complexity of the underwater environment, a lightweight and efficient underwater target detection algorithm YOLO_GN (YOLO with Ghost network) is proposed. Based on the basic framework of YOLOv5s, the algorithm designs a new backbone using GhostNetV2 and proposes Ghost_BottleneckV2 combined with dynamic sparse attention BiFormer to reduce computational costs and improve detection accuracy. The lightweight multi-scale convolutional LW-GSConv combined with VOV-GSCSP is used to capture the complex features of the input data more accurately and improve the network expression ability. In view of the imbalance of a large number of detection samples in underwater targets, the SlideLoss function is introduced and the optimizer of original model is updated to Sophia, so that the algorithm model has better generalization ability. Finally, the YOLO_GN algorithm is equipped with the Raspberry Pi 4B development board, and the camera is called up to realize real-time detection of underwater targets. Simulation results show that the proposed method can achieve 85.35% detection accuracy on the URPC dataset, which is 2.43% higher than the most common architecture in underwater scenarios, and computational complexity is reduced by 46.47%. Moreover, it can achieve better object detection effect in embedded terminals.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26

Similar content being viewed by others

Data availability

Datasets (URPC2020) used in this work are publicly available in the Internet.

References

  1. Limin H (2022) Developing deep blue fisheries and building a maritime power. J Ocean Univ China 3:5–8

    Google Scholar 

  2. Ge Z, Liu S, Wang F, et al. (2021) Yolox: exceeding yolo series in 2021.arXiv:2107.08430

  3. Rahmati M, Pompili D (2017) UNISeC: inspection, separation, and classification of underwater acoustic noise point sources. IEEE J Ocean Eng 43(3):777–791

    Article  Google Scholar 

  4. Redmon J, Divvala S. Girshick R, et al. (2016) You only look once: unified, real-time object detection. In: Computer Vision and Pattern Recognition. IEEE, Las Vegas, pp 779–788

  5. Shi P, Xu X, Ni J et al (2021) Underwater biological detection algorithm based on improved faster R-CNN. Water 13(17):1–12

    Article  Google Scholar 

  6. Ribeiro MT, Singh S, Guestrin G (2018) Anchors: high-precision model-agnostic explanations. In: Proceedings of the AAAI Conference, vol 32, no 1

  7. Peng F, Miao Z, Li F et al (2021) S-FPN: A shortcut feature pyramid network for sea cucumber detection in underwater images. Expert Syst Appl 182(11):1–13

    Google Scholar 

  8. Sang J, Wu Z, Guo P et al (2018) An improved YOLOv2 for vehicle detection. Sensors 18(12):4272

    Article  Google Scholar 

  9. Zhao L, Li S (2020) Object detection algorithm based on improved YOLOv3. Electronics 9(3):537

    Article  Google Scholar 

  10. Gai R, Chen N, Yuan H (2023) A detection algorithm for cherry fruits based on the improved YOLO-v4 model. Neural Comput Appl 35(19):13895–13906

    Article  Google Scholar 

  11. Zheng Z, Wang P, Liu W, et al. (2020) Distance-IoU loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, no 7, pp 12993–13000

  12. Zhang Y, Zhou Y (2019) Overview of clustering algorithms. Comput Appl 39(7):1869–2188

    Google Scholar 

  13. Ying Z, Lin Z, Wu Z et al (2022) A modified-YOLOv5s model for detection of wire braided hose defects. Measurement 190:110683

    Article  Google Scholar 

  14. Güney E, Bayilmiş C, Çakan B (2022) An implementation of real-time traffic signs and road objects detection based on mobile GPU platforms. IEEE Access 10:86191–86203

    Article  Google Scholar 

  15. Güney E, Bayılmış C, Çakar S, et al. Autonomous control of shore robotic charging systems based on computer vision[J]. Expert Systems with Applications, 2023:122116.

    Article  Google Scholar 

  16. Güney E, Sahin IH, Cakar S, et al. (2022) Electric shore-to-ship charging socket detection using image processing and YOLO. In: 2022 International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT). IEEE, pp 1069–1073.

  17. Li C, Li L, Jiang H, et al. (2022) YOLOv6: a single-stage object detection framework for industrial applications.arXiv:2209.02976

  18. Wang CY, Bochkovskiy A, Liao HYM (2023) YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7464–7475

  19. Talaat FM, ZainEldin H (2023) An improved fire detection approach based on YOLO-v8 for smart cities. Neural Comput Appl 35(28):20939–20954

    Article  Google Scholar 

  20. Zhang Q, Jiang Z, Lu Q, et al. (2020) Split to be slim: an overlooked redundancy in vanilla convolution. In: International Joint Conference on Artificial Intelligence. Morgan Kaufmann, Yokohama, pp 3167–3173

  21. Zhu L, Wang X, Ke Z, et al. (2023) BiFormer: vision transformer with bi-level routing attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 10323–10333

  22. Frischholz RW, Dieckmann U (2000) BiolD: a multimodal biometric identification system. Computer 33(2):64–68

    Article  Google Scholar 

  23. Li H, Li J, Wei H, et al. Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv preprintarXiv:2206.02424, 2022.

  24. Zhao X, Song Y (2023) Improved ship detection with YOLOv8 enhanced with MobileViT and GSConv. Electronics 12(22):4666

    Article  Google Scholar 

  25. Wei P (2021) Design of software defined radio platform based on Raspberry Pi. Lanzhou University, Lanzhou

    Google Scholar 

  26. Misra D (2019) Mish: a self regularized non-monotonic activation function.arXiv:1908.08681

  27. Mahaur B, Mishra KK (2023) Small-object detection based on YOLOv5 in autonomous driving systems. Pattern Recogn Lett 168:115–122

    Article  Google Scholar 

  28. Dai X, Chen Y, Xiao B, et al. (2021) Dynamic head: unifying object detection heads with attentions. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7373–7382

  29. Li Z, Fang X, Zhen T et al (2023) Detection of wheat yellow rust disease severity based on improved GhostNetV2. Appl Sci 13(17):9987

    Article  Google Scholar 

  30. Tang Y, Han K, Guo J et al (2022) GhostNetv2: enhance cheap operation with long-range attention. Adv Neural Inf Process Syst 35:9969–9982

    Google Scholar 

  31. Wu Z, Shen C, Van Den Hengel A (2019) Wider or deeper: revisiting the resnet model for visual recognition. Pattern Recogn 90:119–133

    Article  Google Scholar 

  32. He F, Liu T, Tao D (2020) Why resnet works? Residuals generalize. IEEE Trans Neural Netw Learn Syst 31(12):5349–5362

    Article MathSciNet  Google Scholar 

  33. Findlater L, McGrenere J (2010) Beyond performance: Feature awareness in personalized interfaces. Int J Hum Comput Stud 68(3):121–137

    Article  Google Scholar 

  34. Kruse R, Mostaghim S, Borgelt C, et al (2022) Multi-layer perceptrons. Computational intelligence: a methodological introduction. Springer International Publishing, Cham, pp 53–124

  35. Zhao Z, Xu S, Kang BH et al (2015) Investigation and improvement of multi-layer perceptron neural networks for credit scoring. Expert Syst Appl 42(7):3508–3516

    Article  Google Scholar 

  36. Jha N K, Saini R, Nag S, et al. (2020) E2GC: Energy-efficient group convolution in deep neural networks. In: 2020 33rd International Conference on VLSI Design and 2020 19th International Conference on Embedded Systems (VLSID). IEEE, pp 155–160

  37. Babicki S, Arndt D, Marcu A et al (2016) Heatmapper: web-enabled heat mapping for all. Nucleic Acids Res 44(W1):W147–W153

    Article  Google Scholar 

  38. Ren S, He K, Girshick R et al (2016) Object detection networks on convolutional feature maps. IEEE Trans Pattern Anal Mach Intell 39(7):1476–1481

    Article  Google Scholar 

  39. Yu Z, Huang H, Chen W, et al. (2022) Yolo-facev2: a scale and occlusion aware face detector. arXiv preprintarXiv:2208.02019

  40. Sun X, Wu P, Hoi SCH (2018) Face detection using deep learning: an improved faster RCNN approach. Neurocomputing 299:42–50

    Article  Google Scholar 

  41. Jiang D, Li G, Tan C et al (2021) Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model. Futur Gener Comput Syst 123:94–104

    Article  Google Scholar 

  42. Zhai S, Shang D, Wang S et al (2020) DF-SSD: An improved SSD object detection algorithm based on DenseNet and feature fusion. IEEE Access 8:24344–24357

    Article  Google Scholar 

  43. Chen X, Yuan M, Yang Q et al (2023) Underwater-YCC: underwater target detection optimization algorithm based on YOLOv7. J Mar Sci Eng 11(5):995

    Article  Google Scholar 

  44. Zhao S, Yuh J (2005) Experimental study on advanced underwater robot control. IEEE Trans Rob 21(4):695–703

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the Key project of Natural Science Foundation of Shaanxi Province, Grant. 2024JC-YBQN-0724 and National Natural Science Foundation of China, 62031021, Grant Number 62031021.

Author information

Authors and Affiliations

  1. Shaanxi University of Science and Technology, Xi’an City, 710021, Shaanxi Province, China

    Xiao Chen, Chenye Fan, Jingjing Shi, Haiyan Wang & Haiyang Yao

  2. Northwestern Polytechnical University, Xi’an City, 710072, Shaanxi Province, China

    Haiyan Wang

Authors
  1. Xiao Chen

    You can also search for this author inPubMed Google Scholar

  2. Chenye Fan

    You can also search for this author inPubMed Google Scholar

  3. Jingjing Shi

    You can also search for this author inPubMed Google Scholar

  4. Haiyan Wang

    You can also search for this author inPubMed Google Scholar

  5. Haiyang Yao

    You can also search for this author inPubMed Google Scholar

Contributions

Conceptualization, Xiao Chen and Chenye Fan; Formal analysis, Jingjing Shi, Xiao Chen and Chenye Fan; Funding acquisition, Haiyan Wang; Investigation, Haiyang Yao; Methodology, Xiao Chen and Chenye Fan; Resources, Xiao Chen and Chenye Fan; Software, Chenye Fan; Validation, Chenye Fan; Writing – original draft, Xiao Chen and Chenye Fan; Writing – review & editing, Jingjing Shi, Xiao Chen, Chenye Fan, Haiyang Yao and Haiyan Wang. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence toHaiyan Wang.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, X., Fan, C., Shi, J.et al. Underwater target detection and embedded deployment based on lightweight YOLO_GN.J Supercomput80, 14057–14084 (2024). https://doi.org/10.1007/s11227-024-06020-0

Download citation

Keywords

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Advertisement


[8]ページ先頭

©2009-2025 Movatter.jp