Movatterモバイル変換

Qianyou Zhao ORCID:orcid.org/0000-0003-2874-1081¹,
Le Gao³,
Duidi Wu¹,
Yihao Lei¹,
Lingyu Wang¹,
Jin Qi¹ &
…
Jie Hu^1,2

176Accesses
Explore all metrics

Abstract

The automation of excavator operations entails the development and implementation of systems that allow excavators to execute tasks autonomously, thereby significantly reducing the need for human intervention. By integrating advanced sensors and artificial intelligence algorithms, these systems aim to increase operational efficiency, safety, and precision in construction and mining. However, previously developed methods have two weaknesses. First, existing automated excavator systems struggle with adapting to diverse and complex environmental conditions and with precision in control mechanisms. Second, operating an excavator involves multiple, repeated decisions that need to be modeled, planned, and executed in real time. However, there is a significant lack of comprehensive datasets that reflect real-world excavation operations to support this process. In this paper, we present an innovative system named E-GCDT. This system integrates the DoppelGANger module, which generates action time series by emulating human-mined trajectories through a generative adversarial mechanism and replays them in a simulation environment, ultimately expanding the dataset to 155 continuous mining trajectories. Furthermore, E-GCDT integrates terrain features into the decision-making process with the contrastive language-image pre-training model (CLIP), in which the decision transformer optimizes trajectory planning for efficient and accurate continuous excavation tasks. E-GCDT uniquely integrates advanced data augmentation and terrain awareness, developing an advanced Markov decision framework (DT) for continuous excavation tasks. The experimental results of a bulldozer verify that the efficiency of E-GCDT surpasses human efficiency. This system sets a new standard for continuous autonomous mining, and this study provides a new perspective on the application of reinforcement learning in industrial environments.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

TNES: terrain traversability mapping, navigation and excavation system for autonomous excavators on worksite

Article04 July 2023

Physics-Informed Neural Networks-Based Online Excavation Trajectory Planning for Unmanned Excavator

ArticleOpen access08 November 2024

Adaptive spatial discretization using reinforcement learning

ArticleOpen access09 January 2023

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data Availability

Data are available on request to the authors. For more information, please contact us at 1025034345@sjtu.edu.cn

References

Hemami A, Hassani F (2009) An overview of autonomous loading of bulk material. In: 26th International symposium on automation and robotics in construction, pp 405–411. International Association for Automation and Robotics in Construction (IAARC …
Safety O, Administration H, et al (2015) Excavation: Hazard recognition in trenching and shoring. OSHA Technical Manual (Chapter 2). US Department of Labor Occupational Safety and Health Administration. Retrieved on 10, November 2017
Lew J, Abraham D, Wirahadikusumah R, Irizarry J, Arboleda C (2002) Excavation and trenching safety: existing standards and challenges. Implement. Safety Health Construct, Sites, CIB
Akinosho T, Oyedele LO, Bilal M, Ajayi A, Delgado M, AkinadéOO Ahmed AA (2020) Deep learning in the construction industry: A review of present status and future innovations. J Build Eng 32:101827.https://doi.org/10.1016/J.JOBE.2020.101827
Zhang L, Zhao J, Long P, Wang L, Qian L, Lu F, Song X, Manocha D (2021) An autonomous excavator system for material loading tasks. Sci Robot 6(55):3164
Article MATH Google Scholar
Afanuh S, Gillen M, Lentz T (2011) Preventing worker deaths from trench cave-ins
Bauerle T, Dugdale Z, Poplin G (2018) Mineworker fatigue: A review of what we know and future decisions. Min Engg 70(3):33
Google Scholar
Dadhich S, Bodin U, Andersson U (2016) Key challenges in automation of earth-moving machines. Autom Constr 68:212–222
Article MATH Google Scholar
François-Lavet V, Henderson P, Islam R, Bellemare MG, Pineau J, et al (2018) An introduction to deep reinforcement learning. Foundations and Trends® in Machine Learning 11(3-4):219–354
Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms.arXiv:1707.06347
Liu Y, Wang Y, Zhou Z (2024) Simulation of coherent excavator operations in earthmoving tasks based on reinforcement learning. Buildings.https://doi.org/10.3390/buildings14103270
Casas N (2017) Deep deterministic policy gradient for urban traffic light control.arXiv:1703.09035
Azulay O, Shapiro A (2021) Wheel loader scooping controller using deep reinforcement learning. IEEE Access 9:24145–24154.https://doi.org/10.1109/ACCESS.2021.3056625
Article MATH Google Scholar
Paduraru C, Mankowitz D, Dulac-Arnold G, Li J, Levine N, Gowal S, Hester T (2021) Challenges of real-world reinforcement learning: definitions, benchmarks and analysis. Mach Learn 110:2419–2468.https://doi.org/10.1007/s10994-021-05961-4
Article MathSciNet MATH Google Scholar
Hodel BJ (2018) Learning to operate an excavator via policy optimization. Procedia Comput Sci 140:376–382
Article MATH Google Scholar
Prudencio RF, Maximo M, Colombini E (2022) A survey on offline reinforcement learning: Taxonomy, review, and open problems. IEEE transactions on neural networks and learning systems PP.https://doi.org/10.1109/TNNLS.2023.3250269
Bansal MA, Sharma DR, Kathuria DM (2022) A systematic review on data scarcity problem in deep learning: solution and applications. ACM Comput Surv (CSUR) 54(10s):1–29
Article MATH Google Scholar
Babu A, Kirchner F (2021) Terrain adaption controller for a walking excavator robot using deep reinforcement learning. 2021 20th International Conference on Advanced Robotics (ICAR), pp 64–70.https://doi.org/10.1109/ICAR53236.2021.9659399
Babu A, Danter L, Willenbrock P, Natarajan S, Kuehn D, Kirchner F (2022) Arter: a walking excavator robot for autonomous and remote operations. at - Automatisierungstechnik 70, 876–887.https://doi.org/10.1515/auto-2022-0056
Chen L, Lu K, Rajeswaran A, Lee K, Grover A, Laskin M, Abbeel P, Srinivas A, Mordatch I (2021) Decision transformer: Reinforcement learning via sequence modeling. Adv Neural Inf Process Syst 34:15084–15097
Google Scholar
Gui J, Sun Z, Wen Y, Tao D, Ye J (2021) A review on generative adversarial networks: Algorithms, theory, and applications. IEEE Trans Knowl Data Eng 35(4):3313–3332
Article MATH Google Scholar
Radford A, Kim J.W, Hallacy C, Ramesh A, Goh G, Agarwal S, Sastry G, Askell A, Mishkin P, Clark J, et al (2021) Learning transferable visual models from natural language supervision. In: International conference on machine learning, pp 8748–8763. PMLR
Lin Z, Jain A, Wang C, Fanti G, Sekar V (2020) Using gans for sharing networked time series data: Challenges, initial promise, and open questions. In: Proceedings of the ACM internet measurement conference, pp 464–483
Stentz A, Bares J, Singh S, Rowe P (1999) A robotic excavator for autonomous truck loading. Auto Robot 7:175–186
Article Google Scholar
Singh S, Cannon H (1998) Multi-resolution planning for earthmoving. In: Proceedings. 1998 IEEE international conference on robotics and automation (Cat. No. 98CH36146), vol 1, pp 121–126. IEEE
Chang PH, Lee S-J (2002) A straight-line motion tracking control of hydraulic excavator system. Mechatronics 12(1):119–138
Article MATH Google Scholar
Schmidt D, Proetzsch M, Berns K (2010) Simulation and control of an autonomous bucket excavator for landscaping tasks. In: 2010 IEEE international conference on robotics and automation, pp 5108–5113. IEEE
Quanchen Z, Tao Z, Runpu W, Jinxin M, Meicong L, Chen C, Changyu H (2018) From automation to intelligence: Survey of research on vulnerability discovery techniques. J Tsinghua Univ (Sci Technol) 58(12):1079–1094
Google Scholar
Afshar RR, Zhang Y, Vanschoren J, Kaymak U (2022) Automated reinforcement learning: An overview.arXiv:2201.05000
Dadhich S, Bodin U, Sandin F, Andersson U (2016) Machine learning approach to automatic bucket loading. In: 2016 24th Mediterranean conference on control and automation (MED), pp 1260–1265. IEEE
Fukui R, Niho T, Nakao M, Uetake M (2017) Imitation-based control of automated ore excavator: improvement of autonomous excavation database quality using clustering and association analysis processes. Adv Robot 31(11):595–606
Article Google Scholar
Kurinov I, Orzechowski G, Hämäläinen P, Mikkola A (2020) Automated excavator based on reinforcement learning and multibody system dynamics. IEEE access 8:213998–214006
Article MATH Google Scholar
Osa T, Aizawa M (2022) Deep reinforcement learning with adversarial training for automated excavation using depth images. IEEE Access 10:4523–4535
Article MATH Google Scholar
Kalashnikov D, Irpan A, Pastor P, Ibarz J, Herzog A, Jang E, Quillen D, Holly E, Kalakrishnan M, Vanhoucke V, et al (2018) Scalable deep reinforcement learning for vision-based robotic manipulation. In: Conference on robot learning, pp 651–673. PMLR
Xu J, Yoon H-S (2016) A review on mechanical and hydraulic system modeling of excavator manipulator system. J Constr Eng 1:9409370
Google Scholar
SY750H | Large Excavator. (2024)https://www.sanyglobal.com/product/excavator/large_excavator/115/847/, Last accessed on 2024-3-30
SKT90S (Automatic) Diesel Off-highway Mining Truck (2024)https://www.sanyglobal.com/product/truck/off-highway_mining_truck/75/445/, Last accessed on 2024-3-30
Bernardes E, Viollet S (2022) Quaternion to euler angles conversion: A direct, general and computationally efficient method. Plos one 17(11):0276302
Article MATH Google Scholar
Van Erven T, Harremos P (2014) Rényi divergence and kullback-leibler divergence. IEEE Trans Inf Theory 60(7):3797–3820
Article MATH Google Scholar
Yin H, Vahdat A, Alvarez J.M, Mallya A, Kautz J, Molchanov P (2022) A-vit: Adaptive tokens for efficient vision transformer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10809–10818
Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I et al (2019) Language models are unsupervised multitask learners. OpenAI blog 1(8):9
Google Scholar
Kumar A, Zhou A, Tucker G, Levine S (2020) Conservative q-learning for offline reinforcement learning. Adv Neural Inf Process Syst 33:1179–1191
Torabi F, Warnell G, Stone P (2018) Behavioral cloning from observation.arXiv:1805.01954
Hout MC, Papesh MH, Goldinger SD (2013) Multidimensional scaling. Wiley Interdiscip Rev: Cogn Sci 4(1):93–103
Article MATH Google Scholar
Anh DT, Thanh LH (2015) An efficient implementation of k-means clustering for time series data with dtw distance. Int J Bus Intell Data Mining 10(3):213–232

Download references

Acknowledgements

This research is supported by the National Key R&D Program of China (2022YFB3402001), and the National Natural Science Foundation of China (Grant Nos. 52475270, 52375254). We would also like to extend our sincere thanks to Sany Company for their support and contribution to this research.

Author information

Authors and Affiliations

School of Mechanical Engineering, Shanghai Jiao Tong University, 201100, Shanghai, China
Qianyou Zhao, Duidi Wu, Yihao Lei, Lingyu Wang, Jin Qi & Jie Hu
School of Design, Shanghai Jiao Tong University, 201100, Shanghai, China
Jie Hu
Sany heavy machinery Co.ltd, Kunshan, 215300, Jiangsu, China
Le Gao

Authors

Qianyou Zhao
View author publications
You can also search for this author inPubMed Google Scholar
Le Gao
View author publications
You can also search for this author inPubMed Google Scholar
Duidi Wu
View author publications
You can also search for this author inPubMed Google Scholar
Yihao Lei
View author publications
You can also search for this author inPubMed Google Scholar
Lingyu Wang
View author publications
You can also search for this author inPubMed Google Scholar
Jin Qi
View author publications
You can also search for this author inPubMed Google Scholar
Jie Hu
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Conceptualization: Qianyou Zhao; Methodology: Qianyou Zhao, Duidi Wu and Yihao Lei; Formal analysis and investigation: Qianyou Zhao; Writing - original draft preparation: Qianyou Zhao; Writing - review and editing: Lingyu Wang, Jin Qi and Jie Hu; Funding acquisition: Le Gao and Jie Hu; Resources: Le Gao; Supervision: Jie Hu.

Corresponding authors

Correspondence toJin Qi orJie Hu.

Ethics declarations

Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical Approval

This study did not involve any human participants, human data, or animals, and therefore did not require any ethical approval or informed consent.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, Q., Gao, L., Wu, D.et al. E-GCDT: advanced reinforcement learning with GAN-enhanced data for continuous excavation system.Appl Intell55, 413 (2025). https://doi.org/10.1007/s10489-025-06308-5

Download citation

Accepted:28 January 2025
Published:07 February 2025
DOI:https://doi.org/10.1007/s10489-025-06308-5

Movatterモバイル変換

E-GCDT: advanced reinforcement learning with GAN-enhanced data for continuous excavation system

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

TNES: terrain traversability mapping, navigation and excavation system for autonomous excavators on worksite

Physics-Informed Neural Networks-Based Online Excavation Trajectory Planning for Unmanned Excavator

Adaptive spatial discretization using reinforcement learning

Explore related subjects

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Access this article

Subscribe and save

Buy Now