CN120440311A

Movatterモバイル変換

Info

Publication number: CN120440311A
Application number: CN202510942197.9A
Authority: CN
Inventors: 郭鹏宇
Original assignee: National Defense Technology Innovation Institute PLA Academy of Military Science
Current assignee: National Defense Technology Innovation Institute PLA Academy of Military Science
Priority date: 2025-07-09
Filing date: 2025-07-09
Publication date: 2025-08-08

Abstract

本发明公开了基于进化算法和强化学习的航天器形态控制方法及系统，具体为：S1.建立初始种群，种群中的个体为不同功能模块和不同模块数量组合的不同形态的航天器；S2.对种群中的所有个体进行内环初始化学习训练，计算每个个体的适应度值；S3.选择适应度值较高的个体组成精英种群；S4.利用遗传算法中的遗传和变异操作，对精英种群进行均匀交叉、单点变异，生成精英子代；S5.内环强化学习：对步骤S4中获得精英子代进行学习训练；S6.形态评估；S7.形成最优个体。本发明充分结合空间环境和任务需求的特点，基于深度进化强化学习的内外环算法架构，通过外环形态进化和内环学习训练不断交替，实现模块化航天器的形态自主生成。

The present invention discloses a spacecraft morphology control method and system based on evolutionary algorithms and reinforcement learning, specifically: S1. Establishing an initial population, in which individuals in the population are spacecraft of different morphologies with different functional modules and different module quantity combinations; S2. Performing inner-loop initialization learning training on all individuals in the population, and calculating the fitness value of each individual; S3. Selecting individuals with higher fitness values to form an elite population; S4. Using the inheritance and mutation operations in the genetic algorithm, performing uniform crossover and single-point mutation on the elite population to generate elite offspring; S5. Inner-loop reinforcement learning: performing learning and training on the elite offspring obtained in step S4; S6. Morphology evaluation; S7. Forming the optimal individual. The present invention fully combines the characteristics of the space environment and mission requirements, and based on the inner and outer loop algorithm architecture of deep evolutionary reinforcement learning, realizes the autonomous generation of the morphology of modular spacecraft through the continuous alternation of outer-loop morphological evolution and inner-loop learning and training.