CN119322682B

Movatterモバイル変換

Info

Publication number: CN119322682B
Application number: CN202411874201.4A
Authority: CN
Inventors: 张卫平; 杨淦; 梁昊星; 刘安; 陈静婷; 李玲
Original assignee: Global Numerical Technology Co ltd
Current assignee: Global Numerical Technology Co ltd
Priority date: 2024-12-19
Filing date: 2024-12-19
Publication date: 2025-03-18
Anticipated expiration: 2044-12-19
Also published as: CN119322682A

Abstract

Translated fromChinese

本发明提供了一种用于大模型训练的自适应算力调度系统，涉及电数字数据处理领域，包括算力管理模块、任务分析模块、自适应调度模块和数据交互模块，所述算力管理模块用于负责整体算力资源的管理和监控，所述任务分析模块用于负责分析训练任务的需求与特征，所述自适应调度模块根据实时资源状况与任务需求进行算力调度，所述数据交互模块用于管理不同算力节点之间的数据交互；本系统通过分派训练任务至不同的节点，并在执行任务过程中调整分配的资源，能够有效地提高训练效率。

The present invention provides an adaptive computing power scheduling system for large model training, which relates to the field of electrical digital data processing, including a computing power management module, a task analysis module, an adaptive scheduling module and a data interaction module. The computing power management module is responsible for the management and monitoring of the overall computing power resources, the task analysis module is responsible for analyzing the needs and characteristics of training tasks, the adaptive scheduling module performs computing power scheduling according to real-time resource conditions and task requirements, and the data interaction module is used to manage data interaction between different computing power nodes. The system can effectively improve the training efficiency by dispatching training tasks to different nodes and adjusting the allocated resources during the execution of tasks.