CN111858013A

Movatterモバイル変換

Info

Publication number: CN111858013A
Application number: CN202010865834.4A
Authority: CN
Inventors: 谭光明; 汤瑞; 邵恩; 张春明; 段勃
Original assignee: Western Institute Of Advanced Technology Institute Of Computing Chinese Academy Of Sciences
Current assignee: Western Institute Of Advanced Technology Institute Of Computing Chinese Academy Of Sciences
Priority date: 2020-06-19
Filing date: 2020-08-25
Publication date: 2020-10-30

Abstract

The invention provides a workflow job scheduling control method, which comprises the following steps: traversing all the jobs in the workflow of the job control module, and recording the number of the predecessor dependent jobs of each job and the job number thereof; determining executable jobs from the workflow; determining the priority of the executable job, and sending the executable job and the priority thereof to a job control queue; the job control queue divides the jobs into different priority levels according to the priority levels of the jobs, and the selected jobs are executed from a high priority level to a low priority level in each priority level.

Description

Workflow job scheduling control method

Technical Field

The invention relates to the field of computers, in particular to a workflow job scheduling control method.

Background

Scientific computing, deep learning, big data jobs have become the most common job types in data centers and cloud computing centers. The existing data center and cloud computing center are diversified in operation type, and meanwhile, hardware resources are developed towards diversification and isomerization. Besides the common X86 CPU and GPU, the machine resources also comprise an NPU (advanced peripheral Unit) for deep learning operation training (such as an AI chip for deep learning model inference), various FPGA chips with special functions, open-source Arm, MIPS, RISK-V processors and the like. And different types of heterogeneous computing resources are clouded, so that the method becomes the best choice for solving the diversity of the operation load and the heterogeneity of the computing resources. The existing container arrangement management system (such as Kubernets) becomes an important core for operating and managing container operation. The container arrangement management system not only meets the requirement of providing a uniform operating environment for diversified operation loads, but also shields the difference of hardware for application developers. The application developer can only focus on the development process of the application and does not need to focus on system environment configuration maintenance. And system research and development workers do not need to provide compatibility support for various applications, and only need to perform compatibility adaptation on a Kubernetes unified application operation environment container.

But the support of the container arrangement management system on scientific calculation, deep learning and big data operation is still not ideal. This is because existing generic container orchestration management systems are initially stateless web services oriented. Support for stateful services such as scientific computing, deep learning, big data jobs, and caching, databases, etc. is insufficient. Typically, a deep learning operation has multiple steps, such as: the method comprises the following steps of data acquisition, data processing, data conversion, data segmentation, model training, parameter tuning, model verification, model online monitoring and log acquisition. However, in a load with "workflow" features, each step needs to wait for the completion of the previous step before it can be executed.

However, the job scheduler in the current container scheduling management system still adopts a first-come first-serve scheduling policy, and cannot meet the load characteristics of the "workflow" type job load. For example, in a multi-tenant scenario, multiple users submit multiple workflow jobs. The later submitted workflow jobs are delayed for a long time and the quality of service of each workflow cannot be guaranteed.

Therefore, in order to solve the above technical problems, it is necessary to provide a new technical means for solving the problems.

Disclosure of Invention

In view of the above, an object of the present invention is to provide a method for controlling job scheduling of a workflow, which can dynamically adjust the execution priority of jobs in the workflow and dynamically adjust the execution order of each job in the same priority, thereby effectively improving the service quality of the jobs, improving the resource utilization rate, and reducing the overall completion time of the jobs.

The invention provides a workflow job scheduling control method, which comprises the following steps:

s1, traversing all the jobs in the workflow of the job control module, and recording the number of predecessor dependent jobs and job numbers of each job;

s2, determining executable operation from the workflow;

s3, determining the priority of the executable job, and sending the executable job and the priority thereof to a job control queue;

and S4, the job control queue divides the jobs into different priority levels according to the priority levels of the jobs, and in each priority level, the selected jobs are executed from a high priority level to a low priority level.

Further, in step S2, the executable job determination process in the workflow is as follows:

s21, judging whether the operation i is the operation of the workflow according to the operation number of the operation i, if so, entering the next step, and if not, ending;

s22, recording the operation time of the predecessor dependent operation of the operation i, including the operation completion time T_receiveAnd a work start time T_sendCalculating the actual execution time T of the predecessor dependent job of job i_execute：

T_execute＝T_receive-T_send；

S23, each time the predecessor dependent operation of the operation i is executed, the number of the dependent operations of the operation i is reduced by 1, and the earliest executable time T of the operation i is updated_i,start：

T_i,start＝max{T_pre,finishIn which T is_pre,finishRelying on the completion time of the job, T, for the remaining predecessors of job i_pre,finish＝T_execute+T_start，T_startRelying on the start time of the job for the remaining predecessors of job i; if the number of dependent jobs of job i is 0, job i is an executable job.

Further, the priority of the executable job is determined by:

determining an actual start time T for an executable job_i,startWorking time T set by system_i,systemA difference of (d);

determining priority D of executable job i_i：D_i＝{T_i,send-T_i,start0 }; wherein:

if T is_i,send≤T_i,startSince it is described that operation i has not been delayed, the priority is set to 0, and if T is set_i,system＞T_i,startIt is explained that the job has been delayed, and the priority is larger the longer the delayed time is.

Further, in step S4, the priority level is determined according to the following method:

high priority level:

medium priority level:

low priority level:

wherein D ═ D_max-D_min；D_maxAs the maximum value of the priority of all executable jobs, D_minIs the minimum of the priorities of all executable jobs.

Further, in step S4, the selected job is determined by:

and (3) constructing an execution profit equation:

f(i,j)＝max{f(i-1,j),f(i-1,j-v_i)+w_i}; wherein f (i, j) represents the maximum gain that can be achieved by the whole cluster if the machine resource is j under the condition of considering the first i jobs, f (i-1, j) represents the maximum gain that can be achieved by the whole cluster if the machine resource is j under the condition of considering the first i-1 jobs, and f (i-1, j-v)_i) Represents the maximum benefit, w, that the entire cluster can achieve with machine resources j-vi, considering the first i-1 jobs_i＝t_max-t_i，t_maxRepresents the execution time, t, of the job having the longest execution time among all the jobs_iIndicates the current job execution time, f (i-1, V-V)_i) Represents the maximum benefit that can be achieved for the entire cluster with machine resources V-vi, V @, given the first i-1 jobs considered_iRepresenting the request quantity of the job i to all the resources V of the system;

traversing the job queues with three priority levels, and judging whether f (i, V) is equal to f (i-1, V-V)_i)+w_iIf the two are the same, the operation i is selected, otherwise, the operation i is not selectedWherein f (i, V) considers the maximum gain that the whole cluster can achieve under all the resources V of the system under the first i job conditions; f (i-1, V-V)_i) Representing a system resource of V-V under consideration of i-1 job conditions_iThe maximum gain that can be achieved by the whole cluster.

The invention has the beneficial effects that: by the method and the device, the execution priority of the jobs in the workflow can be dynamically adjusted, and the execution sequence of each job in the same priority can be dynamically adjusted, so that the service quality of the jobs can be effectively improved, the resource utilization rate can be improved, and the total completion time of the jobs can be reduced.

Drawings

The invention is further described below with reference to the following figures and examples:

FIG. 1 is a flow chart of the present invention.

Detailed Description

The invention is described in further detail below with reference to the drawings of the specification:

s2, determining executable operation from the workflow;

and S4, the job control queue divides the jobs into different priority levels according to the priority levels of the jobs, and the selected jobs are executed from a high priority level to a low priority level in each priority level.

In this embodiment, in step S2, the executable job determination process in the workflow is as follows:

T_execute＝T_receive-T_send；

T_i,start＝max{T_pre,finishIn which T is_pre,finishRelying on the completion time of the job, T, for the remaining predecessors of job i_pre,finish＝T_execute+T_start，T_startRelying on the start time of the job for the remaining predecessors of job i; if the number of dependent jobs of the job i is 0, the job i is an executable job, wherein the predecessor dependent job of the job i indicates that the job i is to be executed, and the job i must be executed after the jobs such as i1, i2, …, im and the like are completed, wherein i1, i2, …, im indicates that m predecessor dependent jobs of the job i are executed, and each time the predecessor dependent job of the job i is executed, m is reduced by 1 until the m predecessor dependent job is 0.

In this embodiment, the priority of the executable job is determined by the following method:

In this embodiment, in step S4, the priority level is determined according to the following method:

high priority level:

medium priority level:

low priority level:

Specifically, the method comprises the following steps: in step S4, the selected job is determined by:

and (3) constructing an execution profit equation:

go throughThree priority levels of job queue, and determining whether f (i, V) is equal to f (i-1, V-V)_i)+w_iIf so, indicating that the job i is selected, otherwise, not selecting the job i, wherein f (i, V) considers the maximum benefit which can be achieved by the whole cluster under the condition of all the resources V of the system under the condition of the first i jobs; f (i-1, V-V)_i) Representing a system resource of V-V under consideration of i-1 job conditions_iThe maximum gain that can be achieved by the whole cluster.

In the prior art, the job which is always submitted first in the scheduling control of the job always occupies system resources, and the job which is submitted later cannot be executed and is blocked for a long time.

Finally, the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, and all of them should be covered in the claims of the present invention.

Claims

1. A workflow job scheduling control method is characterized in that: the method comprises the following steps:

s2, determining executable operation from the workflow;

2. The workflow job scheduling control method according to claim 1, wherein: in step S2, the executable job determination process in the workflow is as follows:

T_execute＝T_receive-T_send；

3. The workflow job scheduling control method according to claim 2, wherein: the priority of the executable job is determined by the following method:

4. The workflow job scheduling control method according to claim 3, wherein: in step S4, the priority level is determined according to the following method:

high priority level:

medium priority level:

low priority level:

5. The workflow job scheduling control method according to claim 4, wherein: in step S4, the selected job is determined by:

and (3) constructing an execution profit equation:

traversing the job queues with three priority levels, and judging whether f (i, V) is equal to f (i-1, V-V)_i)+w_iIf so, indicating that the job i is selected, otherwise, not selecting the job i, wherein f (i, V) considers the maximum benefit which can be achieved by the whole cluster under the condition of all the resources V of the system under the condition of the first i jobs; f (i-1, V-V)_i) Representing a system resource of V-V under consideration of i-1 job conditions_iThe maximum gain that can be achieved by the whole cluster.