CN106875090B

Movatterモバイル変換

Info

Publication number: CN106875090B
Application number: CN201710015070.8A
Authority: CN
Inventors: 裘智峰; 陈杰; 杨宁; 阳春华; 郭宇骞; 桂卫华
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2017-01-09
Filing date: 2017-01-09
Publication date: 2020-05-08
Anticipated expiration: 2037-01-09
Also published as: CN106875090A

Abstract

Translated fromChinese

本发明提供一种面向动态任务的多机器人分布式任务分配形成方法，该方法包括：根据环境地图中任务点状态变化，分析任务分配需要考虑的因素和难点，任务点发生时，基于多阶段拍卖算法，生成任务分配方案，机器人根据所述任务分配方案执行任务。本发明解决了环境中动态任务分配问题，传统的拍卖算法采用的一次分配来解决给定的任务，在面对动态任务存在很大局限性。本发明通过多次拍卖，以时间最优为目的，最大程度的利用机器人的资源，将上述方法在VC++和Csharp平台上进行仿真，通过大量的实验仿真测试结果表明，改进的拍卖算法较传统的拍卖算法更能够很好的解决环境中的动态任务，通过多次分配来满足实时性的需求，并能够给出接近的最优解。

The present invention provides a multi-robot distributed task assignment formation method oriented to dynamic tasks. The method includes: analyzing the factors and difficulties that need to be considered in task assignment according to the state changes of task points in an environment map, and when task points occur, based on multi-stage auction The algorithm generates a task assignment plan, and the robot executes the task according to the task assignment plan. The invention solves the problem of dynamic task assignment in the environment. The traditional auction algorithm adopts one-time assignment to solve a given task, which has great limitations in the face of dynamic tasks. Through multiple auctions, the invention aims to optimize the time, utilizes the resources of the robot to the greatest extent, and simulates the above method on the VC++ and Csharp platforms. The auction algorithm can better solve the dynamic tasks in the environment, meet the real-time requirements through multiple allocations, and can give close to the optimal solution.

Description

Dynamic task-oriented multi-robot distributed task allocation forming method

Technical Field

The invention relates to the technical field of robot intelligent auction algorithms, in particular to a dynamic task-oriented multi-robot distributed task allocation forming method.

Background

With the deep knowledge of people on artificial intelligence and complex systems, the robot system has good application prospects in the fields of buildings, military, fire fighting, industrial production and the like. However, in the current multi-robot system, the number of tasks is large, the information contained in the task points is complex, the number of the participating robots is large, and the calculation cost of the global search algorithm is exponentially increased, so that the global search algorithm is difficult to find the optimal solution of the task allocation problem within the specified time. Therefore, the global search algorithm is not suitable for solving the large-scale multi-agent dynamic task allocation problem. In large-scale systems, task allocation must meet real-time requirements, and tends to obtain a "good" solution within a specified time, which is not necessarily the optimal solution. Therefore, the method has important significance for the research of the multi-robot dynamic task allocation problem with time constraint.

Most of the current task allocation adopts an auction algorithm, the traditional auction algorithm is static in terms of tasks, and a one-time auction algorithm is adopted. However, in the case of a large number of task points and dynamic changes, the method has a great defect, cannot perform scheduling more times, cannot meet the real-time requirement, and cannot obtain an ideal effect.

Disclosure of Invention

The present invention provides a dynamic task oriented multi-robot distributed task allocation formation method that overcomes or at least partially solves the above-mentioned problems by solving the problems of multiple dynamic tasks in an environment through cooperative cooperation between robots based on an improved auction algorithm and in conjunction with a-star algorithm path optimization.

In order to achieve the above object, the present invention provides a dynamic task-oriented multi-robot distributed task allocation forming method, which comprises:

s1: according to the information in the environment map, the factors and difficulties which need to be considered in the task allocation problem are cleared;

s2: when a task occurs, a task allocation scheme is generated based on a multi-stage auction algorithm, and the robot executes the task according to the task allocation scheme.

Further, the step S1 includes:

s11: a target environment map is given, and a plurality of robots with task execution capacity, task points with attributes changing along with time and a plurality of obstacles are distributed on the map;

s12: according to the state change of the task points, the factors and difficulties needing to be considered for cleaning task distribution

Further, in step S11, the environment is a robot work environment, a rectangular coordinate system is established for a given environment map, the coordinate system uses the x axis to the right as a positive direction and the y axis to the positive direction, the work environment is divided into a plurality of grids, and N (N belongs to Z) is spatially distributed on the environment map⁺) Task point, M (M belongs to Z)⁺) Individual robot and B (B ∈ Z)⁺) And determining coordinates of the robot, the task and the obstacle, respectively.

Where the task is denoted by the letter j, where j equals 1,2, …, N, the task approximates a point on the map. The robot is denoted by the letter i, where i ═ 1,2, …, M. The obstacle is denoted by the letter B, where B is 1,2, …, B.

The coordinates of the task point on the map are

Wherein j is 1,2, …, N; the coordinates of the robot on the map are

Wherein i is 1,2 … M; the coordinates of the obstacle on the map are

Where k is 1,2, …, B.

The distance formula between the task point and the robot is as follows:

suppose that the j coordinate of the task point and the j' coordinate of the task point are respectively

The distance between the two is expressed as:

further, in step S12, the state quantity of the task point j (j ═ 1,2, …, N) at time (t +1) changes exponentially with time, and the state quantity of the task point j (j ═ 1,2, …, N) is expressed as:

in the formula:

S_j(t+1)，S_j(t) task point state quantities of task j (j ═ 1,2, …, N) at stages (t +1) and t, respectively;

α_jthe state change rate of the jth task point;

β_iexecution capabilities of robot i and β_i＞0；

The time difference between t and (t + 1);

the task quantity threshold value is set, whether the task is finished or not is judged, and when s is reached, the task quantity threshold value is set_j(t) < ε, this indicates that the task is complete.

This is a task state distributed dynamic task allocation problem. The state change of the task point is in a continuous exponential function form (equation three). The task volume is time-distributed and is affected by how many robots are performing the task (the sum of the performance capabilities of the agent).

The completion time of task point j (j ═ 1,2, …, N) is inversely proportional to the sum of the performance capabilities acting on task point j. The execution capacity of m robots for executing the target task j at the stage t and

then, it indicates that the m robots cannot complete the task point, s_j(t) exhibits an ascending trend; when in use

Then m robots are shown to be able to complete the task point, s_j(t) shows a tendency to decline.

Further, in step S12, the system task assignment needs to consider factors including task point attribute parameters and robot attribute parameters.

Attribute parameter of task point j (j is 1,2, … N)The number of the components comprises: task quantity (i.e. state value) S of task point_j(t) indicating the task amount of the task point at time t, wherein j is 1,2, …, N, and when t is 0, indicating the initial amount S of the task point_j(0) Duty point change rate α_j(ii) a The task points j are distributed in space, and the positions of the task points j on the environment map

The attribute parameters of the robot i (i ═ 1,2, …, M) include: velocity v of movement_iThe length of the path taken by the robot i in unit time, and a robot capability value β_i(ii) a The position of the robot on the environment map

With the increase of the number of target task points and robots, the difficulty of task allocation problem increases rapidly, and the importance degree and the state change rate of each target task point, the cost and the execution capacity of each intelligent agent when executing tasks are all factors to be considered.

Assuming that the robot that is heading for the jth task point does not go to other task points until the next phase of solution generation, for a given M (M ∈ Z)⁺) Personal robot, N (N is belonged to Z)⁺) A task point, which is shared by multiple auctions

A distribution scheme having a complexity of

In step S2, the method further includes:

s21: when a task is detected, selecting a robot closest to a task point as a proxy robot; if the robot closest to the plurality of task points is the same robot i (i is 1,2, …, M), the system selects the robot i of the task point closest to the origin of the rectangular coordinate system from the plurality of task points as the proxy robot;

s22: the agent robot issues auction information to the auction robots within the communication radius to wait for feedback;

s23: the auction robot receiving the auction information selects the task point with the highest profit to bid according to the profit function of the auction robot;

s24: the agent robot collects the bidding information, selects the winning robot according to a preset rule and informs the winning robot to execute the task;

s25: and selecting the nearest task point for the auction robots not winning the bid and the robots in the communication blind areas.

Further, in step S2, when detecting that the task point and the idle robot without targets have been completed, the idle robot is re-auctioned, so that the idle robot obtains a new task point again, thereby obtaining a new allocation scheme, and the auction is stopped until all task points in the environment are executed.

The target-oriented task of the invention is dynamically changed along with time, and the task points are distributed on the completed time point, namely the task points are inconsistent on the executed time.

If a traditional auction algorithm flow is used, after an allocation scheme is generated, a dynamic target task is executed according to the scheme, and it is difficult to adapt to the dynamic change and real-time requirements of the target task.

The improved auction algorithm can track the state change of the task points in real time, once the task points are finished, the robot without targets can exist, and the algorithm can redistribute the task points, so that a new distribution scheme is obtained.

Further, in step S22, the agent robot is responsible for issuing auction information to the auction robots within the communication radius, and waits for feedback; the agent robot of each task point j (j is 1,2, …, N) grasps the relevant auction information of the task point j, and the message includes: the position of the task point j, the change rate of the task point j, the state quantity of the task point j and the number s of suitable robots needed by the task point j. The state quantities of the task points change exponentially with time.

Number of suitable robots s required for task point j_jIs defined as: suppose there are w robots such that

And is

The optimal number s of robots needed by the task point j takes the following values:

w≤s_jw +2 ≤ (four type)

The robots i (i is 1,2, …, M) all have corresponding communication radiuses r_i(the communication radius of the robot can be set to be different values), and the agent robot is responsible for issuing auction information to the auction robots within the communication radius.

Further, in step S23, the auction robot that has received z (0 ≦ z ≦ N) pieces of proxy robot auction information balances the information with the following balance criteria: the auction robot analyzes the z auction information issued by the agent robot, the traditional auction algorithm is improved, a dynamic economic benefit function is introduced, and the robot i calculates the income value E after executing a certain task point j_ij(E_ijReal number) and selects the most profitable task point to bid. The dynamic revenue function used is:

E_ij＝g(k1,S_j(t),α_j,β_i,n)-h(k2,d_ij) (formula five)

In the formula:

g, the robot i obtains benefits after executing the task point j and is an exponential function;

h is the cost consumed by the robot i to execute the task point j;

k1, k2 variable weight parameters, which can be set to their respective values under different circumstances;

d_ijthe path length between the robot i and the task point j is long;

in the above formula, k1 and S_jThe product of (t) is the scaling factor of an exponential function, α_j,β_iN is the relation between the three

Forming an index in the index function, n representing the number of robots that are executing the task point j (when t is 0, the task point is not executed by a robot, and n is 0);

h is d_ijFunction, given the parameter k2, the value of h is given by d_ijAnd (6) determining.

Each robot has a corresponding execution capacity value β for a given environment map_i(i ═ 1,2, …, M), the ability value of this robot i is a constant, and the execution ability values between robots may be set to be different.

Further, in step S24, after receiving the bidding information of the auction robots, the agent robot analyzes the information, selects the auction robot that wins the bid according to the target with the greatest overall benefit, and notifies all the bidding auction robot results.

Further, in step S25, the robots in the communication blind areas in the map (there may be robots that cannot receive the auction information issued by the agent robot due to the limitation of the communication radius in the environment map) and the auction robots that have not bid themselves to execute the task points closest to the robots.

Further, in step S24, the agent robot may select the auction robot that wins the bid according to a predetermined rule, where the specific content is:

proxy robot acceptance n for task point j_jBid message of each auction robot if n_j＞s_jThen sort by profit size, choose top ranked(s)_j-1) one auction robot acts as a middling robot and informs all bidding auction robots if n_j＜s_jThen n is_jAll the auction robots are the middle-standard robots.

Further, in the step S24, in the process of the robot traveling, the path is optimized through the a-star algorithm, so as to reasonably avoid collision with other robots and obstacles, and search out the optimal collision-free path for the robot to reach the corresponding task point.

And generating a task allocation scheme, starting to execute the target task point by the robot according to the allocated scheme, displaying the position of the robot in real time in the forward target task process by the robot, and searching the optimal collision-free path through an A-star algorithm.

In the searching process of the A-star algorithm, a method for searching the extended space node by a cost function is adopted, and the general form of the cost function is as follows:

(i) g (i) + h (i) (formula six)

In the formula:

(i) a valuation function of node i from the initial node to the target node;

g (i) the actual cost from the initial node to node i in the state space;

h (i) estimated cost of best path from node i to the target node.

The optimal collision-free path is searched through an A-star algorithm, the robot can reasonably avoid obstacles and cannot meet the obstacles in the traveling process, and the A-star is an effective method for solving the shortest path, so that the real-time and environment adaptability efficiency of path planning is improved.

The robot follows the following rules throughout the execution:

a) the optimal number of robots required for task point j is s_j；

b) The robot which is going to the jth task point does not go to other task points before the scheme of the next stage is generated;

c) the robot cannot go to the executed task point;

d) the agent robot selects the auction robot for winning the bid according to a rule established in advance;

e) the auction robot which does not bid or the communication blind area robot (the robot which can not receive the auction information sent by the agent robot in the communication radius) can execute the nearest task point;

f) the agent robot executes the task point of the self agent.

Through the technical scheme, a set of complete execution scheme is finally generated, and the time required for completing all task points is obtained. The time required to complete all task points is the time at which the last task point was executed.

Based on the technical scheme, the multi-stage auction algorithm facing the dynamic tasks is provided, the problem of dynamic task allocation in the environment is solved, the given task is solved by one-time allocation adopted by the traditional auction algorithm, and the expected effect cannot be achieved due to the fact that the dynamic tasks are limited greatly. Through the realization of multiple auctions, the aim of time optimization is fulfilled, the resources of the robot are utilized to the greatest extent, the method is simulated on VC + + and Csharp platforms, and a large number of experimental simulation test results show that the improved auction algorithm can better solve dynamic tasks in the environment compared with the traditional auction algorithm, meets the real-time requirement through multiple distribution, and can provide an approximate optimal solution.

Drawings

FIG. 1 is a diagram of an environment model according to an embodiment of the present invention.

Figure 2 is a flow diagram of a multi-stage auction algorithm according to an embodiment of the present invention.

Fig. 3 is a flow diagram of proxy robot work according to an embodiment of the present invention.

Fig. 4 is a flowchart of the operation of the auction robot according to the embodiment of the present invention.

Fig. 5 is a simulation diagram of a conventional auction algorithm (single-stage auction algorithm) according to an embodiment of the present invention.

Fig. 6 is a simulation diagram of an improved auction algorithm (multi-stage auction algorithm) according to an embodiment of the present invention.

FIG. 7 is a graph comparing time consumption of a multi-stage auction algorithm with a single-stage auction algorithm according to an embodiment of the present invention.

Detailed Description

The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.

The invention provides an improved auction algorithm and a method for solving the problem of a plurality of dynamic tasks in the environment through cooperative cooperation between robots in combination with A-star algorithm path optimization.

The multi-stage auction algorithm is improved on the basis of the traditional auction algorithm, and corresponding rules and dynamic profit functions are added. The traditional auction algorithm is suitable for solving the distribution problem of static target tasks, the value of the target tasks and the execution cost of the robots are respectively measured through benefit and cost parameters, the auction algorithm of the target tasks to the robots is designed, a distribution scheme is generated, and the whole system only carries out auction once.

The robot collaborative cooperation is embodied as follows: when the robot detects that a task point occurs, the agent robot issues auction information to the auction robot, and the auction robot balances and bids according to the received information; the agent robot selects an allocation scheme according to the feedback information and a preset rule, the robot sends the known information to the robot in the communication radius of the robot in the process of executing the task, resource sharing is achieved, once the task points are completed, the idle robot obtains target task points through re-auction to assist other robots to execute the task points until all the task points are completed.

According to an embodiment of the present application, as shown in fig. 1, the environment model diagram of the present invention includes the following steps:

firstly, a rectangular coordinate system is established for a target map, the target map is divided into a plurality of map blocks with the same size, and the length and the width of each map block are equal.

Secondly, in the embodiment of the present invention, given that the length and width of the target map are all 300 units, the map contains 89401 map blocks.

Given an environment map, M (M E Z) can be set specifically⁺) Personal robot, N (N is belonged to Z)⁺) A task point, and B (B ∈ Z)⁺) An obstacle.

And calculating and storing the distance between the task points (formula two) and the distance between the task points and the robot (formula one).

Finally, the study objective of the invention was analyzed: aiming at a plurality of robots with certain task execution capacity, a plurality of tasks with the attributes changing along with time are given, a multi-robot dynamic task allocation strategy is established, the completion time of all tasks is optimized, and the completion time of the last task can be used for measurement.

When M (0 ≦ M) robots act on task point j (j ≦ 1,2, …, N), the task state quantity formula for task point j at (t +1) (t ≧ 0) is:

as shown in FIG. 2, the multi-stage auction algorithm employed in the present invention is a flow chart, detailed as follows:

in S201, when a task occurs, a proxy robot for each task point is determined, and a robot closest to the task point j is selected as a proxy robot for the task point j.

The workflow of the agent robot is shown with reference to fig. 3, and is detailed as follows:

after receiving the message of the monitoring system, the robot i (i is 1,2, …, M) closest to the task point j (j is 1,2, …, N) is the proxy robot of the task point, the robot i first knows the task specific information of the task point j and sorts the task specific information, mainly calculates the optimal number s of robots required for completing the task j and obtains the change rate α of the task point_jThe task state quantity of the task point, the geographic position of the task point and the like. And then sending auction information to the auction robots within the communication radius to wait for feedback. And the auction robots receiving the auction information select the task points with the highest profit to bid according to the profit functions of the auction robots, the agent robot selects the auction robots winning the bid with the highest overall benefit as the target, and notifies all the bidding auction robots. And if the number n of the landmark robots reaches the condition that n is larger than s, selecting s robots with top profit ranks, and reselecting the task points closest to the landmark robots.

In S203, the auction robot workflow refers to fig. 4, which is detailed as follows:

if the auction robot is not selected as the agent robot, the auction robot is automatically converted into the auction robot, and the agent robot firstly issues auction information to the auction robots within the communication radius to wait for feedback. The auction robot receiving the auction information selects the task point with the highest profit to bid according to the profit function of the auction robot, and the agent robot collects the bid information and selects the auction robot winning the bid with the highest overall benefit of the system as the target. And if the auction robot successfully wins the bid, starting to execute the task, and if the auction robot fails to win the bid, selecting to execute the nearest task point. And if the task point is completed, the idle robots are redistributed according to the current situation until the task is completed.

And generating a distribution scheme, starting to execute a task, optimizing the path by the robot through an A-Star algorithm in the process of traveling, reasonably avoiding collision with other robots and obstacles, and searching out the optimal collision-free path for the robot to reach a corresponding task point.

Once there are completed task points, there will be untargeted idle robots, and the system will re-auction them, resulting in a new allocation scheme until all task points in the environment are completed.

The robots cooperatively perform the task test.

The above-mentioned method of the improved auction algorithm (multi-stage auction algorithm) has M robots, N task points, and B obstacles. In that

(m is the number of robots performing task point j). In order to test the improvement of the auction algorithm in performing the dynamic task, the test was performed using a map having a length and a width of 300 grids, comparing the effects of the single-stage auction algorithm and the multi-stage auction algorithm.

Fig. 5 and fig. 6 are both test examples, in which the thick black small circle is a proxy robot, and therobot 3 is a communication blind area robot, specifically as follows:

fig. 5 is a simulation diagram of a test single-stage auction algorithm, where

robots

2,4, and 5 are proxy robots for task points 2,3, and 1, respectively, and the proxy robots distribute auction information corresponding to the task points to auction robots within a communication radius; after the first round of auction, the

auction robots

1,6 and 7 successfully bid for winning the bid and respectively execute the task points 2,3 and 1. Therobot 3 selects thetask point 1 with the closest execution distance for the robot with the communication blind area (which is not in the communication radius range of the proxy robot and cannot accept auction information issued by the proxy robot), the robot starts to execute the target task point after the task allocation scheme is generated, the auction is not carried out in the execution process, and once the task point is executed, the idle robot does not assist other robots to execute the unfinished task point.

Fig. 6 is a simulation diagram of a certain testing multi-stage auction algorithm, in which the initial allocation method is the same as the single-stage auction algorithm allocation method, and the multi-stage auction algorithm is characterized in that once there is a completed task point, there are targets-free robots, and the algorithm reallocates them, in the diagram, thetask point 1 is executed first, and the algorithm reallocates the

robots

3,5, and 7, so that the idle robots are not wasted, and the goal of designing the allocation scheme in stages is achieved by utilizing the resources of the robots to the maximum extent through multiple auctions.

In order to verify the efficiency of the improved auction algorithm, 8 sets of test data were selected, the communication radius was set to 200 units, k1 is 1, k2 is 0.0025; the statistical results are shown in table 1.

Table 1 is a statistical table of test times

The simulation result is drawn into a line graph, as shown in fig. 7, the comparison result is carried out, after corresponding rules and dynamic revenue functions are introduced for the dynamic tasks, the overall effect of the robot for executing the dynamic tasks is greatly improved, and the effectiveness of the robot for the dynamic tasks adopting the improved multi-stage auction algorithm is proved through simulation tests.

Finally, the method of the present application is only a preferred embodiment and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

Translated fromChinese

1.一种面向动态任务的多机器人分布式任务分配形成方法，其特征在于，包括：1. a multi-robot distributed task distribution formation method oriented to dynamic tasks, is characterized in that, comprises:

S1：根据环境地图中信息，理清任务分配需要考虑的因素及难点；S1: According to the information in the environment map, clarify the factors and difficulties that need to be considered in task assignment;

S2：任务发生时，基于多阶段拍卖算法，生成任务分配方案，机器人根据所述任务分配方案执行任务；S2: When a task occurs, a task allocation plan is generated based on a multi-stage auction algorithm, and the robot executes the task according to the task allocation plan;

其中，所述步骤S1中，进一步包括：Wherein, in the step S1, it further includes:

S11：给定一张目标环境地图，所述地图上分布有多个具有任务执行能力的机器人、属性随时间变化的任务点和若干个障碍物；S11: Given a target environment map, a plurality of robots with task execution capabilities, task points whose attributes change over time, and several obstacles are distributed on the map;

S12：根据任务点的状态变化，理清任务分配需要考虑的因素及难点分析；S12: According to the state changes of the task points, clarify the factors to be considered in task assignment and analysis of difficulties;

其中，所述步骤S12中，任务点状态量随时间呈现指数函数形式变化，任务点j(j＝1,2,…,N)在(t+1)时刻的状态量用方程表示为：Wherein, in the step S12, the state quantity of the task point changes with time in the form of an exponential function, and the state quantity of the task point j (j=1,2,...,N) at the time (t+1) is expressed as:

式中：where:

S_j(t+1)，S_j(t):任务j(j＝1,2,…,N)分别在(t+1)和t阶段的任务点状态量；S_j (t+1), S_j (t): task point state quantities of task j (j=1, 2, ..., N) in (t+1) and t stages respectively;

α_j:第j个任务点的状态变化率；α_j : the state change rate of the jth task point;

β_i:机器人i的执行能力且β_i＞0；β_i : the execution ability of robot i and β_i >0;

Δt:t与(t+1)之间的时间差；Δt: the time difference between t and (t+1);

ε:任务量阈值，设定任务量阈值ε判断任务是否完成，当s_j(t)＜ε时，表示任务已完成；ε: task amount threshold, set the task amount threshold ε to judge whether the task is completed, when s_j (t) <ε, it means that the task has been completed;

m：执行任务j的机器人数量；m: the number of robots performing task j;

其中，所述基于多阶段拍卖算法，生成任务分配方案的步骤，具体包括：Wherein, the step of generating a task allocation scheme based on a multi-stage auction algorithm specifically includes:

确定N个任务点的代理机器人；Determine the agent robot of N task points;

计算任务点最佳机器人需求个数S；Calculate the optimal number of robots required at the task point S;

代理机器人发布拍卖信息，判断非代理机器人是否能接收到拍卖信息；The agent robot publishes the auction information, and judges whether the non-agent robot can receive the auction information;

若非代理机器人能接收到拍卖信息，则竞拍机器人根据收益函数确定目标任务点，进行竞拍，代理机器人确定任务点投标个数n；If the non-agent robot can receive the auction information, the bidding robot determines the target task point according to the revenue function and conducts the auction, and the agent robot determines the number of bids for the task point n;

若所述任务点投标个数n小于等于所述任务点最佳机器人需求个数S，则将任务点全部接收；否则，收益函数高的n个机器人中标，竞拍失败的机器人选择就近任务点；If the number of bids n of the task point is less than or equal to the optimal number of robots required for the task point S, all the task points will be received; otherwise, the n robots with high revenue function will win the bid, and the robot that fails to bid will select the nearest task point;

确定分配方案，机器人开始执行任务，有新任务点完成时，任务点完成数目加一；Determine the allocation plan, the robot starts to perform tasks, and when a new task point is completed, the number of completed task points is increased by one;

判断任务点完成数据是否达到N，若达到，则结束，否则，回到代理机器人发布拍卖信息步骤；Determine whether the completion data of the task point reaches N, if so, end, otherwise, go back to the step of releasing the auction information by the agent robot;

在所述判断非代理机器人是否能接收到拍卖信息的步骤之后，还包括：After the step of judging whether the non-agent robot can receive the auction information, it also includes:

若非代理机器人不能接收到拍卖信息，则通讯盲区的机器人选择最近的任务点后跳转至确定分配方案步骤。If the non-agent robot cannot receive the auction information, the robot in the communication blind area selects the nearest task point and jumps to the step of determining the allocation plan.

2.根据权利要求1所述的方法，其特征在于，所述步骤S11中，所述的环境为机器人工作环境，对给定的环境地图建立直角坐标系，坐标系以x轴向右为正方向，y轴向上为正方向，并将该工作环境划分为多个栅格，环境地图在空间上分布有N(N∈Z⁺)个任务点、M(M∈Z⁺)个机器人以及B(B∈Z⁺)个障碍物，并分别确定每个任务、机器人和障碍物的坐标。2. The method according to claim 1, characterized in that, in the step S11, the environment is a robot working environment, and a Cartesian coordinate system is established for a given environment map, and the coordinate system takes the right of the x-axis as positive The working environment is divided into multiple grids. The environment map is spatially distributed with N(N∈Z⁺ ) task points, M(M∈Z⁺ ) robots and B(B ∈ Z⁺ ) obstacles and determine the coordinates of each task, robot and obstacle separately.

3.根据权利要求1所述的方法，其特征在于，所述步骤S12中，根据任务点的状态变化，建立任务点状态模型，分析任务分配需要考虑的因素包括任务点属性参数和机器人属性参数，其中，3. The method according to claim 1, wherein, in the step S12, according to the state change of the task point, a task point state model is established, and the factors that need to be considered in the analysis of task assignment include task point attribute parameters and robot attribute parameters ,in,

任务点属性参数包括：任务点的任务量、任务点的状态变化率和任务点在环境地图上所处的位置；The attribute parameters of the task point include: the task amount of the task point, the state change rate of the task point, and the position of the task point on the environment map;

机器人属性参数包括：运动速度、机器人能力值和机器人在环境地图上所处的位置。The robot attribute parameters include: movement speed, robot ability value and the robot's position on the environment map.

4.根据权利要求1所述的方法，其特征在于，所述步骤S2中，进一步包括：4. The method according to claim 1, wherein, in the step S2, further comprising:

S21：检测到任务时，选择距离任务点最近的机器人作为代理机器人，若距离数个任务点最近的机器人为同一个机器人i(i＝1,2,…,M)，系统会选择数个任务点中选择距离直角坐标系原点最近的任务点的机器人i为其代理机器人；S21: When a task is detected, select the robot closest to the task point as the proxy robot. If the robot closest to several task points is the same robot i (i=1, 2, ..., M), the system will select several tasks Among the points, the robot i that selects the task point closest to the origin of the Cartesian coordinate system is its agent robot;

S22：代理机器人发布拍卖信息给通讯半径内的竞拍机器人，等待反馈；S22: The agent robot releases auction information to the bidding robots within the communication radius, and waits for feedback;

S23:接收到拍卖信息的竞拍机器人根据自身收益函数选择收益最高的任务点进行投标；S23: The bidding robot that has received the auction information selects the task point with the highest revenue to bid according to its own revenue function;

S24：代理机器人收集投标信息，选择中标的机器人，并通知中标的机器人执行任务；S24: The agent robot collects bidding information, selects the winning robot, and notifies the winning robot to perform the task;

S25：未中标的竞拍机器人和处在通讯盲区的机器人选择执行最近的任务点。S25: The bidding robot that did not win the bid and the robot in the communication blind area choose to execute the nearest task point.

5.根据权利要求4所述的方法，其特征在于，所述步骤S22中，代理机器人负责发布拍卖信息给通信半径内的竞拍机器人，等待反馈；每个任务点的代理机器人掌握该任务点的相关拍卖信息，所述信息包括：任务点的位置、任务点的变化率、任务点的初始量和任务点所需要的合适机器人的数量。5. The method according to claim 4, wherein in the step S22, the agent robot is responsible for issuing auction information to the bidding robots within the communication radius, and waiting for feedback; the agent robot of each task point grasps the information of the task point. The relevant auction information includes: the position of the task point, the rate of change of the task point, the initial amount of the task point and the number of suitable robots required for the task point.

6.根据权利要求4所述的方法，其特征在于，所述步骤S23中，接受到z(0≤z≤N)个代理机器人发布拍卖信息的竞拍机器人对这些信息进行权衡，权衡标准为：竞拍机器人对这z个拍卖机器人发布拍卖信息进行分析，机器人计算执行完某个任务点后的收益值，并选择收益最高的任务点进行投标。6. The method according to claim 4, characterized in that, in the step S23, the bidding robots that have received z (0≤z≤N) agent robots to release the auction information weigh these information, and the weighing standard is: The bidding robot analyzes the auction information released by the z auction robots. The robot calculates the revenue value after executing a certain task point, and selects the task point with the highest profit to bid.

7.根据权利要求4所述的方法，其特征在于，所述步骤S24中，代理机器人收到竞拍机器人的投标信息后，分析其信息，根据整体效益最大的目标，选择中标的竞拍机器人，并通知所有投标的竞拍机器人结果。7. The method according to claim 4, wherein in the step S24, after the agent robot receives the bidding information of the bidding robot, it analyzes the information, and selects the bidding robot that wins the bid according to the target with the greatest overall benefit, and Notifies all bidding bots of the result.

8.根据权利要求4所述的方法，其特征在于，所述步骤S24中，机器人在行进过程中，通过A-star算法搜索到达任务点的最优无碰路径。8 . The method according to claim 4 , wherein, in the step S24 , the robot searches for the optimal collision-free path to the task point through the A-star algorithm during the traveling process. 9 .

9.根据权利要求4所述的方法，其特征在于，所述步骤S2中，当检测到已完成任务点和无目标的空闲机器人时，会对空闲机器人进行重新拍卖，使得空闲机器人重新获得新的任务点，从而得出新的分配方案，直至环境中所有的任务点被执行完后，停止拍卖。9. The method according to claim 4, characterized in that, in the step S2, when detecting the idle robot that has completed the task point and no target, the idle robot will be re-auctioned, so that the idle robot can regain new To obtain a new allocation plan, the auction will be stopped after all the task points in the environment have been executed.