Movatterモバイル変換


[0]ホーム

URL:


CN115257746B - A lane-changing decision control method for autonomous driving vehicles considering uncertainty - Google Patents

A lane-changing decision control method for autonomous driving vehicles considering uncertainty
Download PDF

Info

Publication number
CN115257746B
CN115257746BCN202210863061.5ACN202210863061ACN115257746BCN 115257746 BCN115257746 BCN 115257746BCN 202210863061 ACN202210863061 ACN 202210863061ACN 115257746 BCN115257746 BCN 115257746B
Authority
CN
China
Prior art keywords
vehicle
longitudinal
lateral
speed
function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210863061.5A
Other languages
Chinese (zh)
Other versions
CN115257746A (en
Inventor
熊璐
李拙人
杨若霖
付志强
肖宏宇
冷搏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongji University
Original Assignee
Tongji University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongji UniversityfiledCriticalTongji University
Priority to CN202210863061.5ApriorityCriticalpatent/CN115257746B/en
Publication of CN115257746ApublicationCriticalpatent/CN115257746A/en
Application grantedgrantedCritical
Publication of CN115257746BpublicationCriticalpatent/CN115257746B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

Translated fromChinese

本发明涉及一种考虑不确定性的自动驾驶汽车换道决策控制方法,包括:构建状态空间及动作空间,建立状态转移方程;建立自车、他车的观测空间模型及信念空间模型、他车预测轨迹的不确定性模型;设定奖励函数,结合POMDP模型,求解自车的决策状态点集、并解耦为横向空间决策集及纵向时间决策集;分别确定横向、纵向可行驶边界,引入道路边界约束,车速约束及障碍物距离约束,划分出横向可行驶区域及参考路径、纵向可行驶区域及参考速度曲线;根据决策结果,车辆规划模块输出相应车辆最优轨迹,使车辆按照最优轨迹行驶。与现有技术相比,本发明充分考虑他车预测轨迹的不确定性,决策出的自车状态点集更加稳定可靠,能有效增强车辆驾驶的舒适性及安全性。

The present invention relates to a lane-changing decision control method for an autonomous driving vehicle considering uncertainty, including: constructing a state space and an action space, establishing a state transfer equation; establishing an observation space model and a belief space model of the self-vehicle and other vehicles, and an uncertainty model of the predicted trajectory of the other vehicle; setting a reward function, combining a POMDP model, solving the decision state point set of the self-vehicle, and decoupling it into a lateral space decision set and a longitudinal time decision set; respectively determining the lateral and longitudinal drivable boundaries, introducing road boundary constraints, vehicle speed constraints and obstacle distance constraints, dividing the lateral drivable area and reference path, longitudinal drivable area and reference speed curve; according to the decision result, the vehicle planning module outputs the optimal trajectory of the corresponding vehicle, so that the vehicle drives according to the optimal trajectory. Compared with the prior art, the present invention fully considers the uncertainty of the predicted trajectory of other vehicles, and the state point set of the self-vehicle determined by the decision is more stable and reliable, which can effectively enhance the comfort and safety of vehicle driving.

Description

Uncertainty-considered automatic driving automobile lane change decision control method
Technical Field
The invention relates to the technical field of automatic driving automobile control, in particular to an automatic driving automobile lane change decision control method considering uncertainty.
Background
The decision planning is one of the key parts of automatic driving, firstly, multi-sensor information is fused, then task decision is carried out according to driving requirements, then, on the premise that possible obstacles can be avoided, a plurality of selectable safety paths between two points are planned through specific constraint conditions, and an optimal path is selected from the paths to serve as a vehicle driving track.
At present, a decision module of a vehicle integrally gathers required information, and makes reasonable driving decisions through comprehensive analysis, and a planning module of the vehicle outputs safe and comfortable vehicle tracks according to decision results. In existing research, representative decision methods include finite state machines, behavioral trees, markov processes, etc. However, most decision methods consider less uncertainty problems caused by dynamic changes of environment and vehicle states in a future period, and most decision methods simply output behavior instructions, so that the rationality of decision results cannot be guaranteed, even the decision results are discontinuous in time, and therefore, a problem of difficult solution is brought to a track planning module, and poor safety and comfort of vehicles and poor user experience are easily caused.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide the automatic driving automobile lane change decision control method taking uncertainty into consideration, and the reliability of a decision result is improved by taking uncertainty of a predicted track of the other automobile into consideration, so that the comfort and safety of driving the automobile are enhanced.
The aim of the invention can be achieved by the following technical scheme: an automatic driving automobile lane change decision control method considering uncertainty comprises the following steps:
s1, based on a perceived vehicle state and a predicted other vehicle state, a state space and an action space are built by combining a vehicle dynamics model, and a state transition equation is built;
S2, respectively establishing an observation space model and a belief space model of the own vehicle, an observation space model and a belief space model of the other vehicle and an uncertainty model of a predicted track of the other vehicle;
s3, setting a reward function, combining the steps S1 and S2 to construct and obtain a POMDP (PARTIALLY OBSERVABLE MARKOV DECISION PROCESS, part of observable Markov decision process) model, and solving a decision state point set of the vehicle;
S4, decoupling the decision state point set of the own vehicle into a transverse space decision set and a longitudinal time decision set;
S5, determining a transverse drivable boundary, introducing road boundary constraint, vehicle speed constraint and obstacle distance constraint, and dividing a transverse drivable region and a reference path;
Determining a longitudinal drivable boundary, introducing road boundary constraint, vehicle speed constraint and barrier distance constraint, and dividing a longitudinal drivable region and a reference speed curve;
s6, outputting an optimal track of the corresponding vehicle by the vehicle planning module according to the transverse drivable area and the reference path, the longitudinal drivable area and the reference speed curve which are divided in the step S5, so that the vehicle can run according to the optimal track.
Further, the step S1 specifically includes the following steps:
S11, simplifying the motion of a vehicle into the motion of particles in a Frenet coordinate system by using a simplified vehicle kinematic model so as to construct a state space and an action space of the vehicle;
S12, using the state space and the action space to establish state transfer functions of the own vehicle and the other vehicle.
Further, the step S11 specifically includes the following steps:
s111, acquiring the perceived vehicle position, longitudinal speed, longitudinal acceleration and lateral speed information of the self-vehicle to form a state space of the self-vehicle;
the method comprises the steps of collecting predicted course angle, vehicle position, longitudinal speed, longitudinal acceleration and lateral speed information of the other vehicle, and combining the length and width of the vehicle body to form a state space of the other vehicle;
s112, setting a discrete transverse acceleration sequence and a discrete longitudinal acceleration sequence as an action space.
Further, the specific process of step S12 is as follows: the evolution process of the states of the own vehicle and the other vehicle is assumed to be independent, a state transition equation of the own vehicle is obtained according to a simplified vehicle motion model, and the state transition equation of the other vehicle is expressed by probabilities of different states at the next moment.
Further, the observation function of the observation space model in the step S2 is divided into a vehicle and another vehicle, wherein the vehicle observation function is represented by a boolean value, if the next state exists, the vehicle observation function is set to 1, otherwise, the vehicle observation function is set to 0;
The observation function of the other vehicle accords with Gaussian distribution;
And in the step S2, an uncertainty model of the prediction track of the other vehicle is constructed by adopting a multi-element Gaussian distribution.
Further, the step S3 specifically includes the following steps:
S31, setting a reward function comprising a safety reward function Rsafe, a comfort reward function Rcomfort and an efficiency reward function Refficiency, wherein the safety reward function Rsafe comprises a collision reward function Rcolli and a distance reward function Rdis;
The comfort rewards function Rcomfort includes a speed dependent comfort rewards function Rspeed, a lateral speed action penalty function Rvlat, and a continuity indicator Rcontinuity;
The efficiency rewards function Refficiency includes a target task rewards function Rlane and a target speed function Rv_tar;
s32, solving a POMDP model by adopting a deterministic sparse observable tree (DETERMINED SPARSE PARTIALLY Observable Tree, DESPOT) method based on the constructed state space, action space, belief space and rewarding function to obtain a decision state point set of the own vehicle, wherein the content of the decision state point set comprises position information of the own vehicle and speed and acceleration information at different moments.
Further, the step S31 is specifically to design the collision reward function Rcolli according to the probability of collision, that is, giving a penalty when the collision probability exceeds the set threshold;
step S31 is to design a distance rewarding function Rdis according to a collision time (TTC) model, and set upper and lower limits of a maximum safe distance;
In the step S31, the influencing factors of the speed-related comfort bonus function Rspeed include the current longitudinal speed vloncur, the longitudinal acceleration aloncur, the lateral speed vlatcur, and the longitudinal speed vlonnext, the longitudinal acceleration alonnext, and the lateral speed vlatnext at the next moment;
In the step S31, the lateral velocity action penalty function Rvlat is related to the lateral velocity only;
In the step S31, the continuity indicator Rcontinuity is positively correlated with the position change amounts of the two previous and subsequent decisions, and the larger the position change amounts of the two previous and subsequent decisions, the larger Rcontinuity is.
Further, the lateral space decision set in step S4 includes location information of the own vehicle; the longitudinal time decision set includes speed and acceleration information of the own vehicle.
Further, in the step S5, when the lateral drivable boundary is selected, the distance from the node of the lateral drivable boundary to the obstacle is greater than the safety distance, and the lateral drivable boundary value does not exceed the original boundary of the structured road, and the distance of a safety threshold is maintained;
The establishment process of the transverse reference path specifically comprises the following steps:
Firstly, constructing a transverse optimization problem, designing a transverse cost function to evaluate each node, and taking a point at the minimum position of the transverse cost function as an optimal solution, wherein the constraint of the transverse optimization problem is that a course angle is between the minimum and maximum wheel corners, the objective function of the transverse optimization problem is the transverse cost function, and the transverse cost function Cnode-h is a weighted sum of a first distance cost Cd, a first safety cost Co and a continuity cost Cc;
Calculating the distance from the node to the target state point as a first distance cost; calculating a distance from the node to the obstacle as a first security cost; calculating the position change rate between the front node and the rear node as a continuity cost; combining the corresponding weight coefficients, and calculating to obtain a transverse cost function;
And finally, taking the value at the minimum of the transverse cost function, and connecting the lines to form a transverse expected reference path.
Further, the principle of determining the longitudinal drivable boundary in step S5 is as follows: the longitudinal drivable boundary coincides with the parking space coordinates of the obstacle;
the constraints of the longitudinal runnability boundary are: the s-t curve of the longitudinal drivable boundary does not exceed the s-t curve representing the highest and lowest average vehicle speed;
the establishment process of the longitudinal reference speed curve specifically comprises the following steps:
Firstly, constructing a longitudinal optimization problem, designing a longitudinal cost function to evaluate each node, and taking a point at the minimum position of the longitudinal cost function as an optimal solution, wherein the longitudinal cost function Cnode-z is a weighted sum of a second distance cost Cd-ref, a second safety cost Co-ref and a speed change cost Cv;
Calculating the distance from the node to the state point connecting line to be used as a second distance cost; calculating the square of the difference between the distance from the node to the obstacle on the s-axis and the distance threshold value from the obstacle as a second safety cost; calculating the change rate of the reference speed as the speed change cost; calculating to obtain a longitudinal cost function by combining the corresponding weight coefficients;
And finally, taking the value at the minimum of the longitudinal cost function, and forming a longitudinal expected reference speed curve by connecting the two points.
Compared with the prior art, the method fully considers the uncertainty of the predicted track of the own vehicle, establishes the uncertainty model of the predicted track of the own vehicle by establishing the observation space model and the belief space model of the own vehicle and the other vehicle, and obtains the decision state point set of the own vehicle based on the POMDP model by combining the set reward function. Therefore, the determined vehicle state point set is more stable and reliable, the comfort and the safety of driving the vehicle are enhanced, and the vehicle driving method is well applicable to unmanned driving under typical driving conditions such as vehicle following, lane changing and autonomous overtaking.
After the decision state point set of the own vehicle is obtained by solving, the decision state point set is decoupled into a transverse space decision set and a longitudinal time decision set, and the reliability of a decision result can be further ensured by respectively dividing a transverse drivable region and a reference path and dividing a longitudinal drivable region and a reference speed curve through introducing road boundary constraint, vehicle speed constraint and barrier distance constraint.
Drawings
FIG. 1 is a schematic flow chart of the method of the present invention;
FIG. 2 is a schematic diagram of a lane change decision process according to an embodiment;
FIG. 3 is a schematic diagram of uncertainty of a predicted trajectory of a host vehicle in an embodiment;
FIG. 4 is a schematic diagram of a lateral search determining a lateral travelable boundary in an embodiment;
FIG. 5 is a diagram of a lateral travelable region and a lateral desired reference path in an embodiment;
Fig. 6 is a graph of a longitudinal travelable region and a longitudinal desired reference speed profile in an embodiment.
Detailed Description
The invention will now be described in detail with reference to the drawings and specific examples.
Examples
As shown in fig. 1, an uncertainty-considered automatic driving automobile lane change decision control method comprises the following steps:
s1, based on a perceived vehicle state and a predicted other vehicle state, a state space and an action space are built by combining a vehicle dynamics model, and a state transition equation is built;
S2, respectively establishing an observation space model and a belief space model of the own vehicle, an observation space model and a belief space model of the other vehicle and an uncertainty model of a predicted track of the other vehicle;
s3, setting a reward function, combining the steps S1 and S2 to construct a POMDP model, and solving a decision state point set of the vehicle;
S4, decoupling the decision state point set of the own vehicle into a transverse space decision set and a longitudinal time decision set;
S5, determining a transverse drivable boundary, introducing road boundary constraint, vehicle speed constraint and obstacle distance constraint, and dividing a transverse drivable region and a reference path;
Determining a longitudinal drivable boundary, introducing road boundary constraint, vehicle speed constraint and barrier distance constraint, and dividing a longitudinal drivable region and a reference speed curve;
s6, outputting an optimal track of the corresponding vehicle by the vehicle planning module according to the transverse drivable area and the reference path, the longitudinal drivable area and the reference speed curve which are divided in the step S5, so that the vehicle can run according to the optimal track.
The channel change decision process of the embodiment is shown in fig. 2, and includes:
1. Based on the perceived state of the own vehicle and the predicted state of the other vehicle, constructing a state space and an action space according to the simplified vehicle dynamics model, and establishing a state transition equation;
the method specifically comprises the following steps:
11 Using simplified vehicle kinematic model to simplify the motion of vehicle into particle motion in Frenet coordinate system, to construct state space and action space of vehicle;
111 State space expression is:
S=[stateego,state1,state2,...,staten]
Wherein stateego is a state space of the vehicle, time is a time stamp, (sego,lego) is a position of the vehicle, vlonego is a longitudinal speed of the vehicle, accego is a longitudinal acceleration of the vehicle, and vlatego is lateral speed information of the vehicle;
staten is a state space of the other vehicle n, (length, width) is a length and width of the vehicle body of the other vehicle n, θ is a predicted course angle of the other vehicle, (s, l) is a position of the other vehicle, and vn is speed information of the other vehicle;
112 A discrete lateral acceleration sequence and a discrete longitudinal acceleration sequence are set as the action space:
A=[Acclon,Vellat]
Velvlat={-2.0,-1.5,-1.0,-0.5,0.0,0.5,1.0,1.5,2.0}
Acclon={-3.0,-2.5,-2.0,-1.5,-1.0,-0.5,0.0,0.5,1.0,1.5,2.0}
Wherein Velvlat is the lateral acceleration sequence; acclon is the longitudinal acceleration sequence;
12 Using the state space and the action space to establish the state transfer functions of the own vehicle and other vehicles;
the specific establishment process is as follows:
121 Assuming that the evolution processes of the states of the own vehicle and the other vehicle are mutually independent, obtaining a state transition equation of the own vehicle according to a simplified vehicle motion model, wherein the state transition equation of the other vehicle is expressed by probabilities of different states at the next moment;
the calculation formula of the state transition equation of the bicycle is as follows:
the calculation formula of the state transfer equation of the other vehicle is as follows:
2. establishing an observation space model and a belief space model of the own vehicle and the other vehicle, and establishing an uncertainty model of a prediction track of the other vehicle, wherein the observation space comprises information which can be observed by an intelligent vehicle environment perception positioning system, and comprises position, speed and course angle information of the own vehicle and position, speed and course angle information of the other vehicle;
The method comprises the following specific steps:
21 The state of the self-vehicle is completely considerable, and the observation function of the self-vehicle comprises coordinates and course angles, and accords with Gaussian distribution;
an observation function of an observation space is established, the self-vehicle observation function is represented by a Boolean value, if the next state exists and is set to be 1, otherwise, the self-vehicle observation function is set to be 0:
the observation function of the vehicle is:
the observation function of the other vehicle accords with Gaussian distribution, and the calculation formula is as follows:
where μs,l,θ represents the observed value, σs,l,θ represents the variance;
The process of calculating the position uncertainty through the observation model comprises the following steps:
22 Constructing an uncertainty model of the vehicle pose of the other vehicle by adopting a multi-element Gaussian score;
The uncertainty calculation formula of the self-vehicle pose of the other vehicle is as follows:
wherein Φ is a state transition matrix, M represents noise in a state transition process, Z is covariance of the noise, Σ is a covariance matrix of the system, X is a state quantity, and the expression mode is as follows:
the Gaussian distribution covariance of each state quantity on the prediction track point of the other vehicle is obtained through uncertainty modeling and is shown in figure 3;
3. setting a reward function, and solving a decision state point set of the vehicle based on the POMDP model;
31 The content included in the set bonus function: safety index Rsafe, comfort index Rcomfort, and efficiency index Refficiency;
32 A bonus function for designing a security indicator): considering the collision reward function Rcolli and the distance reward function Rdis, the specific calculation formula is:
Rsafe=Rcolli+Rdis
33 Designing a collision reward function Rcolli according to the probability of collision, giving a penalty when the collision probability exceeds a threshold, and specifically solving the collision reward function as follows:
Where wcolli = 100 is a weight value, Psafe is a safety threshold;
designing a reward function Rdis of the distance between the front vehicles according to a collision occurrence time (TTC) model, setting upper and lower limits of the maximum safe distance, and specifically calculating the reward function of the distance between the front vehicles as follows:
wherein dmax、dmin is the maximum safety distance and the minimum safety distance between the own vehicle and other vehicles, and wdis =20 is the weight value of the distance rewards;
34 Design comfort rewards function: considering the speed-related comfort reward function Rspeed, the lateral speed action penalty function Rvlat and the continuity indicator Rcontinuity, the comfort reward function is written:
Rcomfort=Rspeed+Rvlat+Rcontinuity
the solving formula of the speed-related comfort rewarding function Rspeed is as follows:
The current longitudinal speed is vloncur, the current longitudinal acceleration is aloncur, the current lateral speed is vlatcur, the longitudinal speed at the next moment is vlonnext, the longitudinal acceleration at the next moment is alonnext, the lateral speed at the next moment is vlatnext,wspeed =15, and the weight coefficient of the speed is the same;
The lateral velocity action penalty function is calculated as:
Rvlat=-vlat2×wvlat
wherein wvlat = 12 is the weight coefficient of the lateral velocity;
the continuity index calculation formula of the result is:
Wherein t represents the time of each step of the decision result; m and n represent the total step numbers of the front decision result and the rear decision result respectively, (slast_loop,t,llast_loop,t) and (scur_loop,t,lcur_loop,t) are the position information of the last decision result and the current decision result respectively, and wcon =12 is the weight coefficient of the continuity index;
35 Design efficiency bonus function Refficiency: considering the target task rewards function Rlane and the target speed function Rv_tar, the efficiency rewards function is written as:
Refficiency=Rlane+Rv_tar
Wherein the target task rewarding function is Rlane=-|llane-lego|×wlane;
The target speed function is Rv_tar=-|vtar-vego|×wv_tar;
wv_tar = 12 is a weight coefficient for the target speed;
36 Solving a decision action sequence, dividing the decision into a longitudinal space decision set of a transverse space decision set, and specifically solving a POMDP model by adopting a deterministic sparse observational tree method to obtain a vehicle decision state point set comprising vehicle position information and speed and acceleration information at different moments;
4. Decoupling the decision state point set into a transverse space decision set and a longitudinal time decision set, wherein the transverse space decision set comprises the position information of the own vehicle at each moment; the longitudinal time decision set comprises speed and acceleration information of each moment of the vehicle;
5. determining a transverse drivable boundary, introducing road boundary constraint, vehicle speed constraint and barrier distance constraint, and dividing a transverse drivable region and a reference path;
when the lateral drivable boundary is selected, the distance from the node of the boundary to the obstacle is larger than the safety distance, and the specific expression is as follows:
Wherein dobs is the distance from the current node to the obstacle, and when the distance is smaller than a certain safety distance, the node is regarded as unsafe, lk is the lateral coordinate of the node, and lmin≤lk≤lmax should be satisfied;
at the same time, it should also be possible to meet the boundary values lmin and lmax, at which the driving boundary value does not exceed the original boundary of the structured road, and to maintain a distance of the safety threshold dsafe:
the calculated transverse travelable area is shown in fig. 4;
establishing a transverse optimization model in a transverse drivable area to obtain a transverse reference path, wherein the transverse reference path comprises the following specific steps of:
Constructing a transverse optimization problem, designing a transverse cost function to evaluate each node, wherein the point at the minimum of the transverse cost function is the optimal point;
The constraint is heading angle constraint:
wherein, θ'min、θ'max is the minimum value and maximum value of the change of the course angle of the vehicle movement respectively;
the transverse cost function is used as an objective function, and the expression is:
Cnode-h=wdCd+woCo+wcCc
Wherein, Cd is the first distance cost of the node, Co is the first security cost, Cc is the continuity cost, and the corresponding weight functions are wd=0.35,wo=0.45,wc =0.2, respectively;
the first distance cost function Cd is expressed as:
Wherein (sstate,lstate) is the location of the target state point and (sk,lk) is the location of the extension node;
The first security cost function Co is expressed as:
Where dobs represents the distance of the node from the nearest obstacle and dmax represents the distance threshold from the obstacle;
the cost function Cc representing continuity is expressed as:
Wherein, li,li+1,li+2 is the lateral position of the front and rear three nodes;
Finally, a value connecting line at the minimum position of the transverse cost function is taken to form a transverse expected reference path;
The transverse desired reference path formed in this embodiment is represented in fig. 5 by the hollow dots;
6. Determining a longitudinal drivable boundary, introducing road boundary constraint, vehicle speed constraint and barrier distance constraint, and dividing a longitudinal drivable region and a reference speed curve;
The method for determining the longitudinal drivable boundary specifically comprises the following steps:
Ensuring that the boundary sborder in the s-axis direction satisfies the formula:
the sborder constraint is: smin_t≤sborder≤smax_t, wherein smax_t and smin_t are the upper and lower boundaries of the average speed, respectively;
the method for establishing the longitudinal reference speed curve specifically comprises the following steps:
constructing a longitudinal optimization problem, designing a longitudinal cost function to evaluate each node, wherein the point at the minimum of the longitudinal cost function is the optimal point; the longitudinal cost function is calculated as:
Cnode-z=wd-refCd-ref+wo-refCo-ref+wvCv
Wherein Cd-ref is the second distance cost from the node to the state point connection, Co-ref is the second security cost of the node, Cv is the speed change cost, and the corresponding weight coefficients are wd-ref=0.4,wo-ref=0.4,wv =0.2, respectively.
The second distance cost function is:
cd-ref=(si-sref)2
Wherein sref is the position of the node on the two-state point connection line;
the second safety cost function is:
Where sobs represents the distance along the s-axis of the node from the nearest obstacle and smax represents the distance threshold from the obstacle;
The cost function of the speed change is:
and finally, connecting values at the minimum of the longitudinal cost function to form a longitudinal expected reference speed curve.
The longitudinal reference velocity profile formed in this embodiment is shown in fig. 6.
In summary, the technical scheme considers the behavior interaction characteristics of the own vehicle and the traffic participants and the uncertainty of the predicted track of the other vehicle, the decision process is more stable, and the decided own vehicle state point set is more reasonable and reliable, so that the following vehicle planning module can accurately and rapidly solve, and the comfort and safety of vehicle driving are effectively enhanced.

Claims (6)

Translated fromChinese
1.一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,包括以下步骤:1. A lane-changing decision control method for an autonomous driving vehicle considering uncertainty, characterized in that it comprises the following steps:S1、基于感知到的自车状态及预测的他车状态,结合车辆动力学模型,构建状态空间及动作空间,并建立状态转移方程;S1. Based on the perceived state of the vehicle and the predicted state of other vehicles, combined with the vehicle dynamics model, the state space and action space are constructed, and the state transfer equation is established;S2、分别建立自车的观测空间模型及信念空间模型、他车的观测空间模型及信念空间模型、他车预测轨迹的不确定性模型;S2, respectively establish the observation space model and belief space model of the own vehicle, the observation space model and belief space model of the other vehicle, and the uncertainty model of the predicted trajectory of the other vehicle;S3、设定奖励函数,结合步骤S1和S2构建得到POMDP模型,求解出自车的决策状态点集;S3, set the reward function, combine steps S1 and S2 to construct the POMDP model, and solve the decision state point set of the vehicle;S4、将自车的决策状态点集解耦为横向空间决策集及纵向时间决策集;S4, decoupling the decision state point set of the vehicle into a lateral spatial decision set and a longitudinal temporal decision set;S5、确定横向可行驶边界,引入道路边界约束,车速约束及障碍物距离约束,划分出横向可行驶区域及参考路径;S5. Determine the lateral drivable boundary, introduce road boundary constraints, vehicle speed constraints and obstacle distance constraints, and divide the lateral drivable area and reference path;确定纵向可行驶边界,引入道路边界约束,车速约束及障碍物距离约束,划分出纵向可行驶区域及参考速度曲线;Determine the longitudinal drivable boundary, introduce road boundary constraints, vehicle speed constraints and obstacle distance constraints, and divide the longitudinal drivable area and reference speed curve;S6、根据步骤S5划分出的横向可行驶区域及参考路径、纵向可行驶区域及参考速度曲线,车辆规划模块输出相应车辆最优轨迹,使车辆按照最优轨迹行驶;S6, according to the lateral drivable area and reference path, longitudinal drivable area and reference speed curve divided in step S5, the vehicle planning module outputs the optimal trajectory of the corresponding vehicle, so that the vehicle drives according to the optimal trajectory;所述步骤S1具体包括以下步骤:The step S1 specifically includes the following steps:S11、使用简化的车辆运动学模型,将车辆的运动简化为Frenet坐标系中质点的运动,以构建自车的状态空间及动作空间;S11. Use a simplified vehicle kinematic model to simplify the vehicle's motion into the motion of a mass point in the Frenet coordinate system to construct the state space and action space of the ego vehicle;S12、利用状态空间和动作空间,建立转移自车及他车的状态转移函数;S12, using the state space and action space, establish a state transfer function for transferring the self-vehicle and other vehicles;所述步骤S11具体包括以下步骤:The step S11 specifically includes the following steps:S111、采集感知的自车的车辆位置、纵向速度、纵向加速度、侧向速度信息,构成自车的状态空间;S111, collecting the perceived vehicle position, longitudinal speed, longitudinal acceleration, and lateral speed information of the ego vehicle to form a state space of the ego vehicle;采集预测的他车的航向角、车辆位置、纵向速度、纵向加速度、侧向速度信息,结合车身长宽,构成他车的状态空间;Collect the predicted heading angle, vehicle position, longitudinal speed, longitudinal acceleration, and lateral speed information of the other vehicle, and combine it with the length and width of the vehicle body to form the state space of the other vehicle;S112、设定离散的横向加速度序列及纵向加速度序列,以作为动作空间;S112, setting a discrete lateral acceleration sequence and a longitudinal acceleration sequence as an action space;所述步骤S2中观测空间模型的观测函数分为自车和他车,其中,自车观测函数由布尔值表示,若下一状态存在,则设置为1,否则设置为0;The observation function of the observation space model in step S2 is divided into the own vehicle and other vehicles, wherein the own vehicle observation function is represented by a Boolean value, which is set to 1 if the next state exists, otherwise it is set to 0;他车的观测函数符合高斯分布;The observation function of the other car conforms to the Gaussian distribution;所述步骤S2中他车预测轨迹的不确定性模型采用多元高斯分布构建;The uncertainty model of the predicted trajectory of the other vehicle in step S2 is constructed using multivariate Gaussian distribution;所述步骤S3具体包括以下步骤:The step S3 specifically comprises the following steps:S31、设定奖励函数包括安全性奖励函数Rsafe、舒适性奖励函数Rcomfort及效率性奖励函数Refficiency,其中,安全性奖励函数Rsafe包括碰撞奖励函数Rcolli及距离奖励函数RdisS31, setting the reward function to include a safety reward function Rsafe , a comfort reward function Rcomfort , and an efficiency reward function Refficiency , wherein the safety reward function Rsafe includes a collision reward function Rcolli and a distance reward function Rdis ;舒适性奖励函数Rcomfort包括速度相关舒适性奖励函数Rspeed、侧向速度动作惩罚函数Rvlat以及连续性指标RcontinuityThe comfort reward function Rcomfort includes the speed-related comfort reward function Rspeed , the lateral speed action penalty function Rvlat and the continuity index Rcontinuity ;效率性奖励函数Refficiency包括目标任务奖励函数Rlane和目标速度函数Rv_tarThe efficiency reward function Refficiency includes the target task reward function Rlane and the target speed function Rv_tar ;S32、基于构建的状态空间、动作空间、信念空间及奖励函数,采用确定性稀疏可观测树的方法,求解POMDP模型,得到自车的决策状态点集,其中,决策状态点集的内容包括自车的位置信息以及不同时刻的速度和加速度信息。S32. Based on the constructed state space, action space, belief space and reward function, the deterministic sparse observable tree method is used to solve the POMDP model and obtain the decision state point set of the vehicle, where the content of the decision state point set includes the position information of the vehicle and the speed and acceleration information at different times.2.根据权利要求1所述的一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,所述步骤S12的具体过程为:假设自车和他车状态的演进过程是相互独立的,根据简化的车辆运动模型,得到自车的状态转移方程,他车的状态转移方程则由下一时刻不同状态的概率表达。2. According to the uncertainty-taking lane-changing decision control method for an autonomous driving vehicle according to claim 1, the specific process of step S12 is as follows: assuming that the evolution processes of the states of the self-vehicle and the other vehicle are independent of each other, the state transfer equation of the self-vehicle is obtained according to a simplified vehicle motion model, and the state transfer equation of the other vehicle is expressed by the probabilities of different states at the next moment.3.根据权利要求1所述的一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,所述步骤S31具体是根据碰撞发生的概率设计碰撞奖励函数Rcolli,即当碰撞概率超过设定阈值时,给予惩罚;3. The lane changing decision control method for an autonomous driving vehicle considering uncertainty according to claim 1, characterized in that the step S31 specifically designs a collision reward function Rcolli according to the probability of collision, that is, when the collision probability exceeds a set threshold, a penalty is given;所述步骤S31具体是根据碰撞发生时间(TTC)模型设计距离奖励函数Rdis,并设定最大安全距离的上下界限;The step S31 specifically designs a distance reward function Rdis according to a time to collision (TTC) model, and sets upper and lower limits of the maximum safety distance;所述步骤S31中,速度相关舒适性奖励函数Rspeed的影响因素包括当前的纵向速度vloncur、纵向加速度aloncur、侧向速度vlatcur以及下一时刻的纵向速度vlonnext、纵向加速度alonnext、侧向速度vlatnextIn the step S31, the influencing factors of the speed-related comfort reward function Rspeed include the current longitudinal speed vloncur , longitudinal acceleration aloncur , lateral speed vlatcur and the longitudinal speed vlonnext , longitudinal acceleration alonnext , lateral speed vlatnext at the next moment;所述步骤S31中,侧向速度动作惩罚函数Rvlat只与侧向速度有关;In the step S31, the lateral speed action penalty function Rvlat is only related to the lateral speed;所述步骤S31中,连续性指标Rcontinuity与前后两次决策的位置变化量正相关,前后两次决策的位置变化量越大、则Rcontinuity越大。In the step S31, the continuity index Rcontinuity is positively correlated with the position change between two decisions. The greater the position change between the two decisions, the greater the Rcontinuity .4.根据权利要求1所述的一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,所述步骤S4中横向空间决策集包括自车的位置信息;纵向时间决策集包括自车的速度及加速度信息。4. According to the uncertainty-taking lane changing decision control method of an autonomous driving vehicle according to claim 1, it is characterized in that the lateral spatial decision set in step S4 includes the position information of the vehicle; the longitudinal time decision set includes the speed and acceleration information of the vehicle.5.根据权利要求1所述的一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,所述步骤S5中横向可行驶边界在选择时,横向可行驶边界的节点到障碍物的距离大于安全距离,同时横向可行驶边界值不超过结构化道路的原始边界,并保持一个安全阈值的距离;5. The lane change decision control method for an autonomous driving vehicle considering uncertainty according to claim 1, characterized in that, when selecting the lateral drivable boundary in step S5, the distance from the node of the lateral drivable boundary to the obstacle is greater than the safety distance, and the lateral drivable boundary value does not exceed the original boundary of the structured road, and maintains a distance of a safety threshold;横向可行驶区域参考路径的建立过程具体为:The specific process of establishing the reference path of the lateral drivable area is as follows:首先构建横向优化问题,设计横向成本函数对每个节点进行评价,取横向成本函数最小处的点作为最优解,其中,所述横向优化问题的约束为航向角在最小及最大车轮转角之间,所述横向优化问题的目标函数为横向成本函数,所述横向成本函数为第一距离成本Cd,第一安全成本Co及连续性成本Cc的加权和;First, a lateral optimization problem is constructed, and a lateral cost function is designed to evaluate each node, and the point where the lateral cost function is the minimum is taken as the optimal solution, wherein the constraint of the lateral optimization problem is that the heading angle is between the minimum and maximum wheel turning angles, and the objective function of the lateral optimization problem is the lateral cost function, which is the weighted sum of the first distance cost Cd , the first safety costCo and the continuity cost Cc ;通过计算从节点到目标状态点的距离,以作为第一距离成本;计算从节点到障碍物的距离,以作为第一安全成本;计算从前后节点之间的位置变化率,以作为连续性成本;结合对应的权重系数,计算得到横向成本函数;By calculating the distance from the node to the target state point as the first distance cost; calculating the distance from the node to the obstacle as the first safety cost; calculating the position change rate between the front and rear nodes as the continuity cost; combining the corresponding weight coefficients, the lateral cost function is calculated;最终取横向成本函数最小处的值,连线形成横向期望参考路径。Finally, the value at the minimum of the lateral cost function is taken, and the lines are connected to form the lateral expected reference path.6.根据权利要求5所述的一种考虑不确定性的自动驾驶汽车换道决策控制方法,其特征在于,所述步骤S5中纵向可行驶边界的确定原则为:纵向可行驶边界与障碍物车位置坐标重合;6. The lane-changing decision control method for an autonomous driving vehicle considering uncertainty according to claim 5, characterized in that the principle for determining the longitudinal drivable boundary in step S5 is that the longitudinal drivable boundary coincides with the position coordinates of the obstacle vehicle;纵向可行驶边界的约束条件为:纵向可行驶边界的s-t曲线不超过表示最高、最低的平均车速的s-t曲线;The constraint condition of the longitudinal drivable boundary is: the s-t curve of the longitudinal drivable boundary does not exceed the s-t curve representing the highest and lowest average vehicle speeds;纵向可行驶区域参考速度曲线的建立过程具体为:The specific process of establishing the reference speed curve of the longitudinal drivable area is as follows:首先构建纵向优化问题,设计纵向成本函数对每个节点进行评价,取纵向成本函数最小处的点作为最优解,其中,所述纵向成本函数为第二距离成本Cd-ref,第二安全成本Co-ref及速度变化成本Cv的加权和;Firstly, a longitudinal optimization problem is constructed, a longitudinal cost function is designed to evaluate each node, and the point where the longitudinal cost function is the minimum is taken as the optimal solution, wherein the longitudinal cost function is the weighted sum of the second distance cost Cd-ref , the second safety costCo-ref and the speed change cost Cv ;通过计算从节点到状态点连线的距离,以作为第二距离成本;计算节点到障碍物在s轴上的距离与距障碍物的距离阈值之差的平方,以作为第二安全成本;计算参考速度的变化率,以作为速度变化成本;结合对应的权重系数,计算得到纵向成本函数;The distance from the node to the state point is calculated as the second distance cost; the square of the difference between the distance from the node to the obstacle on the s-axis and the distance threshold to the obstacle is calculated as the second safety cost; the rate of change of the reference speed is calculated as the speed change cost; combined with the corresponding weight coefficient, the longitudinal cost function is calculated;最终取纵向成本函数最小处的值,连线形成纵向期望参考速度曲线。Finally, the value at the minimum of the longitudinal cost function is taken, and the lines are connected to form the longitudinal expected reference speed curve.
CN202210863061.5A2022-07-212022-07-21 A lane-changing decision control method for autonomous driving vehicles considering uncertaintyActiveCN115257746B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202210863061.5ACN115257746B (en)2022-07-212022-07-21 A lane-changing decision control method for autonomous driving vehicles considering uncertainty

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202210863061.5ACN115257746B (en)2022-07-212022-07-21 A lane-changing decision control method for autonomous driving vehicles considering uncertainty

Publications (2)

Publication NumberPublication Date
CN115257746A CN115257746A (en)2022-11-01
CN115257746Btrue CN115257746B (en)2024-11-26

Family

ID=83767878

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202210863061.5AActiveCN115257746B (en)2022-07-212022-07-21 A lane-changing decision control method for autonomous driving vehicles considering uncertainty

Country Status (1)

CountryLink
CN (1)CN115257746B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN115685852B (en)*2022-11-082025-08-22深向科技股份有限公司 Vehicle braking control method, device, electronic device, and storage medium
CN116080678A (en)*2022-11-152023-05-09清华大学 A method and device for autonomous driving obstacle avoidance planning for diamond-shaped vehicles
CN116048081B (en)*2023-01-102025-07-25吉林大学Automatic driving vehicle decision and regulation method considering safety boundary constraint
CN116767218B (en)*2023-08-182023-11-17北京理工大学Forced lane change decision method for unmanned vehicle, computer equipment and medium
CN117002536B (en)*2023-09-182025-09-16驭势科技(北京)有限公司Interactive decision method and device for vehicle, electronic equipment and storage medium
CN117666559B (en)*2023-11-072024-07-02北京理工大学前沿技术研究院Autonomous vehicle transverse and longitudinal decision path planning method, system, equipment and medium
CN117348415B (en)*2023-11-082024-06-04重庆邮电大学 An autonomous driving decision-making method based on finite state machine
CN117302224B (en)*2023-11-302024-02-23上海鉴智其迹科技有限公司Lane changing method, automatic driving method, device and vehicle
CN117870689B (en)*2024-01-122024-09-20哈尔滨工业大学(威海)Unmanned vehicle navigation decision planning system and method based on partial Markov decision process
CN117944684B (en)*2024-03-262024-05-31福瑞泰克智能系统有限公司Track changing planning method and device, storage medium and electronic equipment
CN118894134B (en)*2024-10-092024-12-13吉林大学Automatic driving automobile track planning method considering uncertainty of surrounding automobiles
CN119283877B (en)*2024-11-212025-08-22北京赛目科技股份有限公司 Quantitative evaluation method and device for controllability of potential driving hazards
CN119659609B (en)*2024-11-262025-09-23同济大学 Vehicle adaptive cruise control method with steering collision avoidance function

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108387242A (en)*2018-02-072018-08-10西南交通大学Automatic Pilot lane-change prepares and executes integrated method for planning track
CN114013443A (en)*2021-11-122022-02-08哈尔滨工业大学 A lane-changing decision control method for autonomous vehicles based on hierarchical reinforcement learning

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US11782438B2 (en)*2020-03-172023-10-10Nissan North America, Inc.Apparatus and method for post-processing a decision-making model of an autonomous vehicle using multivariate data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108387242A (en)*2018-02-072018-08-10西南交通大学Automatic Pilot lane-change prepares and executes integrated method for planning track
CN114013443A (en)*2021-11-122022-02-08哈尔滨工业大学 A lane-changing decision control method for autonomous vehicles based on hierarchical reinforcement learning

Also Published As

Publication numberPublication date
CN115257746A (en)2022-11-01

Similar Documents

PublicationPublication DateTitle
CN115257746B (en) A lane-changing decision control method for autonomous driving vehicles considering uncertainty
CN112965476B (en) A high-speed unmanned vehicle trajectory planning system and method based on multi-window model
CN110834644B (en)Vehicle control method and device, vehicle to be controlled and storage medium
CN110362910B (en) Game theory-based method for establishing lane-changing conflict coordination model for autonomous vehicles
CN114013443B (en)Automatic driving vehicle lane change decision control method based on hierarchical reinforcement learning
CN111273668B (en)Unmanned vehicle motion track planning system and method for structured road
CN113291308A (en)Vehicle self-learning lane-changing decision-making system and method considering driving behavior characteristics
CN110304074A (en) A Hybrid Driving Method Based on Hierarchical State Machine
CN115214672A (en) A human-like decision-making, planning and control method for autonomous driving considering workshop interaction
CN110568841A (en)Automatic driving decision method and system
CN112269384B (en)Vehicle dynamic trajectory planning method combining obstacle behavior intention
CN117870689B (en)Unmanned vehicle navigation decision planning system and method based on partial Markov decision process
CN112577506B (en)Automatic driving local path planning method and system
CN118212808B (en)Method, system and equipment for planning traffic decision of signalless intersection
CN115140094A (en)Real-time lane change decision-making method based on longitudinal safety interval model
CN116259185B (en)Vehicle behavior decision method and device fusing prediction algorithm in parking lot scene
Tang et al.Integrated decision making and planning framework for autonomous vehicle considering uncertain prediction of surrounding vehicles
Liu et al.Impact of sharing driving attitude information: A quantitative study on lane changing
CN115273450A (en) A lane-changing method for vehicles entering a formation in a networked autonomous driving environment
CN117922612A (en)Interconnection automatic driving decision method based on collaborative awareness and self-adaptive information fusion
Xiong et al.Integrated decision making and planning based on feasible region construction for autonomous vehicles considering prediction uncertainty
CN117864168A (en) An unmanned vehicle navigation control system and method based on game Markov process safety detection
CN117111610A (en)Intelligent vehicle dynamic environment track planning system and method based on self-adaptive potential field
CN114212107B (en)Intelligent Internet-of-things driving planning method based on value graph
CN116394968B (en)Multi-agent reinforcement learning-based automatic driving vehicle control method

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp