Movatterモバイル変換


[0]ホーム

URL:


CN104462024A - Method and device for generating dialogue action strategy model - Google Patents

Method and device for generating dialogue action strategy model
Download PDF

Info

Publication number
CN104462024A
CN104462024ACN201410594506.XACN201410594506ACN104462024ACN 104462024 ACN104462024 ACN 104462024ACN 201410594506 ACN201410594506 ACN 201410594506ACN 104462024 ACN104462024 ACN 104462024A
Authority
CN
China
Prior art keywords
user
target task
system state
action
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410594506.XA
Other languages
Chinese (zh)
Other versions
CN104462024B (en
Inventor
焦增涛
汪冠春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co LtdfiledCriticalBeijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201410594506.XApriorityCriticalpatent/CN104462024B/en
Publication of CN104462024ApublicationCriticalpatent/CN104462024A/en
Application grantedgrantedCritical
Publication of CN104462024BpublicationCriticalpatent/CN104462024B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Landscapes

Abstract

The invention provides a method and device for generating a dialogue action strategy model. The method includes the steps that a user historical dialogue log is acquired; the user historical dialogue log is analyzed by combining the scene priori knowledge of a target task, and a plurality of first system state characteristics of the target task are dug out; a plurality of second system state characteristics of the preset target task are combined with the first system state characteristics, and a plurality of third system state characteristics are acquired; label data extracted from the user historical dialogue log are used as training samples, the third system state characteristics are used as training characteristics to establish an action decision model and to conduct model training, and the parameter vectors of all the third system state characteristics are learned. Fine granularity dialogue strategy learning is guided under a unified framework, more accurate results and the dialogue strategy which mostly meets the target task requirement are provided for users, and therefore user experience is improved.

Description

Method and device for generating dialogue action strategy model
Technical Field
The invention relates to the technical field of natural language processing, in particular to a method and a device for generating a dialogue action strategy model.
Background
In the age of rapid popularization of the internet, the instant messaging conversation system gradually enters the lives of people, great convenience is brought to the lives of people, and the conversation strategy is the key influencing the user experience.
The dialog strategies of existing dialog systems are mainly designed and applied on the basis of rules and on the basis of statistical models. However, the rule-based method requires good background knowledge for rule makers, and in addition, as the factors involved in the rule increase, the processing logic becomes complex, and the effect cannot reach the optimal state, thereby affecting the user experience; the existing dialogue systems based on the statistical model cannot reasonably utilize information related to the dialogue and cannot guide the dialogue systems to complete fine-grained dialogue strategies.
Disclosure of Invention
The invention aims to provide a method and a device for generating a dialogue action strategy model, which are used for establishing an action decision model by using rich state characteristics as training characteristics and performing model training so as to guide the learning of a fine-grained dialogue strategy under a unified framework.
According to an aspect of the present invention, there is provided a method of generating a dialogue action strategy model, the method comprising: acquiring a historical dialog log of a user; analyzing the user historical dialogue logs by combining scene prior knowledge of a target task, and excavating a plurality of first system state characteristics of the target task; combining a plurality of preset second system state features of the target task with the plurality of first system state features to obtain a plurality of third system state features; and taking the marked data extracted from the user historical dialogue logs as training samples, taking the third system state characteristics as training characteristics to establish an action decision model and carry out model training, and learning parameter vectors of the third system state characteristics.
According to another aspect of the present invention, there is provided an apparatus for generating a dialogue action strategy model, the apparatus comprising: the log acquisition unit is used for acquiring a user history conversation log; the state acquisition unit is used for analyzing the user historical conversation log by combining scene prior knowledge of a target task and excavating a plurality of first system state characteristics of the target task; the state combining unit is used for combining a plurality of preset second system state characteristics of the target task with the plurality of first system state characteristics to obtain a plurality of third system state characteristics; and the decision model generating unit is used for establishing an action decision model by taking the marking data extracted from the user historical dialogue logs as training samples and taking the third system state characteristics as training characteristics, carrying out model training and learning parameter vectors of the third system state characteristics.
According to the method and the device for generating the dialogue action strategy model, provided by the invention, the situation prior characteristics of the dialogue are effectively utilized and the user behavior characteristics are analyzed based on the dialogue log, so that rich system state characteristics are obtained for establishing and training the action decision model, and the learning of a fine-grained dialogue strategy is guided under a unified frame, so that a more accurate result and a dialogue strategy which best meets the requirement of a target task are provided for a user, and the user experience is improved.
Drawings
Fig. 1 is a flowchart illustrating a method of generating a dialogue action policy model according to an exemplary embodiment of the present invention.
FIG. 2 is example data of a user history dialog log illustrating a method of generating a dialog action policy model according to an example embodiment of the present invention.
Fig. 3 is an exemplary diagram illustrating a context prior feature of an exemplary embodiment of the present invention.
Fig. 4 is an exemplary diagram illustrating a third system status feature of an exemplary embodiment of the present invention.
FIG. 5 is an exemplary diagram of a Markov decision process based decision model illustrating an exemplary embodiment of the invention.
Fig. 6 is a logic block diagram illustrating an apparatus for generating a dialogue action policy model according to an exemplary embodiment of the present invention.
Detailed Description
The general concept of the invention is that a user historical dialogue log is analyzed by combining scene prior knowledge of a template task, a plurality of new system state features (first system state features) of the target task are mined, the new system state features are combined with the traditional state features (second system state features) to obtain rich system state features (third system state features), and an action decision model is established by using the rich system state features and training samples obtained based on labeled data extracted from the user historical dialogue log to carry out model training, so that the learning of a fine-grained dialogue strategy is guided under a unified frame, and the user experience of the dialogue task is enhanced.
A method and apparatus for generating a dialogue action strategy model according to an exemplary embodiment of the present invention will be described in detail with reference to the accompanying drawings.
Fig. 1 is a flowchart illustrating a method of generating a dialogue action policy model according to an exemplary embodiment of the present invention.
Referring to fig. 1, in step S110, a user history dialog log is acquired.
According to an exemplary embodiment of the present invention, the user history dialog log may be a number of rounds of dialog initiated in a dialog system to complete a target task.
FIG. 2 is example data of a user history dialog log illustrating a method of generating a dialog action policy model according to an example embodiment of the present invention. FIG. 2 shows a dialog record for a user performing a reservation airline ticket task.
Referring to fig. 2, the data of the user history dialog log includes, but is not limited to: the date the conversation occurred (e.g., 06-13), the time (e.g., 09:08), the USER's identity ID (e.g., USER _04E15FFC $ D261B6D2032B6316CBD36F4), the USER query word (e.g., "tickets to nanjing" in the figure), and the system return results (e.g., destination nanjing, origin Manshan, departure date 2014-. In practical application, different user history conversation logs and corresponding data in the logs can be obtained according to different conversation tasks.
In step S120, the historical dialog logs of the user are analyzed in combination with the scenario prior knowledge of the target task, and a plurality of first system state features of the target task are mined.
Specifically, the plurality of first system state features of the mining target task comprise: counting the distribution state of a plurality of user behavior characteristics on the preselected characteristics of the target task according to the user historical dialogue log; further, the situation prior characteristics of the target task are verified according to the distribution state of the user behavior characteristics on the preselected characteristics of the target task, and the first system state characteristics (new system state characteristics) are extracted from the situation prior characteristics. The preselected features are preselected dialog state features that have a close association with the performance of the task. For example, in the task of booking an airline ticket, the preselected feature may be "origin", "destination", or the like.
Wherein the plurality of user behavior characteristics include at least one of the following statistical characteristics: the proportion of the user completing the target task query, the proportion of the user not completing the target task query, the proportion of the user continuing the conversation after obtaining the query result, the proportion of the user clearly expressing the unconsciousness and the average number of interactive rounds of the conversation. Thus, for example, the distribution of the proportion of the user who completed the airline reservation query over the origin characteristic, the distribution of the proportion of the user who did not complete the airline reservation query over the origin characteristic, the proportion of the user who continued to converse (e.g., make a hotel) after completing the airline reservation, and so forth may be counted.
Fig. 3 is an exemplary diagram illustrating a context prior feature of an exemplary embodiment of the present invention.
The scene prior characteristics include various elements that may affect the effect of the system returned result, as shown in fig. 3, for example: time prior information (e.g., time, date, etc. of user's session), region prior information (e.g., city where user's session is located, size type of the city, whether it is a tourist area or an industrial city), and historical action information (e.g., last confirmed current session turn, last cleared current session turn, or last inquired current session turn, etc. during user's session), it can be understood by those skilled in the art that for a target task, there may be different factors that affect the session effect, such as ticket booking task, day of week, time of week, place of departure, destination, session turn of last specific action from current action, etc. as the context prior knowledge.
Specifically, for the time prior information, for example, a user who orders an air ticket at late night tends to completely submit ticket ordering requirement information more than a user who orders an air ticket at evening, and other user behavior characteristics at two times are also obviously different; the reason for understanding the prior information of the region, for example, the user who starts from the tourist area has a higher ratio of completing the target task query by the user than the user who is scheduled to go to the air ticket of the tourist area, a lower ratio of not completing the target task query by the user, a ratio of continuing the conversation after the user obtains the query result, a ratio of clearly expressing the unconsciousness of the user and an average number of interactive turns of the conversation, is probably that the user who starts from the tourist area generally needs to go back urgently, and the user who goes to the tourist area may only browse the result and then make a decision; in addition, in the user dialogue experience, the frequency of system actions such as information confirmation, clarification and the like affects the user experience and further affects the user behavior data, and in summary, it is also necessary to count the historical action information of the system.
In step S130, a plurality of preset second system state features of the target task are combined with the plurality of first system state features obtained in step S120 to obtain a plurality of third system state features (i.e., rich system state features, wherein the plurality of third system state features are respectively represented as feature vectors), so as to form a more complete system state vector.
According to an exemplary embodiment of the present invention, in step S130, the plurality of second system state features are conventional system state features (which may be, but is not limited to, represented by feature attribute slot states), such as filling states and filling types of attribute slots, and specifically, for example, in an airline ticket booking task, whether a destination attribute slot is filled (or assigned) or is assigned with an ambiguity, or is assigned with a high confidence level, and the like.
Fig. 4 is an exemplary diagram illustrating a third system status feature of an exemplary embodiment of the present invention.
Further, a plurality of second system state features of the preset target task are combined with the plurality of first system state features to obtain a third system state feature, the third system state feature is output in the form of a feature vector, the feature vector represents different meaning types according to different dimensions, as shown in fig. 4, the third system state feature comprises a preset traditional system state feature (second system state feature) and newly-added time prior information, region prior information and historical action information, therefore, not only can the logic of the system task be expressed, but also a conversation strategy which better accords with the individual fine granularity of the user action feature can be described, wherein the traditional system state feature, the time prior information, the region prior information and the historical action information are all expressed as feature vectors.
In step S140, an operation decision model is created using the plurality of third system state features obtained in step S130 as training features, model training is performed, a parameter vector of each third system state feature is learned, and labeled data extracted from the user history dialog log is used as a training sample.
Specifically, in step S140, the original system log is formatted into a sample format required by the training model based on the new system state feature (first system state feature) and the user behavior feature counted from the user history dialog log, wherein in this step, the training sample is used as an input, and the action decision is used as an output.
According to a preferred embodiment of the invention, the action decision model is a Markov Decision Process (MDP) based model or a Partially Observable Markov Decision Process (POMDP) based model, and each of the training samples comprises parameter values of a plurality of third system state features, action data and a reward score for labeling the action.
FIG. 5 is an exemplary diagram of a Markov decision process based decision model illustrating an exemplary embodiment of the invention.
As shown in fig. 5, a Markov Decision Process (MDP) based model is exemplified for the description. In particular, for an exemplary graph of a Markov decision process based decision model of an exemplary embodiment of the present invention, a one-wheel dialog process may be represented based on the current system state s1The system takes action a1And for the action, the external environment gives a reward r for the system action1After the user gives the next query term (i.e. user requirement), the system enters the next system state s2And the above process is repeated until the session is over (e.g., the prize r shown in the figure)3)。
Wherein the system state siCan beThe action a of the system is described by a new state vector from a dialog logiCan be directly extracted from the historical dialog log of the user, and the reward r isiThe method can be obtained by fitting according to the user behavior characteristics or by means of manual marking, and the number of the i is represented.
In particular, a Q-value function of a Markov Decision Process (MDP) model or a Partially Observable Markov Decision Process (POMDP) model is estimated by means of function approximationWherein,the method is to express a characteristic function, a pair of system states s and system actions a are combined and mapped to a K-dimensional space, theta is an action decision model, the goal of model training under the line is to learn the action decision model theta based on training corpus, model parameters are learned by time difference calculation, and the method comprises the following steps of continuously iterating (iterative formula is as follows: q(s)t,at)=Q(st,at)+α(rt+1+γQ(st+1,at+1)-Q(st,at) Where α is a learning step length, γ is a discount coefficient, rt+1Rewarding the system action by the external environment at the moment of t +1, wherein the rewarding is obtained according to the user behavior characteristics of the system), and learning the approximate optimal model parameter vector, thereby outputting the corresponding action decision model.
Further, based on the decision model θ obtained by training, the process of obtaining the system action a from the current state s is the system decision, and when the system makes the action decision, the system is firstly usedIn (1)And mapping the system state characteristics and any effective action to a K-dimensional space by the function, obtaining a Q function value corresponding to each action based on the decision model theta, and taking the action corresponding to the maximum Q value as the system action output.
In order to make the technical scheme of the present invention more understandable, the technical scheme of the present invention will be further explained by using examples of the technical scheme of the present invention, wherein the examples of the application are as follows:
scene 1: suppose a user has a conversation with the system in Beijing.
The user: and helping I order a ticket of the economy class for going to III.
The system comprises the following steps: good to help you find an economy class airline ticket tomorrow from beijing to saint, as follows, what are other needs left?
If the dialogue is performed in the scene 1, the user is in a metropolitan city (Beijing) and a destination tourist attraction (III), a radical default assignment dialogue strategy can be learned through model iterative training, a GPS address (Beijing) is automatically used as a starting place, and a starting date is set as tomorrow, so that the aim of rapidly displaying results and screening the results by the user is fulfilled.
Scene 2: suppose a user is conversing with the system at three.
The user: helping me to order an economic cabin air ticket back to Beijing.
The system comprises the following steps: good, to beijing economy class ticket, where do you go? When to want to walk?
The user: tomorrow started from san.
The system comprises the following steps: help you find the result as follows (display result), do you want to find a few tickets?
As shown in the dialog of the scene 2, after statistical learning, a strategy can be learned that a user orders an air ticket when traveling, and generally has a clear return plan, and the system can ask detailed needs (for example, "where do you go.
The application scenarios of the two regions of prior information are taken as examples for explanation, and the application scenarios of the time prior information and the historical action information can learn the fine-granularity conversation strategy which best meets the requirement of the target task through the technical scheme of the invention.
According to the method for generating the dialogue action strategy model, provided by the invention, the situation prior characteristics of the dialogue are effectively utilized and the user behavior characteristics are analyzed based on the dialogue log, so that rich system state characteristics are obtained to establish and train an action decision model, and the learning of a fine-grained dialogue strategy is guided under a unified frame, so that a more accurate result and a dialogue strategy which best meets the requirement of a target task are provided for a user, and the user experience is improved.
Fig. 6 is a logic block diagram illustrating an apparatus for generating a dialogue action policy model according to an exemplary embodiment of the present invention.
Referring to fig. 6, the apparatus for generating a dialogue action policy model according to an exemplary embodiment of the present invention includes a log obtaining unit 610, a state obtaining unit 620, a state combining unit 630, and a decision model generating unit 640.
The log obtaining unit 610 is used for obtaining a user history dialog log.
The state obtaining unit 620 is configured to analyze the user history dialog log in combination with context prior knowledge of a target task, and mine a plurality of first system state features of the target task.
According to a preferred embodiment of the present invention, the state acquiring unit 620 further includes a statistical unit (not shown in the figure) and a state feature extracting unit (not shown in the figure). The statistical unit is used for counting the distribution state of a plurality of user behavior characteristics on the preselected characteristics of the target task according to the user historical dialogue logs; the state feature extraction unit is used for verifying the situation prior feature of the target task according to the distribution state of the user behavior features on the preselected feature of the target task, and extracting the first system state features from the situation prior feature.
Wherein the plurality of user behavior characteristics include at least one of the following statistical characteristics: the proportion of the user completing the target task query, the proportion of the user not completing the target task query, the proportion of the user continuing the conversation after obtaining the query result, the proportion of the user clearly expressing the unconsciousness and the average number of interactive rounds of the conversation.
In addition, the context prior characteristics of the target task include time prior information, region prior information, and historical action information.
The state combining unit 630 is configured to combine a plurality of preset second system state features of the target task with the plurality of first system state features to obtain a plurality of third system state features.
According to another preferred embodiment of the present invention, the state combining unit comprises a state feature representing unit (not shown in the figure) for representing the plurality of third system state features as feature vectors, respectively.
The decision model generating unit 640 is configured to use the labeled data extracted from the user historical dialog log as a training sample, use the plurality of third system state features as training features to establish an action decision model, perform model training, and learn parameter vectors of the third system state features.
Further, the action decision model is a Markov Decision Process (MDP) based model or a Partially Observable Markov Decision Process (POMDP) based model, and each of the training samples applied at the decision model generating unit 640 includes parameter values of a plurality of third system state features, action data, and a reward score for labeling the action.
According to the device for generating the dialogue action strategy model, provided by the invention, the situation prior characteristics of the dialogue are effectively utilized and the user behavior characteristics are analyzed based on the dialogue log, so that rich system state characteristics are obtained for establishing and training the action decision model, and the learning of a fine-grained dialogue strategy is guided under a unified frame, so that a more accurate result and a dialogue strategy which best meets the requirement of a target task are provided for a user, and the user experience is improved.
It should be noted that, according to the implementation requirement, each step described in the present application can be divided into more steps, and two or more steps or partial operations of the steps can be combined into a new step to achieve the purpose of the present invention.
The above-described method according to the present invention can be implemented in hardware, firmware, or as software or computer code storable in a recording medium such as a CD ROM, a RAM, a floppy disk, a hard disk, or a magneto-optical disk, or as computer code originally stored in a remote recording medium or a non-transitory machine-readable medium and to be stored in a local recording medium downloaded through a network, so that the method described herein can be stored in such software processing on a recording medium using a general-purpose computer, a dedicated processor, or programmable or dedicated hardware such as an ASIC or FPGA. It will be appreciated that the computer, processor, microprocessor controller or programmable hardware includes memory components (e.g., RAM, ROM, flash memory, etc.) that can store or receive software or computer code that, when accessed and executed by the computer, processor or hardware, implements the processing methods described herein. Further, when a general-purpose computer accesses code for implementing the processes shown herein, execution of the code transforms the general-purpose computer into a special-purpose computer for performing the processes shown herein.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims (14)

CN201410594506.XA2014-10-292014-10-29The method and apparatus for generating dialogue action policy modelActiveCN104462024B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201410594506.XACN104462024B (en)2014-10-292014-10-29The method and apparatus for generating dialogue action policy model

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201410594506.XACN104462024B (en)2014-10-292014-10-29The method and apparatus for generating dialogue action policy model

Publications (2)

Publication NumberPublication Date
CN104462024Atrue CN104462024A (en)2015-03-25
CN104462024B CN104462024B (en)2018-07-13

Family

ID=52908097

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201410594506.XAActiveCN104462024B (en)2014-10-292014-10-29The method and apparatus for generating dialogue action policy model

Country Status (1)

CountryLink
CN (1)CN104462024B (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN105788593A (en)*2016-02-292016-07-20中国科学院声学研究所Method and system for generating dialogue strategy
CN106295792A (en)*2016-08-052017-01-04北京光年无限科技有限公司Dialogue data interaction processing method based on multi-model output and device
CN106777081A (en)*2016-12-132017-05-31竹间智能科技(上海)有限公司Method and device for determining conversational system acknowledgment strategy
CN106778303A (en)*2016-12-072017-05-31腾讯科技(深圳)有限公司Delegated strategy optimization method and delegated strategy optimization device
CN107066444A (en)*2017-03-272017-08-18上海奔影网络科技有限公司Language material generation method and device based on many wheel interactions
CN107357838A (en)*2017-06-232017-11-17上海交通大学Dialog strategy canbe used on line method based on multi-task learning
CN107894829A (en)*2016-10-042018-04-10谷歌公司The classification annotation of dialogue action
CN108304489A (en)*2018-01-052018-07-20广东工业大学A kind of goal directed type personalization dialogue method and system based on intensified learning network
CN109086282A (en)*2017-06-142018-12-25杭州方得智能科技有限公司A kind of method and system for the more wheels dialogue having multitask driving capability
WO2019000905A1 (en)*2017-06-282019-01-03京东方科技集团股份有限公司Triage dialogue method, device, and system
CN109685462A (en)*2018-12-212019-04-26义橙网络科技(上海)有限公司A kind of personnel and post matching method, apparatus, system, equipment and medium
CN110413754A (en)*2019-07-222019-11-05清华大学 Dialogue (medium) reward evaluation and dialogue method, medium, apparatus and computing device
CN111341308A (en)*2020-02-122020-06-26百度在线网络技术(北京)有限公司Method and apparatus for outputting information
US11321535B2 (en)2016-10-042022-05-03Google LlcHierarchical annotation of dialog acts
CN114581162A (en)*2022-05-092022-06-03成都晓多科技有限公司Method and device for predicting order in customer service conversation process and electronic equipment
CN115309877A (en)*2022-08-032022-11-08北京百度网讯科技有限公司 Dialogue generation method, dialogue model training method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7526474B2 (en)*2005-03-142009-04-28Fuji Xerox Co., Ltd.Question answering system, data search method, and computer program
US20090327234A1 (en)*2008-06-272009-12-31Google Inc.Updating answers with references in forums
CN102262634A (en)*2010-05-242011-11-30北京大学深圳研究生院Automatic questioning and answering method and system
CN103412882A (en)*2013-07-182013-11-27百度在线网络技术(北京)有限公司Method and device for distinguishing consumption intention
CN103853842A (en)*2014-03-202014-06-11百度在线网络技术(北京)有限公司Automatic question and answer method and system
CN104008184A (en)*2014-06-102014-08-27百度在线网络技术(北京)有限公司Method and device for pushing information

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7526474B2 (en)*2005-03-142009-04-28Fuji Xerox Co., Ltd.Question answering system, data search method, and computer program
US20090327234A1 (en)*2008-06-272009-12-31Google Inc.Updating answers with references in forums
CN102262634A (en)*2010-05-242011-11-30北京大学深圳研究生院Automatic questioning and answering method and system
CN103412882A (en)*2013-07-182013-11-27百度在线网络技术(北京)有限公司Method and device for distinguishing consumption intention
CN103853842A (en)*2014-03-202014-06-11百度在线网络技术(北京)有限公司Automatic question and answer method and system
CN104008184A (en)*2014-06-102014-08-27百度在线网络技术(北京)有限公司Method and device for pushing information

Cited By (26)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN105788593A (en)*2016-02-292016-07-20中国科学院声学研究所Method and system for generating dialogue strategy
CN105788593B (en)*2016-02-292019-12-10中国科学院声学研究所Method and system for generating conversation strategy
CN106295792B (en)*2016-08-052019-08-20北京光年无限科技有限公司Dialogue data interaction processing method and device based on multi-model output
CN106295792A (en)*2016-08-052017-01-04北京光年无限科技有限公司Dialogue data interaction processing method based on multi-model output and device
CN107894829B (en)*2016-10-042021-09-10谷歌有限责任公司Hierarchical annotation of conversational actions
CN107894829A (en)*2016-10-042018-04-10谷歌公司The classification annotation of dialogue action
US11321535B2 (en)2016-10-042022-05-03Google LlcHierarchical annotation of dialog acts
US11122089B2 (en)2016-12-072021-09-14Tencent Technology (Shenzhen) Company LimitedAuthorization policy optimization method and apparatus, and storage medium
CN106778303B (en)*2016-12-072020-03-17腾讯科技(深圳)有限公司Authorization policy optimization method and authorization policy optimization device
CN106778303A (en)*2016-12-072017-05-31腾讯科技(深圳)有限公司Delegated strategy optimization method and delegated strategy optimization device
CN106777081A (en)*2016-12-132017-05-31竹间智能科技(上海)有限公司Method and device for determining conversational system acknowledgment strategy
CN107066444A (en)*2017-03-272017-08-18上海奔影网络科技有限公司Language material generation method and device based on many wheel interactions
CN107066444B (en)*2017-03-272020-11-03上海奔影网络科技有限公司Corpus generation method and apparatus based on multi-round interaction
CN109086282A (en)*2017-06-142018-12-25杭州方得智能科技有限公司A kind of method and system for the more wheels dialogue having multitask driving capability
CN107357838A (en)*2017-06-232017-11-17上海交通大学Dialog strategy canbe used on line method based on multi-task learning
CN107357838B (en)*2017-06-232020-09-01上海交大知识产权管理有限公司On-line implementation method of conversation strategy based on multi-task learning
WO2019000905A1 (en)*2017-06-282019-01-03京东方科技集团股份有限公司Triage dialogue method, device, and system
CN108304489A (en)*2018-01-052018-07-20广东工业大学A kind of goal directed type personalization dialogue method and system based on intensified learning network
CN109685462A (en)*2018-12-212019-04-26义橙网络科技(上海)有限公司A kind of personnel and post matching method, apparatus, system, equipment and medium
CN110413754A (en)*2019-07-222019-11-05清华大学 Dialogue (medium) reward evaluation and dialogue method, medium, apparatus and computing device
CN111341308A (en)*2020-02-122020-06-26百度在线网络技术(北京)有限公司Method and apparatus for outputting information
CN111341308B (en)*2020-02-122023-07-21百度在线网络技术(北京)有限公司Method and device for outputting information
CN114581162A (en)*2022-05-092022-06-03成都晓多科技有限公司Method and device for predicting order in customer service conversation process and electronic equipment
CN114581162B (en)*2022-05-092022-09-02成都晓多科技有限公司Method and device for predicting order in customer service conversation process and electronic equipment
CN115309877A (en)*2022-08-032022-11-08北京百度网讯科技有限公司 Dialogue generation method, dialogue model training method and device
CN115309877B (en)*2022-08-032023-05-30北京百度网讯科技有限公司Dialogue generation method, dialogue model training method and device

Also Published As

Publication numberPublication date
CN104462024B (en)2018-07-13

Similar Documents

PublicationPublication DateTitle
CN104462024B (en)The method and apparatus for generating dialogue action policy model
CN106534548B (en)Voice error correction method and device
US20210073551A1 (en)Method and system for video segmentation
US20180190283A1 (en)On-board voice command identification method and apparatus, and storage medium
US11514313B2 (en)Sampling from a generator neural network using a discriminator neural network
US11887224B2 (en)Method, apparatus, and computer program for completing painting of image, and method, apparatus, and computer program for training artificial neural network
CN112488806A (en)Method and device for predicting order willingness, computer equipment and computer-readable storage medium
CN113505854A (en)Method, device, equipment and medium for constructing facial image quality evaluation model
CN114529765B (en)Data processing method, device and computer readable storage medium
CN109275047A (en) Video information processing method and device, electronic device, and storage medium
CN111291799A (en)Room window classification model construction method, room window classification method and room window classification system
CN110059174B (en)Query guiding method and device
CN112784024B (en)Man-machine conversation method, device, equipment and storage medium
CN117290605A (en)Vehicle-mounted intelligent scene recommendation method, device, equipment and medium
CN118505842B (en)Data processing method, device, equipment and readable storage medium
CN114056343A (en)Interaction method, server and storage medium
CN118298443A (en)Verification code data labeling processing method and device
CN104504104A (en)Picture material processing method and device for search engine, and search engine
CN114218288B (en)Driving route recommendation method and device, storage medium and terminal
CN117216206A (en)Session processing method and device, electronic equipment and storage medium
CN116612205A (en)Image generation method and device of multi-mode model based on double-contrast learning
CN110766435B (en)Vector training method and device, electronic equipment and computer readable storage medium
CN111401962A (en)Key opinion consumer mining method, device, equipment and medium
CN115983281B (en)Information extraction and model acquisition method and device, electronic equipment and storage medium
CN116702785B (en)Processing method and device of relational tag, storage medium and electronic equipment

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp