CN120374059A

Movatterモバイル変換

Info

Publication number: CN120374059A
Application number: CN202510859690.4A
Authority: CN
Inventors: 丁志勇; 庞博; 薛畅; 高丽珺; 沈雨欣; 卢苇; 贝东昇; 万长松; 李博华
Original assignee: Bank of Beijing
Current assignee: Bank of Beijing
Priority date: 2025-06-25
Filing date: 2025-06-25
Publication date: 2025-07-25
Anticipated expiration: 2045-06-25

Abstract

The application discloses an intelligent collaborative management method and platform based on artificial intelligent driving. The method comprises the steps of obtaining a target post to be processed in an exchange platform, determining a multidimensional feature vector of the target post, constructing a graph neural network, analyzing the multidimensional feature vector and the graph neural network by utilizing a depth Q network to obtain a target node for processing the target post in the graph neural network, forwarding the target post to the target node, obtaining a processing result of the target node for the target post, and forwarding the processing result to an information receiving area or a target communication address associated with the target post. The application solves the technical problem of low post management efficiency caused by the fact that related technologies cannot efficiently and accurately distribute posts in an exchange platform to objects suitable for processing the posts.

Description

Intelligent collaborative management method and platform based on artificial intelligent drive

Technical Field

The application relates to the technical field of artificial intelligence, in particular to an intelligent collaborative management method and platform based on artificial intelligence driving.

Background

The enterprise forum platform in the related technology is mainly used for staff exchange, and supports staff to issue posts to share experience, raise questions or discuss related topics of business, and other staff can reply comments under the posts. Taking an enterprise forum of a certain bank as an example, staff can issue discussion posts about the popularization strategy of new credit products, and colleagues issue respective insights in comment areas. However, the relevant platform is not deeply integrated with the task management flow. When valuable task demands are generated in the forum, such as when tasks for optimizing a certain business process are proposed according to discussion, an administrator needs to manually screen related post contents, derive and sort out the post contents, and then secondarily allocate the post contents to corresponding departments. In the task execution process, the forum cannot track the task progress and cannot conduct closed-loop processing on the task, so that task management is disordered, and the task is difficult to complete in time and high quality.

In view of the above problems, no effective solution has been proposed at present.

Disclosure of Invention

The application provides an intelligent collaborative management method and platform based on artificial intelligent driving, which at least solve the technical problem of low post management efficiency caused by the fact that related technologies cannot efficiently and accurately distribute posts in an exchange platform to objects suitable for processing the posts.

According to one aspect of the application, an intelligent collaborative management method based on artificial intelligence driving is provided, which comprises the steps of obtaining a target post to be processed in an exchange platform, determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises a theme classification feature, an emotion feature and a keyword coding feature, constructing a graphic neural network, nodes in the graphic neural network comprise department nodes and employee nodes, edges in the graphic neural network comprise department collaborative edges, department-employee membership edges and employee collaborative edges, department node attributes at least comprise the post number to be processed, post average processing time length and professional field coding, employee node attributes at least comprise a current state, a skill feature vector and a historical satisfaction score, analyzing the multidimensional feature vector and the graphic neural network by utilizing a first depth Q network to obtain a target node for processing the target post in the graphic neural network, wherein the state in the first depth Q network comprises the current multidimensional feature vector, the topology structure and the node attributes of the graphic neural network, and the department node attributes in the first depth Q network comprise selecting the target node, the department-employee membership edges and the employee collaborative edges, forwarding the post to the target processing area, and forwarding the post to the target processing result to the target area, and forwarding the target post processing result to the target post processing target node.

Optionally, the reward function of the first deep Q network is determined by determining a first sub-reward function according to a matching score, wherein the matching score is used for representing the semantic matching degree between the post content and the node professional field, the matching score is obtained by calculating the similarity between a post keyword and a node preset label through a natural language processing technology, determining a second sub-reward function according to a time delay, wherein the time delay is used for representing the time difference from the creation of the post to the successful allocation of the time stamp, determining a third sub-reward function according to a load index, wherein the load index is used for representing the ratio of the current task queue length of the node to the maximum processing capacity of the node, determining a fourth sub-reward function according to a satisfaction score, wherein the satisfaction score is used for representing the feedback rating provided by a user after the post processing is completed, and determining the reward function according to the first sub-reward function, the second sub-reward function, the third sub-reward function and the fourth sub-reward function.

Optionally, determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises extracting a semantic embedded vector of the target post by using a pre-training language model, calculating different probability distributions corresponding to different topic classifications by using a topic attention layer, normalizing the different probability distributions to obtain a first feature component, calculating a vocabulary strength score of the target post based on a financial emotion dictionary, wherein the vocabulary strength score comprises a plurality of different preset words for representing positive emotion, neutral emotion and negative emotion, the vocabulary strength score is used for quantifying the strength of an emotion tendency expressed in the target post, determining the emotion probability of the target post by using a deep learning model, wherein the emotion probability is used for representing the probability that the target post belongs to positive emotion, neutral emotion and negative emotion, dynamically weighting and fusing the vocabulary strength score and the emotion probability according to a financial term density to obtain a second feature component, wherein the financial term density is the ratio of the number of financial terms appearing in the target post to the total word number of the target post, removing general stop words and high-frequency financial virtual words in the target post, and retaining word frequency-inverse document frequency exceeds a preset threshold, and jointly filtering the second feature component by using a core word, and jointly filtering the second feature component by using a core feature, and jointly filtering the second feature component by using the core feature component, and performing dimension reduction processing on the splicing result through the orthogonal constraint linear transformation layer to obtain a multidimensional feature vector.

Optionally, after the target post to be processed in the communication platform is obtained, the method further comprises the steps of encoding the target post by utilizing a pre-training language model to obtain a semantic feature vector, determining a click rate standardization value and a poster job level weight of the target post, splicing the semantic feature vector, the click rate standardization value and the poster job level weight, determining a splicing result as a state vector, analyzing the state vector by utilizing a second depth Q network, determining whether the target post is an essence post or not, wherein an action space of the second depth Q network comprises a marked essence post and a non-marked essence post, a reward function of the second depth Q network comprises a compliance score, an expert review pass rate and a false mark penalty, the compliance score is a quantification result of risk scanning of post contents through a preset gold fusion rule base, the compliance score is a difference value of 1 and a target ratio, the target ratio is a ratio of the number of occurrence times of risk keywords to the number of total keywords, the expert review pass rate is a mark and a ratio of the number of the marks to the total number of the marks, the marks is punishment cost of the expert review pass rate and the marks is a punishment cost of the marks, the punishment cost comprises a first and a preset essence cost per-post is a preset essence cost, the first post is a preset cost per-unit, and the first post is a preset cost per-unit and the cost of the first post is a preset per-unit of the cost and the attention is a preset per-to be a per-cost per-unit.

Optionally, if the target post is determined to be an essence post by using the second deep Q network, after the processing result of the target node for the target post is obtained, the processing result is sent to the target object for rechecking, the rechecking result is obtained, and the rechecking result is forwarded to an information receiving area or a target communication address associated with the target post.

Optionally, the processing result at least comprises target content, a processing conclusion type, a user privacy level identifier and a result sensitivity label, wherein the processing result is forwarded to an information receiving area or a target communication address information receiving area related to the target post, and the processing result comprises the steps of forwarding the target content to the information receiving area when the processing conclusion type is a first type used for representing a public reply request, forwarding the target content to the target communication address when the processing conclusion type is a second type used for representing personal transaction, forwarding the target content to the information receiving area when the user privacy level identifier is a first identifier used for representing public authority, forwarding the target content to the target communication address when the user privacy level identifier is a second identifier used for representing private authority, and forwarding the target content to the target communication address when the result sensitivity label is a first label used for representing private data.

According to the application, the intelligent collaborative management platform based on artificial intelligence driving is further provided, and comprises an acquisition unit, an analysis unit and a construction unit, wherein the acquisition unit is used for acquiring target posts to be processed in the communication platform and determining multi-dimensional feature vectors of the target posts, the multi-dimensional feature vectors comprise topic classification features, emotion features and keyword coding features, the construction unit is used for constructing a graph neural network, nodes in the graph neural network comprise department nodes and employee nodes, edges in the graph neural network comprise department collaborative edges, department-employee membership edges and employee collaborative edges, department node attributes at least comprise post numbers to be processed, post average processing time length and professional field coding, employee node attributes at least comprise current states, skill feature vectors and historical satisfaction scores, the analysis unit is used for analyzing the multi-dimensional feature vectors and the graph neural network by means of a first depth Q network to obtain target nodes for processing the target posts, the states in the first depth Q network comprise the multi-dimensional feature vectors of the current posts, the topology structures and the node attributes of the graph neural network, the first depth Q network comprise department collaborative edges, the department-employee membership edges and employee collaborative edges, the department node attributes comprise the first depth Q network, the first depth Q network comprises the department nodes and the target nodes are selected to be used for forwarding the post results to the target posts to the target nodes, and the target nodes are selected to be forwarded to the target nodes, and the target nodes are forwarded to the target nodes.

According to still another aspect of the present application, there is also provided a non-volatile storage medium, the storage medium including a stored program, wherein the program, when running, controls a device in which the storage medium is located to execute the above intelligent collaborative management method based on artificial intelligence driving.

According to still another aspect of the present application, there is also provided an electronic device including a memory and a processor for running a program stored in the memory, wherein the program runs to perform the above intelligent collaborative management method based on artificial intelligence driving.

According to still another aspect of the present application, there is also provided a computer program, wherein the computer program, when executed by a processor, implements the above intelligent collaborative management method based on artificial intelligence driving.

According to yet another aspect of the present application, there is also provided a computer program product comprising a non-volatile computer readable storage medium, wherein the non-volatile computer readable storage medium stores a computer program which, when executed by a processor, implements the above intelligent collaborative management method based on artificial intelligence driving.

The method comprises the steps of acquiring a target post to be processed in an exchange platform, determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises a theme classification feature, an emotion feature and a keyword coding feature, constructing a graph neural network, nodes in the graph neural network comprise department nodes and staff nodes, edges in the graph neural network comprise department-to-department cooperation edges, department-staff membership edges and staff-to-staff cooperation edges, department node attributes at least comprise post numbers to be processed, post average processing time length and professional field coding, staff node attributes at least comprise a current state, a skill feature vector and a historical satisfaction score, analyzing the multidimensional feature vector and the graph neural network by utilizing a first depth Q network to obtain target nodes for processing the target post in the graph neural network, wherein the states in the first depth Q network comprise the multidimensional feature vector of the current post, the topological structure and the node attributes of the graph neural network, the actions in the first depth Q network comprise the selection of the target nodes, the selection of the staff nodes and the execution of multiple decision-making along the graph edges, the target post is forwarded to the target nodes, the target post is acquired, the target post is forwarded to the target nodes, the target post is accurately distributed to the target posts, the target posts are well-related to the target posts, the post processing target posts are well-by the communication platform, the high-efficient communication is achieved, the high-efficient and the communication is achieved, the high-efficient and the communication is achieved, and the communication efficiency is achieved, the technical problem of low post management efficiency is caused.

Drawings

The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute a limitation on the application. In the drawings:

FIG. 1 is a flow chart of an intelligent collaborative management method based on artificial intelligence drivers according to an embodiment of the application;

FIG. 2 is a block diagram of an intelligent collaborative management platform based on an artificial intelligence driver in accordance with an embodiment of the present application;

Fig. 3 is a block diagram of a hardware structure of a computer terminal based on an intelligent collaborative management method driven by artificial intelligence according to an embodiment of the present application.

Detailed Description

In order that those skilled in the art will better understand the present application, a technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present application without making any inventive effort, shall fall within the scope of the present application.

It should be noted that the terms "first," "second," and the like in the description and the claims of the present application and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the application described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

According to an embodiment of the present application, there is provided a method embodiment of an intelligent collaborative management method based on artificial intelligence driving, it should be noted that the steps illustrated in the flowchart of the drawings may be performed in a computer system such as a set of computer executable instructions, and that although a logical order is illustrated in the flowchart, in some cases, the steps illustrated or described may be performed in an order different from that herein.

FIG. 1 is a flow chart of an intelligent collaborative management method based on artificial intelligence driving according to an embodiment of the application, as shown in FIG. 1, the method comprises the following steps:

Step S102, obtaining a target post to be processed in the communication platform, and determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises a theme classification feature, an emotion feature and a keyword coding feature.

The communication platform comprises, but is not limited to, an employee forum, an instant messaging platform and a document collaboration platform.

It will be appreciated that in order to more accurately understand and process the mass content of posts in an exchange platform, particularly to identify information that contains potential task value or that requires further action, each post that requires significant attention, i.e., the target post, needs to be converted into a structured mathematical representation. This mathematical representation is known as a "multi-dimensional feature vector," which is essentially a list or array of numbers, where each location (dimension) represents the quantized feature value of the post in a particular aspect.

Specifically, the multi-dimensional feature vector includes three core components. Topic classification features, which refer to the business domain or content category of the main discussion of the post, such as "credit product," "internal flow optimization," "IT system problem," or "customer service feedback," are automatically judged by natural language processing techniques (e.g., topic models or text classifiers), and the topic category to which the post belongs is represented by a specific numerical code. Emotional characteristics, reflecting the emotional tendency or attitude strength expressed by the author of the post, may be a numerical value calculated by an emotion analysis model, such as a score ranging from-1 (extreme negative) to +1 (extreme positive), or a finer granularity classification such as positive emotion, neutral emotion, and codes corresponding to negative emotion. Keyword encoding features, which involve identifying and quantifying core terms or phrases (e.g., "slow approval", "systematic katon", "suggestion add function X") that appear in a post, may be used to generate a sequence of values by word embedding techniques that capture the key concepts mentioned in the post and their importance.

And calculating and combining the characteristic values of the three dimensions to finally form a multidimensional characteristic vector. This vector becomes a "digital fingerprint" of the post.

Step S104, constructing a graph neural network, wherein nodes in the graph neural network comprise department nodes and employee nodes, edges in the graph neural network comprise department cooperation edges, department-employee membership edges and employee cooperation edges, department node attributes at least comprise post numbers to be processed, post average processing time length and professional field codes, and employee node attributes at least comprise current states, skill feature vectors and historical satisfaction scores.

The graph neural network is composed of nodes (or vertices) representing entities and edges representing connection relationships between the nodes. The nodes and the edges can have respective feature vectors for describing attribute information, and the whole graph can also have global information. By iteratively passing and aggregating the characteristic information of nodes and their neighbors, learning the representation of nodes and graphs, the graph neural network can capture the dependency relationships and topological structure features between nodes in the graph. A skill feature vector is a form of vector used to represent the skill-facet characteristics. Each dimension of skill (e.g., type of skill, proficiency, application scenario, etc.) is quantified as a series of values, and a skill is fully described by a vector of these values.

In the above step S104, the nodes in the graph neural network represent basic constituent units of the enterprise, and are mainly divided into two types, namely, department nodes and employee nodes. Department nodes, such as credit, risk control, IT support, represent functional units in the enterprise, and employee nodes represent specific personnel members. The connection relation between the nodes is embodied by edges, wherein the inter-department cooperation edges describe formal or informal cooperation relations (such as cooperation of credit parts and risk control parts on approval processes) generated between different departments due to projects or processes, department-employee membership edges clearly identify subordinate relations between employees and departments to which the employees belong, and the inter-employee cooperation edges reflect actual work networks formed between employee individuals based on historical project cooperation, knowledge sharing or social interaction (such as frequent cooperation of employees of two different departments solves the problem of clients).

Further, in order for the graph neural network to learn and perform automatic task allocation efficiently, each node is given attribute information describing its state and capabilities. The attributes of the department nodes at least comprise the number of posts to be processed, wherein the number of posts to be processed quantifies the backlog of task requests to be processed currently received by the department from a forum, the backlog is a key index for measuring the load of the department, the average processing time of the posts, and the professional field codes are used for describing the core responsibility of the department and the service types (such as credit approval, system operation and maintenance and customer complaints) of the department through vectorized representation, so that the tasks are conveniently matched to the most suitable department. The attributes of the staff nodes at least comprise the current state (such as idle, busy and vacation), skill feature vectors which are multidimensional vectors, encode various skills mastered by the staff and the proficiency (such as data analysis-high-level, client communication-medium-level and Python programming-proficiency) of the staff, are core bases of skills required by accurate matching tasks, and historical satisfaction scores are calculated based on the quality of the completion of the past tasks, timeliness or colleagues evaluation and are used for measuring the reliability and the work performance level of the staff.

And S106, analyzing the multidimensional feature vector and the graph neural network by using a first depth Q network to obtain a target node for processing the target post in the graph neural network, wherein the state in the first depth Q network comprises the multidimensional feature vector of the current post, the topological structure and the node attribute of the graph neural network, and the actions in the first depth Q network comprise selecting a target department node, selecting a target employee node and executing multi-hop decision along the graph edge.

Step S106 is used to combine the multidimensional feature vector representing the core information of the post to be processed in step S102 and the neural network of the graph describing the internal structure, the capacity and the relationship of the enterprise in step S104, and automatically and intelligently decide which department or employee should be responsible for processing the task requirement contained in the post through reinforcement learning technology (specifically, deep Q network, DQN).

In step S106, the state of the first deep Q network includes a multidimensional feature vector of the current target post, which provides core information of task requirements, such as subject of discussion (e.g., "credit approval process optimization"), expressed emotional tendency (e.g., "negative, indicating that there is a serious problem with the process"), and key appeal vocabulary (e.g., "simplified", "accelerated", "automated"). The state also comprises the current topological structure of the whole graph neural network and the real-time attribute of all nodes in the graph neural network, wherein the current topological structure of the graph neural network reflects the cooperative relationship among departments, the membership of the departments and staff and the possible cooperative links among staff, and the real-time attribute of all nodes such as the current post count (load) to be processed of each department, the average processing time (efficiency), the professional field (capability matching degree), the current state (whether idle) of the staff, the skill vector (whether the skill required by the processing task is provided) and the historical satisfaction score (reliability and performance). It will be appreciated that the above states fully describe what the task is and what the departments and employees can do at present.

The actions mainly include three categories, selecting a target department node (directly assigning tasks to a department process), selecting a target employee node (directly assigning tasks to a particular employee process), and performing multi-hop decisions along the graph edges. Multi-hop decisions are particularly important for modeling the process of posts (i.e., tasks in reinforcement learning) flowing in a neural network. For example, the decision "first step of selecting department A- > second step of moving to department B- > third step of selecting employee C within department B along the collaboration edge of department A and department B". Or "first step: select employee X- > second step: transfer to employee Y along employee X's collaborative edge with employee Y. This action design enables flexible handling of situations where cross-department collaboration is required or specific specialists are sought, greatly enhancing the rationality and adaptability of the decision.

Notably, multi-hop decisions allow agents to make continuous, path-like movements on graph structures made up of department and employee nodes and the various edges connecting them (department-employee membership edges, employee-employee cooperation edges), ultimately reaching one or more target nodes. Specifically, each "hop" represents an action that the DQN selects in the current state, which is not the final allocation, but rather instructs the system to move along a particular "edge" in the graph to the next adjacent "node". For example, a first jump is performed starting from a starting point (e.g., a department node A initially associated with the subject of the post) and moving along an inter-department collaboration edge to another department node B. To simulate tasks being handed over from department a to department B for processing because of the need to collaborate across departments. And the second jump action is to determine to move from the department node B to an employee node C affiliated to the department along a department-employee affiliated side after reaching the department node B. Indicating that the task is assigned to a specific employee C within the department B. A third-hop action, DQN, may also be at employee node C, again deciding to move along one inter-employee collaboration edge to another employee node D (even though D may belong to another department). This simulates employee C thinking that it cannot do alone, or that employee D is a more appropriate expert, issuing a task transfer or collaboration request to D.

This "jump" process described above may continue until the DQN considers the most appropriate final processing node (target department or target employee) to be found, or a preset maximum number of hops limit is reached. The choice of each step "jump" (which edge to walk, to which neighboring node) depends on the DQN's analysis and evaluation of the current combined state (post feature vector + real-time structure/attributes of the whole graph) with the goal of maximizing long-term jackpot.

Step S108, forwarding the target posts to the target nodes, obtaining processing results of the target nodes aiming at the target posts, and forwarding the processing results to an information receiving area or a target communication address associated with the target posts.

The steps shown in fig. 1 are exemplarily illustrated and explained below.

According to some alternative embodiments of the present application, the reward function of the first deep Q network in step S106 is determined by determining a first sub-reward function according to a matching score, wherein the matching score is used for representing a semantic matching degree between the post content and the node professional field, the matching score is obtained by calculating a similarity between the post keyword and the node preset label through a natural language processing technology, determining a second sub-reward function according to a time delay, wherein the time delay is used for representing a time difference from the post creation time stamp to the successful allocation time stamp, determining a third sub-reward function according to a load index, wherein the load index is used for representing a ratio of a current task queue length of the node to a maximum processing capacity of the node, determining a fourth sub-reward function according to a satisfaction score, wherein the satisfaction score is used for representing a feedback rating provided by a user after the post processing is completed, and determining the reward function according to the first sub-reward function, the second sub-reward function, the third sub-reward function, and the fourth sub-reward function.

In this embodiment, the first sub-bonus function is based on a matching score. The matching score is used for accurately measuring the semantic matching degree between task demand content (represented by keyword codes, topic classifications and the like in the multidimensional feature vectors) contained in the target posts and professional capabilities of potential processing nodes (departments or staff). Specifically, natural language processing techniques (e.g., word vector similarity calculation, topic model matching analysis) may be utilized to calculate a similarity score between the extracted keywords or core semantics in the posts and a specialized domain label preset by the node (e.g., a "specialized domain code" of a department or a core capability label in a "skill feature vector" of an employee). The higher the matching score, the more specialized the node has to handle the task, and the greater the potential for task completion quality and efficiency.

The time delay refers to the time difference from the time stamp created by the forum post until the task demand represented by the post is successfully assigned to the time stamp of the target node. This index directly reflects the efficiency of the task allocation link. Too long delay can lead to a problem response lag, affecting business operations and employee experience. The load index is used to evaluate how busy a target node (department or employee) is at the time of task allocation and resource availability. The load index is defined as the ratio of the current pending task queue length of the node (e.g., the "pending post count" of the department or the number of tasks currently undertaken by the employee) to the maximum processing capacity of the node. The higher the ratio, the more overloaded the node, at which point reassigning new tasks may lead to processing delays, quality degradation, and even system crashes. Satisfaction score is the final quality feedback after task closure, typically derived from feedback ratings (e.g., five star score or satisfaction/dissatisfaction tab) provided by the post initiator or related user after task processing is complete. The satisfaction score directly reflects whether the result of task execution really solves the problem and meets the requirement.

In this embodiment, first, an initial weighting coefficient is set for each sub-bonus function (e.g., based on business experience or preliminary experimental setting: matching coefficientsDelay coefficientLoad factorCoefficient of satisfaction. The business operation indexes of several cores are continuously monitored, namely a post allocation success rate index (the proportion of successfully converting posts into tasks and allocating out), an average processing delay index (the total average time consumed from post creation to task completion), and a user satisfaction average index (the average feedback of users on task processing results).

Then, according to the change trend of the monitoring indexes, the corresponding weight coefficients are dynamically adjusted:

1. when the post allocation success rate index is detected to be reduced, the task allocation link is problematic, and the task requirement is not matched with the capability of a processor (the matching degree is low), or the processing result is not satisfactory to the user (the subsequent task conversion will is reduced). Therefore, the weight coefficient corresponding to the first sub-bonus function (matching score) needs to be increasedAnd increasing the weighting factor corresponding to the fourth sub-bonus function (satisfaction score). The purpose of this is to guide the DQN to pay more attention to selecting nodes with high professional ability matching and finally to obtain a distribution path with high satisfaction when deciding, thereby improving the probability of successful reception and efficient completion of tasks.

2. When an increase in the average processing delay index is detected, it is indicated that the overall period from the presentation to the completion of the task becomes long, and the efficiency becomes low. The delay may result from the allocation process itself being too long (a large time delay) or the task being allocated to a node that is already too busy (a high load), resulting in the task backlog in the queue. Therefore, it is necessary to increase the weight coefficient corresponding to the second sub-bonus function (time delay)And increasing the weight coefficient corresponding to the third sub-bonus function (load index). Thereby causing DQN to preferentially select nodes that are able to respond quickly to allocation requests (low time delay) and are currently lightly loaded (low load index) to shorten overall processing cycles.

3. When the user satisfaction mean value index is detected to be reduced, the final quality of the task processing result is reflected to be poor, and the user is dissatisfied. Although there may be a number of reasons for the reduced satisfaction, increasing the weight of the fourth sub-prize is the most straightforward countermeasure. Therefore, it is necessary to increase the weight coefficient corresponding to the fourth sub-bonus function (satisfaction score)。

Finally, after one or more weight coefficients are adjusted according to the rule, normalization operation is performed, namely, the sum of all four updated weight coefficients is ensured to be strictly equal to 1. The rewarding function of the first depth Q network is obtained by multiplying the four dynamically adjusted weight coefficients by corresponding sub-rewarding function values respectively and adding the multiplied weight coefficients.

The relative importance (weight coefficient) of the dimensions within this dynamically constructed bonus function will vary intelligently. Matching and satisfaction are emphasized more when allocation success rate is low, fast response and load balancing are emphasized more when processing delay is high, and final satisfaction is emphasized most when users are not full. The closed-loop feedback mechanism enables the DQN to continuously learn and optimize the allocation strategy thereof so as to cope with the changing business demands, and finally drives the whole intelligent task allocation system to evolve towards the optimal comprehensive performance.

According to other optional embodiments of the application, the multi-dimensional feature vector of the target post is determined by extracting a semantic embedded vector of the target post by using a pre-trained language model, calculating different probability distributions corresponding to different topic classifications by a topic attention layer, normalizing the different probability distributions to obtain a first feature component, calculating a vocabulary strength score of the target post based on a financial emotion dictionary, wherein the financial emotion dictionary comprises a plurality of different preset words for representing positive emotion, neutral emotion and negative emotion, the vocabulary strength score is used for quantifying the strength of an emotion tendency expressed in the target post explicitly, determining an emotion probability of the target post by using a deep learning model, wherein the emotion probability is used for representing the probability that the target post belongs to positive emotion, neutral emotion and negative emotion, dynamically weighting and fusing the vocabulary strength score and the emotion probability according to the density of the financial terms to obtain a second feature component, wherein the density of the financial terms is a ratio of the number of the financial terms appearing in the target post to the total word number of the target post, removing general stop words and high-frequency financial words in the target post, and preserving word-inverse document frequency, and preserving the co-occurrence of the common feature word in the target post, and jointly screening the feature components by using a core feature, and the core feature component is determined by using a deep learning model, and performing dimension reduction processing on the splicing result through the orthogonal constraint linear transformation layer to obtain a multidimensional feature vector.

In this embodiment, first, the original text of the target post is encoded by using a pre-training language model (such as BERT, roBERTa, etc.), so as to generate a semantic embedded vector containing overall semantic information. Next, to capture the core business category of post discussion, a topic attention layer is introduced. The layer receives the semantic embedded vector and calculates the probability distribution that the post belongs to a preset different topic classification (e.g. "credit product", "risk control", "operation and maintenance", "customer service", "internal flow", etc.). After normalization processing (ensuring that the sum of all probabilities is 1), the probability distribution forms a first characteristic component representing the main discussion direction of the post, and the theme tendency of the post content is clearly quantized.

Further, in emotion analysis, a dual-management strategy is adopted to improve accuracy in the context of the financial field. In one aspect, the lexical strength score of the post is calculated based on a specially constructed financial emotion dictionary (containing a large number of preset, labeled positive, neutral, or negative financial domain related words). The score directly quantifies the strength of emotion tendencies expressed explicitly in the text by counting and analyzing the emotion words that appear in the posts and their strength (possibly considering word frequency, location, modifier, etc.). On the other hand, a deep learning emotion analysis model is used to predict the probability distribution that the post as a whole belongs to positive, neutral or negative emotion. Taking a recurrent neural network as an example, by introducing a recurrent structure in the network, information can be transferred along the sequence, so that sequence information in the text can be captured. For a post, the model may process word by word (or character by character), with each word being transformed by the embedding layer into a vector of fixed dimensions that can represent the semantic information of the word. The recurrent neural network then affects the processing of the following words based on the information of the preceding words, so that the context and context relationship of the text can be understood. The model gradually extracts higher-level text features through a multi-layer neural network structure. In the final stage of the model, there is an output layer, which may be a fully connected neural network layer, whose output nodes correspond in number to emotion categories, such as positive, neutral, negative three categories, each node outputting a probability value. These probability values are processed through a softmax function such that their sum is 1, forming a probability distribution. The probability distribution can intuitively represent the probability that a post belongs to each emotion category, for example, a post may have a probability of 0.7 belonging to positive emotion, a probability of 0.2 belonging to neutral emotion and a probability of 0.1 belonging to negative emotion, so that the emotion tendency of the whole post can be judged according to the probabilities, and the uncertainty degree of emotion classification is reflected.

Notably, to combine the advantages of both methods and to accommodate the nature of the financial text, financial term density (i.e., the ratio of the number of financial terms appearing in the post to the total number of words of the post) is introduced as a dynamic weighting factor. The method is characterized in that when the density of the financial terms is higher than a threshold value, the text specificity is higher, the word strength score obtained by a financial emotion dictionary method is more depended, and when the density of the financial terms is lower than the threshold value, the emotion probability predicted by an emotion analysis model is more prone to deep learning. The two results are dynamically weighted and fused according to the term density, and finally a second characteristic component reflecting the emotion tendency and the strength of the posts is generated.

Further, for identifying core complaints and specialized focuses in posts, focus is on extracting key terminology. And preprocessing the post text, removing general stop words and high-frequency financial stop words (such as certain high-frequency but low-information financial connecting words), and retaining the professional terms that the word frequency-inverse document frequency value exceeds a preset threshold value. These screened terms constitute a candidate keyword set. To further identify the most core and representative words therein, a term co-occurrence graph is constructed. The nodes in the term co-occurrence graph are each selected professional term, and edges between the nodes are established according to the co-occurrence frequency or semantic association degree (which can be calculated through word vectors) of the terms in the post context window. A centrality metric (e.g., degree centrality, near centrality, or feature vector centrality) is calculated for each term node in the graph. The highly centralised term is considered to be at a core location in the post semantic network, more representative of the core issues of the post. These filtered core words are used to construct a third feature component, which may be a vector obtained by averaging or weighted averaging (weights may be determined by centrality) of its word vectors.

And finally, splicing the first characteristic component representing the theme tendency, the second characteristic component representing the emotion intensity and the third characteristic component containing the core professional appeal to form a temporary vector with higher dimension. In order to compress this high-dimensional temporal vector into a final practical, information-intensive and dimension-controllable multi-dimensional feature vector while reducing redundancy between features, a linear transformation layer with orthogonal constraints (e.g., a linear layer in which a weight matrix is constrained to be approximately orthogonal) is used for the dimension reduction process. The orthogonal constraint is helpful to keep the information of the original feature space as much as possible and reduce the correlation among features in the dimension reduction process, and finally a multi-dimensional feature vector with low dimension and high information retention is output, wherein the vector comprehensively codes the information of posts in three key dimensions of theme, emotion and core professional appeal.

After the target post to be processed in the communication platform is obtained, the method further comprises the steps of coding the target post by utilizing a pre-training language model to obtain a semantic feature vector, determining a click rate standardization value and a poster job level weight of the target post, splicing the semantic feature vector, the click rate standardization value and the poster job level weight, determining a splicing result as a state vector, analyzing the state vector by utilizing a second depth Q network, determining whether the target post is an essence post or not, wherein an action space of the second depth Q network comprises a marked essence post and a non-marked essence post, a reward function of the second depth Q network comprises a compliance score, a review pass rate and a false mark penalty, the compliance score is a quantification result of risk scanning of post contents by a preset gold fusion rule base, the compliance score is a difference value of 1 and the target ratio, the target ratio is a ratio of the number of occurrence times of risk keywords to the number of total keywords, the expert review pass rate is a ratio of the marked and the number of confirmed posts to the marked post, the total number of the marked posts is determined, the action space of the second depth Q network comprises a marked essence post and the non-marked essence post, the reward pass rate and the false mark penalty is set according to a preset essence cost per unit, and the preset cost of the first post is a preset essence cost and the attenuation cost per the first post is set.

In the embodiment, firstly, a pre-training language model is utilized to carry out deep semantic coding on target posts, semantic feature vectors containing context information are generated, and the vectors can capture the core value of the posts in aspects of business insight, knowledge depth and the like. Meanwhile, two key auxiliary indexes, namely a click rate standardization value (the historical click rate of posts is converted into a relative heat index of [0,1] interval, and the influence of different plate flow difference is eliminated) and a poster job level weight (a job level preset weight coefficient of a poster in an enterprise architecture is calculated according to the fact that a department general supervision=0.9 and a common employee=0.5, and authority difference is reflected). The three types of information are fused into a comprehensive state vector through splicing operation, so that complete input representing the value potential of the post is formed.

The state vector is then input into a second deep Q network for decision analysis. The action space of the network comprises binary choices of marked essence posts or unmarked essence posts. The reward function adopts a triple constraint mechanism to ensure decision quality, wherein a compliance score is calculated through scanning of a preset gold fusion rule base, a specific formula is 1- (the occurrence number of risk keywords/the total number of keywords), the score approaches 1 to represent complete compliance, negative rewards are triggered when the score is lower than a threshold value, expert review passing rate is used as a feedback supervision signal, the proportion of the number of essence posts confirmed by field experts to the total marking quantity is calculated as marks (for example, 80 posts in 100 posts are approved by the expert, 0.8 is rewarded), wrong-mark punishment fines quantify the cost of wrong decision, and comprise two key cost items, wherein the first cost is post exposure amount multiplied by preset unit attention cost (the unit cost is converted into monetary value according to the man-hour loss of staff reading a typical document) and used for punishing attention resources occupied by low-quality posts, and the second cost is a post knowledge value coefficient multiplied by preset attenuation factor (the knowledge value coefficient is calculated by index backtracking such as the reference number of a knowledge base and the attenuation factor increases exponentially along with wrong-mark discovery time) and is used for punishing the implicit loss missing high-value content.

And finally, realizing intelligent decision through dynamic balance rewarding factors, when the compliance score is low, enabling the second depth Q network to avoid risk content, when the expert rechecks that the passing rate is reduced, enabling the second depth Q network to tighten the marking standard, and ensuring balance between noise interference reduction and knowledge asset loss prevention through double-cost design of false mark punishment. The second deep Q network learns the optimal strategy in continuous training, namely, the essence mark is triggered only when the comprehensive state vector of the post predicts that the post can bring forward net benefit (expected reward > 0), so that the accurate capture of the knowledge value is realized.

Further, if the target post is determined to be an essence post by using the second depth Q network, after the processing result of the target node for the target post is obtained, the processing result is sent to the target object for rechecking, the rechecking result is obtained, and the rechecking result is forwarded to an information receiving area or a target communication address associated with the target post.

As some optional embodiments of the application, the processing result at least comprises target content, processing conclusion type, user privacy level identification and result sensitivity label.

Further, forwarding the processing result to the information receiving area or the target communication address information receiving area associated with the target post may be achieved by forwarding the target content to the information receiving area if the processing conclusion type is a first type for representing a public reply request, forwarding the target content to the target communication address if the processing conclusion type is a second type for representing personal transactions, forwarding the target content to the information receiving area if the user privacy level identification is a first identification for representing public rights, forwarding the target content to the target communication address if the user privacy level identification is a second identification for representing private rights, and forwarding the target content to the target communication address if the result sensitivity label is a first label for representing private data.

FIG. 2 is a block diagram of an intelligent collaborative management platform based on artificial intelligence drivers, according to an embodiment of the application, as shown in FIG. 2, the platform comprising:

the obtaining unit 22 is configured to obtain a target post to be processed in the communication platform, and determine a multidimensional feature vector of the target post, where the multidimensional feature vector includes a topic classification feature, an emotion feature, and a keyword coding feature.

The construction unit 24 is configured to construct a graph neural network, where nodes in the graph neural network include department nodes and employee nodes, edges in the graph neural network include inter-department collaboration edges, department-employee membership edges, and inter-employee collaboration edges, and the department node attributes include at least a post count to be processed, a post average processing duration, and a professional field code, and the employee node attributes include at least a current state, a skill feature vector, and a historical satisfaction score.

The analysis unit 26 is configured to analyze the multidimensional feature vector and the graph neural network by using a first depth Q network, so as to obtain a target node in the graph neural network for processing the target post, where a state in the first depth Q network includes the multidimensional feature vector of the current post, a topology structure of the graph neural network, and node attributes, and an action in the first depth Q network includes selecting a target department node, selecting a target employee node, and performing a multi-hop decision along a graph edge.

And the forwarding unit 28 is configured to forward the target post to the target node, obtain a processing result of the target node for the target post, and forward the processing result to an information receiving area or a target communication address associated with the target post.

Optionally, determining a reward function according to the first sub-reward function, the second sub-reward function, the third sub-reward function and the fourth sub-reward function comprises the steps of obtaining initial weight coefficients of each sub-reward function, continuously detecting post distribution success rate indexes, average processing delay indexes and user satisfaction average indexes, increasing first initial weight coefficients corresponding to the first sub-reward function and fourth initial weight coefficients corresponding to the fourth sub-reward function when detecting post distribution success rate indexes to be reduced, increasing second initial weight coefficients corresponding to the second sub-reward function and third initial weight coefficients corresponding to the third sub-reward function when detecting average processing delay indexes to be increased, increasing fourth initial weight coefficients when detecting user satisfaction average indexes to be reduced, executing normalization operation after each time of weight coefficient updating, enabling the sum of the updated weight coefficients to be 1, obtaining first weight coefficients, second weight coefficients, third weight coefficients and fourth weight coefficients, and determining a reward function according to the first sub-reward function, the first weight coefficients, the second weight coefficients, the third sub-reward function and the fourth sub-reward function.

Optionally, determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises the steps of extracting a semantic embedded vector of the target post by using a pre-training language model, calculating different probability distributions corresponding to different topic classifications by using a topic attention layer, normalizing the different probability distributions to obtain a first feature component, calculating a vocabulary strength score of the target post based on a financial emotion dictionary, wherein the finance emotion dictionary comprises a plurality of different preset words for representing positive emotion, neutral emotion and negative emotion, the vocabulary strength score is used for quantifying the strength of an emotion tendency expressed in the target post, determining the emotion probability of the target post by using a deep learning model, wherein the emotion probability is used for representing the probability that the target post belongs to positive emotion, neutral emotion and negative emotion, dynamically weighting and fusing the vocabulary strength score and the emotion probability according to the density of the finance term to obtain a second feature component, wherein the density of the finance term is the ratio of the number of finance terms appearing in the target post to the total word number of the target post, removing general stop words and high-frequency virtual words in the target post, and retaining word frequency-inverse document frequency exceeds a preset threshold value, and jointly displaying the second feature component by using a core feature, and jointly displaying the second feature component in the graph is determined according to the co-occurrence feature of the first feature and second feature component, and the second feature component is obtained after the co-occurrence of the core feature is obtained in the co-related feature graph, and performing dimension reduction processing on the splicing result through the orthogonal constraint linear transformation layer to obtain a multidimensional feature vector.

Optionally, after the target post to be processed in the communication platform is obtained, the method further comprises the steps of encoding the target post by using a pre-training language model to obtain a semantic feature vector, determining a click rate standardization value and a poster job level weight of the target post, splicing the semantic feature vector, the click rate standardization value and the poster job level weight, determining a splicing result as a state vector, analyzing the state vector by using a second depth Q network, and determining whether the target post is an essence post, wherein an action space of the second depth Q network comprises a marked essence post and a non-marked essence post, a reward function of the second depth Q network comprises a compliance penalty, an expert review pass rate and a false mark, the compliance penalty is a quantification result of risk scanning of post contents by a preset gold fusion rule base, the compliance score is a difference value of 1 and a target ratio, the target ratio is a ratio of the occurrence times of risk keywords to the total keyword number, the expert review pass rate is a mark and a ratio of the number of posts confirmed by the expert, the total number of posts to the mark, the mark comprises a first cost penalty and a preset essence cost, the first cost and the first cost unit is a preset essence cost and the attenuation cost is a preset essence cost, and the first cost unit is a preset cost of the consumption of the post is determined according to the preset essence cost.

Optionally, the processing result at least comprises target content, processing conclusion type, user privacy level identification and result sensitivity label. Forwarding the processing result to an information receiving area or a target communication address information receiving area associated with the target post, wherein the processing result comprises the steps of forwarding target content to the information receiving area when the processing conclusion type is a first type for representing a public reply request, forwarding the target content to the target communication address when the processing conclusion type is a second type for representing personal transaction, forwarding the target content to the information receiving area when the user privacy level identification is a first identification for representing public rights, forwarding the target content to the target communication address when the user privacy level identification is a second identification for representing private rights, and forwarding the target content to the target communication address when the result sensitivity label is a first label for representing private data.

It should be noted that each module in fig. 2 may be a program module (for example, a set of program instructions for implementing a specific function), or may be a hardware module, and for the latter, it may be expressed in a form, but is not limited to, that each module is expressed in a form of one processor, or the functions of each module are implemented by one processor.

It should be noted that, the preferred implementation manner of the embodiment shown in fig. 2 may refer to the related description of the embodiment shown in fig. 1, which is not repeated herein.

Fig. 3 shows a block diagram of a hardware architecture of a computer terminal for implementing an artificial intelligence driven based intelligent collaborative management method. As shown in fig. 3, the computer terminal 30 may include one or more processors 302 (shown in the figures as 302a, 302b, 302 n), which processor 302 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA, a memory 304 for storing data, and a transmission module 306 for communication functions. Among other things, a display, an input/output interface (I/O interface), a Universal Serial BUS (USB) port (which may be included as one of the ports of the BUS BUS), a network interface, a power supply, and/or a camera. It will be appreciated by those of ordinary skill in the art that the configuration shown in fig. 3 is merely illustrative and is not intended to limit the configuration of the electronic device described above. For example, the computer terminal 30 may also include more or fewer components than shown in FIG. 3, or have a different configuration than shown in FIG. 3.

It should be noted that the one or more processors 302 and/or other data processing circuits described above may be referred to generally herein as "data processing circuits. The data processing circuit may be embodied in whole or in part in software, hardware, firmware, or any other combination. Furthermore, the data processing circuitry may be a single stand-alone processing module or incorporated, in whole or in part, into any of the other elements in the computer terminal 30. As referred to in embodiments of the application, the data processing circuit acts as a processor control (e.g., selection of the path of the variable resistor termination connected to the interface).

The memory 304 may be used to store software programs and modules of application software, such as program instructions/data storage devices corresponding to the intelligent collaborative management method based on artificial intelligence driving in the embodiment of the present application, and the processor 302 executes various functional applications and data processing by running the software programs and modules stored in the memory 304, that is, implements the intelligent collaborative management method based on artificial intelligence driving. Memory 304 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 304 may further include memory remotely located relative to the processor 302, which may be connected to the computer terminal 30 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The transmission module 306 is used to receive or transmit data via a network. The specific examples of the network described above may include a wireless network provided by a communication provider of the computer terminal 30. In one example, the transmission module 306 includes a network adapter (Network Interface Controller, NIC) that can connect to other network devices through a base station to communicate with the internet. In one example, the transmission module 306 may be a Radio Frequency (RF) module for communicating with the internet wirelessly.

The display may be, for example, a touch screen type Liquid Crystal Display (LCD) that may enable a user to interact with a user interface of the computer terminal 30.

It should be noted here that, in some alternative embodiments, the computer terminal shown in fig. 3 may include hardware elements (including circuits), software elements (including computer code stored on a computer readable medium), or a combination of both hardware and software elements. It should be noted that fig. 3 is only one example of a specific example, and is intended to illustrate the types of components that may be present in the computer terminal described above.

It should be noted that, the computer terminal shown in fig. 3 is configured to execute the intelligent collaborative management method based on the artificial intelligence driver shown in fig. 1, so that the explanation of the execution method of the command is also applicable to the electronic device, and is not repeated herein.

The embodiment of the application also provides a nonvolatile storage medium, which comprises a stored program, wherein the program controls equipment where the storage medium is located to execute the intelligent collaborative management method based on the artificial intelligent drive when running.

The method comprises the steps of acquiring a target post to be processed in an exchange platform, determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises a theme classification feature, an emotion feature and a keyword coding feature, constructing a graph neural network, nodes in the graph neural network comprise department nodes and staff nodes, edges in the graph neural network comprise department cooperation edges, department-staff membership edges and staff cooperation edges, department node attributes at least comprise post to be processed, post average processing time length and professional field coding, staff node attributes at least comprise a current state, a skill feature vector and a historical satisfaction score, analyzing the multidimensional feature vector and the graph neural network by utilizing a first depth Q network to obtain a target node for processing the target post in the graph neural network, wherein the state in the first depth Q network comprises the current multidimensional feature vector, a topological structure of the graph neural network and node attributes, the actions in the first depth Q network comprise selecting the target node, executing multi-jump decision along the graph edges, forwarding the target post to the target post, and receiving a target post to a target post or a target address of the target post, and forwarding the target post to a target post or a target address of the target post processing area.

The embodiment of the application also provides the electronic equipment, which comprises a memory and a processor, wherein the processor is used for running a program stored in the memory, and the intelligent collaborative management method based on the artificial intelligence driver is executed when the program runs.

The processor is used for operating a program for executing the following functions of acquiring a target post to be processed in an exchange platform and determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector comprises a theme classification feature, an emotion feature and a keyword coding feature, a graph neural network is constructed, nodes in the graph neural network comprise department nodes and staff nodes, edges in the graph neural network comprise department-to-department cooperation edges, department-staff membership edges and staff-to-staff cooperation edges, department node attributes at least comprise post number to be processed, post average processing time length and professional field coding, staff node attributes at least comprise a current state, a skill feature vector and a historical satisfaction score, the multidimensional feature vector and the graph neural network are analyzed by utilizing a first depth Q network to obtain a target node for processing the target post in the graph neural network, the states in the first depth Q network comprise the current multidimensional feature vector, the topological structure and the node attributes of the graph neural network, actions in the first depth Q network comprise selecting the target node, selecting the target staff node, executing multi-jump decision along the graph edges, forwarding the target post to the target post, and forwarding the target post to the target post or the target post to the target post processing target address, and forwarding the target post processing result to the target post processing target node.

The foregoing embodiment numbers of the present application are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.

In the foregoing embodiments of the present application, the descriptions of the embodiments are emphasized, and for a portion of this disclosure that is not described in detail in this embodiment, reference is made to the related descriptions of other embodiments.

In the above embodiment of the present application, the collected information is information and data authorized by the user or sufficiently authorized by each party, and the processes of collection, storage, use, processing, transmission, provision, disclosure, application, etc. of the related data all comply with the related laws and regulations and standards, necessary protection measures are taken without violating the public welfare, and corresponding operation entries are provided for the user to select authorization or rejection.

In the several embodiments provided in the present application, it should be understood that the disclosed technology may be implemented in other manners. The above-described embodiments of the apparatus are merely exemplary, and the division of the units, for example, may be a logic function division, and may be implemented in another manner, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interfaces, units or modules, or may be in electrical or other forms.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.

The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be embodied in essence or a part contributing to the related art or all or part of the technical solution, in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server or a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. The storage medium includes a U disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a removable hard disk, a magnetic disk, or an optical disk, etc. which can store the program code.

The foregoing is merely a preferred embodiment of the present application and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present application, which are intended to be comprehended within the scope of the present application.

Claims

Translated fromChinese

1.一种基于人工智能驱动的智能协同管理方法，其特征在于，包括：1. An intelligent collaborative management method based on artificial intelligence drive, characterized by comprising:

获取交流平台中待处理的目标帖子，并确定所述目标帖子的多维特征向量，其中，所述多维特征向量中包括：主题分类特征、情感特征、关键词编码特征；Obtaining a target post to be processed in the communication platform, and determining a multidimensional feature vector of the target post, wherein the multidimensional feature vector includes: topic classification features, sentiment features, and keyword coding features;

构建图神经网络，其中，所述图神经网络中的节点包括：部门节点和员工节点，所述图神经网络中的边包括：部门间协作边、部门-员工隶属边和员工间协作边，部门节点属性至少包括：待处理帖子数、帖子平均处理时长、专业领域编码，员工节点属性至少包括：当前状态、技能特征向量、历史满意度得分；Constructing a graph neural network, wherein the nodes in the graph neural network include: department nodes and employee nodes, the edges in the graph neural network include: inter-department collaboration edges, department-employee affiliation edges and inter-employee collaboration edges, the department node attributes include at least: the number of posts to be processed, the average post processing time, and the professional field code, and the employee node attributes include at least: current status, skill feature vector, and historical satisfaction score;

利用第一深度Q网络对所述多维特征向量和所述图神经网络进行分析，得到所述图神经网络中用于处理所述目标帖子的目标节点，其中，所述第一深度Q网络中的状态包括：当前帖子的多维特征向量、所述图神经网络的拓扑结构和节点属性，所述第一深度Q网络中的动作包括：选择目标部门节点、选择目标员工节点、沿图边执行多跳决策；Analyzing the multidimensional feature vector and the graph neural network using a first deep Q network to obtain a target node in the graph neural network for processing the target post, wherein the state in the first deep Q network includes: the multidimensional feature vector of the current post, the topological structure and node attributes of the graph neural network, and the actions in the first deep Q network include: selecting a target department node, selecting a target employee node, and performing multi-hop decisions along graph edges;

将所述目标帖子转发至所述目标节点，获取所述目标节点针对所述目标帖子的处理结果，并将所述处理结果转发至所述目标帖子关联的信息接收区域或目标通信地址。The target post is forwarded to the target node, a processing result of the target node for the target post is obtained, and the processing result is forwarded to an information receiving area or a target communication address associated with the target post.

2.根据权利要求1所述的方法，其特征在于，所述第一深度Q网络的奖励函数为通过如下方法确定的：2. The method according to claim 1, characterized in that the reward function of the first deep Q network is determined by the following method:

根据匹配分数，确定第一子奖励函数，其中，所述匹配分数用于表征帖子内容与节点专业领域之间的语义匹配程度，所述匹配分数通过自然语言处理技术计算帖子关键词与节点预设标签的相似度得到；Determine a first sub-reward function according to the matching score, wherein the matching score is used to characterize the semantic matching degree between the post content and the node professional field, and the matching score is obtained by calculating the similarity between the post keywords and the node preset tags through natural language processing technology;

根据时间延迟，确定第二子奖励函数，其中，所述时间延迟用于表征从帖子创建时间戳到成功分配时间戳的时间差；Determine a second sub-reward function according to a time delay, wherein the time delay is used to characterize a time difference from a post creation timestamp to a successful assignment timestamp;

根据负载指数，确定第三子奖励函数，其中，所述负载指数用于表征节点当前任务队列长度与节点最大处理容量的比值；Determine a third sub-reward function according to the load index, wherein the load index is used to represent the ratio of the current task queue length of the node to the maximum processing capacity of the node;

根据满意度得分，确定第四子奖励函数，其中，所述满意度得分用于表征用户在帖子处理完成后提供的反馈评级；Determining a fourth sub-reward function according to the satisfaction score, wherein the satisfaction score is used to characterize the feedback rating provided by the user after the post processing is completed;

根据所述第一子奖励函数、所述第二子奖励函数、所述第三子奖励函数、所述第四子奖励函数，确定所述奖励函数。The reward function is determined according to the first sub-reward function, the second sub-reward function, the third sub-reward function, and the fourth sub-reward function.

3.根据权利要求2所述的方法，其特征在于，根据所述第一子奖励函数、所述第二子奖励函数、所述第三子奖励函数、所述第四子奖励函数，确定所述奖励函数，包括：3. The method according to claim 2, characterized in that determining the reward function according to the first sub-reward function, the second sub-reward function, the third sub-reward function, and the fourth sub-reward function comprises:

获取每个子奖励函数的初始权重系数；Get the initial weight coefficient of each sub-reward function;

持续检测帖子分配成功率指标、平均处理延迟指标和用户满意度均值指标；Continuously monitor the post assignment success rate, average processing delay, and user satisfaction average indicators;

在检测到所述帖子分配成功率指标下降的情况下，增大所述第一子奖励函数对应的第一初始权重系数和所述第四子奖励函数对应的第四初始权重系数；在检测到所述平均处理延迟指标上升的情况下，增大所述第二子奖励函数对应的第二初始权重系数和所述第三子奖励函数对应的第三初始权重系数；在检测到所述用户满意度均值指标下降的情况下，增大所述第四初始权重系数；In the case where it is detected that the post distribution success rate indicator decreases, the first initial weight coefficient corresponding to the first sub-reward function and the fourth initial weight coefficient corresponding to the fourth sub-reward function are increased; in the case where it is detected that the average processing delay indicator increases, the second initial weight coefficient corresponding to the second sub-reward function and the third initial weight coefficient corresponding to the third sub-reward function are increased; in the case where it is detected that the user satisfaction mean indicator decreases, the fourth initial weight coefficient is increased;

在每次权重系数更新后执行归一化操作，使得更新后的权重系数的和为1，得到第一权重系数、第二权重系数、第三权重系数、第四权重系数；After each weight coefficient is updated, a normalization operation is performed so that the sum of the updated weight coefficients is 1, thereby obtaining a first weight coefficient, a second weight coefficient, a third weight coefficient, and a fourth weight coefficient;

根据所述第一子奖励函数、所述第一权重系数，所述第二子奖励函数、所述第二权重系数，所述第三子奖励函数、所述第三权重系数，所述第四子奖励函数、所述第四权重系数，确定所述第一深度Q网络的所述奖励函数。The reward function of the first deep Q network is determined according to the first sub-reward function, the first weight coefficient, the second sub-reward function, the second weight coefficient, the third sub-reward function, the third weight coefficient, the fourth sub-reward function, and the fourth weight coefficient.

4.根据权利要求1所述的方法，其特征在于，确定所述目标帖子的多维特征向量，包括：4. The method according to claim 1, characterized in that determining the multi-dimensional feature vector of the target post comprises:

用预训练语言模型提取所述目标帖子的语义嵌入向量；通过主题注意力层计算不同主题分类对应的不同概率分布；对不同概率分布进行归一化处理，得到第一特征分量；Extracting the semantic embedding vector of the target post using a pre-trained language model; calculating different probability distributions corresponding to different topic classifications through a topic attention layer; normalizing the different probability distributions to obtain a first feature component;

基于金融情感词典计算所述目标帖子的词汇强度得分，其中，所述金融情感词典中包括用于表征正面情感、中性情感和负面情感的多个不同的预设字词，所述词汇强度得分用于量化目标帖子中显式表达的情感倾向的强度；Calculating a vocabulary strength score of the target post based on a financial sentiment dictionary, wherein the financial sentiment dictionary includes a plurality of different preset words for representing positive sentiment, neutral sentiment, and negative sentiment, and the vocabulary strength score is used to quantify the strength of the sentiment tendency explicitly expressed in the target post;

通过深度学习模型确定所述目标帖子的情感概率，其中，所述情感概率用于表示所述目标帖子属于正面情感、中性情感和负面情感的概率；Determining the emotion probability of the target post through a deep learning model, wherein the emotion probability is used to represent the probability that the target post belongs to positive emotion, neutral emotion, and negative emotion;

根据金融术语密度动态加权融合所述词汇强度得分和所述情感概率，得到第二特征分量，其中，所述金融术语密度为目标帖子中出现的金融专业术语数量与目标帖子的总词数的比值；The second feature component is obtained by dynamically weighting and fusing the vocabulary strength score and the sentiment probability according to the density of financial terms, wherein the density of financial terms is the ratio of the number of financial professional terms appearing in the target post to the total number of words in the target post;

去除所述目标帖子中的通用停用词及高频金融虚词，并保留词频-逆文档频率超过预设阈值的专业术语；Removing common stop words and high-frequency financial function words from the target post, and retaining professional terms whose word frequency-inverse document frequency exceeds a preset threshold;

通过术语共现图的中心性度量指标筛选所述专业术语中的核心词，并根据所述核心词确定第三特征分量，其中，所述术语共现图用于表示经过筛选后保留的专业术语在所述目标帖子中共同出现的关系，所述术语共现图的节点为经过筛选的专业术语，所述术语共现图的边为专业术语之间的语义关联度；The core words in the professional terms are screened by using the centrality measurement index of the term co-occurrence graph, and the third feature component is determined according to the core words, wherein the term co-occurrence graph is used to represent the co-occurrence relationship of the professional terms retained after screening in the target post, the nodes of the term co-occurrence graph are the screened professional terms, and the edges of the term co-occurrence graph are the semantic associations between the professional terms;

对所述第一特征分量、所述第二特征分量和所述第三特征分量进行拼接，并通过正交约束的线性变换层对拼接结果进行降维处理，得到所述多维特征向量。The first feature component, the second feature component and the third feature component are concatenated, and a dimension reduction process is performed on the concatenated result through an orthogonal constrained linear transformation layer to obtain the multidimensional feature vector.

5.根据权利要求1所述的方法，其特征在于，获取交流平台中待处理的目标帖子之后，所述方法还包括：5. The method according to claim 1, characterized in that after obtaining the target post to be processed in the communication platform, the method further comprises:

利用预训练语言模型对所述目标帖子进行编码，得到语义特征向量；确定所述目标帖子的点击率标准化值和发帖人职级权重；对所述语义特征向量、所述点击率标准化值和所述发帖人职级权重进行拼接，将拼接结果确定为状态向量；Encode the target post using a pre-trained language model to obtain a semantic feature vector; determine a normalized click rate value and a poster's rank weight of the target post; concatenate the semantic feature vector, the normalized click rate value, and the poster's rank weight, and determine the concatenation result as a state vector;

利用第二深度Q网络对所述状态向量进行分析，确定所述目标帖子是否为精华帖，其中，所述第二深度Q网络的动作空间包括：标记精华帖和不标记精华帖；所述第二深度Q网络的奖励函数包括：合规得分、专家复核通过率、误标惩罚，其中，所述合规得分为通过预置金融合规规则库对帖子内容进行风险扫描的量化结果，所述合规得分为1与目标比值的差值，所述目标比值为风险关键词出现次数与总关键词数量的比值；所述专家复核通过率为标记且专家确认的精华帖数量与标记的精华帖总数量的比值；所述误标惩罚包括第一成本和第二成本，其中，所述第一成本为帖子曝光量与预设单位注意力成本的乘积，所述第二成本为帖子知识价值系数与预设衰减因子的乘积，所述预设单位注意力成本根据员工阅读预设内容的工时损耗所确定。The state vector is analyzed by using a second deep Q network to determine whether the target post is an essence post, wherein the action space of the second deep Q network includes: marking essence posts and not marking essence posts; the reward function of the second deep Q network includes: compliance score, expert review pass rate, and mislabeling penalty, wherein the compliance score is a quantitative result of risk scanning of post content through a preset financial compliance rule library, the compliance score is the difference between 1 and a target ratio, and the target ratio is the ratio of the number of occurrences of risk keywords to the total number of keywords; the expert review pass rate is the ratio of the number of essence posts marked and confirmed by experts to the total number of marked essence posts; the mislabeling penalty includes a first cost and a second cost, wherein the first cost is the product of the post exposure and a preset unit attention cost, and the second cost is the product of the post knowledge value coefficient and a preset attenuation factor, and the preset unit attention cost is determined according to the man-hour loss of employees reading the preset content.

6.根据权利要求5所述的方法，其特征在于，所述方法还包括：若利用所述第二深度Q网络确定所述目标帖子为精华帖，在获取所述目标节点针对所述目标帖子的处理结果之后，将所述处理结果发送至目标对象进行复核，并得到复核结果，将所述复核结果转发至所述目标帖子关联的信息接收区域或所述目标通信地址。6. The method according to claim 5 is characterized in that the method also includes: if the target post is determined to be an essential post using the second deep Q network, after obtaining the processing result of the target node for the target post, sending the processing result to the target object for review, and obtaining the review result, and forwarding the review result to the information receiving area associated with the target post or the target communication address.

7.根据权利要求1所述的方法，其特征在于，7. The method according to claim 1, characterized in that

所述处理结果中至少包括：目标内容、处理结论类型、用户隐私级别标识及结果敏感度标签；The processing result at least includes: target content, processing conclusion type, user privacy level identification and result sensitivity label;

将所述处理结果转发至所述目标帖子关联的信息接收区域或目标通信地址信息接收区域，包括：Forwarding the processing result to an information receiving area associated with the target post or a target communication address information receiving area, including:

在所述处理结论类型为用于表示公开回复请求的第一类型的情况下，将所述目标内容转发至所述信息接收区域；在所述处理结论类型为用于表示个人事务处理的第二类型的情况下，将所述目标内容转发至所述目标通信地址；In the case where the processing conclusion type is a first type for indicating a public reply request, forwarding the target content to the information receiving area; in the case where the processing conclusion type is a second type for indicating personal transaction processing, forwarding the target content to the target communication address;

在所述用户隐私级别标识为用于表示公开权限的第一标识的情况下，将所述目标内容转发至所述信息接收区域；在所述用户隐私级别标识为用于表示私密权限的第二标识的情况下，将所述目标内容转发至所述目标通信地址；When the user privacy level identifier is a first identifier for indicating public authority, forwarding the target content to the information receiving area; when the user privacy level identifier is a second identifier for indicating private authority, forwarding the target content to the target communication address;

在所述结果敏感度标签为用于表示隐私数据的第一标签的情况下，将所述目标内容转发至所述目标通信地址。In a case where the result sensitivity label is a first label for representing private data, the target content is forwarded to the target communication address.

8.一种基于人工智能驱动的智能协同管理平台，其特征在于，包括：8. An intelligent collaborative management platform driven by artificial intelligence, characterized by comprising:

获取单元，用于获取交流平台中待处理的目标帖子，并确定所述目标帖子的多维特征向量，其中，所述多维特征向量中包括：主题分类特征、情感特征、关键词编码特征；An acquisition unit is used to acquire a target post to be processed in the communication platform and determine a multidimensional feature vector of the target post, wherein the multidimensional feature vector includes: a topic classification feature, a sentiment feature, and a keyword coding feature;

构建单元，用于构建图神经网络，其中，所述图神经网络中的节点包括：部门节点和员工节点，所述图神经网络中的边包括：部门间协作边、部门-员工隶属边和员工间协作边，部门节点属性至少包括：待处理帖子数、帖子平均处理时长、专业领域编码，员工节点属性至少包括：当前状态、技能特征向量、历史满意度得分；A construction unit is used to construct a graph neural network, wherein the nodes in the graph neural network include: department nodes and employee nodes, the edges in the graph neural network include: inter-department collaboration edges, department-employee affiliation edges and inter-employee collaboration edges, the department node attributes include at least: the number of posts to be processed, the average post processing time, and the professional field code, and the employee node attributes include at least: current status, skill feature vector, and historical satisfaction score;

分析单元，用于利用第一深度Q网络对所述多维特征向量和所述图神经网络进行分析，得到所述图神经网络中用于处理所述目标帖子的目标节点，其中，所述第一深度Q网络中的状态包括：当前帖子的多维特征向量、所述图神经网络的拓扑结构和节点属性，所述第一深度Q网络中的动作包括：选择目标部门节点、选择目标员工节点、沿图边执行多跳决策；An analysis unit is used to analyze the multidimensional feature vector and the graph neural network using a first deep Q network to obtain a target node in the graph neural network for processing the target post, wherein the state in the first deep Q network includes: the multidimensional feature vector of the current post, the topological structure and node attributes of the graph neural network, and the actions in the first deep Q network include: selecting a target department node, selecting a target employee node, and performing multi-hop decision-making along a graph edge;

转发单元，用于将所述目标帖子转发至所述目标节点，获取所述目标节点针对所述目标帖子的处理结果，并将所述处理结果转发至所述目标帖子关联的信息接收区域或目标通信地址。The forwarding unit is used to forward the target post to the target node, obtain the processing result of the target node for the target post, and forward the processing result to the information receiving area or target communication address associated with the target post.

9.一种非易失性存储介质，其特征在于，所述非易失性存储介质包括存储的程序，其中，在所述程序运行时控制所述非易失性存储介质所在设备执行权利要求1至7中任意一项所述的基于人工智能驱动的智能协同管理方法。9. A non-volatile storage medium, characterized in that the non-volatile storage medium includes a stored program, wherein when the program is running, the device where the non-volatile storage medium is located is controlled to execute the artificial intelligence-driven intelligent collaborative management method described in any one of claims 1 to 7.

10.一种计算机程序产品，包括计算机程序，其特征在于，所述计算机程序被处理器执行时实现权利要求1至7中任意一项所述的基于人工智能驱动的智能协同管理方法。10. A computer program product, comprising a computer program, characterized in that when the computer program is executed by a processor, it implements the intelligent collaborative management method based on artificial intelligence drive as described in any one of claims 1 to 7.