CN111598239B

Movatterモバイル変換

Info

Publication number: CN111598239B
Application number: CN202010727219.7A
Authority: CN
Inventors: 宋永生; 王楠
Original assignee: Jiangsu United Industrial Ltd By Share Ltd
Current assignee: Wenling Technology Beijing Co ltd
Priority date: 2020-07-27
Filing date: 2020-07-27
Publication date: 2020-11-06
Anticipated expiration: 2040-07-27
Also published as: CN111598239A

Abstract

The invention provides a method and a device for extracting a process system of an article based on a graph neural network, which relate to the technical field of artificial intelligence, and identify the hierarchical structure of different grades of titles of a first article by analyzing format information of the first article; judging whether each title is a behavior word describing the first process, when the first-level title is the behavior word describing the first process, establishing a time vector of the first-level title and the second-level title in the lower-layer title where the first-level title is located, establishing a belonging vector from the upper-layer title of the first-level title to the lower-layer title, further establishing a first title network diagram according to the time vector and the belonging vector, and performing unsupervised learning of a graph neural network on a large number of second title network diagrams of second articles to obtain a step sequence of a first process system and the first process system, so that the technical effect of maximizing the accuracy of the result of iterative learning of the graph neural network on the article title hierarchical structure is achieved.

Description

Method and device for extracting process system of article based on graph neural network

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to a method and a device for extracting a process system of an article based on a graph neural network.

Background

The basis of machine intelligence is the cognitive architecture of computers, which includes two broad categories: one type is a static conceptual system, such as: a classification system according to attribute characteristics, a structural system according to physical connection, and a relationship system according to logical relationships; the other is a dynamic event (process) system. A process that occurs in a particular spatio-temporal context is an event. Therefore, the identification and extraction of the process system are indispensable steps for the computer to acquire the machine intelligence, are the basis for the computer to judge the historical events and predict the future events, and are an important direction for the machine intelligence research at present.

The layout and the hierarchy for identifying the article title are mature technologies in the industry, because the commonly used text software (such as word, PDF, HTML and the like) of people carries format information, and people also use title numbering, font rendering, paragraph indentation, and counterpoint and the like to highlight the hierarchy of the title and the paragraph. Therefore, the computer can obtain rich information to identify the hierarchy of the article titles. The identified article title hierarchy itself reflects the relationship between the process and its steps. A title node is a step for the previous layer title it points to, and a process name for the next layer title it points to, so that when constructing the belonging vector (edge) of the title network graph, it is sufficient to rely on the hierarchical information of the article title structure. When determining how many steps a process includes and what the sequence is, the information about the process and the steps that are attached to the process provided by the title structure of an article is often incomplete, and in an article, even if two steps look "adjacent" in relative time, in fact, other steps may be hidden in the middle of the article. The traditional mathematical statistics needs similarity aggregation on a large number of article title structures, irreversibility and consistency check on increase and decrease of sequence elements in one step, and the like.

However, the applicant of the present invention finds that the prior art has at least the following technical problems:

the existing mathematical statistics can only carry out statistics aiming at the existing steps, the capacity of deducing unknown steps is not provided, and when step information of the same process reflected by different articles conflicts, consistency verification can cause the loss of the accuracy of the final result.

Disclosure of Invention

The embodiment of the invention provides a method and a device for extracting a process system of an article based on a graph neural network, which solve the technical problems that in the prior art, mathematical statistics can only be carried out aiming at the existing steps, the capacity of unknown steps is not deduced, and when step information of the same process reflected by different articles conflicts, consistency verification can cause the loss of the accuracy of a final result, thereby achieving the technical effects of continuously iterative learning based on the graph neural network, further having certain capacity of excavating hidden steps and ensuring the maximization of the accuracy of the result of the iterative learning of the graph neural network.

In view of the above problems, the present application is provided to provide a method and an apparatus for a process architecture for extracting articles based on a graph neural network.

Preferably, the first article format information includes a first article text format, a first article font format, and a first article paragraph format.

Preferably, the establishing the belonging vector according to the upper layer title and the lower layer title includes:

determining an upper node according to the upper title; determining the lower-layer title according to the first-level title and the second-level title; determining a lower layer node according to the lower layer title; and obtaining the vector of the lower node pointing to the upper node according to the lower node and the upper node.

Preferably, identifying the first paragraph and the second paragraph according to time to establish a time vector of the first level title and the second level title comprises:

obtaining a first-level title node of the first-level title; obtaining a second-level title node of the second-level title; obtaining a first time quantum according to the first paragraph corresponding to the first level title; obtaining a second time quantum according to the second paragraph corresponding to the second-level title; judging the time sequence of the first time quantum and the second time quantum; when the first amount of time is before the time of the second amount of time, determining whether the first level title node and the second level title node are adjacent nodes; obtaining the time vector pointing from the first level title node to the second level title node when the first level title node and the second level title node are adjacent nodes.

Preferably, the method further comprises:

inputting the first and second header network diagrams into the neural network for training to obtain multiple first header state functions h_vWherein the first header state function h_vIs denoted by h_v= f(X_v,X_co[v],h_ne[v],X_ne[v]) Wherein the first header state function h_vIs vectorization representation of a node v, and judges whether the node v is a description first process; f (—) is a local transfer function, is shared by all nodes, and updates the state of the nodes according to the input domain information; x_vIs a characteristic representation of the node v; x_co[v]Is the edge connected to the node v, i.e. the feature representation of the belonging vector and the time vector; h is_ne[v]Is the state of the neighboring node; x_ne[v]Is a feature representation of the node v neighbors;

applying the plurality of first title state functions h_vPerforming aggregation to obtain a first header state function set H, wherein the first header state function set H is represented as H = F (H, X), and F (#) is a local transfer function set; x is the feature set of the node v;

iteratively learning the first title state function set H along time to obtain an iterative function H^t+1Wherein the iterative function H^t+1Is represented by H^t+1= F(H^tX) in which H^t+1The title state function set at the time t +1 of the first title state function set; h^tA first set of title state functions at time t;

when the iterative function H^t+1=H^tWhile computing said iterative function H^t+1Obtaining the first process system and a sequence of steps for the first process system.

Preferably, the method further comprises:

according to the plurality of first title state functions h_vDetermining the node v as describing a plurality of first steps O in the first process_vWherein the first step O_vIs represented by O_v= g(h_v,X_v) Wherein g (#) is a local output function;

subjecting the plurality of first steps O_vPerforming aggregation to obtain a first step aggregation O of the first process system, wherein the first step aggregation O is represented by O = G (H, X), and G (×) is a local output function aggregation.

In a second aspect, the present invention provides an apparatus for a process architecture for article extraction based on graph neural network, the apparatus comprising:

a first obtaining unit, configured to obtain first article format information of a first article;

a second obtaining unit, configured to identify a title hierarchy of the first article according to the first article format information to obtain a first-level title, where the first-level title includes a first paragraph corresponding to the first-level title;

the first judging unit is used for judging whether the first-level title is a behavior word describing a first process;

a first determining unit, configured to determine, when the first-level title is a behavior word describing the first process, an upper-layer title of the first-level title and a lower-layer title where the first-level title is located;

a third obtaining unit, configured to obtain a second-level title that describes the first process in the lower-level title, where the second-level title includes a second paragraph corresponding to the second-level title;

the first construction unit is used for establishing a vector of the upper layer title and the lower layer title according to the upper layer title and the lower layer title, and identifying the first paragraph and the second paragraph according to time to establish a time vector of the first level title and the second level title;

a second constructing unit, configured to establish a first title network map according to the first level title, the second level title, the upper layer title, the belonging vector, and the time vector;

a third constructing unit, configured to obtain a plurality of second articles, and correspondingly establish a plurality of second headline network graphs according to the plurality of second articles, where the article names of the second articles and the first articles belong to a synonym;

a fourth obtaining unit, configured to perform deep learning on the first header network map and the plurality of second header network map input maps by using a neural network, so as to obtain a first process system and a sequence of steps of the first process system.

Preferably, the establishing, by the first building unit, the vector according to the upper-layer title and the lower-layer title includes:

a second determining unit configured to determine an upper node according to the upper header;

a third determining unit configured to determine the lower-layer title according to the first-level title and the second-level title;

a fourth determining unit configured to determine a lower node according to the lower title;

a fifth obtaining unit, configured to obtain, according to the lower node and the upper node, the belonging vector of the lower node pointing to the upper node.

Preferably, the establishing, in the first building unit, a time vector of the first-level title and the second-level title according to the time for identifying the first paragraph and the second paragraph includes:

a sixth obtaining unit configured to obtain a first-level title node of the first-level title;

a seventh obtaining unit configured to obtain a second-level title node of the second-level title;

an eighth obtaining unit, configured to obtain a first amount of time according to the first paragraph corresponding to the first level title;

a ninth obtaining unit, configured to obtain a second amount of time according to the second paragraph corresponding to the second level title;

a second judging unit, configured to judge a time sequence of the first time amount and the second time amount;

a third judging unit configured to judge whether the first-level title node and the second-level title node are adjacent nodes when the first amount of time is before the time of the second amount of time;

a tenth obtaining unit configured to obtain the time vector pointing from the first-level title node to the second-level title node when the first-level title node and the second-level title node are adjacent nodes.

Preferably, the apparatus further comprises:

a tenth obtaining unit, configured to input the first header network map and the second header network map into the map neural network for training, and obtain a plurality of first header state functions h_vWherein the first header state function h_vIs denoted by h_v= f(X_v,X_co[v],h_ne[v],X_ne[v]) Wherein the first header state function h_vIs vectorization representation of a node v, and judges whether the node v is a description first process; f (—) is a local transfer function, is shared by all nodes, and updates the state of the nodes according to the input domain information; x_vIs a characteristic representation of the node v; x_co[v]Is the edge connected to the node v, i.e. the feature representation of the belonging vector and the time vector; h is_ne[v]Is the state of the neighboring node; x_ne[v]Is a feature representation of the node v neighbors;

an eleventh obtaining unit for combining the plurality of first title state functions h_vPerforming aggregation to obtain a first header state function set H, wherein the first header state function set H is represented as H = F (H, X), and F (#) is a local transfer function set; x is the feature set of the node v;

a twelfth obtaining unit, configured to iteratively learn, along time, the first header state function set H to obtain an iterative function H^t+1Wherein the iterative function H^t+1Is represented by H^t+1= F(H^tX) in which H^t+1The title state function set at the time t +1 of the first title state function set; h^tA first set of title state functions at time t;

a thirteenth obtaining unit for obtaining the iteration function H^t+1=H^tTime, calculateThe iteration function H^t+1Obtaining the first process system and a sequence of steps for the first process system.

Preferably, the apparatus further comprises:

a fifth determination unit for determining a first title state function h from the plurality of first title state functions_vDetermining the node v as describing a plurality of first steps O in the first process_vWherein the first step O_vIs represented by O_v= g(h_v,X_v) Wherein g (#) is a local output function;

a fourteenth obtaining unit for obtaining the plurality of first steps O_vPerforming aggregation to obtain a first step aggregation O of the first process system, wherein the first step aggregation O is represented by O = G (H, X), and G (×) is a local output function aggregation.

In a third aspect, the present invention provides an apparatus for a process architecture for article extraction based on graph neural network, comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of any one of the above methods when executing the program.

In a fourth aspect, the invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.

One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

Drawings

FIG. 1 is a flowchart illustrating a method for extracting a process architecture of an article based on a graph neural network according to an embodiment of the present invention;

FIG. 2 is a block diagram of an apparatus for a process architecture for article extraction based on graph neural networks according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of another apparatus for extracting an article based on a graph neural network according to an embodiment of the present invention.

Description of reference numerals: a first obtainingunit 11, a second obtainingunit 12, afirst judging unit 13, a first determiningunit 14, a third obtainingunit 15, afirst constructing unit 16, asecond constructing unit 17, athird constructing unit 18, a fourth obtainingunit 19, abus 300, areceiver 301, aprocessor 302, atransmitter 303, amemory 304, and abus interface 306.

Detailed Description

The embodiment of the invention provides a method and a device for extracting a process system of an article based on a graph neural network, which are used for solving the technical problems that in the prior art, mathematical statistics can only be carried out aiming at the existing steps, the capability of deducing unknown steps is not realized, and when step information of the same process reflected by different articles conflicts, consistency verification can cause the loss of the accuracy of a final result.

The technical solutions of the present invention are described in detail below with reference to the drawings and specific embodiments, and it should be understood that the specific features in the embodiments and examples of the present invention are described in detail in the technical solutions of the present application, and are not limited to the technical solutions of the present application, and the technical features in the embodiments and examples of the present application may be combined with each other without conflict.

The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.

Example one

Fig. 1 is a flowchart illustrating a method for extracting a process system of an article based on a graph neural network according to an embodiment of the present invention. As shown in fig. 1, an embodiment of the present invention provides a method for extracting a process architecture of an article based on a graph neural network, where the method includes:

step 110: first article format information of a first article is obtained.

Step 120: and identifying the title hierarchy of the first article according to the first article format information to obtain a first-level title, wherein the first-level title comprises a first paragraph corresponding to the first-level title.

Further, the first article format information includes a first article text format, a first article font format, and a first article paragraph format.

Specifically, the first article text format, the first article font format and the first article paragraph format of the first article are analyzed, such as the title font, the title font size, the paragraph indentation and the proof. According to a first article text format, a first article font format, a first article paragraph format and the like in the first article format information, a title hierarchical structure of a first article is identified, and then the grade of the title, namely a first-grade title, is obtained, wherein the first-grade title comprises a first-grade title, a second-grade title, a third-grade title and the like. The first level headlines are a collective term for each headline in the hierarchy of headlines identified in the first article. The first-level title comprises a first section of title information corresponding to the first-level title, wherein the first section is used for describing or further expanding the specific text content of the first-level title, and the first section belongs to the content hooked by the first-level title.

Step 130: and judging whether the first-level title is a behavior word describing a first process.

Step 140: and when the first-level title is a behavior word describing the first process, determining an upper-layer title of the first-level title and a lower-layer title where the first-level title is located.

Step 150: obtaining a second level title describing the first process in the lower level title, wherein the second level title includes a second paragraph corresponding to the second level title.

Specifically, the process identification is performed on each title in the title hierarchy identified in the first article, that is, the process is determined which titles in the first-level title describe. When the first-level title has a behavior word for describing a first process, determining an upper-layer title of the first-level title and a lower-layer title where the first-level title is located, wherein the name of the first process is the upper-layer title of the layer where the first-level title is located; the lower layer title is the title of the layer where the first level title is located. And performing the same process identification on all the titles of the layer where the first-level title is located, and obtaining all second-level titles describing the first process in the lower-layer title, wherein the second-level titles and the first-level title belong to the same-layer title. The second-level title comprises a second paragraph corresponding to the second-level title, wherein the second paragraph is a specific text content for describing or further expanding the second-level title, and the second paragraph belongs to a content hooked by the second-level title. For example, if a title describing "prosecution" and a title describing "court" both have the same upper level title "litigation," the process is named "litigation" and both "prosecution" and "court" are steps in the litigation process.

Step 160: and establishing a belonging vector according to the upper layer title and the lower layer title, and identifying the first paragraph and the second paragraph according to time to establish a time vector of the first-level title and the second-level title.

Further, the establishing of the vector according to the upper layer title and the lower layer title includes: determining an upper node according to the upper title; determining the lower-layer title according to the first-level title and the second-level title; determining a lower layer node according to the lower layer title; and obtaining the vector of the lower node pointing to the upper node according to the lower node and the upper node. Further, identifying the first paragraph and the second paragraph according to time to establish a time vector of the first-level title and the second-level title, including: obtaining a first-level title node of the first-level title; obtaining a second-level title node of the second-level title; obtaining a first time quantum according to the first paragraph corresponding to the first level title; obtaining a second time quantum according to the second paragraph corresponding to the second-level title; judging the time sequence of the first time quantum and the second time quantum; when the first amount of time is before the time of the second amount of time, determining whether the first level title node and the second level title node are adjacent nodes; obtaining the time vector pointing from the first level title node to the second level title node when the first level title node and the second level title node are adjacent nodes.

Specifically, each title in the article is taken as a node of a title network graph, and an upper-layer title is determined to be an upper-layer node and a lower-layer title is determined to be a lower-layer node, wherein the lower-layer title comprises a first-level title and a second-level title. And obtaining the affiliated vector of the lower node pointing to the upper node according to the lower node and the upper node, namely drawing an edge pointing to the name in the step between the lower header and the first process name header of the upper header as the affiliated vector. The method comprises the steps of obtaining a first level title node of a first level title and a second level title node of a second level title, obtaining a first time amount according to a first section corresponding to the first level title and obtaining a second time amount according to a second section corresponding to the second level title. And judging the time sequence of the first time quantum and the second time quantum, and judging whether the first-level title node and the second-level title node are adjacent nodes or not when the first time quantum is before the time of the second time quantum. That is, the next step of determining the first level title node is the second level title node. When the first level title node and the second level title node are adjacent nodes, a time vector pointing from the first level title node to the second level title node is obtained. That is, the amount of time is found in the corresponding text passage of each step heading with the same belonging vector pointing thereto, and the adjacent nodes are found in the heading containing the amount of time, and an edge connecting the two adjacent nodes is drawn in the first-to-last direction as a time vector.

Step 170: and establishing a first title network graph according to the first-level title, the second-level title, the upper-layer title, the affiliated vector and the time vector.

Specifically, a first-level title, a second-level title and an upper-layer title are used as nodes, and a belonging vector and a time vector are used as edges to establish a first-title network graph. That is, a first level title and its vector and time vector linked neighboring second level titles form a node of the first title network graph, and all steps included under each first process are linked in such a manner as to form the first title network graph of the first process.

Step 180: and obtaining a plurality of second articles, and correspondingly establishing a plurality of second title network graphs according to the plurality of second articles, wherein the article names of the second articles and the first articles belong to synonyms.

Step 190: and inputting the first header network diagram and the plurality of second header network diagrams into a neural network for deep learning to obtain a first process system and a first process system.

In particular, in a particular article, all headings describing process steps that are next to a process name heading, do not necessarily cover all steps of the process, and are incomplete. To obtain the complete set of steps of the process, all steps of the same process that occur in a large number of articles need to be analyzed. To do this, the present embodiment requires that a hierarchy of all the titles of a large number of articles be combated into a data set of elementary units combined by individual "process names and their underlying steps", although each unit may contain steps that are incomplete and out of order. Thus, a large number of second articles are obtained, the article names of the second articles and the first articles belonging to the same synonym, i.e. the second articles and the first articles belonging to the same type of articles. And correspondingly establishing a plurality of second headline network graphs according to a plurality of second articles, inputting the first headline network graph and the second headline network graphs into a graph neural network for deep learning, adding new nodes from the second headline network graphs to the first headline network graph, or adjusting the positions of the existing nodes, and obtaining a step sequence of a first process system and a first process system with extremely high integrity when the gradient function of the nodes between the second headline network graph and the first headline network graph tends to zero through continuous iterative learning. The graph neural network can deduce the definition (label) of the core node through the information of the surrounding nodes and edges in the continuously iterative learning process, so that the graph neural network has certain capacity of mining hidden nodes (steps), and further obtains a process system with high integrity and consistency.

Further, the method further comprises: inputting the first and second header network diagrams into the neural network for training to obtain multiple first header state functions h_vWherein the first header state function h_vIs denoted by h_v= f(X_v,X_co[v],h_ne[v],X_ne[v]) Wherein the first header state function h_vIs vectorization representation of a node v, and judges whether the node v is a description first process; f (—) is a local transfer function, is shared by all nodes, and updates the state of the nodes according to the input domain information; x_vIs a characteristic representation of the node v; x_co[v]Is the edge connected to the node v, i.e. the feature representation of the belonging vector and the time vector; h is_ne[v]Is the state of the neighboring node; x_ne[v]Is a feature representation of the node v neighbors; applying the plurality of first title state functions h_vPerforming aggregation to obtain a first header state function set H, wherein the first header state function set H is represented as H = F (H, X), and F (#) is a local transfer function set; x is the feature set of the node v; iteratively learning the first title state function set H along time to obtain an iterative function H^t+1Wherein the iterative function H^t+1Is represented by H^t+1= F(H^tX) in which H^t+1The title state function set at the time t +1 of the first title state function set; h^tA first set of title state functions at time t; when the iterative function H^t+1=H^tWhile computing said iterative function H^t+1Obtaining the first process system and a sequence of steps for the first process system.

Further, the method further comprises: according to the plurality of first title state functions h_vDetermining the node v as describing a plurality of first steps O in the first process_vWherein the first step O_vIs represented by O_v= g(h_v,X_v) Wherein g (#) is a local output function; subjecting the plurality of first steps toProcedure O_vPerforming aggregation to obtain a first step aggregation O of the first process system, wherein the first step aggregation O is represented by O = G (H, X), and G (×) is a local output function aggregation.

Specifically, a first header network diagram and a second header network diagram are input into a neural network for training to obtain a plurality of first header state functions h_vWherein the first header state function h_vIs denoted by h_v= f(x_v,x_co[v],h_ne[v],x_ne[v]) The first header state function is a state that converts nodes of the neural network of the graph composed of the first level headers into a digital representation. And collecting the second title state functions and the first title state functions in the plurality of second title network graphs to obtain a first title state function set H, namely the total number of all nodes in the first title network graph and the second title network graph. Iteratively learning the first title state function set H along time, sequencing all nodes in the first title state function set H according to the time sequence to obtain an iterative function H^t+1. The process is to find a first time amount in a first section under a first-level title, compare the time with adjacent nodes, if the time sequence is right, the position is not adjusted, if the time sequence is not right, the position of the first-level title node in the graph is adjusted until the time sequence is correct. When iterating function H^t+1=H^tComputing an iterative function H^t+1The first process architecture and the sequence of steps of the first process architecture are obtained, that is, there is a relationship between adjacent states of the set of nodes, each learning of the graph neural network is an iteration, each iteration adds a new node to the graph, or adjusts the position of an existing node in the graph, or both. Can be used (H)^t+1-H^t) The objective of the iterative learning is to make the gradient function approach to zero. When no new node can be added and no existing node position can be adjusted, the gradient function is zero, namely, no matter how many articles and titles are added, the number of processes which can be found by the graph neural network is zeroNo longer changing, the sequence of steps of each found process no longer changing, i.e. the iteration function H^t+1=H^tA first process system with a very high degree of integrity and a sequence of steps of the first process system can be obtained. In the process of iteratively learning the first title network diagram and the second title network diagram by the graph neural network, a plurality of first title state functions h are used_vIt can be determined that node v describes a plurality of first steps O in a first procedure_vAnd a plurality of first steps O_vAnd (4) performing aggregation to obtain a first step aggregate O of the first process system, namely, aggregating all the first steps in the second title network diagram and the first steps in the first title network diagram to determine a complete first step.

Example two

Based on the same inventive concept as the method for extracting the process system of the article based on the graph neural network in the foregoing embodiment, the present invention further provides a method and an apparatus for extracting the process system of the article based on the graph neural network, as shown in fig. 2, the apparatus includes:

a first obtainingunit 11, where the first obtainingunit 11 is configured to obtain first article format information of a first article;

a second obtainingunit 12, where the second obtainingunit 12 is configured to identify a headline hierarchy of the first article according to the first article format information to obtain a first-level headline, where the first-level headline includes a first paragraph corresponding to the first-level headline;

afirst judging unit 13, where thefirst judging unit 13 is configured to judge whether the first-level title is a behavior word describing a first process;

a first determiningunit 14, where the first determiningunit 14 is configured to determine an upper-layer title of the first-level title and a lower-layer title where the first-level title is located when the first-level title is a behavior word describing the first process;

a third obtainingunit 15, where the third obtainingunit 15 is configured to obtain a second-level title describing the first process in the lower-level title, where the second-level title includes a second paragraph corresponding to the second-level title;

afirst constructing unit 16, where thefirst constructing unit 16 is configured to establish a vector according to the upper layer title and the lower layer title, and identify, according to time, the first paragraph and the second paragraph, and establish a time vector of the first level title and the second level title;

asecond constructing unit 17, where thesecond constructing unit 17 is configured to establish a first title network map according to the first level title, the second level title, the upper layer title, the belonging vector, and the time vector;

athird constructing unit 18, where thethird constructing unit 18 is configured to obtain a plurality of second articles, and correspondingly establish a plurality of second headline network graphs according to the plurality of second articles, where the article names of the second articles and the first articles belong to synonyms;

a fourth obtainingunit 19, where the fourth obtainingunit 19 is configured to perform deep learning on the first header network map and the plurality of second header network maps input map neural networks, so as to obtain a first process system and a sequence of steps of the first process system.

Further, the establishing, by the first constructing unit, the vector according to the upper layer title and the lower layer title includes:

Further, the establishing, by the first building unit, the time vector of the first-level title and the second-level title according to the time vector of the first-level title and the second-level title by identifying the first paragraph and the second paragraph according to time includes:

Further, the apparatus further comprises:

a first training unit, configured to input the first header network diagram and the second header network diagram into the graph neural network for training to obtain a plurality of first header state functions h_vWherein the first header state function h_vIs denoted by h_v= f(X_v,X_co[v],h_ne[v],X_ne[v])，Wherein the first title state function h_vIs vectorization representation of a node v, and judges whether the node v is a description first process; f (—) is a local transfer function, is shared by all nodes, and updates the state of the nodes according to the input domain information; x_vIs a characteristic representation of the node v; x_co[v]Is the edge connected to the node v, i.e. the feature representation of the belonging vector and the time vector; h is_ne[v]Is the state of the neighboring node; x_ne[v]Is a feature representation of the node v neighbors;

a thirteenth obtaining unit for obtaining the iteration function H^t+1=H^tWhile computing said iterative function H^t+1Obtaining the first process system and a sequence of steps for the first process system.

Further, the apparatus further comprises:

Various changes and specific examples of the method for extracting a process system of an article based on a graph neural network in the first embodiment of fig. 1 are also applicable to the apparatus for extracting a process system of an article based on a graph neural network in the present embodiment, and through the foregoing detailed description of the method for extracting a process system of an article based on a graph neural network, those skilled in the art can clearly know an implementation method of the apparatus for extracting a process system of an article based on a graph neural network in the present embodiment, so for the brevity of the description, detailed descriptions are not further provided here.

EXAMPLE III

Based on the same inventive concept as the method for extracting the process architecture of the article based on the graph neural network in the foregoing embodiment, the present invention further provides an apparatus for extracting the process architecture of the article based on the graph neural network, as shown in fig. 3, including amemory 304, aprocessor 302, and a computer program stored on thememory 304 and operable on theprocessor 302, where theprocessor 302 executes the program to implement the steps of any one of the methods for extracting the process architecture of the article based on the graph neural network.

Where in fig. 3 a bus architecture (represented by bus 300),bus 300 may include any number of interconnected buses and bridges,bus 300 linking together various circuits including one or more processors, represented byprocessor 302, and memory, represented bymemory 304. Thebus 300 may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. Abus interface 306 provides an interface between thebus 300 and thereceiver 301 andtransmitter 303. Thereceiver 301 and thetransmitter 303 may be the same element, i.e., a transceiver, providing a means for communicating with various other apparatus over a transmission medium. Theprocessor 302 is responsible for managing thebus 300 and general processing, and thememory 304 may be used for storing data used by theprocessor 302 in performing operations.

Example four

In a specific implementation, when the program is executed by a processor, any method step in the first embodiment may be further implemented.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. A method for extracting a process system of an article based on a graph neural network is characterized by comprising the following steps:

obtaining first article format information of a first article;

identifying a title hierarchy of the first article according to the first article format information to obtain a first-level title, wherein the first-level title comprises a first paragraph of title information corresponding to the first-level title, and the first-level title is a general name of each title in a title hierarchy identified in the first article;

judging whether the first-level title is a behavior word describing a first process;

when the first-level title is a behavior word describing the first process, determining an upper-layer title of the first-level title and a lower-layer title where the first-level title is located; the name of the first process is an upper layer title of a layer where the first level title is located; the lower layer title is the title of the layer where the first level title is located;

obtaining a second-level title describing the first process in the lower-level title, wherein the second-level title contains a second paragraph corresponding to the second-level title; wherein the second-level title and the first-level title belong to a same-layer title;

establishing a vector according to the upper layer title and the lower layer title, and identifying the first paragraph and the second paragraph according to time to establish a time vector of the first-level title and the second-level title;

establishing a first title network graph according to the first-level title, the second-level title, the upper-layer title, the affiliated vector and the time vector;

obtaining a plurality of second articles, and correspondingly establishing a plurality of second title network graphs according to the plurality of second articles, wherein the article names of the second articles and the first articles belong to synonyms;

inputting the first header network diagram and the plurality of second header network diagrams into a neural network for deep learning to obtain a first process system and a first process system;

the specific steps of judging whether the first-level title is a behavior word describing a first process are as follows: and determining which titles in the first-level titles describe a process by performing process identification on each title in the title hierarchy identified in the first article.

2. The method of claim 1 wherein the first article format information comprises a first article text format, a first article font format, a first article paragraph format.

3. The method of claim 1, wherein said establishing the belonging vector according to the upper layer title and the lower layer title comprises:

determining an upper node according to the upper title;

determining the lower-layer title according to the first-level title and the second-level title;

determining a lower layer node according to the lower layer title;

and obtaining the vector of the lower node pointing to the upper node according to the lower node and the upper node.

4. The method of claim 1, wherein identifying the first and second paragraphs as a function of time to establish a time vector for the first and second level headings comprises:

obtaining a first-level title node of the first-level title;

obtaining a second-level title node of the second-level title;

obtaining a first time quantum according to the first paragraph corresponding to the first level title;

obtaining a second time quantum according to the second paragraph corresponding to the second-level title;

judging the time sequence of the first time quantum and the second time quantum;

when the first amount of time is before the time of the second amount of time, determining whether the first level title node and the second level title node are adjacent nodes;

obtaining the time vector pointing from the first level title node to the second level title node when the first level title node and the second level title node are adjacent nodes.

5. The method of claim 1, wherein the method further comprises:

inputting the first and second header network diagrams into the neural network for training to obtain multiple first header state functions h_vWherein the first header state function h_vIs shown as

h_v= f(X_v,X_co[v],h_ne[v],X_ne[v])，

Wherein the first title state function h_vIs vectorization representation of a node v, and judges whether the node v is a description first process; f (—) is a local transfer function, is shared by all nodes, and updates the state of the nodes according to the input domain information; x_vIs a characteristic representation of the node v; x_co[v]Is the edge connected to the node v, i.e. the feature representation of the belonging vector and the time vector; h is_ne[v]Is the state of the neighboring node; x_ne[v]Is a feature representation of the node v neighbors;

applying the plurality of first title state functionsh_vCollecting to obtain a first title state function set H, wherein the first title state function set H is expressed as

H = F(H,X)，

Wherein, F () is a local transfer function set; x is the feature set of the node v;

iteratively learning the first title state function set H along time to obtain an iterative function H^t+1Wherein the iterative function H^t+1Is shown as

H^t+1= F(H^t,X)，

Wherein H^t+1The title state function set at the time t +1 of the first title state function set; h^tA first set of title state functions at time t;

6. The method of claim 5, wherein the method further comprises:

according to the plurality of first title state functions h_vDetermining the node v as describing a plurality of first steps O in the first process_vWherein the first step O_vIs shown as

O_v= g(h_v,X_v)，

Wherein g () is a local output function;

subjecting the plurality of first steps O_vCollecting to obtain a first step set O of the first process system, wherein the first step set O is represented as

O = G(H,X)，

Wherein G (×) is a set of local output functions.

7. An apparatus of a process architecture for article extraction based on graph neural network, the apparatus comprising:

a second obtaining unit, configured to identify a headline hierarchy of the first article according to the first article format information, and obtain a first-level headline, where the first-level headline includes a first paragraph whose headline information corresponds to the first-level headline, and the first-level headline is a general name of each headline in the headline hierarchy identified in the first article;

a first determining unit, configured to determine, when the first-level title is a behavior word describing the first process, an upper-layer title of the first-level title and a lower-layer title where the first-level title is located; the name of the first process is an upper layer title of a layer where the first level title is located; the lower layer title is the title of the layer where the first level title is located;

a third obtaining unit, configured to obtain a second-level title that describes the first process in the lower-level title, where the second-level title includes a second paragraph corresponding to the second-level title; wherein the second-level title and the first-level title belong to a same-layer title;

a fourth obtaining unit, configured to perform deep learning on the first header network map and the plurality of second header network map input maps by using a neural network, so as to obtain a first process system and a sequence of steps of the first process system;

wherein the first judgment unit includes: a first determining module, configured to determine which titles in the first-level titles describe a process by performing process identification on each title in the title hierarchy identified in the first article.

8. An apparatus for a process architecture for article extraction based on graph neural networks, comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the steps of the method of any one of claims 1-6 are implemented when the program is executed by the processor.

9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 6.