Movatterモバイル変換


[0]ホーム

URL:


CN110232121B - A Semantic Web-based Control Order Classification Method - Google Patents

A Semantic Web-based Control Order Classification Method
Download PDF

Info

Publication number
CN110232121B
CN110232121BCN201910348919.2ACN201910348919ACN110232121BCN 110232121 BCN110232121 BCN 110232121BCN 201910348919 ACN201910348919 ACN 201910348919ACN 110232121 BCN110232121 BCN 110232121B
Authority
CN
China
Prior art keywords
control
neural network
control instruction
words
argument
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910348919.2A
Other languages
Chinese (zh)
Other versions
CN110232121A (en
Inventor
王煊
蒋伟煜
崔红宇
丁辉
陈平
严勇杰
王冠
徐秋程
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CETC 28 Research Institute
Original Assignee
CETC 28 Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CETC 28 Research InstitutefiledCriticalCETC 28 Research Institute
Priority to CN201910348919.2ApriorityCriticalpatent/CN110232121B/en
Publication of CN110232121ApublicationCriticalpatent/CN110232121A/en
Application grantedgrantedCritical
Publication of CN110232121BpublicationCriticalpatent/CN110232121B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Classifications

Landscapes

Abstract

The invention discloses a control instruction classification method based on a semantic network, aiming at: a computer-readable structured control instruction is constructed, and a basis is provided for the automatic processing of the control instruction; and forming a predicate logical structure to provide a basis for realizing a knowledge reasoning system. By processing the unstructured regulation instruction, the method can realize the following auxiliary functions: extracting information carried by basic control terms appearing in the control instruction for extraction; extracting and simulating the information of the aircraft such as the action and the state; computer-readable structured information is formed to provide data for knowledge reasoning. Aiming at the condition that a complex control instruction comprises a plurality of control intentions, the method judges whether different arguments in the control instruction are associated with different control intention verbs through a deep neural network, and brings the associated verbs and argument words into a semantic network for semantic role labeling.

Description

Semantic network-based control instruction classification method
Technical Field
The invention belongs to the technical field of air traffic control automation systems, and particularly relates to a control instruction classification method based on a semantic network.
Background
With the vigorous development of the civil aviation industry in the last 30 years, the requirements of air traffic management are continuously expanded, so that the potential safety hazard problem is increasingly prominent. Statistics show that human factors account for over 75% of the past flight safety incidents, and among them, incidents due to controller error account for 25%. The mainstream method for solving the conflict caused by the error of the controller at present is to strengthen the monitoring equipment of the scene, and prevent the conflict from occurring by monitoring equipment such as a radar, a multipoint positioning system sensor and the like by the scene. Meanwhile, more advanced solutions based on artificial intelligence are also proposed, such as recognizing the control voice by using a voice recognition technology and converting the control voice into a text format, and extracting key information of the control instruction by using a natural language processing technology. The semantic analysis of the regulation instruction needs to design a structural instruction template, the traditional method is mainly to design the structural template based on the ground-air conversation rule, and extract keywords such as basic regulation terms, subjects, predicates and the like in the instruction by a keyword extraction method, the method cannot describe the association among different component words, and complex regulation instructions containing a plurality of verbs with different regulation intentions cannot be understood more accurately, so that the semantic analysis and understanding process of the actual regulation instruction is limited.
Disclosure of Invention
The purpose of the invention is as follows: the method analyzes the structure of the actual control instruction from the perspective of linguistics, extracts the control instruction by combining the land-air conversation rule and the natural language processing technology to form a computer-understandable structural instruction, and focuses on carrying out fine semantic role marking on the multi-verb control instruction. The control instructions generated under the actual working environment can be mostly analyzed.
The technical scheme is as follows: the invention classifies and extracts a plurality of different regulatory intentions and corresponding argument words contained in the complex regulatory instruction by using the fully-connected neural network, and understands the different regulatory intentions and the corresponding argument words through the semantic network, thereby being helpful for a computer to understand the intention of the complex regulatory instruction. The invention can be used for auxiliary alarm for scene conflicts caused by error, forgetting and omission of controllers. Especially for understanding of complex policing instructions with multiple policing intent verbs, and combining and converting the policing intents into logical expression form triples that can be understood by the system, such as policing instructions: CES3984, down to 13000 meters for maintenance. The control instruction comprises two different control intentions, namely descending and keeping, other words and words have different relations with the two words, and the neural network is required to classify the words in the control instruction, which are respectively related to the different control intentions. When the invention is used for the application of auxiliary alarm for scene conflicts caused by error, forgetting and omission of controllers, the invention comprises the following steps:
step 1, carrying out voice recognition processing on the control voice to obtain a control instruction in a text format;
step 2, performing part-of-speech analysis on words contained in the control instruction in the text format;
step 3, extracting a control intention based on the part of speech analysis result, and simultaneously extracting other words in the control instruction to form a candidate argument set;
step 4, carrying out BIO labeling processing on the candidate argument set (BIO labeling: labeling each element as 'B-X', 'I-X' or 'O', wherein 'B-X' indicates that the segment where the element is located belongs to the X type and the element is at the beginning of the segment, 'I-X' indicates that the segment where the element is located belongs to the X type and the element is at the middle position of the segment, 'O' indicates that the element does not belong to any type), adding E label to indicate that the argument has semantic relation with a plurality of verbs in the sentence at the same time, training parameters of a fully-connected neural network by using labeled data, and obtaining a control intention control verb-element group through the neural network;
step 5, judging the number of verb regulatory intentions in the regulatory instruction, and entering step 6 if the number is more than 1;
step 6, generating an input word vector;
step 7, classifying the input word vectors by using a neural network to obtain a probability output value z;
step 8, judging the relation: if the probability output value z has higher probability in a certain category, judging that the argument has a relationship with the corresponding control intention, executing the step 10, otherwise, not processing;
step 9, because the semantic relation between the control intention verb and different argument words is univocal mapping in most cases in the control instruction, a semantic network is constructed according to experience, and the semantic network comprises the control intention verb, the argument words and semantic association triple knowledge;
step 10, substituting the verb of the control intention and the argument word into a semantic network to obtain a specific semantic relation between the verb and a corresponding argument and form a structured control instruction;
and step 11, using the structured control instruction to directly detect whether scene conflict is caused by wrong, forgotten and missed issuing of wrong control instruction by a controller. For example: after the structured control instruction is obtained, the running track of the aircraft can be obtained by utilizing the information related to the words, whether scene conflict can occur or not can be judged by comparing whether the tracks obtained from the two control instructions are intersected, and if the control instruction comprises a plurality of control intents, the running track of the aircraft can be more accurately described by combining the control intents.
In the invention, the step 2 comprises the following steps:
and 2-1, performing Chinese word segmentation operation on the control instruction by utilizing the jieba word segmentation to obtain a word sequence.
Step 2-2, part of speech tagging: and labeling each word in the word sequence according to the corresponding part of speech to obtain a part of speech analysis result. The main parts of speech include: basic control terms (sp), verbs (v/vi), other parts of speech and the like, wherein in the step, special parts of speech are set due to the particularity of the control instructions, so that different words in the computer mechanism control instructions are helped to be solved;
the step 3 comprises the following steps: based on the result of part-of-speech analysis, removing words which are basic control terms in the control instruction, wherein one control instruction after the basic control terms are removed comprises verbs and part-of-speech words of other components, judging according to the result of part-of-speech tagging, extracting the verbs to form a verb set, and simultaneously extracting the other words to form a candidate argument set, wherein elements contained in the verb set and the candidate argument set are used for judging whether correlation exists or not, namely verb-argument correlation pairs;
judging which words in the instruction are basic control terms through the part of speech, if the words with the part of speech of "sp" are the basic control terms, extracting the basic control terms, and some basic control terms need to form a phrase structure with surrounding words, such as: the ground wind is 300 meters per second, special part-of-speech tagging is carried out on the control terms, a phrase searching algorithm is designed to obtain a control term phrase, the behavior sp of the ground wind is set by taking the control terms as an example, meanwhile, a backward searching rule is designed, and finally the ground wind is 300 meters per second and extracted as a phrase;
and judging according to the part-of-speech tagging result, extracting verbs to form a verb set, and extracting words of other components to form a candidate argument set. Removing light verbs from the extracted verbs, forming linkage structural words and the like, and storing results into a verb set;
step 4 comprises the following steps:
step 4-1, converting into word vectors: the word is preprocessed by using a word2vec method, words in a word vector expression control instruction are generated, word vector representation of an input sentence is generated and is used as input of a neural network, and due to the defects that high-dimensional and sparse one-hot representation (one-hot representation) word vectors cannot well represent word similarity and dimension disasters are easily caused, low-dimensional and dense distributed expression (distributed representation) word vectors are used for representing words. A commonly used distributed word vector is a 100-dimensional array of numbers.
Step 4-2, the invention mainly uses the fully-connected neural network, trains the model parameter of the fully-connected neural network: collecting control instructions in an actual working scene and forming a corpus training set, manually labeling the corpus training set of the control instructions, and training weight parameters of neurons in all layers of the fully-connected neural network by using the corpus training set to obtain a trained neural network; when marking data, marking different categories of the governing intention verbs, such as B0, B1, … and BM, wherein M is the category of the governing intention, such as: CES3421, down to 5000 meters retention. Two regulatory intention verbs "drop" and "hold" are included, the verb "drop" is labeled B0, the verb "hold" is labeled B1, and the argument words associated with the verb are labeled the same numeric category, such as "to" is labeled B0;
and 4-3, predicting by using a neural network: judging whether the target argument in the candidate argument set and the verb set is associated with the verb through the trained neural network to obtain a verb-argument, for example: at regulatory command "CES 3984, runway 35L, take off immediately. "in, the governing-intention verb is takeoff, and the arguments include" CES3984, runway 35L ", and the verb-argument group here is respectively: takeoff-CES 3984 and takeoff-runway 35L.
In step 5, the number of verbs in the control command is determined, and the verb in the step does not include the light verb, and the interlocked structural word is regarded as one verb. If the number of verbs is 1, the detailed subsequent processing refers to a patent 'a structured regulation instruction extraction method based on natural language processing', the invention aims at the condition of a plurality of regulation intention verbs, and if the number of verbs is more than 1, the step 6 is entered;
the step 6 comprises the following steps: converting the input sentence into word vector input;
in step 7, the trained neural network is used to classify the input matrix. The neural network can capture the internal features of the input word vector well. Because the judgment of whether the target verb and the target argument have a relationship can be regarded as a two-classification problem, the output end of the neural network contains probability output values corresponding to different classes of regulatory intents: if the probability is higher, the target argument is in relation with the selected target verb, otherwise, the target argument is in relation with the selected target verb;
the neural network comprises an input layer, a hidden layer and an output layer, and is defined to have 100 × N input x, wherein N represents the length of a sentence, and a maximum length N is generally defined to be 30, that is, the maximum length contains 30 words; if there are n neurons in the input layer, defining the weight matrix as W(2w+2)×nBias matrix is b1×nThe output value of the input layer is alpha1×nThe output value of the activation function is h1×nDefining a ReLU activation function fRe Lu(t) is:
Figure BDA0002043246640000041
where t is the input value, the output value of the neuron of the hidden layer is obtained by the following formula:
αh=hhWh+bh
hh=fRe Luh),
wherein the weight matrix of the hidden layer is WhBias matrix is bhOutput value of alphahThe output value of the activation function is hhThe output value z of the neural network is obtained by:
z=hhWo+bo
wherein the weight matrix of the output layer is WoBias matrix is boThe output value is z.
Step 9 comprises: the construction idea of The Semantic net of The control instructions is derived from frame semantics (cited documents: Grigoris.A, Paul.G, Frank.v.H, Rinke.H.A Semantic Web Primer (Third Edition) [ M ]. Cambridge: The MIT Press, 2012: 1-288.), and The basic theory is The lattice syntax theory of The American linguistics' Fillmore. The definition of the verb grid in the semantic network can be used for more finely describing the relationship between the argument and the verb in the regulation instruction, and generally comprises the following steps: a stop grid, a source grid, a place grid, an end grid, etc. Because the semantic relation between the control intention verb and different argument words is unilaterally mapped in most cases in the control instruction, a semantic network is constructed according to experience, and the semantic network comprises the control intention verb, the argument words and semantic association triple knowledge;
the invention also comprises a step 12 of constructing a structured template:
the structured template herein comprises two parts: 1. a basic governing term phrase; 2. verb-argument-lattice triplets. Because the control instruction comprises a plurality of verbs, the second part is composed of verb-argument relation triples according to the appearance sequence of the verbs. And establishing a structured template with two parts of basic control terms and verb-argument-lattice triples, and filling deconstructed control instruction information into the template to form a structured instruction.
The method can be applied to semantic understanding of the control command in the air traffic control system. Since a plurality of verbs may be included in the regulation instruction issued by the regulator in actual work, judging arguments associated with different verbs and finding out the relationship between them are important for accurately understanding the regulation instruction. The method can better analyze the control command and form a structured command, and can effectively process the work from the speech recognition of the control command to the movement trend prediction based on the control command content.
Has the advantages that: the invention has the following technical effects:
1. and (4) enabling the computer to autonomously understand the semantics of the control instruction and judge the motion process of the aircraft.
2. Semantic role labeling is performed on a plurality of verbs appearing in the management instruction.
3. Basic regulation term information appearing in the regulation instruction is extracted.
4. And converting the unstructured regulatory instruction into a structured regulatory instruction.
Drawings
The foregoing and other advantages of the invention will become more apparent from the following detailed description of the invention when taken in conjunction with the accompanying drawings.
FIG. 1 is a neural network-based regulatory instruction multi-verb role classification and structured instruction extraction process.
FIG. 2 is a main flowchart of a method for extracting a structured policing instruction.
Fig. 3 is a structure of a neural network.
FIG. 4 is a generated structured policing instruction template.
Detailed Description
The invention is further explained below with reference to the drawings and the embodiments.
The present invention is described in further detail below with reference to the use cases of the policing instructions and the associated figures. First, an example of a policing instruction is given: CES1234, eastern tower, slides along taxiways d5-p4-a5 to the 18 left runway waiting point, waiting outside the 18 left runway.
For ease of illustration and description, the steps of implementation herein are divided according to the main flow diagrams shown in fig. 1 and 2.
Step 1: part of speech analysis
The method comprises two steps of treatment processes: chinese word segmentation and part-of-speech tagging. And carrying out Chinese word segmentation and part-of-speech tagging on the control instruction by using a jieba word segmentation in Python software to obtain a corresponding result. In the word segmentation process, basic control terms are added in the word segmentation dictionary, so that the precision of the word segmentation result can be improved.
Step 2: extraction of words from each component
In the last step, words of different composition are labeled as different parts of speech, wherein the part of speech of "east tower" is labeled as "sp", "slide to" and "wait" are labeled as "v". According to the part of speech, the oriental tower is a basic control term and is extracted. The original regulatory terms become: CES1234, sliding along taxiway d5-p4-a5 to the 18 left runway waiting point, waiting outside the 18 left runway, closer to natural language. Get verbs "slide to" and "wait" at the same time. Other component words in the sentence are extracted as candidate arguments.
And step 3: relationship determination
The step is taken as the core step of the invention, and is mainly introduced in three parts:
and 3-1, converting the word vector into a word vector to obtain an input vector. According to the principle of word2vec, words can train out corresponding low-dimensional and dense distributed vectors through a neural network, and the relation among similar words can be obtained through the distribution of word vectors. Word vectors for each word can be calculated from regulatory instruction text data using the gensim framework package in Python software. It should be noted that the amount of data in the text data set of the regulatory directive is large enough to contain all of the regulatory directive vocabulary (not including the basic regulatory terms).
And 3-2, training model parameters of the neural network. The corpus training set of the control instruction is labeled, and the input vector not only considers the relation between the target argument and the target verb in the sentence, but also considers the function of adjacent words of the target argument, namely the function of context information. If the target argument is associated with the target verb when in a context, the tag is defined as the same tag as the target argument. It should also be noted that the number of positive and negative samples should be equal when preparing the training set. The fully-connected neural network was written using Python, the structure is shown in fig. 3, and multiple rounds of training were performed on the parameters using the training data. The larger the amount of training data, the better the model.
And 3-3, predicting by using a neural network. In this step, two models can be used: one is a word vector model which is responsible for converting words into word vectors, and the other is a fully-connected neural network classification model which is responsible for judging whether the target argument is associated with the verb. Through this step, verb-theoretic tuples can be obtained, but at this time the relationship between the two is not known.
And 4, step 4: semantic Web construction
The step completes the relation between the associated verb and the argument by constructing a semantic network. The semantic web is constructed in the form of an ontology and can express the relationship between the verb and other entities in the control field. The lattice types of verbs in the present invention include: the site lattice, the object lattice, the place lattice, the direction lattice, the source point lattice and the end point lattice. Since the field of empty management belongs to a special field, the format type carried by the verb appearing in the regulatory instruction is not the same as that in the normal natural language environment, such as: and (4) sliding. Two example sentences are given below:
1. CES1234, sliding to the end of the runway.
2. He steps on the skateboard to glide.
In the first sentence, the regulation instruction shows that the 'slide' in the regulation instruction has no tool lattice, and in the second sentence, the 'slide' is the tool lattice of the verb 'slide'.
Because the definition between verbs and arguments in the semantic web is fixed, and the numbers such as flight number, height, speed and the like in the control instruction do not have fixed values, another advantage of using word vectors to express words is that the similarity of similar words can be expressed, and the problem can be solved by setting templates and using the similarity of word vectors.
And 5: structured instructions
Through the processing of the previous steps, the deconstructed policing instruction is obtained, wherein the verb is judged to have two, so the structured policing instruction in the example is divided into three parts:
the basic regulatory terms: an east tower.
Verb 1: slide to CES1234, schlieren, slide to d5-p4-a5, slide to wait point to end point grid.
Verb 2: wait-CES 1234-shit grid, wait-off-runway-azimuth grid.
The resulting structured template results are shown in fig. 4.
From the results, flight CES1234 performs a taxi action first, and the taxi action is performed on taxiway d5-p4-a5, and its taxi end point is a runway waiting point, and performs a second action: and waiting for action. And obtaining a sliding path and an end point of flight CES1234 according to the analysis.
Examples
For convenience of illustration and description, the steps implemented herein are divided according to the main flow chart shown in fig. 2, and are explained in conjunction with the actual policing instructions. First, an example of a policing instruction is given:
1. CQH1207, east tower, sliding along D5-P4, waiting outside runway 35L.
2. CES3984, east tower, runway 35L, may take off.
Step 1, carrying out voice recognition processing on the control voice to obtain a control instruction in a text format; the policing voice is processed using a speech recognition device resulting in unstructured policing instruction text, as shown above.
Step 2, performing part-of-speech analysis on words contained in the control instruction in the text format, for example:
1. CQH/eng,1207/m, east tower/sp, edge/P, D5-P4/m, glide/v, runway/n, 35L/m, outer/f, wait/v.
2. CES/eng,3984/m, east tower/sp, runway/n, 35L/m, and take-off/v.
The part-of-speech tagging is carried out on the words in the control instruction, so that the part-of-speech of different words can be obtained, the control-purpose words and other words can be distinguished, meanwhile, the different words are combined into word groups in a mode of using a well-defined control-instruction special word group dictionary, and if: CQH1207, CES3984, runway 35L.
And 3, extracting a control intention based on the part-of-speech analysis result, and simultaneously extracting other words in the control instruction to form a candidate argument set, wherein the word of the verb part-of-speech is usually the control intention, for example: taxiing, waiting and taking off. Other words, such as: CQH1207, CES3984, east tower, runway 35L, edge, D5-P4, outer, etc. are all argument words.
Step 4, performing BIO labeling processing on the candidate argument set, and training parameters of the fully-connected neural network by using labeled data, wherein the goal is to obtain a control intention verb-argument group through the neural network, for example: the control instruction 1 includes two verbs, which are denoted as X ═ 0 and X ═ 1, the CQH1207 is associated with both verbs, and the control instruction E is denoted, the east tower is not associated with both verbs, and the control instruction O is denoted, and the control instruction B-0 is denoted, along with the association with only the glide.
Step 5, judging the number of verb regulatory intentions in the regulatory instruction, and entering step 6 if the number is more than 1;
and 6, generating an input word vector, obtaining the number sequence word vectors of different words by using word2vec, and forming the word vectors of sentences by using the word vectors of the words.
Step 7, classifying the input word vectors by using a neural network to obtain a probability output value z;
step 8, judging the relation: if the probability output value z has a higher probability in a category, it is determined that the argument has a relationship with the corresponding regulatory intention, step 10 is executed, otherwise, no processing is performed, for example: if the edge is associated with glide only and not with wait in step 4, then its probability at marker B-0 will be much greater than at marker B-1.
Step 9, constructing a semantic network, wherein the semantic network comprises regulation intention verbs, argument words and semantic association triple knowledge, such as: the semantic relationship between the flight and the verb is the operator of the action, the relationship between the runway 35L and the takeoff capability is the starting position of the action, and the relationship between D5-P4 and the taxi is the path of the action implementation.
Step 10, substituting the verb of the control intention and the argument word into a semantic network to obtain a specific semantic relation between the verb and a corresponding argument and form a structured control instruction;
step 11, the structured regulation instruction is used to directly detect whether a scene conflict is caused by the wrong, forgotten, or missed sending of the wrong regulation instruction by the controller, for example: when the controller sends the above two instructions in sequence, the following structured instruction can be obtained from instruction 1:
CQH1207, glide: the implementer;
CQH1207, wait for: the implementer;
D5-P4, slide: an action path;
outside runway 35L, wait: an action location;
then the end of the taxi is outside the runway 35L, as can be seen from the meaning of waiting.
From instruction 2, the following structured instructions can be derived:
CES3984, can take off: the implementer;
runway 35L, which can take off: an action source point;
therefore, the paths represented by the two instructions can be obtained, and the two control instructions are judged not to conflict.
The method and the device can be used for analyzing the intention of the empty control command, and the control command containing a plurality of different verbs can be processed by using the method and the device to obtain the structured commands with the same number as the verbs. The invention can be used for auxiliary alarm for scene conflicts caused by error, forgetting and omission of controllers. The semantic lattice analyzes the action intention of the control command and the action route of the aircraft, so that whether the conflict between the aircraft can be caused by different control commands can be judged.
The present invention provides a classification method for regulatory instructions based on semantic network, and a plurality of methods and approaches for implementing the technical scheme, where the foregoing is only a preferred embodiment of the present invention, it should be noted that, for those skilled in the art, a plurality of improvements and modifications may be made without departing from the principle of the present invention, and these improvements and modifications should also be regarded as the protection scope of the present invention. All the components not specified in the present embodiment can be realized by the prior art.

Claims (1)

Translated fromChinese
1.一种基于语义网的管制指令分类方法,其特征在于,包括如下步骤:1. a kind of control instruction classification method based on semantic web, is characterized in that, comprises the steps:步骤1,对管制语音进行语音识别处理,得到文本格式的管制指令;Step 1, performing speech recognition processing on the control voice to obtain control instructions in text format;步骤2,对文本格式的管制指令中包含的词语进行词性分析;Step 2: Perform part-of-speech analysis on the words contained in the control instruction in text format;步骤3,基于词性分析结果提取出管制意图,同时对管制指令中的其他词语进行提取,形成候选论元集合;Step 3, extracting the control intention based on the part-of-speech analysis result, and at the same time extracting other words in the control instruction to form a candidate argument set;步骤4,对候选论元集合进行BIO标注处理,并用标注的数据训练全连接神经网络的参数,目标为通过神经网络得到管制意图动词-论元组;Step 4: Perform BIO annotation processing on the candidate argument set, and train the parameters of the fully connected neural network with the marked data, the goal is to obtain the control intent verb-argument group through the neural network;步骤5,对管制指令中的管制意图动词数量进行判断,如果数量大于1个,则进入步骤6;Step 5, judge the number of control intent verbs in the control instruction, if the number is greater than 1, go to step 6;步骤6,生成输入词向量;Step 6, generate the input word vector;步骤7,使用神经网络对输入的词向量进行分类,得到概率输出值z;Step 7, use the neural network to classify the input word vector, and obtain the probability output value z;步骤8,判断关系:如果概率输出值z在一个类别处出现较高的概率,则判定该论元与这个对应的管制意图存在关系,执行步骤10,否则不做处理;Step 8, determine the relationship: if the probability output value z has a high probability in a category, then determine that the argument has a relationship with the corresponding control intention, and execute step 10, otherwise no processing;步骤9,构建语义网,语义网包含了管制意图动词、论元词语、语义关联三元组知识;Step 9, build a semantic web, which includes the triple knowledge of control intent verbs, argument words, and semantic associations;步骤10,将管制意图动词与论元词语代入语义网,获取动词与相应论元间的具体语义关系,形成结构化管制指令;Step 10: Substitute control intent verbs and argument words into the Semantic Web to obtain the specific semantic relationship between verbs and corresponding arguments to form a structured control instruction;步骤11,将结构化管制指令用于直接检测是否因管制员错、忘、漏发出错误管制指令而造成了场面冲突;Step 11: Use the structured control instruction to directly detect whether a scene conflict is caused by the controller's mistake, forgetting, or omission to issue an erroneous control instruction;步骤2包括:Step 2 includes:步骤2-1,利用jieba分词对管制指令进行中文分词操作,得到词序列;Step 2-1, using jieba word segmentation to perform Chinese word segmentation operation on the control instruction to obtain a word sequence;步骤2-2,词性标注:对词序列中的每个词按对应的词性进行标注,得到词性分析结果;Step 2-2, part-of-speech tagging: tagging each word in the word sequence according to the corresponding part-of-speech to obtain the part-of-speech analysis result;步骤3包括:基于词性分析结果,去除管制指令中为基本管制术语的词语,去除基本管制术语后的一条管制指令包含动词和其他成份的词性词,通过词性标注结果进行判断,将动词提取出来形成管制意图动词集合,同时将其他成份的词提取出来,形成候选论元集合;Step 3 includes: based on the part-of-speech analysis results, remove the words that are basic regulatory terms in the control instruction, and after removing the basic regulatory terms, a control instruction that contains the part-of-speech words of verbs and other components, judges based on the part-of-speech tagging results, and extracts the verbs to form. Control the set of intent verbs, and at the same time extract the words of other components to form a set of candidate arguments;步骤4中,所述BIO标注是指:将每个元素标注为B-X、I-X或者O,其中,B-X表示此元素所在的片段属于X类型并且此元素在此片段的开头,I-X表示此元素所在的片段属于X类型并且此元素在此片段的中间位置,O表示不属于任何类型;In step 4, the BIO labeling refers to: labeling each element as B-X, I-X or O, where B-X indicates that the segment where this element is located belongs to type X and this element is at the beginning of this segment, and I-X indicates that this element is located. Fragment is of type X and this element is in the middle of this fragment, O means not of any type;步骤4包括:Step 4 includes:步骤4-1,转成词向量:使用word2vec方法对词语进行预处理,并生成输入句子的词向量表示,作为神经网络的输入;Step 4-1, convert to word vector: use the word2vec method to preprocess the words, and generate the word vector representation of the input sentence as the input of the neural network;步骤4-2,使用全连接神经网络,训练全连接神经网络的模型参数:收集实际工作场景中的管制指令并形成语料训练集,对管制指令的语料训练集进标注,用语料训练集对全连接神经网络的各层神经元的权重参数进行训练,得到训练好的神经网络;Step 4-2, use the fully connected neural network to train the model parameters of the fully connected neural network: collect the control instructions in the actual work scene and form a corpus training set, annotate the corpus training set of the control instructions, use the corpus training set to Connect the weight parameters of the neurons in each layer of the neural network for training to obtain a trained neural network;步骤4-3,使用神经网络进行预测:通过训练好的神经网络判断候选论元集和动词集合中的目标论元与动词是否存在关联,得到动词-论元组;Step 4-3, use the neural network to predict: determine whether the target argument in the candidate argument set and the verb set is related to the verb through the trained neural network, and obtain the verb-argument group;步骤6包括:将步骤4-1中得到的句子词向量作为神经网络模型的输入Step 6 includes: using the sentence word vector obtained in step 4-1 as the input of the neural network model步骤7中,神经网络包含输入层、隐藏层和输出层,定义神经网络有100×n个输入x,若输入层有n个神经元,则定义权重矩阵为W(2w+2)×n,偏置矩阵为b1×n,输入层的输出值为α1×n,激活函数的输出值为h1×n,定义ReLU激活函数fReLU(t)为:In step 7, the neural network includes an input layer, a hidden layer and an output layer. It is defined that the neural network has 100×n inputs x. If the input layer has n neurons, the weight matrix is defined as W(2w+2)×n , The bias matrix is b1×n , the output value of the input layer is α1×n , the output value of the activation function is h1×n , and the ReLU activation function fReLU (t) is defined as:
Figure FDA0002955805710000021
Figure FDA0002955805710000021
其中t是输入数值,隐藏层的神经元的输出值由下式得到:where t is the input value, and the output value of the neurons in the hidden layer is given by:αh=hhWh+bhαh =hh Wh +bh ,hh=fReLUh),hh =fReLUh ),其中隐藏层的权重矩阵为Wh,偏置矩阵为bh,输出值为αh,激活函数的输出值为hh,通过下式得到神经网络的输出值z:The weight matrix of the hidden layer is Wh , the bias matrix is bh , the output value is αh , the output value of the activation function is hh , and the output value z of the neural network is obtained by the following formula:z=hhWo+boz=hh Wo +bo ,其中输出层的权重矩阵为Wo,偏置矩阵为boThe weight matrix of the output layer is Wo and the bias matrix isbo .
CN201910348919.2A2019-04-282019-04-28 A Semantic Web-based Control Order Classification MethodActiveCN110232121B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201910348919.2ACN110232121B (en)2019-04-282019-04-28 A Semantic Web-based Control Order Classification Method

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201910348919.2ACN110232121B (en)2019-04-282019-04-28 A Semantic Web-based Control Order Classification Method

Publications (2)

Publication NumberPublication Date
CN110232121A CN110232121A (en)2019-09-13
CN110232121Btrue CN110232121B (en)2021-04-06

Family

ID=67860304

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201910348919.2AActiveCN110232121B (en)2019-04-282019-04-28 A Semantic Web-based Control Order Classification Method

Country Status (1)

CountryLink
CN (1)CN110232121B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN110837731A (en)*2019-10-122020-02-25创新工场(广州)人工智能研究有限公司Word vector training method and device
CN111179926A (en)*2019-12-102020-05-19深圳微品致远信息科技有限公司Method and device for generating aircraft control command and computer equipment
CN113569545B (en)*2021-09-262021-12-07中国电子科技集团公司第二十八研究所Control information extraction method based on voice recognition error correction model
CN115470796B (en)*2022-09-142025-08-05四川大学 A method and device for generating text instructions for air traffic control simulation training

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5987404A (en)*1996-01-291999-11-16International Business Machines CorporationStatistical natural language understanding using hidden clumpings
CN107656990A (en)*2017-09-142018-02-02中山大学A kind of file classification method based on two aspect characteristic informations of word and word
CN109460547A (en)*2018-09-192019-03-12中国电子科技集团公司第二十八研究所A kind of structuring control order extracting method based on natural language processing

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5987404A (en)*1996-01-291999-11-16International Business Machines CorporationStatistical natural language understanding using hidden clumpings
CN107656990A (en)*2017-09-142018-02-02中山大学A kind of file classification method based on two aspect characteristic informations of word and word
CN109460547A (en)*2018-09-192019-03-12中国电子科技集团公司第二十八研究所A kind of structuring control order extracting method based on natural language processing

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
卢薇冰.基于CNN的陆空通话语义识别方法.《中国优秀硕士学位论文全文数据库》.2017,(第1期),第C031-283页.*
王煊.用于管制语音理解的语义分析方法.《指挥信息系统与技术》.2019,第10卷(第1期),第33-36页.*

Also Published As

Publication numberPublication date
CN110232121A (en)2019-09-13

Similar Documents

PublicationPublication DateTitle
CN110232121B (en) A Semantic Web-based Control Order Classification Method
CN109871955B (en)Aviation safety accident causal relation extraction method
CN112487206B (en) An Entity Relationship Extraction Method for Automatically Constructing Datasets
CN109684626A (en)Method for recognizing semantics, model, storage medium and device
CN107818153A (en)Data classification method and device
CN113157916A (en)Civil aviation emergency extraction method based on deep learning
CN111475650B (en)Russian semantic role labeling method, system, device and storage medium
CN107273358A (en)A kind of end-to-end English structure of an article automatic analysis method based on pipe modes
Rustamov et al.Sentence-level subjectivity detection using neuro-fuzzy models
Jing et al.BERT for aviation text classification
CN106227718A (en)Land based on CNN sky call semantic consistency method of calibration
CN114372145A (en) A scheduling method for dynamic allocation of operation and maintenance resources based on knowledge graph platform
Luo et al.Using lda2vec topic modeling to identify latent topics in aviation safety reports
CN110321434A (en)A kind of file classification method based on word sense disambiguation convolutional neural networks
CN113948091A (en)Air-ground communication voice recognition engine for civil aviation passenger plane and application method thereof
CN112200268A (en) An image description method based on encoder-decoder framework
CN108536781B (en)Social network emotion focus mining method and system
Zhao et al.A Multi-channel Character Relationship Classification Model Based on Attention Mechanism
CN110069771B (en) A method for processing control instruction information based on semantic chunking
CN113158658A (en)Knowledge embedding-based structured control instruction extraction method
CN109460547B (en)Structured control instruction extraction method based on natural language processing
CN111241233A (en) A service robot instruction parsing method based on full-density transfer of key verb features
CN106326495A (en) A Chinese Text Automatic Classification Method Based on Topic Model
CN116822956A (en) A safety risk perception method for civil aviation control operations based on topic model
CN109344251A (en) A Method for Extracting Specific Text Information Based on Layer Classifier and Template Matching

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp