Movatterモバイル変換


[0]ホーム

URL:


US11720629B2 - Knowledge graph construction method and device - Google Patents

Knowledge graph construction method and device
Download PDF

Info

Publication number
US11720629B2
US11720629B2US16/034,799US201816034799AUS11720629B2US 11720629 B2US11720629 B2US 11720629B2US 201816034799 AUS201816034799 AUS 201816034799AUS 11720629 B2US11720629 B2US 11720629B2
Authority
US
United States
Prior art keywords
entity
attribute
knowledge graph
attribute value
measurement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US16/034,799
Other versions
US20190019088A1 (en
Inventor
Yue Wang
Wei You
Weixing ZHANG
Nengwei HUA
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding LtdfiledCriticalAlibaba Group Holding Ltd
Assigned to GUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD.reassignmentGUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: HUA, Nengwei, WANG, YUE, YOU, Wei, ZHANG, WEIXING
Publication of US20190019088A1publicationCriticalpatent/US20190019088A1/en
Assigned to ALIBABA GROUP HOLDING LIMITEDreassignmentALIBABA GROUP HOLDING LIMITEDASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: GUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD.
Application grantedgrantedCritical
Publication of US11720629B2publicationCriticalpatent/US11720629B2/en
Activelegal-statusCriticalCurrent
Adjusted expirationlegal-statusCritical

Links

Images

Classifications

Definitions

Landscapes

Abstract

The present invention provides a knowledge graph construction method and device. The method includes: obtaining structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value; performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity; and importing the structured data into the knowledge graph according to an entity alignment processing result. Embodiments may ensure correctness of data in the knowledge graph.

Description

CROSS REFERENCE TO RELATED APPLICATION
The present application claims priority to and benefits of Chinese Application No. 201710576164.2, filed on Jul. 14, 2017, the entire content of which is incorporated herein by reference in its entirety.
TECHNICAL FIELD
The present invention relates to the field of information search technologies, and in particular, to a knowledge graph construction method and device.
BACKGROUND
With rapid development of information technologies, the current society has entered an information explosion era, and people seek, increasingly with the help of a network, information that they need. Currently, to provide a more precise search service to a user, a knowledge graph is increasingly applied to the field of search.
Knowledge graph has been developed, which aim to describe various entities existing in a real world and a relationship among them. After a knowledge graph is established, and a user enters a keyword, associated information such as an attribute of this keyword, and a relationship between this keyword and another entity may be returned to the user. In this way, needed information may be more accurately and thoroughly provided to the user. Therefore, in a knowledge graph construction process, how to ensure accuracy of data in the knowledge graph becomes especially important.
SUMMARY
Embodiments of the present invention provide a knowledge graph construction method and device, so as to improve accuracy of data in a knowledge graph.
According to a first aspect, the present invention provides a knowledge graph construction method, including obtaining structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value, performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity, and importing the structured data into the knowledge graph according to an entity alignment processing result.
Optionally, the performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity includes: determining, according to a type of a data source of the structured data, a measurement type for similarity measurement between the first entity and the second entity in the knowledge graph, and performing entity alignment processing on the first entity according to the determined measurement type.
Optionally, the type of the data source is a data source based on a classification tree, and the measurement type includes measurement of a structure similarity of a classification tree on which an entity is located. The performing entity alignment processing on the first entity according to the determined measurement type includes determining, in the knowledge graph, that the data source is a second entity of a classification tree, and determining whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, and the measurement type includes measurement of a character similarity. The performing entity alignment processing on the first entity according to the determined measurement type includes: determining whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, the measurement type includes measurement of an attribute similarity, and the first attribute includes a key attribute and a non-key attribute. The performing entity alignment processing on the first entity according to the determined measurement type includes: determining whether a second attribute the same as the key attribute exists in the knowledge graph, and if yes, determining whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
Optionally, the importing the structured data into the knowledge graph according to an entity alignment processing result includes: determining, when the entity alignment processing result is that the first entity is aligned with the second entity, and attribute alignment processing is performed on the first attribute of the first entity and a second attribute of the second entity, whether a second attribute the same as the first attribute exists in the knowledge graph; if not, mapping the first attribute and the first attribute value to the second entity; or if yes, when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and if not, performing deduplication processing, or if yes, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, mapping the first attribute value to the second entity, and deleting the second attribute value; or when the first attribute is a multi-value attribute, and a plurality of first attribute values do not conflict with a second attribute value, determining, in the plurality of first attribute values, an attribute value different from the second attribute value, and mapping a plurality of different first attribute values to the second entity; and when the entity alignment processing result is that the first entity is unaligned with the second entity, newly constructing a graph item in the knowledge graph, where the newly constructed graph item includes the first entity name and the first attribute value corresponding to the first attribute.
Optionally, before the determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph, the method further includes: obtaining a description type of each piece of attribute information, and performing cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
Optionally, the method further includes: in the knowledge graph, for a second attribute used to represent a relationship between entities, determining an implied relationship between entities by using a preset chain rule, and mapping the implied relationship to the knowledge graph.
According to a second aspect, the present disclosure provides a knowledge graph construction device, including: a data obtaining module, configured to obtain structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value, an alignment processing module, configured to perform, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity, and a data importing module, configured to import the structured data into the knowledge graph according to an entity alignment processing result.
Optionally, the alignment processing module is specifically configured to determine, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph, and perform entity alignment processing on the first entity according to the determined measurement type.
Optionally, the type of the data source is a data source based on a classification tree, and the measurement type includes measurement of a structure similarity of a classification tree on which an entity is located. The alignment processing module is specifically configured to determine, in the knowledge graph, that the data source is a second entity of a classification tree, and determine whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, and the measurement type includes measurement of a character similarity. The alignment processing module is specifically configured to determine whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, the measurement type includes measurement of an attribute similarity, and the first attribute includes a key attribute and a non-key attribute. The alignment processing module is specifically configured to determine whether a second attribute the same as the key attribute exists in the knowledge graph, and if yes, determine whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, the data importing module is specifically configured to determine, when the entity alignment processing result is that the first entity is aligned with the second entity, and attribute alignment processing is performed on the first attribute of the first entity and a second attribute of the second entity, whether a second attribute the same as the first attribute exists in the knowledge graph;
if not, map the first attribute and the first attribute value to the second entity; or
if yes, when the first attribute is a single-value attribute, determine whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and if not, perform deduplication processing, or if yes, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, map the first attribute value to the second entity, and delete the second attribute value; or when the first attribute is a multi-value attribute, and a plurality of first attribute values do not conflict with a second attribute value, determine, in the plurality of first attribute values, an attribute value different from the second attribute value, and map a plurality of different first attribute values to the second entity; and
when the entity alignment processing result is that the first entity is unaligned with the second entity, newly construct a graph item in the knowledge graph, where the newly constructed graph item includes the first entity name and the first attribute value corresponding to the first attribute.
Optionally, the device further includes: a cleansing processing module, configured to obtain a description type of each piece of attribute information, and perform cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
Optionally, the device further includes: a reasoning module, configured to: in the knowledge graph, for a second attribute used to represent a relationship between entities, determine an implied relationship between entities by using a preset chain rule, and map the implied relationship to the knowledge graph.
According a third aspect, the present disclosure further provides a knowledge graph construction device, comprising a processor and a non-transitory computer-readable storage medium storing instructions that, when execute by the processor, cause the processor to perform a method for constructing a knowledge graph. The method comprises obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value, performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, wherein the measurement of the similarity comprises at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity, and importing the structured data into the knowledge graph according to an entity alignment processing result.
According a fourth aspect, the present disclosure further provides a non-transitory computer-readable storage medium storing instructions that, when execute by a processor, cause the processor to perform a method for constructing a knowledge graph. The method comprises obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value, performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, wherein the measurement of the similarity comprises at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity, and importing the structured data into the knowledge graph according to an entity alignment processing result.
Based on the knowledge graph construction method and device provided in the embodiments, structured data is obtained, and entity alignment processing is performed, based on measurement of a similarity between a first entity and a second entity in a knowledge graph, on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity. By using measurement types of these similarities, correctness of an entity alignment result may be ensured, correct structured data can be imported into the knowledge graph, to prevent erroneous or redundant data from being imported into the knowledge graph, and improve accuracy of the knowledge graph, and an accurate search result can be provided to a user.
BRIEF DESCRIPTION OF THE DRAWINGS
To describe the technical solutions in the embodiments of the present invention or in the prior art more clearly, the following briefly introduces the accompanying drawings required for describing the embodiments of the present invention or the prior art. Apparently, the accompanying drawings in the following description show some embodiments of the present invention, and a person of ordinary skill in the art may derive other drawings from these accompanying drawings without creative efforts.
FIG.1 is a schematic structural diagram of a knowledge graph according to one embodiment of the present invention;
FIG.2 is a diagram of an overall framework for constructing a knowledge graph according to one embodiment of the present invention;
FIG.3 is a flowchart of a knowledge graph construction method according to one embodiment of the present invention;
FIG.4 is a flowchart of a knowledge graph construction method according to one embodiment of the present invention;
FIG.5 is a schematic structural diagram of a knowledge graph construction device according to one embodiment of the present invention; and
FIG.6 is a schematic structural diagram of a knowledge graph construction device according to one embodiment of the present invention.
DETAILED DESCRIPTION
To make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the following clearly and completely describes the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Apparently, the described embodiments are some rather than all of the embodiments of the present invention. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts shall fall within the protection scope of the present invention.
FIG.1 is a schematic structural diagram of a knowledge graph according to one embodiment of the present invention. The knowledge graph is used to describe various entities existing in a real world and a relationship among them. The knowledge graph includes attributes and attribute values of entities and a relationship among the entities. The knowledge graph may be considered as a huge diagram, where a node indicates an entity, and a line is formed by an attribute or a relationship. For example,FIG.1 shows an attribute and an attribute value of an entity A, and an entity B, an entity C, and an entity D that have a relationship with the entity A.
FIG.2 is a diagram of an overall framework for constructing a knowledge graph according to an embodiment of the present invention. As shown inFIG.2, in a process of constructing the knowledge graph shown inFIG.1, the overall framework includes three parts: a data source, data obtaining, and data importing.
The data source is a source of data for constructing the knowledge graph. The data source includes structured data, for example, structured data provided by a vertical website or the like; semi-structured data, for example, an encyclopedia or a list; and non-structured data, for example, web page data in a text form.
The data obtaining is obtaining data from the data source, and an obtaining manner includes directly obtaining information about a vertical website from a partner's interface; obtaining, in a manner of crawling in combination with parsing, data from a vertical website, a semi-structured website, or the like; extracting encyclopedic information; generally extracting web page data in a text form; and the like. Data obtained in this embodiment includes structured data, semi-structured data, and non-structured data. Semi-structured data and non-structured data are converted into structured data in such a manner as template matching. A specific implementation of the structured data is not particularly limited in this embodiment.
In this embodiment, the data imported into the knowledge graph is structured data. The data importing process relates to processes on the structured data such as pre-processing, entity alignment, attribute alignment, and attribute screening, and structured data satisfying a condition is finally imported into the knowledge graph. Subsequently, continuable update and extension of the knowledge graph are ensured in such a manner as incremental iteration. In the following embodiments, for description convenience, entities are distinguished, an entity in obtained structured data is referred to as a first entity that corresponds to a first attribute and a first attribute value, and an entity existing in a knowledge graph is referred to as a second entity that corresponds to a second attribute and a second attribute value. A process of constructing the knowledge graph is described in detail below by using detailed embodiments.
FIG.3 is a flowchart of a knowledge graph construction method according to an embodiment of the present invention. As shown inFIG.3, the method includes the following steps.
S301. Obtain structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value.
For a specific implementation of obtaining structured data, refer to the embodiment shown inFIG.2. Details are not described again in this embodiment. The structured data includes the first entity name of the first entity and the attribute information corresponding to the first entity name. The attribute information includes the first attribute and the first attribute value. In this embodiment, the first attribute has two meanings.
One meaning is: the first attribute is a self attribute of the first entity, the self attribute may be a height, a weight, or the like of the first entity, and the corresponding first attribute value is an attribute value corresponding to the self attribute, for example, a specific height value and weight value. In the knowledge graph, line information is used to indicate the self attribute of the first entity. There may be a plurality of self attributes of the first entity, and each self attribute may also correspond to a plurality of attribute values.
The other meaning is: the first attribute is an attribute of a relationship between the first entity and the second entity, the relationship attribute may be couple, children, parent, friend, or the like, and the corresponding first attribute value is another entity having a relationship with the first entity. In the knowledge graph, line information is used to indicate a relationship between the first entity and another entity. The first entity may correspond to a plurality of relationship attributes, and each relationship attribute may correspond to a plurality of other entities.
S302. Perform, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity.
After the structured data is obtained, at the beginning of creation, if the knowledge graph has no data, the structured data is directly imported into the knowledge graph, so as to form the knowledge graph. When the knowledge graph has data, entity alignment processing is performed on the first entity based on the measurement of the similarity between the first entity and the second entity in the knowledge graph. The entity alignment processing includes determining entities that have different IDs but represent the same object in a real world, and classifying these entities as an entity that has a globally unique identifier, that is, finding the first entity and the second entity that are the same, so as to map the first attribute of the first entity and the corresponding first attribute value to the second entity, that is, adding the first attribute and the corresponding first attribute value on the basis of the second entity.
Optionally, before entity alignment processing is performed, the method further includes a data pre-processing process. Specifically, a description type of each piece of attribute information is obtained; and cleansing and normalization processing is performed on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
Specifically, for the same attribute value, data of different sources may be described in different manners. For example, the height is 1.75 meter, and different descriptions include 1.75 m, 1.75 M, 1 meter and 75 centimeters, 175 cm, 175 centimeters, and the like. Cleansing and normalization aim to remove noise in raw data and uniformize data expression forms. In current data cleansing and normalization, different standard description statements are configured for attributes corresponding to different description types, that is, each description type corresponds to a standard description statement. For example, each of date, time, value, nation, city and the like may correspond to a standard description statement.
By pre-processing data, a uniform data description may be ensured, and when entity alignment and attribute alignment are performed, alignment accuracy is ensured.
In this embodiment, the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity, that are separately described sequentially below.
1. Measurement of a character similarity. Specifically, whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold is determined; and if yes, the entities are aligned, and if not, the entities are not aligned.
In a specific implementation process, entities having the same expression may be the same entity. For example, from different vertical house property websites, entities such as Shanghai Greenland Haipo Xinghui, Greenland Haipo Xinghui, Haipo Xinghui, and Haipo Xinghui residence area may be collected, and it may be determined, according to a character similarity between names, that the entities may be the same entity. For example, if the character similarity is greater than a preset threshold, it is determined that the entities are the same entity. Optionally, before the character similarity is determined, a prefix, a suffix, or the like may be further filtered out. For example, a prefix place name and a prefix property developer name are filtered out, and a suffix “residence area” is filtered out.
2. Measurement of a structure similarity of a classification tree on which an entity is located. It is determined, in the knowledge graph, that the data source is a second entity of a classification tree; and whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity is determined; and if yes, the entities are aligned, and if not, the entities are not aligned.
In a specific implementation process, entities having the same neighboring node may point to the same entity. When the data source is a classification tree, measurement may be performed by using measurement of a structure similarity. For example, when botanical classification trees are respectively collected from Flora of China and Aihuhua, a parent node and a child node of Fragrant flower and a parent node and a child node ofOsmanthus fragransare all the same, and therefore it is determined that the two are the same entity.
3. Measurement of an attribute similarity. In this embodiment, the first attribute includes a key attribute and a non-key attribute; and whether a second attribute the same as the key attribute exists in the knowledge graph is determined, and if yes, whether attribute values corresponding to the key attribute and the second attribute are the same is determined; and if yes, the entities are aligned, and if not, the entities are not aligned.
In a specific implementation process, entities having the same attribute and the same attribute value may represent the same entity. Such similarity measurement manner is usually used to determine alignment of entities having the same name. In an attribute alignment process, alignment is performed by using a key attribute of an entity, the key attribute may be a preset attribute representing uniqueness of the entity, and a non-key attribute is not used as an alignment determining standard. For example, for two “Baolong Squares”, if key attributes: detailed addresses, that is, attribute values are the same, it is determined that the two “Baolong Squares” are the same building; and for two “Li Na”, if key attributes: birthday information are consistent, it is determined that the two “Li Na” are the same person. A person skilled in the art may understand that, there may be one or more key attributes, and a quantity of key attributes is not particularly limited in this embodiment.
In this embodiment, which similarity is specifically used for measurement may depend on an actual situation, and in different scenarios, there are different applications. For example, only the character similarity may be used, only the attribute similarity may be used, or only the structure similarity may be used.
For another example, two similarities may be further combined to perform implementation. For example, the character similarity is combined with the attribute similarity, measurement of the character similarity may be first performed, and for entities satisfying the character similarity, verification may be performed with reference to measurement of the attribute similarity. For example, characters are similar, but attribute values corresponding to key attributes are different, the entities are not the same entity; and the character similarity may be further combined with the structure similarity, for entities not satisfying the character similarity, measurement of the structure similarity may be performed. For another embodiment in which two similarities are combined, details are not described again in this embodiment.
For another example, three similarities may be further combined to perform implementation. That is, measurement of the structure similarity is first performed, then measurement of the character similarity is performed, then measurement of the attribute similarity is performed, and entity alignment processing is performed by means of mutual supplement and mutual verification between the three similarities.
In a possible implementation, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph may be determined according to a type of a data source of the structured data; and entity alignment processing is performed on the first entity according to the determined measurement type.
Specifically, when the type of the data source is a data source based on a classification tree, the measurement type is measurement of a structure similarity of a classification tree on which an entity is located. When the type of the data source is a data source not based on a classification tree, the measurement type is measurement of a character similarity and/or the measurement type is measurement of an attribute similarity.
That is, the type of the data source may be first determined, and the measurement type is determined according to the type of the data source. When the type of the data source is a data source based on a classification tree, measurement of a structure similarity of a classification tree on which an entity is located is first used. Optionally, subsequently, supplement and verification may be further performed by using the character similarity and/or the attribute similarity. When the type of the data source is a data source not based on a classification tree, measurement of a structure similarity is not considered, and measurement of a character similarity and/or measurement of an attribute similarity is selected.
S303. Import the structured data into the knowledge graph according to an entity alignment processing result.
After the entity alignment processing result is obtained, an attribute alignment operation may be further performed on the first entity and the second entity on which entity alignment is performed. Attribute alignment means determining attributes having different descriptions but representing the same attribute in a real world, and classifying these attributes as one attribute. For example, two attributes: birth date and birthday are classified as one attribute that may be indicated by using the birth date.
After entity alignment and attribute alignment, entities may be divided into an aligned entity and an unaligned entity. For the unaligned entity, importing into the graph is performed in an addition manner, and for the aligned entity, it is imported into the knowledge graph by modifying the knowledge graph in different manners such as replacement and supplement, thereby constructing the knowledge graph.
Based on the knowledge graph construction method provided in the embodiments, structured data is obtained, and entity alignment processing is performed, based on measurement of a similarity between a first entity and a second entity in a knowledge graph, on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity. By using measurement types of these similarities, accuracy of an entity alignment result may be ensured, correct structured data can be imported into the knowledge graph, to prevent erroneous or redundant data from being imported into the knowledge graph, and improve accuracy of the knowledge graph, and an accurate search result can be provided to a user.
An implementation process of importing structured data into a knowledge graph by using an attribute screening process after entity alignment and attribute alignment is mainly described below by using a detailed embodiment.
FIG.4 is a flowchart of a knowledge graph construction method according to an embodiment of the present invention. As shown inFIG.4, the method includes the following steps.
S401. Determine, when an entity alignment processing result is that a first entity is aligned with a second entity, and attribute alignment processing is performed on a first attribute of the first entity and a second attribute of the second entity, whether a second attribute the same as the first attribute exists in a knowledge graph; and if not, perform S402, or if yes, perform S403 or S404.
S402. Map the first attribute and a first attribute value to the second entity.
If the first attribute of the first entity is different from the second attribute of the second entity when the first entity is aligned with the second entity, the first attribute and the first attribute value are mapped to the second entity. For example, if the first entity is Zhang San, the second entity: Zhang San aligned with the Zhang San exists in the knowledge graph, the first attribute is the birth date, and the second attribute of the second entity has no birth date, the first attribute: birth date and a corresponding attribute value are mapped to the second entity in the knowledge graph.
S403. When the first attribute is a multi-value attribute, and a plurality of first attribute values do not conflict with a second attribute value, determine, in the plurality of first attribute values, an attribute value different from the second attribute value, and map a plurality of different first attribute values to the second entity.
When the first attribute is a multi-value attribute, for example, the first attribute is friends or children, when the first entity is aligned with the second entity, and the first attribute of the first entity is the same as the second attribute of the second entity, an attribute value corresponding to the second attribute may be supplemented.
For example, for children of Zhang San, attribute values corresponding to the second attribute are Zhang Yiming and Zhang Erming, and attribute values corresponding to the first attribute are Zhang Yimin, Zhang Sanming, Zhang Siming, and Zhang Xiaoming. Zhang Yiming and Zhang Yimin are two attribute values conflicting with each other, it is determined that Zhang Yiming is a correct attribute value and Zhang Yimin is an erroneous attribute value, Zhang Yimin is deleted, then it is determined that Zhang Sanming, Zhang Siming, and Zhang Xiaoming are attribute values different from the second attribute value, and Zhang Sanming, Zhang Siming, and Zhang Xiaoming are mapped to children of the second attribute: Zhang San.
S404. When the first attribute is a single-value attribute, determine whether the first attribute value corresponding to the first attribute conflicts with the second attribute value corresponding to the second attribute; and if not, perform S405, or if yes, perform S406.
S405. Perform deduplication processing.
S406. When a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, map the first attribute value to the second entity, and delete the second attribute value.
When the first attribute is a single-value attribute, that is, the first attribute has only one attribute value, whether the first attribute value corresponding to the first attribute conflicts with the second attribute value corresponding to the second attribute is determined. If the two attribute values do not conflict, it indicates that the two attribute values are both correct attribute values, merging and deduplication processing may be performed, to obtain a unique attribute value. When the two attribute values conflict, reliability of the first attribute value and reliability of the second attribute value are determined. If the reliability of the first attribute value is higher than that of the second attribute value, the second attribute value is deleted, or if a reliability degree of the first attribute value is less than that of the second attribute value, the first attribute value is deleted.
The reliability degree may be determined according to frequencies that the first attribute value occurs on different data sources, a preset reliability degree rank of a data source, and the like. A specific implementation of determining a reliability degree is not particularly limited in this embodiment.
S407. When the entity alignment processing result is that the first entity is unaligned with the second entity, newly construct a graph item in the knowledge graph, where the newly constructed graph item includes the first entity name and the first attribute value corresponding to the first attribute.
In this embodiment, a process of importing the structured data into the knowledge graph is fully considered in different situations, to prevent erroneous and repetitive data from being imported into the knowledge graph, and also prevent data from being omitted, thereby ensuring accuracy of the knowledge graph.
On the basis of the foregoing embodiments, this embodiment further provides incremental iteration and continuable construction on the knowledge graph.
In the knowledge graph, for a second attribute used to represent a relationship between entities, an implied relationship between entities can be determined by using a preset chain rule, and the implied relationship can be mapped to the knowledge graph.
For example, a chain rule may be defined in advance: the father-in-law is the father of the wife. By using this rule, when the wife (Ye Li) of Yao Ming and the father (Ye Fa) of Ye Li are known, it may be inferred that the father-in-law of Yao Ming is Ye Fa. Another chain rule is similar to this chain rule, and details are not described again in this embodiment.
In this embodiment, by performing incremental iteration processing on the knowledge graph, knowledge coverage of the knowledge graph is further increased, to ensure that construction of the knowledge graph is a verifiable and continuable increasing process.
FIG.5 is a schematic structural diagram of a knowledge graph construction device according to an embodiment of the present invention. As shown inFIG.5, the device includes:
adata obtaining module501, configured to obtain structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value;
analignment processing module502, configured to perform, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity; and
adata importing module503, configured to import the structured data into the knowledge graph according to an entity alignment processing result.
The device provided in this embodiment may be configured to perform the foregoing method embodiment, and the implementation principle and technical effect of the device are similar to those of the method embodiment. Details are not described again in this embodiment.
FIG.6 is a schematic structural diagram of a knowledge graph construction device according to an embodiment of the present invention. This embodiment is on the basis of the embodiment inFIG.5.
Thealignment processing module502 is specifically configured to determine, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph; and
perform entity alignment processing on the first entity according to the determined measurement type.
Optionally, the type of the data source is a data source based on a classification tree, and the measurement type includes measurement of a structure similarity of a classification tree on which an entity is located; and
thealignment processing module502 is specifically configured to determine, in the knowledge graph, that the data source is a second entity of a classification tree; and
determine whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, and the measurement type includes measurement of a character similarity; and
thealignment processing module502 is specifically configured to determine whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, the type of the data source is a data source not based on a classification tree, the measurement type includes measurement of an attribute similarity, and the first attribute includes a key attribute and a non-key attribute; and
thealignment processing module502 is specifically configured to determine whether a second attribute the same as the key attribute exists in the knowledge graph, and if yes, determine whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determine that the entities are aligned, and if not, determine that the entities are not aligned.
Optionally, thedata importing module503 is specifically configured to:
determine, when the entity alignment processing result is that the first entity is aligned with the second entity, and attribute alignment processing is performed on the first attribute of the first entity and a second attribute of the second entity, whether a second attribute the same as the first attribute exists in the knowledge graph;
if not, map the first attribute and the first attribute value to the second entity; or
if yes, when the first attribute is a single-value attribute, determine whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and if not, perform deduplication processing, or if yes, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, map the first attribute value to the second entity, and delete the second attribute value; or when the first attribute is a multi-value attribute, and a plurality of first attribute values do not conflict with a second attribute value, determine, in the plurality of first attribute values, an attribute value different from the second attribute value, and map a plurality of different first attribute values to the second entity; and
when the entity alignment processing result is that the first entity is unaligned with the second entity, newly construct a graph item in the knowledge graph, where the newly constructed graph item includes the first entity name and the first attribute value corresponding to the first attribute.
Optionally, the device further includes: a cleansingprocessing module504, configured to obtain a description type of each piece of attribute information; and
perform cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
Optionally, the device further includes: areasoning module505, configured to: in the knowledge graph, for a second attribute used to represent a relationship between entities, determine an implied relationship between entities by using a preset chain rule, and map the implied relationship to the knowledge graph.
The device provided in this embodiment may be configured to perform the foregoing method embodiment, and an implementation principle and a technical effect of the device are similar to those of the method embodiment. Details are not described again in this embodiment.
Persons of ordinary skill in the art may understand that all or a part of the steps of the method embodiments may be implemented by a program instructing relevant hardware. The program may be stored in a computer readable storage medium. When the program runs, the steps of the method embodiments are performed. The foregoing storage medium includes: any medium that can store program code, such as a ROM, a RAM, a magnetic disk, or an optical disc.
Finally, it should be noted that the foregoing embodiments are merely intended for describing the technical solutions of the present invention other than limiting the present invention. Although the present invention is described in detail with reference to the foregoing embodiments, persons of ordinary skill in the art should understand that they may still make modifications to the technical solutions described in the foregoing embodiments or make equivalent replacements to a part or all of the technical features thereof. Such modifications or replacements do not cause the essence of corresponding technical solutions to depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (18)

What is claimed is:
1. A computer-implemented knowledge graph construction method, comprising:
obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value;
performing, based on a measurement of similarity between the first entity and a second entity in a knowledge graph, entity alignment on the first entity, wherein the measurement of similarity comprises at least one of the following types: measurement of a character similarity, and measurement of an attribute similarity; and
importing, the structured data into the knowledge graph based on the entity alignment, wherein the importing comprises:
when the entity alignment indicates that the first entity is aligned with the second entity, and attribute alignment is performed on the first attribute of the first entity and a second attribute of the second entity, determining whether the second attribute exists in the knowledge graph;
if the second attribute does not exist in the knowledge graph, importing the first attribute and the first attribute value to the second entity; and
if the second attribute exists in the knowledge graph:
when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and
if the first attribute value does not conflict with the second attribute value, performing deduplication processing;
if the first attribute value conflicts with the second attribute value, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, importing the first attribute value to the second entity, and deleting the second attribute value; or
when the first attribute is a multi-value attribute, and comprises a plurality of first attribute values that do not conflict with the second attribute value, determining, in the plurality of first attribute values, an attribute value different from the second attribute value, and importing the determined attribute value to the second entity.
2. The computer-implemented knowledge graph construction method according toclaim 1, wherein the performing, based on the measurement of similarity between the first entity and the second entity in the knowledge graph, entity alignment processing on the first entity comprises:
determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph; and
performing entity alignment processing on the first entity according to the determined measurement type.
3. The computer-implemented knowledge graph construction method according toclaim 2, wherein
the performing entity alignment processing on the first entity according to the determined measurement type comprises:
determining whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
4. The computer-implemented knowledge graph construction method according toclaim 2, wherein
the performing entity alignment processing on the first entity according to the determined measurement type comprises:
determining whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
5. The computer-implemented knowledge graph construction method according toclaim 2, wherein the first attribute comprises a key attribute and a non-key attribute; and
the performing entity alignment processing on the first entity according to the determined measurement type comprises:
determining whether the second attribute exists in the knowledge graph, and if yes, determining whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned.
6. The computer-implemented knowledge graph construction method according toclaim 1, wherein before the determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph, the method further comprises:
obtaining a description type of each piece of attribute information; and
performing cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
7. The computer-implemented knowledge graph construction method according toclaim 1, wherein the method further comprises:
in the knowledge graph, for a second attribute used to represent a relationship between entities, determining an implied relationship between entities by using a preset chain rule, and mapping the implied relationship to the knowledge graph.
8. A knowledge graph construction device, comprising a processor and a non-transitory computer-readable storage medium storing instructions that, when execute by the processor, cause the processor to perform a method comprising:
obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value;
performing, based on a measurement of similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, wherein the measurement of similarity comprises at least one of the following types: measurement of a character similarity, and measurement of an attribute similarity; and
importing the structured data into the knowledge graph based on the entity alignment, wherein the importing comprises:
when the entity alignment indicates that the first entity is aligned with the second entity, and attribute alignment is performed on the first attribute of the first entity and a second attribute of the second entity, determining whether the second attribute exists in the knowledge graph;
if the second attribute does not exist in the knowledge graph, importing the first attribute and the first attribute value to the second entity; and
if the second attribute exists in the knowledge graph:
when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and
if the first attribute value does not conflict with the second attribute value, performing deduplication processing, or if the first attribute value conflicts with the second attribute value, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, importing the first attribute value to the second entity, and deleting the second attribute value; or
when the first attribute is a multi-value attribute, and comprises a plurality of first attribute values that do not conflict with the second attribute value, determining, in the plurality of first attribute values, an attribute value different from the second attribute value, and importing the determined attribute value to the second entity.
9. The knowledge graph construction device according toclaim 8, wherein the method further comprises: determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph; and
performing entity alignment processing on the first entity comprises performing entity alignment processing on the first entity according to the determined measurement type.
10. The knowledge graph construction device according toclaim 9, wherein the performing entity alignment processing on the first entity comprises:
determining whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determining that the first entity and second entity are aligned, and if not, determining that the first entity and second entity are not aligned.
11. The knowledge graph construction device according toclaim 9, wherein the performing entity alignment processing on the first entity comprises:
determining whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determining that the first entity and second entity are aligned, and if not, determining that the first entity and second entity are not aligned.
12. The knowledge graph construction device according toclaim 9, wherein the first attribute comprises a key attribute and a non-key attribute, and the performing entity alignment processing on the first entity comprises:
determining whether a second attribute the same as the key attribute exists in the knowledge graph, and if yes, determine whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determining that the first entity and second entity are aligned, and if not, determining that the first entity and second entity are not aligned.
13. The knowledge graph construction device according toclaim 8, wherein the method further comprising:
obtaining a description type of each piece of attribute information; and
performing cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description.
14. The knowledge graph construction device according toclaim 8, wherein the method further comprising: in the knowledge graph, for a second attribute used to represent a relationship between entities, determining an implied relationship between entities by using a preset chain rule, and map the implied relationship to the knowledge graph.
15. A non-transitory computer-readable storage medium storing instructions that, when execute by a processor, cause the processor to perform a method for constructing a knowledge graph, the method comprising:
obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value;
performing, based on a measurement of similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, wherein the measurement of similarity comprises at least one of the following types: measurement of a character similarity, and measurement of an attribute similarity; and
importing the structured data into the knowledge graph based on the entity alignment, wherein the importing comprises:
when the entity alignment indicates that the first entity is aligned with the second entity, and attribute alignment is performed on the first attribute of the first entity and a second attribute of the second entity, determining whether the second attribute exists in the knowledge graph;
if the second attribute does not exist in the knowledge graph, importing the first attribute and the first attribute value to the second entity; and
if the second attribute exists in the knowledge graph:
when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and
if the first attribute value does not conflict with the second attribute value, performing deduplication processing, or if the first attribute value conflicts with the second attribute value, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, importing the first attribute value to the second entity, and deleting the second attribute value; or
when the first attribute is a multi-value attribute, and comprises a plurality of first attribute values that do not conflict with the second attribute value, determining, in the plurality of first attribute values, an attribute value different from the second attribute value, and importing the determined attribute value to the second entity.
16. The non-transitory computer-readable storage medium according toclaim 15, wherein the performing, based on the measurement of similarity between the first entity and the second entity in the knowledge graph, entity alignment processing on the first entity comprises:
determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph; and
performing entity alignment processing on the first entity according to the determined measurement type.
17. The non-transitory computer-readable storage medium according toclaim 15, wherein the performing entity alignment processing on the first entity according to the determined measurement type comprises:
determining whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determining that the first entity and second entity are aligned, and if not, determining that the first entity and second entity are not aligned.
18. The non-transitory computer-readable storage medium according toclaim 17, wherein the performing entity alignment processing on the first entity according to the determined measurement type comprises:
determining whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determining that the first entity and second entity are aligned, and if not, determining that the first entity and second entity are not aligned.
US16/034,7992017-07-142018-07-13Knowledge graph construction method and deviceActive2042-04-06US11720629B2 (en)

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
CN201710576164.2ACN108268581A (en)2017-07-142017-07-14The construction method and device of knowledge mapping
CN201710576164.22017-07-14

Publications (2)

Publication NumberPublication Date
US20190019088A1 US20190019088A1 (en)2019-01-17
US11720629B2true US11720629B2 (en)2023-08-08

Family

ID=62771706

Family Applications (1)

Application NumberTitlePriority DateFiling Date
US16/034,799Active2042-04-06US11720629B2 (en)2017-07-142018-07-13Knowledge graph construction method and device

Country Status (2)

CountryLink
US (1)US11720629B2 (en)
CN (1)CN108268581A (en)

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US10824675B2 (en)*2017-11-172020-11-03Microsoft Technology Licensing, LlcResource-efficient generation of a knowledge graph
US11170046B2 (en)*2018-05-292021-11-09American Express Travel Related Services Company, Inc.Network node consolidation
CN109145003B (en)*2018-08-242022-05-27联动数科(北京)科技有限公司Method and device for constructing knowledge graph
CN109408645B (en)*2018-10-152021-02-26国信优易数据股份有限公司Entity type determination method and device
CN109344262B (en)*2018-10-312020-09-01百度在线网络技术(北京)有限公司Knowledge system establishing method and device and storage medium
CN111209407B (en)*2018-11-212023-06-16北京嘀嘀无限科技发展有限公司Data processing method, device, electronic equipment and computer readable storage medium
CN110020957A (en)*2019-01-312019-07-16阿里巴巴集团控股有限公司Damage identification method and device, the electronic equipment of maintenance objects
CN109885697B (en)*2019-02-012022-02-18北京百度网讯科技有限公司Method, apparatus, device and medium for constructing data model
CN109960810B (en)*2019-03-282020-05-19科大讯飞(苏州)科技有限公司Entity alignment method and device
CN109960722B (en)*2019-03-312021-10-22联想(北京)有限公司Information processing method and device
CN110008353A (en)*2019-04-092019-07-12福建奇点时空数字科技有限公司A kind of construction method of dynamic knowledge map
CN110008355A (en)*2019-04-112019-07-12华北科技学院 Disaster scene information fusion method and device based on knowledge graph
CN110245131A (en)*2019-06-052019-09-17江苏瑞中数据股份有限公司 Method, system and storage medium for entity alignment in knowledge graph
CN110489560A (en)*2019-06-192019-11-22民生科技有限责任公司The little Wei enterprise portrait generation method and device of knowledge based graphical spectrum technology
CN110263083B (en)*2019-06-202022-04-05北京百度网讯科技有限公司 Processing method, device, equipment and medium for knowledge graph
CN110275966B (en)*2019-07-012021-10-01科大讯飞(苏州)科技有限公司Knowledge extraction method and device
CN110457543B (en)*2019-07-252021-10-22中国科学院软件研究所 A method and system for entity resolution based on end-to-end multi-view matching
CN110532399A (en)*2019-08-072019-12-03广州多益网络股份有限公司Knowledge mapping update method, system and the device of object game question answering system
CN110457502B (en)*2019-08-212023-07-18京东方科技集团股份有限公司 Constructing knowledge map method, human-computer interaction method, electronic equipment and storage medium
CN110543573A (en)*2019-08-282019-12-06珠海格力电器股份有限公司knowledge graph-based garbage classification method and device
CN110909168B (en)*2019-09-232023-02-24腾讯科技(深圳)有限公司Knowledge graph updating method and device, storage medium and electronic device
CN110928894B (en)*2019-11-182023-05-02北京秒针人工智能科技有限公司Entity alignment method and device
CN112015792B (en)*2019-12-112023-12-01天津泰凡科技有限公司Material repeated code analysis method and device and computer storage medium
CN111177409A (en)*2019-12-272020-05-19北京明略软件系统有限公司Method and device for realizing data processing, computer storage medium and terminal
CN111241212B (en)*2020-01-202023-10-24京东方科技集团股份有限公司Knowledge graph construction method and device, storage medium and electronic equipment
CN111488461A (en)*2020-03-242020-08-04腾讯科技(深圳)有限公司Data processing method and device, electronic equipment and storage medium
CN111475657B (en)*2020-03-302023-10-03海信集团有限公司Display equipment, display system and entity alignment method
CN113535968B (en)*2020-04-202025-02-21北京沃东天骏信息技术有限公司 Method and device for extracting key attributes of data
CN111597788B (en)*2020-05-182023-11-14腾讯科技(深圳)有限公司Attribute fusion method, device, equipment and storage medium based on entity alignment
CN111597355A (en)*2020-05-222020-08-28北京明略软件系统有限公司Information processing method and device
CN112115271B (en)*2020-06-012024-05-03泰康保险集团股份有限公司Knowledge graph construction method and device
CN111831830B (en)*2020-07-012023-10-20腾讯科技(深圳)有限公司Knowledge graph entity field conflict detection method, device and related equipment
CN112116965B (en)*2020-07-202022-06-14上海大学 Material Process Matching Method Based on Embedding Attribute Similarity
CN113971217B (en)*2020-07-242025-06-27北京金山数字娱乐科技有限公司 A method and device for information error correction based on knowledge graph
CN112148735B (en)*2020-09-232023-05-26上海爱数信息技术股份有限公司Construction method for structured form data knowledge graph
CN112612902B (en)*2020-12-232023-07-14国网浙江省电力有限公司电力科学研究院 A method and device for constructing a knowledge map of power grid main equipment
CN112597315B (en)*2020-12-282023-07-14中国航天系统科学与工程研究院System model map construction method based on SysML meta-model ontology
CN112598563B (en)*2020-12-292023-11-17中国科学技术大学 A smart city data construction method based on knowledge graph
CN112784065B (en)*2021-02-012023-07-14东北大学 Unsupervised Knowledge Graph Fusion Method and Device Based on Multi-Order Neighborhood Attention Network
CN112765370B (en)*2021-03-292021-07-06腾讯科技(深圳)有限公司Entity alignment method and device of knowledge graph, computer equipment and storage medium
CN112948566B (en)*2021-04-212024-02-02华东理工大学Construction method and device of chemical knowledge graph and intelligent question-answering method and device
CN113656592B (en)*2021-07-222022-09-27北京百度网讯科技有限公司 Data processing method, device, electronic device and medium based on knowledge graph
CN113824707A (en)*2021-09-132021-12-21厦门吉快科技有限公司Website performance dial testing measurement method and device based on knowledge graph
CN114817553B (en)*2021-11-232024-12-13科大讯飞股份有限公司 Knowledge graph construction method, knowledge graph construction system and computing device
CN114329010B (en)*2021-12-292025-04-29杭州电子科技大学 A method for generating image scene graph based on knowledge graph
CN114596151A (en)*2022-04-072022-06-07中信百信银行股份有限公司 Product management method, apparatus, electronic device and storage medium
CN114691889B (en)*2022-04-152024-04-12中北大学 A method for constructing knowledge graph for fault diagnosis of turnout machine
CN115114443B (en)*2022-04-272025-03-11腾讯科技(深圳)有限公司 Training method, device, electronic device and storage medium for multimodal coding model
CN114780756B (en)*2022-06-072022-09-16国网浙江省电力有限公司信息通信分公司Entity alignment method and device based on noise detection and noise perception
CN115658711A (en)*2022-11-012023-01-31支付宝(杭州)信息技术有限公司 Method and device for inserting data into graph database
CN116150929B (en)*2023-04-172023-07-07中南大学Construction method of railway route selection knowledge graph
CN116187446B (en)*2023-05-042023-07-04中国人民解放军国防科技大学Knowledge graph completion method, device and equipment based on self-adaptive attention mechanism
CN117195000A (en)*2023-09-152023-12-08上海中亦图灵数字科技有限公司 Configuration item attribute entity alignment processing method
CN119990135A (en)*2024-12-112025-05-13北京北大英华科技有限公司 A generative multi-hop legal rule chain mining and evaluation method

Citations (38)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20020052858A1 (en)*1999-10-312002-05-02Insyst Ltd.Method and tool for data mining in automatic decision making systems
US6687404B1 (en)*1997-06-202004-02-03Xerox CorporationAutomatic training of layout parameters in a 2D image model
US7542969B1 (en)*2004-11-032009-06-02Microsoft CorporationDomain knowledge-assisted information processing
US20090299999A1 (en)2009-03-202009-12-03Loui Alexander CSemantic event detection using cross-domain knowledge
US20100189316A1 (en)*2009-01-272010-07-29Gannon Technologies Group, LlcSystems and methods for graph-based pattern recognition technology applied to the automated identification of fingerprints
US20110119243A1 (en)*2009-10-302011-05-19Evri Inc.Keyword-based search engine results using enhanced query strategies
US20110137919A1 (en)*2009-12-092011-06-09Electronics And Telecommunications Research InstituteApparatus and method for knowledge graph stabilization
US20120016877A1 (en)*2010-07-142012-01-19Yahoo! Inc.Clustering of search results
US8122047B2 (en)*2007-05-252012-02-21Kit Digital Inc.Recommendation systems and methods using interest correlation
US20120143911A1 (en)*2010-12-012012-06-07Google Inc.Recommendations based on topic clusters
US20120246154A1 (en)*2011-03-232012-09-27International Business Machines CorporationAggregating search results based on associating data instances with knowledge base entities
US20120290571A1 (en)*2011-04-152012-11-15IP StreetEvaluating Intellectual Property
US8356036B2 (en)*2005-02-042013-01-15Accenture Global ServicesKnowledge discovery tool extraction and integration
US20130166547A1 (en)*2011-12-272013-06-27Business Objects Software Ltd.Generating dynamic hierarchical facets from business intelligence artifacts
US20140046921A1 (en)*2010-12-302014-02-13Google Inc.Context-based person search
US8713021B2 (en)*2010-07-072014-04-29Apple Inc.Unsupervised document clustering using latent semantic density analysis
US20140156634A1 (en)*2012-11-302014-06-05Daniel BuchmannUnification of search and analytics
US8805861B2 (en)*2008-12-092014-08-12Google Inc.Methods and systems to train models to extract and integrate information from data sources
US8873813B2 (en)*2012-09-172014-10-28Z Advanced Computing, Inc.Application of Z-webs and Z-factors to analytics, search engine, learning, recognition, natural language, and other utilities
US8930178B2 (en)*2007-01-042015-01-06Children's Hospital Medical CenterProcessing text with domain-specific spreading activation methods
CN104462501A (en)2014-12-192015-03-25北京奇虎科技有限公司Knowledge graph construction method and device based on structural data
US9081501B2 (en)*2010-01-082015-07-14International Business Machines CorporationMulti-petascale highly efficient parallel supercomputer
US9104979B2 (en)2011-06-162015-08-11Microsoft Technology Licensing, LlcEntity recognition using probabilities for out-of-collection data
US20150235143A1 (en)*2003-12-302015-08-20Kantrack LlcTransfer Learning For Predictive Model Development
US20150286709A1 (en)*2014-04-022015-10-08Samsung Electronics Co., Ltd.Method and system for retrieving information from knowledge-based assistive network to assist users intent
US20150286713A1 (en)*2014-04-042015-10-08University Of Southern CaliforniaSystem and method for fuzzy ontology matching and search across ontologies
US20150324464A1 (en)*2014-05-062015-11-12Baidu Online Network Technology (Beijing) Co., LtdSearching method and apparatus
US20160098433A1 (en)*2013-12-022016-04-07Qbase, LLCMethod for facet searching and search suggestions
CN105574098A (en)2015-12-112016-05-11百度在线网络技术(北京)有限公司 Method and device for generating knowledge map, entity comparison method and device
CN106156082A (en)2015-03-312016-11-23华为技术有限公司A kind of body alignment schemes and device
US9535902B1 (en)*2013-06-282017-01-03Digital Reasoning Systems, Inc.Systems and methods for entity resolution using attributes from structured and unstructured data
US9548799B2 (en)*2013-04-072017-01-17Ziv A Corp.Distributed co-operating nodes using time reversal
US9558265B1 (en)*2016-05-122017-01-31Quid, Inc.Facilitating targeted analysis via graph generation based on an influencing parameter
CN106447346A (en)2016-08-292017-02-22北京中电普华信息技术有限公司Method and system for construction of intelligent electric power customer service system
US9721563B2 (en)*2012-06-082017-08-01Apple Inc.Name recognition system
US9916538B2 (en)*2012-09-152018-03-13Z Advanced Computing, Inc.Method and system for feature detection
US11080336B2 (en)*2012-08-292021-08-03Dennis Van DusenSystem and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
US11170306B2 (en)*2017-03-032021-11-09International Business Machines CorporationRich entities for knowledge bases

Patent Citations (39)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6687404B1 (en)*1997-06-202004-02-03Xerox CorporationAutomatic training of layout parameters in a 2D image model
US20020052858A1 (en)*1999-10-312002-05-02Insyst Ltd.Method and tool for data mining in automatic decision making systems
US20150235143A1 (en)*2003-12-302015-08-20Kantrack LlcTransfer Learning For Predictive Model Development
US7542969B1 (en)*2004-11-032009-06-02Microsoft CorporationDomain knowledge-assisted information processing
US8356036B2 (en)*2005-02-042013-01-15Accenture Global ServicesKnowledge discovery tool extraction and integration
US8930178B2 (en)*2007-01-042015-01-06Children's Hospital Medical CenterProcessing text with domain-specific spreading activation methods
US8122047B2 (en)*2007-05-252012-02-21Kit Digital Inc.Recommendation systems and methods using interest correlation
US8615524B2 (en)2007-05-252013-12-24Piksel, Inc.Item recommendations using keyword expansion
US8805861B2 (en)*2008-12-092014-08-12Google Inc.Methods and systems to train models to extract and integrate information from data sources
US20100189316A1 (en)*2009-01-272010-07-29Gannon Technologies Group, LlcSystems and methods for graph-based pattern recognition technology applied to the automated identification of fingerprints
US20090299999A1 (en)2009-03-202009-12-03Loui Alexander CSemantic event detection using cross-domain knowledge
US20110119243A1 (en)*2009-10-302011-05-19Evri Inc.Keyword-based search engine results using enhanced query strategies
US20110137919A1 (en)*2009-12-092011-06-09Electronics And Telecommunications Research InstituteApparatus and method for knowledge graph stabilization
US9081501B2 (en)*2010-01-082015-07-14International Business Machines CorporationMulti-petascale highly efficient parallel supercomputer
US8713021B2 (en)*2010-07-072014-04-29Apple Inc.Unsupervised document clustering using latent semantic density analysis
US20120016877A1 (en)*2010-07-142012-01-19Yahoo! Inc.Clustering of search results
US20120143911A1 (en)*2010-12-012012-06-07Google Inc.Recommendations based on topic clusters
US20140046921A1 (en)*2010-12-302014-02-13Google Inc.Context-based person search
US20120246154A1 (en)*2011-03-232012-09-27International Business Machines CorporationAggregating search results based on associating data instances with knowledge base entities
US20120290571A1 (en)*2011-04-152012-11-15IP StreetEvaluating Intellectual Property
US9104979B2 (en)2011-06-162015-08-11Microsoft Technology Licensing, LlcEntity recognition using probabilities for out-of-collection data
US20130166547A1 (en)*2011-12-272013-06-27Business Objects Software Ltd.Generating dynamic hierarchical facets from business intelligence artifacts
US9721563B2 (en)*2012-06-082017-08-01Apple Inc.Name recognition system
US11080336B2 (en)*2012-08-292021-08-03Dennis Van DusenSystem and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
US9916538B2 (en)*2012-09-152018-03-13Z Advanced Computing, Inc.Method and system for feature detection
US8873813B2 (en)*2012-09-172014-10-28Z Advanced Computing, Inc.Application of Z-webs and Z-factors to analytics, search engine, learning, recognition, natural language, and other utilities
US20140156634A1 (en)*2012-11-302014-06-05Daniel BuchmannUnification of search and analytics
US9548799B2 (en)*2013-04-072017-01-17Ziv A Corp.Distributed co-operating nodes using time reversal
US9535902B1 (en)*2013-06-282017-01-03Digital Reasoning Systems, Inc.Systems and methods for entity resolution using attributes from structured and unstructured data
US20160098433A1 (en)*2013-12-022016-04-07Qbase, LLCMethod for facet searching and search suggestions
US20150286709A1 (en)*2014-04-022015-10-08Samsung Electronics Co., Ltd.Method and system for retrieving information from knowledge-based assistive network to assist users intent
US20150286713A1 (en)*2014-04-042015-10-08University Of Southern CaliforniaSystem and method for fuzzy ontology matching and search across ontologies
US20150324464A1 (en)*2014-05-062015-11-12Baidu Online Network Technology (Beijing) Co., LtdSearching method and apparatus
CN104462501A (en)2014-12-192015-03-25北京奇虎科技有限公司Knowledge graph construction method and device based on structural data
CN106156082A (en)2015-03-312016-11-23华为技术有限公司A kind of body alignment schemes and device
CN105574098A (en)2015-12-112016-05-11百度在线网络技术(北京)有限公司 Method and device for generating knowledge map, entity comparison method and device
US9558265B1 (en)*2016-05-122017-01-31Quid, Inc.Facilitating targeted analysis via graph generation based on an influencing parameter
CN106447346A (en)2016-08-292017-02-22北京中电普华信息技术有限公司Method and system for construction of intelligent electric power customer service system
US11170306B2 (en)*2017-03-032021-11-09International Business Machines CorporationRich entities for knowledge bases

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
First Examination Report dated Jan. 20, 2021, issued in related Indian Application No. 201844025680 (7 pages).
First Office Action dated Jun. 3, 2020, issued in related Chinese Application No. 201710576164.2, with English machine translation (18 pages).
First Search dated May 26, 2020, issued in related Chinese Application No. 201710576164.2 (2 pages).
Office Action dated Oct. 20, 2020, issued in related Indonesian Application No. PID201805069, with English translation (4 pages).
Second Examination Report dated Dec. 20, 2021, issued in related Indonesian Application No. PID201805069, with English machine translation (4 pages).
Second Office Action dated Dec. 14, 2020, issued in related Chinese Application No. 201710576164.2, with English machine translation (18 pages).

Also Published As

Publication numberPublication date
US20190019088A1 (en)2019-01-17
CN108268581A (en)2018-07-10

Similar Documents

PublicationPublication DateTitle
US11720629B2 (en)Knowledge graph construction method and device
US10642938B2 (en)Artificial intelligence based method and apparatus for constructing comment graph
US10725836B2 (en)Intent-based organisation of APIs
US10282606B2 (en)Organizational logo enrichment
CN109582799B (en) Method, device and electronic device for determining knowledge sample data set
US9239875B2 (en)Method for disambiguated features in unstructured text
US11397855B2 (en)Data standardization rules generation
CN101986296B (en)Noise data cleaning method based on semantic ontology
US20160275196A1 (en)Semantic search apparatus and method using mobile terminal
US11500876B2 (en)Method for duplicate determination in a graph
CN104699730A (en)Identifying and displaying relationships between candidate answers
US20130311517A1 (en)Representing Incomplete and Uncertain Information in Graph Data
CN104866498A (en)Information processing method and device
US11275772B2 (en)Method and system for managing network of field-specific entity records
US20140222791A1 (en)Authority based content filtering
US20170228402A1 (en)Inconsistency Detection And Correction System
US8990170B2 (en)Method and apparatus for detecting an address update
CN111913963B (en)Method and system for storing interface data on demand
CN105488165B (en)Data retrieval method and system based on index database
US11409773B2 (en)Selection device, selection method, and non-transitory computer readable storage medium
US10296585B2 (en)Assisted free form decision definition using rules vocabulary
KR101521112B1 (en)Apparatus and method for data linking and merging
CN113032518A (en)Information processing apparatus, storage medium, and information processing method
US20240152565A1 (en)Information processing system, information processing method and information processing program
JP7273888B2 (en) Decision device, decision method and decision program

Legal Events

DateCodeTitleDescription
FEPPFee payment procedure

Free format text:ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

ASAssignment

Owner name:GUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD., CHINA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WANG, YUE;YOU, WEI;ZHANG, WEIXING;AND OTHERS;SIGNING DATES FROM 20180930 TO 20181016;REEL/FRAME:047187/0089

Owner name:GUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD., CHIN

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WANG, YUE;YOU, WEI;ZHANG, WEIXING;AND OTHERS;SIGNING DATES FROM 20180930 TO 20181016;REEL/FRAME:047187/0089

STPPInformation on status: patent application and granting procedure in general

Free format text:DOCKETED NEW CASE - READY FOR EXAMINATION

ASAssignment

Owner name:ALIBABA GROUP HOLDING LIMITED, CAYMAN ISLANDS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GUANGDONG SHENMA SEARCH TECHNOLOGY CO., LTD.;REEL/FRAME:053066/0864

Effective date:20200428

STPPInformation on status: patent application and granting procedure in general

Free format text:NON FINAL ACTION MAILED

STPPInformation on status: patent application and granting procedure in general

Free format text:RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPPInformation on status: patent application and granting procedure in general

Free format text:FINAL REJECTION MAILED

STPPInformation on status: patent application and granting procedure in general

Free format text:RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPPInformation on status: patent application and granting procedure in general

Free format text:PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCFInformation on status: patent grant

Free format text:PATENTED CASE


[8]ページ先頭

©2009-2025 Movatter.jp