









技术领域Technical Field
本发明涉及数据脱敏技术领域,具体涉及一种标签数据脱敏方法、装置、电子设备及存储介质。The present invention relates to the technical field of data desensitization, and in particular to a label data desensitization method, device, electronic device and storage medium.
背景技术Background Art
随着大数据在各个行业领域深入普及,通过打标签来建立标签数据作为研究分析重要业务对象的核心技术手段越来越受到重视,其中最具代表性的就是用户画像标签。标签数据的大规模应用,对相关数据安全保护也提出了更高的要求。As big data becomes more and more popular in various industries, labeling is becoming more and more important as a core technical means of researching and analyzing important business objects. The most representative of these is user portrait labeling. The large-scale application of label data also puts forward higher requirements for the security protection of related data.
相关技术中,通过在原始数据上标注敏感标签确定是否存在敏感字段的方式进行数据脱敏,没有针对打标对象本身的标签特征进行特定处理,导致数据脱敏的安全性不高和通用性不强,不便于参与大规模成体系的标签数据脱敏。In the related technology, data desensitization is performed by marking sensitive labels on the original data to determine whether there are sensitive fields. No specific processing is performed on the label features of the labeled object itself, resulting in low security and low versatility of data desensitization, which is not convenient for large-scale systematic label data desensitization.
发明内容Summary of the invention
鉴于以上所述现有技术的缺点,本发明提供一种标签数据脱敏方法、装置、电子设备及存储介质,以解决上述对大规模成体系的标签数据进行脱敏时安全性不高和通用性不强的技术问题。In view of the shortcomings of the prior art mentioned above, the present invention provides a label data desensitization method, device, electronic device and storage medium to solve the above-mentioned technical problems of low security and low versatility when desensitizing large-scale systematic label data.
本发明提供一种标签数据脱敏方法,包括:获取初始标签数据;对所述初始标签数据进行分级分类,得到分级分类数据,并基于所述分级分类数据和预设等级规则对应表确定标签脱敏规则,所述预设等级规则对应表用于表征预设脱敏等级和预设脱敏规则的匹配关系;基于所述标签脱敏规则对所述初始标签数据进行脱敏,得到目标脱敏数据。The present invention provides a label data desensitization method, comprising: acquiring initial label data; grading and classifying the initial label data to obtain graded and classified data, and determining label desensitization rules based on the graded and classified data and a preset grade rule correspondence table, wherein the preset grade rule correspondence table is used to characterize the matching relationship between a preset desensitization grade and a preset desensitization rule; desensitizing the initial label data based on the label desensitization rule to obtain target desensitized data.
于本发明一实施例中,对所述初始标签数据进行分类,得到分类标签数据;In one embodiment of the present invention, the initial label data is classified to obtain classified label data;
基于预设分级标准对所述初始标签数据进行分级,得到分级标签数据;将所述分类标签数据和所述分级标签数据作为分级分类数据。The initial label data is classified based on a preset classification standard to obtain classified label data; and the classified label data and the classified label data are used as classified classified data.
于本发明一实施例中,根据所述分类标签数据得到第一脱敏等级和第一脱敏规则;根据所述分级标签数据得到第二脱敏等级和第二脱敏规则;基于所述预设等级规则对应表在所述第一脱敏等级、所述第一脱敏规则、所述第二脱敏等级和所述第二脱敏规则中确定所述标签脱敏规则。In one embodiment of the present invention, a first desensitization level and a first desensitization rule are obtained according to the classified label data; a second desensitization level and a second desensitization rule are obtained according to the graded label data; and the label desensitization rule is determined among the first desensitization level, the first desensitization rule, the second desensitization level and the second desensitization rule based on the preset level rule correspondence table.
于本发明一实施例中,若所述初始标签数据不存在父标签,则基于所述初始标签数据的业务类型对所述标签数据进行分类,得到第一分类数据;若所述初始标签数据存在父标签,则继承所述父标签的分类,得到第二分类数据;将所述第一分类数据和所述第二分类数据作为分类标签数据。In one embodiment of the present invention, if the initial label data does not have a parent label, the label data is classified based on the business type of the initial label data to obtain first classified data; if the initial label data has a parent label, the classification of the parent label is inherited to obtain second classified data; the first classified data and the second classified data are used as classified label data.
于本发明一实施例中,若所述标签脱敏规则中存在多个预设子脱敏规则,则根据初始标签数据的数据类型确定目标脱敏规则;若所述标签脱敏规则中存在一个预设子脱敏规则,则将所述预设子脱敏规则确定为目标脱敏规则;根据所述目标脱敏规则对所述初始标签数据进行脱敏,得到所述目标脱敏数据。In one embodiment of the present invention, if there are multiple preset sub-desensitization rules in the label desensitization rule, the target desensitization rule is determined according to the data type of the initial label data; if there is a preset sub-desensitization rule in the label desensitization rule, the preset sub-desensitization rule is determined as the target desensitization rule; the initial label data is desensitized according to the target desensitization rule to obtain the target desensitized data.
于本发明一实施例中,若所述初始标签数据为数据集,则对所述初始标签数据进行聚合处理,得到聚合数据,并根据所述目标脱敏规则对所述聚合数据进行脱敏,得到第一脱敏数据;若所述数据集存在多个标签,则根据预设脱敏算力资源将所述数据集分配至不同脱敏节点进行脱敏,得到第二脱敏数据;若所述初始标签数据存在同一个标签不同实体对象,则基于所述初始标签数据的键值将所述初始标签数据分配到不同脱敏节点进行脱敏,得到第三脱敏数据;若所述初始标签数据存在嵌套数据,则由内至外对所述初始标签数据进行脱敏,得到第四脱敏数据;基于所述第一脱敏数据、所述第二脱敏数据、所述第三脱敏数据和所述第四脱敏数据得到所述目标脱敏数据。In one embodiment of the present invention, if the initial label data is a data set, the initial label data is aggregated to obtain aggregated data, and the aggregated data is desensitized according to the target desensitization rule to obtain first desensitized data; if the data set has multiple labels, the data set is allocated to different desensitizing nodes for desensitization according to preset desensitizing computing resources to obtain second desensitized data; if the initial label data has different entity objects with the same label, the initial label data is allocated to different desensitizing nodes for desensitization based on the key value of the initial label data to obtain third desensitized data; if the initial label data has nested data, the initial label data is desensitized from the inside to the outside to obtain fourth desensitized data; the target desensitized data is obtained based on the first desensitized data, the second desensitized data, the third desensitized data and the fourth desensitized data.
于本发明一实施例中,将所述脱敏数据分发到目标资源位置,所述目标资源位置用于存储所述目标脱敏数据或调用所述目标脱敏数据。In one embodiment of the present invention, the desensitized data is distributed to a target resource location, and the target resource location is used to store the target desensitized data or call the target desensitized data.
本发明提供一种标签数据脱敏装置,包括:获取模块,用于获取初始标签数据;The present invention provides a label data desensitization device, comprising: an acquisition module, used to acquire initial label data;
规则确定模块,用于对所述初始标签数据进行分级分类,得到分级分类数据,并基于所述分级分类数据和预设等级规则对应表确定标签脱敏规则,所述预设等级规则对应表用于表征预设脱敏等级和预设脱敏规则的匹配关系;脱敏模块,用于基于所述标签脱敏规则对所述初始标签数据进行脱敏,得到目标脱敏数据。A rule determination module is used to grade and classify the initial label data to obtain graded and classified data, and determine the label desensitization rules based on the graded and classified data and the preset grade rule correspondence table, wherein the preset grade rule correspondence table is used to characterize the matching relationship between the preset desensitization level and the preset desensitization rule; a desensitization module is used to desensitize the initial label data based on the label desensitization rules to obtain target desensitized data.
于本发明一实施例中,所述标签数据脱敏装置还包括标签数据资源模块、标签管控系统、标签服务模块和标签应用模块;所述标签数据资源模块用于存储初始标签数据;所述标签管控系统包括标签管理模块和标签脱敏模块,所述标签管理模块用于通过所述获取模块获取所述初始标签数据,并对所述初始标签数据进行分级分类管理和基于所述分级分类管理确定标签脱敏规则,所述标签脱敏模块用于基于标签脱敏规则对所述初始标签数据进行脱敏,得到目标脱敏数据;所述标签服务模块用于将所述目标脱敏数据分发至所述标签应用模块使用。In one embodiment of the present invention, the label data desensitization device also includes a label data resource module, a label management system, a label service module and a label application module; the label data resource module is used to store initial label data; the label management system includes a label management module and a label desensitization module, the label management module is used to obtain the initial label data through the acquisition module, and perform hierarchical and classified management on the initial label data and determine label desensitization rules based on the hierarchical and classified management, the label desensitization module is used to desensitize the initial label data based on the label desensitization rules to obtain target desensitized data; the label service module is used to distribute the target desensitized data to the label application module for use.
本发明提供一种电子设备,所述电子设备包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述电子设备实现如上述各实施例任一所述的车辆应用升级失败应急方法。The present invention provides an electronic device, comprising: one or more processors; a storage device for storing one or more programs, wherein when the one or more programs are executed by the one or more processors, the electronic device implements the emergency method for vehicle application upgrade failure as described in any of the above embodiments.
本发明提供一种计算机可读存储介质,其上存储有计算机程序,当所述计算机程序被计算机的处理器执行时,使计算机执行上述各实施例任一所述的车辆应用升级失败应急方法。The present invention provides a computer-readable storage medium having a computer program stored thereon. When the computer program is executed by a processor of a computer, the computer is enabled to execute the emergency method for vehicle application upgrade failure described in any of the above embodiments.
本发明的有益效果:本发明提供了一种标签数据脱敏方法、装置、电子设备及存储介质。对于打上大数据分析标签的初始标签数据,本发明可以通过对初始标签数据进行分级分类并基于预设等级规则对应表的方式快速确定标签脱敏规则,再根据标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据,基于初始标签数据的标签确定标签脱敏规则提高了数据脱敏的安全性和通用性,有利于大规模成体系的标签数据进行脱敏。Beneficial effects of the present invention: The present invention provides a method, device, electronic device and storage medium for desensitizing label data. For initial label data labeled with big data analysis labels, the present invention can quickly determine label desensitization rules by grading and classifying the initial label data and based on a preset grade rule correspondence table, and then desensitize the initial label data according to the label desensitization rules to obtain target desensitized data. Determining label desensitization rules based on the labels of the initial label data improves the security and versatility of data desensitization, which is conducive to desensitizing large-scale systematic label data.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本发明。It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本发明的实施例,并与说明书一起用于解释本发明的原理。显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术者来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。在附图中:The drawings herein are incorporated into and constitute a part of the specification, showing embodiments consistent with the present invention, and together with the specification, are used to explain the principles of the present invention. Obviously, the drawings described below are only some embodiments of the present invention, and for those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work. In the drawings:
请参阅图1,图1示出了可以应用本发明实施例的技术方案的示例性系统架构的示意图;Please refer to FIG1 , which shows a schematic diagram of an exemplary system architecture to which the technical solution of an embodiment of the present invention can be applied;
图2示出了根据本发明一个实施例的标签数据脱敏方法的流程示意图;FIG2 is a schematic diagram showing a flow chart of a method for desensitizing label data according to an embodiment of the present invention;
图3示出了根据本发明一个实施例的得到分级分类数据的流程示意图;FIG3 is a schematic diagram showing a process of obtaining hierarchical classification data according to an embodiment of the present invention;
图4示出了根据本发明一个实施例的得到分类标签数据的流程示意图;FIG4 is a schematic diagram showing a process of obtaining classification label data according to an embodiment of the present invention;
图5示出了根据本发明一个实施例的确定标签脱敏规则的流程示意图;FIG5 is a schematic diagram showing a flow chart of determining a label desensitization rule according to an embodiment of the present invention;
图6示出了根据本发明一个实施例的对初始标签数据进行脱敏的流程示意图;FIG6 is a schematic diagram showing a process of desensitizing initial tag data according to an embodiment of the present invention;
图7示出了根据本发明一个实施例的得到目标脱敏数据的流程示意图;FIG7 shows a schematic diagram of a process for obtaining target desensitized data according to an embodiment of the present invention;
图8示出了根据本发明一个实施例的标签数据脱敏总体的流程示意图;FIG8 shows a schematic diagram of the overall process of label data desensitization according to an embodiment of the present invention;
图9示出了根据本发明一个实施例的标签数据脱敏装置的框图;FIG9 shows a block diagram of a label data desensitization device according to an embodiment of the present invention;
图10示出了适于用来实现本发明实施例的电子设备的计算机系统的结构示意图。FIG. 10 is a schematic diagram showing the structure of a computer system of an electronic device suitable for implementing an embodiment of the present invention.
具体实施方式DETAILED DESCRIPTION
以下将参照附图和优选实施例来说明本发明的实施方式,本领域技术人员可由本说明书中所揭露的内容轻易地了解本发明的其他优点与功效。本发明还可以通过另外不同的具体实施方式加以实施或应用,本说明书中的各项细节也可以基于不同观点与应用,在没有背离本发明的精神下进行各种修饰或改变。应当理解,优选实施例仅为了说明本发明,而不是为了限制本发明的保护范围。The following will describe the embodiments of the present invention with reference to the accompanying drawings and preferred embodiments. Those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. The present invention can also be implemented or applied through other different specific embodiments, and the details in this specification can also be modified or changed in various ways based on different viewpoints and applications without departing from the spirit of the present invention. It should be understood that the preferred embodiments are only for illustrating the present invention, not for limiting the scope of protection of the present invention.
需要说明的是,以下实施例中所提供的图示仅以示意方式说明本发明的基本构想,遂图式中仅显示与本发明中有关的组件而非按照实际实施时的组件数目、形状及尺寸绘制,其实际实施时各组件的型态、数量及比例可为一种随意的改变,且其组件布局型态也可能更为复杂。It should be noted that the illustrations provided in the following embodiments are only schematic illustrations of the basic concept of the present invention, and thus the drawings only show components related to the present invention rather than being drawn according to the number, shape and size of components in actual implementation. In actual implementation, the type, quantity and proportion of each component may be changed arbitrarily, and the component layout may also be more complicated.
在下文描述中,探讨了大量细节,以提供对本发明实施例的更透彻的解释,然而,对本领域技术人员来说,可以在没有这些具体细节的情况下实施本发明的实施例是显而易见的,在其他实施例中,以方框图的形式而不是以细节的形式来示出公知的结构和设备,以避免使本发明的实施例难以理解。In the following description, numerous details are discussed to provide a more thorough explanation of the embodiments of the present invention. However, it is obvious to those skilled in the art that the embodiments of the present invention can be implemented without these specific details. In other embodiments, well-known structures and devices are shown in the form of block diagrams rather than in detail to avoid making the embodiments of the present invention difficult to understand.
请参阅图1,请参阅图1,图1示出了可以应用本发明实施例的技术方案的示例性系统架构的示意图。如图1所示,系统架构包括标签数据资源模块、标签管控系统、标签服务模块和标签应用模块。标签数据资源模块用于存储初始标签数据,标签管控系统包括标签管理模块和标签脱敏模块,标签管理模块在获取到初始标签数据后对标签数据进行分级分类管理和基于分级分类管理确定标签脱敏规则,标签脱敏模块用于基于标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据;标签服务模块用于将目标脱敏数据分发至标签应用模块,标签应用模块根据业务情况对脱敏数据进行使用。Please refer to Figure 1, which shows a schematic diagram of an exemplary system architecture to which the technical solution of an embodiment of the present invention can be applied. As shown in Figure 1, the system architecture includes a label data resource module, a label management system, a label service module, and a label application module. The label data resource module is used to store initial label data, and the label management system includes a label management module and a label desensitization module. After obtaining the initial label data, the label management module performs hierarchical classification management on the label data and determines the label desensitization rules based on the hierarchical classification management. The label desensitization module is used to desensitize the initial label data based on the label desensitization rules to obtain the target desensitized data; the label service module is used to distribute the target desensitized data to the label application module, and the label application module uses the desensitized data according to the business situation.
为解决上述技术问题,本发明提供了一种标签数据脱敏方法、装置、电子设备及存储介质,以下对本发明实施例的技术方案的实现细节进行详细阐述。In order to solve the above technical problems, the present invention provides a label data desensitization method, device, electronic device and storage medium. The implementation details of the technical solution of the embodiment of the present invention are elaborated in detail below.
请参阅图2,图2示出了根据本发明一个实施例的标签数据脱敏方法的流程示意图。如图2所示,在一示例性的实施例中,标签数据脱敏方法至少包括步骤S2100至步骤S2300,详细介绍如下:Please refer to Figure 2, which shows a schematic flow chart of a method for desensitizing label data according to an embodiment of the present invention. As shown in Figure 2, in an exemplary embodiment, the method for desensitizing label data includes at least steps S2100 to S2300, which are described in detail as follows:
步骤S2100,获取初始标签数据。Step S2100, obtaining initial tag data.
在本发明的一个实施例中,通过读取数据源的数据集到内存获取到初始标签数据。数据源包括大数据平台、关系型数据库以及非关系型数据库(主要是key-Value库)。大数据平台的数据集读取:主流的大数据平台是基于Hadoop构建,本发明可以通过诸如新型查询系统(Impala)、数据仓库(Hive)、交互式分析查询(Trino)等各种SQL on Hadoop组件提供的客户端提供的数据读取能力实现数据集读取。关系型数据库通过字段找到对应的标签,即通过Java数据库连接(JDBC)实现数据的读取。key-value数据库以客户端或API接口的方式并通过key找到对应的数据。读取的数据在内存中保存为一张包含了由实体标识符及其他标签特征组成的二维表数据集对象。In one embodiment of the present invention, the initial label data is obtained by reading the data set of the data source into the memory. The data source includes a big data platform, a relational database, and a non-relational database (mainly a key-Value library). Data set reading of the big data platform: The mainstream big data platform is built on Hadoop. The present invention can realize data set reading through the data reading capabilities provided by the client provided by various SQL on Hadoop components such as the new query system (Impala), the data warehouse (Hive), and the interactive analysis query (Trino). The relational database finds the corresponding label through the field, that is, the data is read through the Java Database Connection (JDBC). The key-value database finds the corresponding data through the key in the form of a client or API interface. The read data is saved in the memory as a two-dimensional table data set object consisting of entity identifiers and other label features.
步骤S2200,对初始标签数据进行分级分类,得到分级分类数据,并基于分级分类数据和预设等级规则对应表确定标签脱敏规则。Step S2200, hierarchically classify the initial label data to obtain hierarchical classification data, and determine the label desensitization rules based on the hierarchical classification data and the preset grade rule correspondence table.
其中,预设等级规则对应表用于表征预设脱敏等级和预设脱敏规则的匹配关系。Among them, the preset level rule correspondence table is used to characterize the matching relationship between the preset desensitization level and the preset desensitization rule.
在本发明的一个实施例中,从标签元数据中加载脱敏等级和脱敏规则。标签管理模块管理了对初始标签数据的描述,包括标签的定义信息、加工规则以及对应的数据安全分级分类。数据安全分级分类即根据实际业务情况及国家规定,通过自定义的多层级多类别进行数据分类管理。数据分类支持敏感等级、描述和识别规则的管理,可对分类内容进行敏感定级,给分类添加规则标签,作为脱敏的依据。In one embodiment of the present invention, the desensitization level and desensitization rules are loaded from the tag metadata. The tag management module manages the description of the initial tag data, including the tag definition information, processing rules, and corresponding data security classification. Data security classification is to manage data classification through customized multi-level and multi-category based on actual business conditions and national regulations. Data classification supports the management of sensitivity levels, descriptions, and identification rules. It can classify the classified content and add rule labels to the classification as the basis for desensitization.
在本发明的一个实施例中,根据国家规定对初始标签数据进行分级,国家分级标准为脱敏后的数据可还原度,脱敏等级从高到低为:0-完全不可还原、1-仅具统计学意义、2-部分可还原、3-有条件(非加密)可完全还原、4-有条件(加密)可还原、5-完全明文6个等级。国家分级标准见表1所示:In one embodiment of the present invention, the initial label data is graded according to national regulations. The national grading standard is the degree of data restoration after desensitization. The desensitization levels are from high to low: 0-completely irreversible, 1-only statistically significant, 2-partially restorable, 3-conditionally (non-encrypted) completely restorable, 4-conditionally (encrypted) restorable, and 5-completely plaintext. The national grading standards are shown in Table 1:
表1Table 1
在本发明的一个实施例中,预设等级规则对应表包括算法、算法描述、适用数据类型、等级和算法具体实现的匹配关系。例如,若预设脱敏规则为关系映射,则其对应的脱敏等级为3或4,具体实现方式包括固定映射、随机映射和哈希,预设等级规则对应表的具体关系如表2所示:In one embodiment of the present invention, the preset level rule correspondence table includes the matching relationship between the algorithm, algorithm description, applicable data type, level and specific implementation of the algorithm. For example, if the preset desensitization rule is relational mapping, the corresponding desensitization level is 3 or 4, and the specific implementation method includes fixed mapping, random mapping and hashing. The specific relationship of the preset level rule correspondence table is shown in Table 2:
表2Table 2
在本发明的一个实施例中,请参阅图3,图3示出了根据本发明一个实施例的得到分级分类数据的流程示意图。如图3所示,对初始标签数据进行分级分类,得到分级分类数据至少包括S3210至步骤S3230,详细介绍如下:In one embodiment of the present invention, please refer to FIG3, which shows a schematic diagram of a process for obtaining hierarchical classification data according to one embodiment of the present invention. As shown in FIG3, hierarchical classification of the initial label data to obtain hierarchical classification data includes at least steps S3210 to S3230, which are described in detail as follows:
步骤S3210,对初始标签数据进行分类,得到分类标签数据。Step S3210, classify the initial label data to obtain classified label data.
在本发明的一个实施例中,系统内可根据业务类型预先设置分类脱敏等级和分类脱敏规则。In one embodiment of the present invention, classification desensitization levels and classification desensitization rules can be pre-set in the system according to the business type.
在本发明的一个实施例中,请参阅图4,图4示出了根据本发明一个实施例的得到分类标签数据的流程示意图。如图4所示,对初始标签数据进行分类,得到分类标签数据至少包括步骤S4211至步骤S4213,详细介绍如下:In one embodiment of the present invention, please refer to Figure 4, which shows a schematic diagram of a process for obtaining classified label data according to one embodiment of the present invention. As shown in Figure 4, classifying the initial label data to obtain the classified label data includes at least steps S4211 to S4213, which are described in detail as follows:
步骤S4211,若初始标签数据不存在父标签,则基于初始标签数据的业务类型对标签数据进行分类,得到第一分类数据。Step S4211: if the initial tag data does not have a parent tag, the tag data is classified based on the business type of the initial tag data to obtain first classified data.
在本发明的一个实施例中,若初始标签数据不存在父标签,可根据业务类型对初始标签数据进行分类,通过分类,系统可以快速发现和复用相似分类的脱敏等级及脱敏规则,并支持由人工进行修订和确认。In one embodiment of the present invention, if the initial label data does not have a parent label, the initial label data can be classified according to the business type. Through classification, the system can quickly discover and reuse desensitization levels and desensitization rules of similar classifications, and support manual revision and confirmation.
步骤S4212,若初始标签数据存在父标签,则继承父标签的分类,得到第二分类数据。Step S4212: If the initial label data has a parent label, the classification of the parent label is inherited to obtain second classification data.
在本发明的一个实施例中,对于通过父标签数据衍生出的派生子标签的情况,子标签按照从严规则,自动集成继承父标签敏感等级中最高一个的敏感等级。In one embodiment of the present invention, for a derived sub-label derived from parent label data, the sub-label automatically integrates and inherits the highest sensitivity level of the parent label according to strict rules.
步骤S4213,将第一分类数据和第二分类数据作为分类标签数据。Step S4213: Use the first classification data and the second classification data as classification label data.
步骤S3220,基于预设分级标准对初始标签数据进行分级,得到分级标签数据。Step S3220, classifying the initial label data based on a preset classification standard to obtain classified label data.
在本发明的一个实施例中,预设分级标准为根据国家规定的国家分级标准,根据国家分级标准对初始标签数据进行分级,得到分级标签数据。In one embodiment of the present invention, the preset classification standard is a national classification standard stipulated by the state, and the initial label data is classified according to the national classification standard to obtain the classified label data.
步骤S3230,将分类标签数据和分级标签数据作为分级分类数据。Step S3230, taking the classification label data and the hierarchical label data as hierarchical classification data.
在本发明的一个实施例中,请参阅图5,图5示出了根据本发明一个实施例的确定标签脱敏规则的流程示意图。如图5所示,基于分级分类数据和预设等级规则对应表确定标签脱敏规则至少包括步骤S5240至S5260,详细介绍如下:In one embodiment of the present invention, please refer to FIG5, which shows a schematic diagram of a process for determining a label desensitization rule according to one embodiment of the present invention. As shown in FIG5, determining a label desensitization rule based on hierarchical classification data and a preset hierarchical rule correspondence table includes at least steps S5240 to S5260, which are described in detail as follows:
步骤S5240,根据分类标签数据得到第一脱敏等级和第一脱敏规则。Step S5240, obtaining a first desensitization level and a first desensitization rule according to the classification label data.
在本发明的一个实施例中,根据分类标签数据对应的预设脱敏等级和预设脱敏规则得到第一脱敏等级和第一脱敏规则。In one embodiment of the present invention, a first desensitization level and a first desensitization rule are obtained according to a preset desensitization level and a preset desensitization rule corresponding to the classification label data.
步骤S5250,根据分级标签数据得到第二脱敏等级和第二脱敏规则。Step S5250, obtaining a second desensitization level and a second desensitization rule according to the graded label data.
在本发明的一个实施例中,根据分级标签数据对应的预设脱敏等级和预设脱敏规则得到第二脱敏等级和第二脱敏规则。In one embodiment of the present invention, a second desensitization level and a second desensitization rule are obtained according to a preset desensitization level and a preset desensitization rule corresponding to the hierarchical label data.
步骤S5260,基于预设等级规则对应表在第一脱敏等级、第一脱敏规则、第二脱敏等级和第二脱敏规则中确定标签脱敏规则。Step S5260: Determine the label desensitization rule among the first desensitization level, the first desensitization rule, the second desensitization level and the second desensitization rule based on the preset level rule correspondence table.
在本发明的一个实施例中,按照从严确定标签脱敏规则,即首先需要在第一脱敏等级、第二脱敏等级中确定最高脱敏等级,在第一脱敏规则和第二脱敏规则中确定最高脱敏规则。检测最高脱敏等级和最高脱敏规则是否匹配。若不匹配则将最高脱敏等级对应的脱敏规则确定为标签脱敏规则。In one embodiment of the present invention, the label desensitization rule is strictly determined, that is, first, the highest desensitization level needs to be determined in the first desensitization level and the second desensitization level, and the highest desensitization rule needs to be determined in the first desensitization rule and the second desensitization rule. It is detected whether the highest desensitization level and the highest desensitization rule match. If they do not match, the desensitization rule corresponding to the highest desensitization level is determined as the label desensitization rule.
在本发明的一个实施例中,根据检查标识标签确定必须脱敏的数据,相对于其他数据的差异点在于带有特定对象的标识特征,例如姓名或身份证等,通过这些标识特征极容易识别该对象的特殊个体,一旦泄露容易造成巨大危害,对标签标识进行充分且必要的标签脱敏规则检查,不允许明文出现。标签标识包括但不限于身份唯一标识和身份特征标识。身份唯一标识为唯一识别具体的实体对象,例如,手机号、设备号和身份证号等。身份特征标识为根据少数几个特征标识,可以很快速地定位一个实体对象,比如通过居住地址和姓名可以快速定位到一个人。In one embodiment of the present invention, the data that must be desensitized is determined based on the inspection of the identification label. The difference from other data is that it carries the identification features of a specific object, such as a name or ID card, etc. These identification features can easily identify the special individual of the object. Once leaked, it can easily cause huge harm. The label identification is fully and necessary. The label desensitization rules are checked, and plain text is not allowed to appear. Label identification includes but is not limited to unique identity identification and identity feature identification. Unique identity identification is to uniquely identify a specific entity object, such as a mobile phone number, device number, and ID card number. Identity feature identification is to quickly locate an entity object based on a few feature identifications, such as quickly locating a person by residential address and name.
步骤S2300,基于标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据。Step S2300: desensitize the initial label data based on the label desensitization rule to obtain target desensitized data.
在本发明的一个实施例中,请参阅图6,图6示出了根据本发明一个实施例的对初始标签数据进行脱敏的流程示意图。如图6所示,基于标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据至少包括步骤S6310至S6330,详细介绍如下:In one embodiment of the present invention, please refer to FIG6, which shows a schematic diagram of a process of desensitizing initial label data according to one embodiment of the present invention. As shown in FIG6, desensitizing the initial label data based on the label desensitization rule to obtain the target desensitized data includes at least steps S6310 to S6330, which are described in detail as follows:
步骤S6310,若标签脱敏规则中存在多个预设子脱敏规则,则根据初始标签数据的数据类型确定目标脱敏规则。Step S6310: If there are multiple preset sub-desensitization rules in the label desensitization rule, the target desensitization rule is determined according to the data type of the initial label data.
在本发明的一个实施例中,如表2所示,若标签脱敏规则中存在多个预设子脱敏规则,可根据初始标签数据的数据类型确定目标脱敏规则。例如,标签脱敏规则为偏移取整,时间可向上取整;再例如向下取整,取整位数为2,则1988.65脱敏后为1900,仅为一个示例。预设等级规则对应表还包括常见脱敏算法及应用场景的匹配关系,如表3所示:In one embodiment of the present invention, as shown in Table 2, if there are multiple preset sub-desensitization rules in the label desensitization rule, the target desensitization rule can be determined according to the data type of the initial label data. For example, the label desensitization rule is offset rounding, and the time can be rounded up; for another example, rounding down, the rounding digit is 2, then 1988.65 is desensitized to 1900, which is only an example. The preset level rule correspondence table also includes the matching relationship between common desensitization algorithms and application scenarios, as shown in Table 3:
表3Table 3
步骤S6320,若标签脱敏规则中存在一个预设子脱敏规则,则将预设子脱敏规则确定为目标脱敏规则。Step S6320: If there is a preset sub-desensitization rule in the label desensitization rule, the preset sub-desensitization rule is determined as the target desensitization rule.
步骤S6330,根据目标脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据。Step S6330, desensitize the initial label data according to the target desensitization rule to obtain the target desensitized data.
在本发明的一个实施例中,对初始标签数据进行脱敏是通过算法函数计算的任务编排形成基于Spark的分布大数据计算任务,该任务提交到固定的计算集群执行,脱敏的算法具体有一个或多个脱敏算法函数实现,对于负责的脱敏算法可能涉及多个函数的嵌套调用。In one embodiment of the present invention, the initial label data is desensitized by task orchestration of algorithm function calculation to form a distributed big data computing task based on Spark. The task is submitted to a fixed computing cluster for execution. The desensitization algorithm is specifically implemented by one or more desensitization algorithm functions. A specific desensitization algorithm may involve nested calls of multiple functions.
在本发明的一个实施例中,请参阅图7,图7示出了根据本发明一个实施例的得到目标脱敏数据的流程示意图。如图7所示,根据目标脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据至少包括步骤S7331至S7335,详细介绍如下:In one embodiment of the present invention, please refer to FIG. 7, which shows a schematic diagram of a process for obtaining target desensitized data according to one embodiment of the present invention. As shown in FIG. 7, desensitizing the initial label data according to the target desensitization rule to obtain the target desensitized data includes at least steps S7331 to S7335, which are described in detail as follows:
步骤S7331,若初始标签数据为数据集,则对初始标签数据进行聚合处理,得到聚合数据,并根据目标脱敏规则对聚合数据进行脱敏,得到第一脱敏数据。Step S7331, if the initial label data is a data set, the initial label data is aggregated to obtain aggregated data, and the aggregated data is desensitized according to the target desensitization rule to obtain first desensitized data.
在本发明的一个实施例中,对数据集整体进行聚合函数的调用,例如计算均值和分布等,此类调用属于聚合类型计算需要在分布式各个节点预先聚合之后,最终在控制节点进行脱敏计算。In one embodiment of the present invention, an aggregation function is called on the entire data set, such as calculating the mean and distribution. This type of call belongs to an aggregation type calculation that needs to be pre-aggregated at each distributed node and finally desensitized at the control node.
步骤S7332,若数据集存在多个标签,则根据预设脱敏算力资源将数据集分配至不同脱敏节点进行脱敏,得到第二脱敏数据。Step S7332: If there are multiple labels in the data set, the data set is allocated to different desensitizing nodes for desensitization according to the preset desensitizing computing resources to obtain second desensitized data.
在本发明的一个实施例中,不同标签上的函数调用运算可以根据预设脱敏算力资源的使用情况进行分配和脱敏计算。In one embodiment of the present invention, function call operations on different tags can be allocated and desensitized according to the usage of preset desensitized computing power resources.
步骤S7333,若初始标签数据存在同一个标签不同实体对象,则基于初始标签数据的键值将初始标签数据分配到不同脱敏节点进行脱敏,得到第三脱敏数据。Step S7333: If the initial label data has different entity objects with the same label, the initial label data is assigned to different desensitizing nodes based on the key value of the initial label data for desensitization to obtain third desensitized data.
在本发明的一个实施例中,对于同一个标签不同的实体对象:根据初始标签数据的键值进行分散,分布到不同脱敏节点后进行脱敏计算。In one embodiment of the present invention, for entity objects with the same label but different labels, the entity objects are dispersed according to the key value of the initial label data, and then distributed to different desensitizing nodes for desensitization calculation.
步骤S7334,若初始标签数据存在嵌套数据,则由内至外对初始标签数据进行脱敏,得到第四脱敏数据。Step S7334: If the initial label data contains nested data, the initial label data is desensitized from the inside out to obtain fourth desensitized data.
在本发明的一个实施例中,对于嵌套数据,通过嵌套函数进行脱敏,即先计算内层函数。例如,funcA(funcB(Object))这样的嵌套函数需要先执行内层的funcB,再执行外层的funcA,多层嵌套以此类推。In one embodiment of the present invention, nested data is desensitized by nesting functions, that is, the inner function is calculated first. For example, a nested function such as funcA(funcB(Object)) needs to execute the inner funcB first, and then execute the outer funcA, and so on for multiple layers of nesting.
步骤S7335,基于第一脱敏数据、第二脱敏数据、第三脱敏数据和第四脱敏数据得到目标脱敏数据。Step S7335, obtaining target desensitized data based on the first desensitized data, the second desensitized data, the third desensitized data and the fourth desensitized data.
在本发明的一个实施例中,得到目标脱敏数据后通过标签服务模块将目标脱敏数据脱敏分发到目标资源位置,目标资源位置用于存储目标脱敏数据或调用目标脱敏数据。通过数据库或文件系统存储目标脱敏数据,便于通过以数据文件、字节流或者API的方式在标签应用模块上进行调用。In one embodiment of the present invention, after obtaining the target desensitized data, the target desensitized data is desensitized and distributed to the target resource location through the label service module, and the target resource location is used to store the target desensitized data or call the target desensitized data. The target desensitized data is stored in a database or file system, which is convenient for calling on the label application module in the form of a data file, byte stream or API.
在本发明的一个实施例中,请继续参阅图1,标签管理模块还用于标签基本信息管理,包括标签数据的基本定义、物理存放位置(库、表、字段、key等)、对应的数据安全等级与脱敏信息等。标签脱敏模块还能在多台计算节点上进行脱敏函数算法的并行和串行调用,并实时监控任务运行状态。In one embodiment of the present invention, please continue to refer to FIG. 1 , the tag management module is also used for basic tag information management, including basic definition of tag data, physical storage location (library, table, field, key, etc.), corresponding data security level and desensitization information, etc. The tag desensitization module can also perform parallel and serial calls of desensitization function algorithms on multiple computing nodes, and monitor the task running status in real time.
在本发明的一个实施例中,请参阅图8,图8示出了根据本发明一个实施例的标签数据脱敏总体的流程示意图。如图8所示,步骤S8100,获取初始标签数据:读取数据源的数据集到内存中。步骤S8200,对初始标签数据分级分类后得到多个脱敏等级和脱敏规则:基于初始标签数据分类后的数据得到第一脱敏等级和第一脱敏规则,基于初始标签数据分级后的数据得到第二脱敏等级和第二脱敏规则。步骤S8300,检查脱敏等级和脱敏规则匹配情况:从严确定最高脱敏等级和最高脱敏规则,检查最高脱敏等级和最高脱敏规则的匹配,得到标签脱敏规则。步骤S8400,检查标识标签:以防带有特定对象的标识特征的脱敏等级低。步骤S8500,调用脱敏处理函数,执行脱敏任务:根据标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据。步骤S8600,将脱敏后的数据分发到目标资源位置:以便标签应用模块使用目标脱敏数据。本发明可以提高敏感数据自动发现、系统流程化脱敏、支持丰富数据源、脱敏算法库充足、敏感类型丰富、支持分布式计算等功能,从而减轻人工成本的同时提升效率,保证数据脱敏的基本诉求。可以更加广泛地被适用于以标签数据作为数据原料的作数据挖掘、智能推荐、用户个性化服务等一系列场景中的数据脱敏。In one embodiment of the present invention, please refer to FIG. 8, which shows a schematic diagram of the overall process of label data desensitization according to one embodiment of the present invention. As shown in FIG. 8, step S8100, obtain the initial label data: read the data set of the data source into the memory. Step S8200, after grading and classifying the initial label data, multiple desensitization levels and desensitization rules are obtained: the first desensitization level and the first desensitization rule are obtained based on the data after the initial label data is classified, and the second desensitization level and the second desensitization rule are obtained based on the data after the initial label data is graded. Step S8300, check the matching of the desensitization level and the desensitization rule: strictly determine the highest desensitization level and the highest desensitization rule, check the matching of the highest desensitization level and the highest desensitization rule, and obtain the label desensitization rule. Step S8400, check the identification label: in case the desensitization level of the identification feature with a specific object is low. Step S8500, call the desensitization processing function and execute the desensitization task: desensitize the initial label data according to the label desensitization rule to obtain the target desensitized data. Step S8600, distribute the desensitized data to the target resource location: so that the label application module can use the target desensitized data. The present invention can improve the functions of automatic discovery of sensitive data, system process desensitization, support for rich data sources, sufficient desensitization algorithm library, rich sensitive types, support for distributed computing, etc., thereby reducing labor costs while improving efficiency and ensuring the basic demand of data desensitization. It can be more widely applied to data desensitization in a series of scenarios such as data mining, intelligent recommendation, and user personalized services using label data as data raw materials.
请参阅图9,图9示出了根据本发明一个实施例的标签数据脱敏装置的框图。该装置可以应用于图1所示的实施环境。该装置也可以适用于其它的示例性实施环境,并具体配置在其它设备中,本实施例不对该装置所适用的实施环境进行限制。Please refer to Figure 9, which shows a block diagram of a label data desensitization device according to an embodiment of the present invention. The device can be applied to the implementation environment shown in Figure 1. The device can also be applied to other exemplary implementation environments and specifically configured in other devices. This embodiment does not limit the implementation environment to which the device is applicable.
如图9所示,该示例性的标签数据脱敏装置包括:获取模块901、规则确定模块902和脱敏模块903。As shown in FIG. 9 , the exemplary label data desensitizing device includes: an
获取模块901,用于获取初始标签数据;An
规则确定模块902,用于对初始标签数据进行分级分类,得到分级分类数据,并基于分级分类数据和预设等级规则对应表确定标签脱敏规则,预设等级规则对应表用于表征预设脱敏等级和预设脱敏规则的匹配关系;A
脱敏模块903,用于基于标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据。The
标签数据脱敏装置还包括:标签数据资源模块、标签管控系统、标签服务模块和标签应用模块。The label data desensitizing device also includes: a label data resource module, a label management and control system, a label service module and a label application module.
标签数据资源模块用于存储初始标签数据;The label data resource module is used to store initial label data;
标签管控系统包括标签管理模块和标签脱敏模块,标签管理模块用于通过获取模块获取初始标签数据,并对初始标签数据进行分级分类管理和基于分级分类管理确定标签脱敏规则,标签脱敏模块用于基于标签脱敏规则对初始标签数据进行脱敏,得到目标脱敏数据;The label management system includes a label management module and a label desensitization module. The label management module is used to obtain the initial label data through the acquisition module, and to perform hierarchical classification management on the initial label data and determine the label desensitization rules based on the hierarchical classification management. The label desensitization module is used to desensitize the initial label data based on the label desensitization rules to obtain the target desensitized data.
标签服务模块用于将目标脱敏数据分发至标签应用模块使用。The label service module is used to distribute the target desensitized data to the label application module for use.
需要说明的是,上述实施例所提供的标签数据脱敏装置与上述实施例所提供的标签数据脱敏方法属于同一构思,其中各个模块和单元执行操作的具体方式已经在方法实施例中进行了详细描述,此处不再赘述。上述实施例所提供的标签数据脱敏装置在实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能,本处也不对此进行限制。It should be noted that the label data desensitization device provided in the above embodiment and the label data desensitization method provided in the above embodiment belong to the same concept, wherein the specific manner in which each module and unit performs the operation has been described in detail in the method embodiment and will not be repeated here. In practical applications, the label data desensitization device provided in the above embodiment can distribute the above functions to different functional modules as needed, that is, divide the internal structure of the device into different functional modules to complete all or part of the functions described above, and this is not limited here.
本发明的实施例还提供了一种电子设备,包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当一个或多个程序被一个或多个处理器执行时,使得电子设备实现上述各个实施例中提供的标签数据脱敏方法。An embodiment of the present invention also provides an electronic device, comprising: one or more processors; a storage device for storing one or more programs, when the one or more programs are executed by one or more processors, the electronic device implements the label data desensitization method provided in the above-mentioned embodiments.
请参阅图10,图10示出了适于用来实现本发明实施例的电子设备的计算机系统的结构示意图。需要说明的是,图10示出的电子设备的计算机系统1000仅是一个示例,不应对本发明实施例的功能和使用范围带来任何限制。Please refer to Figure 10, which shows a schematic diagram of the structure of a computer system of an electronic device suitable for implementing an embodiment of the present invention. It should be noted that the
如图10所示,计算机系统1000包括中央处理单元(Central Processing Unit,CPU)1001,其可以根据存储在只读存储器(Read-OnlyMemory,ROM)1002中的程序或者从储存部分1008加载到随机访问存储器(Random Access Memory,RAM)1003中的程序而执行各种适当的动作和处理,例如执行上述实施例中的方法。在RAM 1003中,还存储有系统操作所需的各种程序和数据。CPU 1001、ROM 1002以及RAM 1003通过总线1004彼此相连。输入/输出(Input/Output,I/O)接口1005也连接至总线1004。As shown in FIG10 , a
以下部件连接至I/O接口1005:包括键盘、鼠标等的输入部分1006;包括诸如阴极射线管(Cathode Ray Tube,CRT)、液晶显示器(Liquid Crystal Display,LCD)等以及扬声器等的输出部分1007;包括硬盘等的储存部分1008;以及包括诸如LAN(LocalAreaNetwork,局域网)卡、调制解调器等的网络接口卡的通信部分1009。通信部分1009经由诸如因特网的网络执行通信处理。驱动器1010也根据需要连接至I/O接口1005。可拆卸介质1011,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1010上,以便于从其上读出的计算机程序根据需要被安装入储存部分1008。The following components are connected to the I/O interface 1005: an
特别地,根据本发明的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本发明的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的计算机程序。在这样的实施例中,该计算机程序可以通过通信部分1009从网络上被下载和安装,和/或从可拆卸介质1011被安装。在该计算机程序被中央处理单元(CPU)1001执行时,执行本发明的系统中限定的各种功能。In particular, according to an embodiment of the present invention, the process described above with reference to the flowchart can be implemented as a computer software program. For example, an embodiment of the present invention includes a computer program product, which includes a computer program carried on a computer readable medium, and the computer program includes a computer program for executing the method shown in the flowchart. In such an embodiment, the computer program can be downloaded and installed from a network through the
需要说明的是,本发明实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本发明中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的计算机程序。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的计算机程序可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the embodiment of the present invention may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two. The computer-readable storage medium may be, for example, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination of the above. More specific examples of computer-readable storage media may include, but are not limited to: an electrical connection with one or more wires, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM), a flash memory, an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the above. In the present invention, a computer-readable signal medium may include a data signal propagated in a baseband or as part of a carrier wave, which carries a computer-readable computer program. This propagated data signal may take a variety of forms, including but not limited to an electromagnetic signal, an optical signal, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which may send, propagate or transmit a program for use by or in conjunction with an instruction execution system, apparatus or device. A computer program contained on a computer-readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
附图中的流程图和框图,图示了按照本发明各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。其中,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the possible architecture, functions and operations of the systems, methods and computer program products according to various embodiments of the present invention. Among them, each box in the flowchart or block diagram can represent a module, a program segment, or a part of the code, and the above-mentioned module, program segment, or a part of the code contains one or more executable instructions for implementing the specified logical function. It should also be noted that in some alternative implementations, the functions marked in the box can also occur in a different order from the order marked in the accompanying drawings. For example, two boxes represented in succession can actually be executed substantially in parallel, and they can sometimes be executed in the opposite order, depending on the functions involved. It should also be noted that each box in the block diagram or flowchart, and the combination of boxes in the block diagram or flowchart, can be implemented with a dedicated hardware-based system that performs a specified function or operation, or can be implemented with a combination of dedicated hardware and computer instructions.
描述于本发明实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现,所描述的单元也可以设置在处理器中。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of the present invention may be implemented by software or hardware, and the units described may also be arranged in a processor. The names of these units do not, in some cases, limit the units themselves.
本发明的另一方面还提供了一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被计算机的处理器执行时,使计算机执行上述各个实施例中提供的标签数据脱敏方法。该计算机可读存储介质可以是上述实施例中描述的电子设备中所包含的,也可以是单独存在,而未装配入该电子设备中。Another aspect of the present invention further provides a computer-readable storage medium having a computer program stored thereon, which, when executed by a processor of a computer, causes the computer to execute the label data desensitization method provided in each of the above embodiments. The computer-readable storage medium may be included in the electronic device described in the above embodiments, or may exist independently without being assembled into the electronic device.
本发明的另一方面还提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各个实施例中提供的标签数据脱敏方法。Another aspect of the present invention further provides a computer program product or a computer program, which includes a computer instruction stored in a computer-readable storage medium. A processor of a computer device reads the computer instruction from the computer-readable storage medium, and the processor executes the computer instruction, so that the computer device executes the label data desensitization method provided in each of the above embodiments.
在上述实施例中,除非另外规定,否则通过使用“第一”和“第二”等序号对共同的对象进行描述,只表示其指代相同对象的不同实例,而非是采用表示被描述的对象必须采用给定的顺序,无论是时间地、空间地、排序地或任何其他方式。In the above embodiments, unless otherwise specified, describing common objects by using serial numbers such as "first" and "second" only indicates that they refer to different instances of the same object, rather than indicating that the objects being described must adopt a given order, whether in time, space, sorting or any other way.
上述实施例仅示例性说明本发明的原理及其功效,而非用于限制本发明。任何熟悉此技术的人士皆可在不违背本发明的精神及范畴下,对上述实施例进行修饰或改变。因此,但凡所属技术领域中具有通常知识者在未脱离本发明所揭示的精神与技术思想下所完成的一切等效修饰或改变,仍应由本发明的权利要求所涵盖。The above embodiments are merely illustrative of the principles and effects of the present invention, and are not intended to limit the present invention. Anyone familiar with the technology may modify or change the above embodiments without violating the spirit and scope of the present invention. Therefore, all equivalent modifications or changes made by a person of ordinary skill in the art without departing from the spirit and technical ideas disclosed by the present invention shall still be covered by the claims of the present invention.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310298859.4ACN116226925A (en) | 2023-03-24 | 2023-03-24 | Label data desensitizing method and device, electronic equipment and storage medium |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310298859.4ACN116226925A (en) | 2023-03-24 | 2023-03-24 | Label data desensitizing method and device, electronic equipment and storage medium |
| Publication Number | Publication Date |
|---|---|
| CN116226925Atrue CN116226925A (en) | 2023-06-06 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202310298859.4APendingCN116226925A (en) | 2023-03-24 | 2023-03-24 | Label data desensitizing method and device, electronic equipment and storage medium |
| Country | Link |
|---|---|
| CN (1) | CN116226925A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116545776A (en)* | 2023-07-06 | 2023-08-04 | 中航金网(北京)电子商务有限公司 | Data transmission method, device, computer equipment and storage medium |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102110101A (en)* | 2009-12-29 | 2011-06-29 | 北大方正集团有限公司 | Method and device used for indexing digital content |
| CN107480549A (en)* | 2017-06-28 | 2017-12-15 | 银江股份有限公司 | A kind of shared sensitive information desensitization method of data-oriented and system |
| CN109977222A (en)* | 2019-03-05 | 2019-07-05 | 广州海晟科技有限公司 | The recognition methods of data sensitive behavior |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102110101A (en)* | 2009-12-29 | 2011-06-29 | 北大方正集团有限公司 | Method and device used for indexing digital content |
| CN107480549A (en)* | 2017-06-28 | 2017-12-15 | 银江股份有限公司 | A kind of shared sensitive information desensitization method of data-oriented and system |
| CN109977222A (en)* | 2019-03-05 | 2019-07-05 | 广州海晟科技有限公司 | The recognition methods of data sensitive behavior |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116545776A (en)* | 2023-07-06 | 2023-08-04 | 中航金网(北京)电子商务有限公司 | Data transmission method, device, computer equipment and storage medium |
| CN116545776B (en)* | 2023-07-06 | 2023-10-03 | 中航金网(北京)电子商务有限公司 | Data transmission method, device, computer equipment and storage medium |
| Publication | Publication Date | Title |
|---|---|---|
| CN111783045B (en) | Data authorization method and device based on hierarchical classification | |
| EP3622448A1 (en) | Adaptation of machine learning algorithms | |
| WO2022007434A1 (en) | Visualization method and related device | |
| WO2018206374A1 (en) | Load balancing of machine learning algorithms | |
| CN110069495A (en) | Date storage method, device and terminal device | |
| US11720825B2 (en) | Framework for multi-tenant data science experiments at-scale | |
| CN112182138A (en) | Method and device for cataloging | |
| CN117749530B (en) | Network information security analysis method and system based on big data | |
| CN115422273A (en) | Data lake metadata processing method and device, electronic equipment, medium and product | |
| CN116226925A (en) | Label data desensitizing method and device, electronic equipment and storage medium | |
| CN114329369B (en) | Rights management method and device, electronic equipment and computer readable medium | |
| CN115795525A (en) | Sensitive data identification method, apparatus, electronic device, medium, and program product | |
| WO2020253344A1 (en) | Authorization control method and apparatus, and storage medium | |
| CN110888972A (en) | Sensitive content identification method and device based on Spark Streaming | |
| CN112131257B (en) | Data query method and device | |
| CN118897840A (en) | Method, device and electronic device for checking table data balance of distributed database | |
| CN112000727A (en) | Desensitization display method for dynamically configured service data | |
| EP4577921A1 (en) | Dynamically updating classifier priority of a classifier model in digital data discovery | |
| US20120323840A1 (en) | Data flow cost modeling | |
| EP4281860A1 (en) | Unified data security labeling framework | |
| CN115543428A (en) | Simulated data generation method and device based on strategy template | |
| CN114756363A (en) | Resource allocation detection method and device | |
| CN114723403A (en) | Report account multiplexing management method, device, equipment and medium based on Tableau | |
| CN115759875B (en) | Classified and hierarchical management method and system for suppliers of public resource transaction | |
| US12326864B2 (en) | Method and system for operation objects discovery from operation data |
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |