Movatterモバイル変換


[0]ホーム

URL:


CN110084046A - Mask method, detection method and device, the electronic equipment of structural data watermark - Google Patents

Mask method, detection method and device, the electronic equipment of structural data watermark
Download PDF

Info

Publication number
CN110084046A
CN110084046ACN201910199779.7ACN201910199779ACN110084046ACN 110084046 ACN110084046 ACN 110084046ACN 201910199779 ACN201910199779 ACN 201910199779ACN 110084046 ACN110084046 ACN 110084046A
Authority
CN
China
Prior art keywords
data
structured data
data set
structured
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910199779.7A
Other languages
Chinese (zh)
Inventor
安宁钰
梁潇
赵保华
高昆仑
郑晓崑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Global Energy Interconnection Research Institute Co Ltd
State Grid Corp of China SGCC
Original Assignee
Global Energy Interconnection Research Institute Co Ltd
State Grid Corp of China SGCC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Global Energy Interconnection Research Institute Co Ltd, State Grid Corp of China SGCCfiledCriticalGlobal Energy Interconnection Research Institute Co Ltd
Priority to CN201910199779.7ApriorityCriticalpatent/CN110084046A/en
Publication of CN110084046ApublicationCriticalpatent/CN110084046A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

The invention discloses mask method, detection method and device, the electronic equipment of a kind of structural data watermark, which includes: the transmission property information for obtaining structured data sets and the structured data sets to be sent;Mark data is generated according to transmission property information;The initial data in structural data concentration preset range is extracted, identifies the data type of initial data;Initial data is converted according to the data type of preset watermarking algorithm and the initial data, mark data, obtains the structured data sets of insertion mark data.By applying the present invention, concentrating transmission property information as digital watermark embedding structural data, the transmission property information so as to the subsequent structured data sets according to leakage is called to account.

Description

Translated fromChinese
结构化数据水印的标注方法、检测方法及装置、电子设备Marking method, detection method and device of structured data watermark, electronic equipment

技术领域technical field

本发明涉及数字水印技术领域,具体涉及一种结构化数据水印的标注 方法、检测方法及装置、电子设备。The invention relates to the technical field of digital watermarking, in particular to a marking method, detection method and device, and electronic equipment for structured data watermarking.

背景技术Background technique

信息技术发展迅猛,以大数据分析、新一代人工智能技术为代表,已 在国家治理、机构精益化管理、提升客户服务等方面发挥了重要作用。数 据的充分融合和共享已成为大势所趋,必将为经济社会发展带来深远影响。 数据已成为国家和组织机构的重要资产。然而,数据安全问题却日益突出, 数据的盗用和滥用问题日趋严重,是目前妨碍数据进一步进行融合共享的 首要问题。The rapid development of information technology, represented by big data analysis and new-generation artificial intelligence technology, has played an important role in national governance, lean management of institutions, and improvement of customer service. The full integration and sharing of data has become an irresistible trend and will definitely have a profound impact on economic and social development. Data has become an important asset for countries and organizations. However, the problem of data security is becoming more and more prominent, and the problem of data theft and abuse is becoming more and more serious, which is the primary problem currently hindering the further integration and sharing of data.

数字水印(Digital Watermarking)技术是将一些标识信息(即数字水印) 直接嵌入数字载体当中(包括多媒体、文档、软件等),且不影响原载体的 使用价值,也不容易被探知和再次修改,但可以被生产方识别和辨认。数 字水印是实现数据防伪追责、版权保护的有效办法。结构化数据具有价值 密度高、易于分析等特点,但由于结构化数据本身的特点,其可供嵌入水 印的空间很少,因而数字水印技术在结构化数据中应用较少,且目前缺少对于结构化数据的泄露追责方法。Digital watermarking (Digital Watermarking) technology is to embed some identification information (that is, digital watermark) directly into digital carriers (including multimedia, documents, software, etc.), without affecting the use value of the original carrier, and it is not easy to be detected and modified again. But it can be identified and identified by the producer. Digital watermarking is an effective way to realize data anti-counterfeiting accountability and copyright protection. Structured data has the characteristics of high value density and easy analysis, but due to the characteristics of structured data itself, there is very little space for embedding watermarks, so digital watermarking technology is rarely used in structured data, and there is currently a lack of structural data. Accountability methods for data breaches.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明实施例提供了一种结构化数据水印的标注方法、检 测方法及装置、电子设备,以解决现有技术中缺少对于结构化数据的泄露 后追责泄露人的问题。In view of this, the embodiment of the present invention provides a structured data watermark marking method, detection method and device, and electronic equipment to solve the problem in the prior art that there is no accountability for the leakage of structured data.

根据第一方面,本发明实施例提供了一种结构化数据水印的标注方法, 包括:获取待发送的结构化数据集及所述结构化数据集的传输属性信息; 提取所述结构化数据集中预设范围内的原始数据,识别所述原始数据的数 据类型;根据所述传输属性信息生成标识数据;根据预设的水印算法及所 述原始数据的数据类型、所述标识数据对原始数据进行转换,得到嵌入标 识数据的结构化数据集。According to the first aspect, an embodiment of the present invention provides a structured data watermarking method, including: acquiring a structured data set to be sent and transmission attribute information of the structured data set; extracting the structured data set The original data within the preset range, identifying the data type of the original data; generating identification data according to the transmission attribute information; performing a process on the original data according to the preset watermark algorithm, the data type of the original data, and the identification data Transformation, resulting in a structured dataset with embedded identity data.

结合第一方面,在第一方面第一实施方式中,所述传输属性信息至少 包括:接收方的ID及传输日期信息,根据所述传输属性信息生成标识数据, 包括:提取所述接收方的ID及传输日期信息中的数字信息;根据预设排序 方式对所述数字信息进行排序,生成所述标识数据。With reference to the first aspect, in the first implementation manner of the first aspect, the transmission attribute information includes at least: the recipient’s ID and transmission date information, and generating identification data according to the transmission attribute information includes: extracting the recipient’s The digital information in the ID and transmission date information; sorting the digital information according to a preset sorting method to generate the identification data.

结合第一方面,在第一方面第二实施方式中,根据预设的水印算法及 所述原始数据的数据类型、所述标识数据将所述标识数据对原始数据进行 转换,包括:将所述标识数据进行格式转换,生成结构化标识数据;根据 所述结构化标识数据及原始数据对应位的数据及所述数据类型确定数据嵌 入方式;根据所述预设的水印算法及数据嵌入方式将所述原始数据转换为 所述嵌入标识数据的结构化数据集。With reference to the first aspect, in the second implementation manner of the first aspect, converting the identification data to the original data according to the preset watermark algorithm, the data type of the original data, and the identification data includes: converting the Converting the format of the identification data to generate structured identification data; determining the data embedding method according to the structured identification data and the corresponding bit data of the original data and the data type; according to the preset watermark algorithm and data embedding method The raw data is converted into the structured data set with embedded identification data.

根据第二方面,本发明实施例提供了一种结构化数据水印的检测方法, 包括:获取待测结构化数据集及原始数据集;使用如权利要求1-3中任一项 所述的结构化数据的水印标注方法根据所述原始数据集生成第一结构化数 据集;判断所述待测结构化数据集与所述第一结构化数据集是否匹配;当 所述待测结构化数据集与所述第一结构化数据集匹配时,对所述原始数据 集的标识数据进行还原,得到所述待测结构化数据集的传输属性信息。According to the second aspect, an embodiment of the present invention provides a method for detecting a structured data watermark, comprising: obtaining a structured data set to be tested and an original data set; using the structure described in any one of claims 1-3 The watermarking method for structured data generates a first structured data set according to the original data set; judges whether the structured data set to be tested matches the first structured data set; when the structured data set to be tested When matching with the first structured data set, restore the identification data of the original data set to obtain the transmission attribute information of the structured data set to be tested.

根据第三方面,本发明实施例提供了一种结构化数据水印的检测方法, 包括:获取待测结构化数据集;对所述待测结构化数据集进行排序;通过 水印提取算法从排序后的待测结构化数据集提取第一标识数;判断所述第 一标识数据与数据库中的预设标识数据是否匹配;当所述标识数据与所述 数据库中的标识数据匹配时,对所述第一标识数据进行还原,得到所述待 测结构化数据集的传输属性信息。According to the third aspect, the embodiment of the present invention provides a structured data watermark detection method, including: obtaining the structured data set to be tested; sorting the structured data set to be tested; Extracting the first identification number from the structured data set to be tested; judging whether the first identification data matches the preset identification data in the database; when the identification data matches the identification data in the database, the The first identification data is restored to obtain the transmission attribute information of the structured data set to be tested.

根据第四方面,本发明实施例提供了一种结构化数据水印的标注装置, 包括:第一获取模块,用于获取待发送的结构化数据集及所述结构化数据 集的传输属性信息;第一生成模块,用于根据所述传输属性信息生成标识 数据;识别模块,用于提取所述结构化数据集中预设范围内的原始数据, 识别所述原始数据的数据类型;第一处理模块,用于根据预设的水印算法 及所述原始数据的数据类型、所述标识数据对原始数据进行转换,得到嵌入标识数据的结构化数据集。According to a fourth aspect, an embodiment of the present invention provides a watermarking device for structured data, including: a first acquisition module, configured to acquire a structured data set to be sent and transmission attribute information of the structured data set; The first generation module is used to generate identification data according to the transmission attribute information; the identification module is used to extract the original data within the preset range in the structured data set, and identify the data type of the original data; the first processing module , for converting the original data according to a preset watermark algorithm, the data type of the original data, and the identification data, to obtain a structured data set embedded with identification data.

根据第五方面,本发明实施例提供了一种结构化数据水印的检测装置, 包括:第二获取模块,用于获取待测结构化数据集及原始数据集;第二生 成模块,用于使用如第一方面或第一方面任意一个实施方式中的结构化数 据的水印标注方法根据所述原始数据集生成第一结构化数据集;第一匹配 模块,用于判断所述待测结构化数据集与所述第一结构化数据集是否匹配; 第二处理模块,用于当所述待测结构化数据集与所述第一结构化数据集匹 配时,对所述待测原始数据集的标识数据进行还原,得到所述待测结构化 数据集的传输属性信息。According to the fifth aspect, an embodiment of the present invention provides a structured data watermark detection device, including: a second acquisition module, configured to acquire a structured data set to be tested and an original data set; a second generation module, configured to use As in the first aspect or in any implementation of the first aspect, the watermarking method for structured data generates a first structured data set according to the original data set; the first matching module is used to judge the structured data to be tested whether the set matches the first structured data set; the second processing module is used to process the original data set to be tested when the structured data set to be tested matches the first structured data set The identification data is restored to obtain the transmission attribute information of the structured data set to be tested.

根据第六方面,本发明实施例提供了一种结构化数据水印的检测装置, 包括:第三获取模块,用于获取待测结构化数据集;排序模块,用于对所 述待测结构化数据集进行排序;提取模块,用于通过水印提取算法从排序 后的待测结构化数据集提取第一标识数;第二匹配模块,用于判断所述第 一标识数据与数据库中的预设标识数据是否匹配;第三处理模块,用于当 所述标识数据与所述数据库中的标识数据匹配时,对所述第一标识数据进行还原,得到所述待测结构化数据集的传输属性信息。According to the sixth aspect, an embodiment of the present invention provides a structured data watermark detection device, including: a third acquisition module, configured to acquire a structured data set to be tested; a sorting module, configured to sort the structured data set to be tested The data set is sorted; the extraction module is used to extract the first identification number from the sorted structured data set to be tested through the watermark extraction algorithm; the second matching module is used to judge the first identification data and the preset in the database Whether the identification data matches; a third processing module, configured to restore the first identification data when the identification data matches the identification data in the database, and obtain the transmission attribute of the structured data set to be tested information.

根据第七方面,本发明实施例提供了一种电子设备,包括:存储器和 处理器,所述存储器和所述处理器之间互相通信连接,所述存储器中存储 有计算机指令,所述处理器通过执行所述计算机指令,从而执行如权利要 求第一方面或第一方面任一实施方式中所述的结构化数据水印的标注方 法;或,执行如第二方面或第三方面中所述的结构化数据水印的检测方法。According to a seventh aspect, an embodiment of the present invention provides an electronic device, including: a memory and a processor, the memory and the processor are connected to each other in communication, the memory stores computer instructions, and the processor By executing the computer instructions, the structured data watermarking method as described in the first aspect of the claim or any implementation manner of the first aspect is executed; or, the method as described in the second aspect or the third aspect is executed A method for detecting watermarks in structured data.

根据第八方面,本发明实施例提供了一种计算机可读存储介质,所述 计算机可读存储介质存储有计算机指令,所述计算机指令用于使所述计算 机执行如权利要求第一方面或第一方面任一个实施方式中所述的结构化数 据水印的标注方法;或,执行如第二方面或第三方面中所述的结构化数据 水印的检测方法。According to an eighth aspect, an embodiment of the present invention provides a computer-readable storage medium, the computer-readable storage medium stores computer instructions, and the computer instructions are used to enable the computer to execute the first aspect or the second aspect of the claim. On the one hand, the structured data watermark marking method described in any one of the implementation manners; or, execute the structured data watermark detection method described in the second aspect or the third aspect.

与现有技术相比,本发明具有以下的有益效果:将结构化数据集的传 输属性信息作为标识信息嵌入结构化数据集中,对结构化数据可用性影响 较小;对泄露的结构化数据集提取标识信息,并还原对应的传输属性信息, 就可以根据传输属性信息找到泄露结构化数据集的责任人,并进行追责。Compared with the prior art, the present invention has the following beneficial effects: the transmission attribute information of the structured data set is embedded in the structured data set as identification information, which has little impact on the usability of the structured data; Identify the information and restore the corresponding transmission attribute information, then the person responsible for leaking the structured data set can be found and held accountable according to the transmission attribute information.

附图说明Description of drawings

通过参考附图会更加清楚的理解本发明的特征和优点,附图是示意性 的而不应理解为对本发明进行任何限制,在附图中:The features and advantages of the present invention will be more clearly understood by referring to the accompanying drawings, which are schematic and should not be construed as limiting the invention in any way, in the accompanying drawings:

图1示出了本发明实施例中结构化数据水印的标注方法流程框图;Fig. 1 shows a flow chart of a method for labeling structured data watermarks in an embodiment of the present invention;

图2示出了本发明实施例中结构化数据水印的检测方法流程框图;Fig. 2 shows a flow chart of a method for detecting a structured data watermark in an embodiment of the present invention;

图3示出了本发明实施例中结构化数据水印的检测方法流程框图;Fig. 3 shows a flow chart of a method for detecting a structured data watermark in an embodiment of the present invention;

图4示出了本发明实施例中结构化数据水印的标注装置的结构框图;FIG. 4 shows a structural block diagram of a structured data watermark marking device in an embodiment of the present invention;

图5示出了本发明实施例中结构化数据水印的检测装置的结构框图;FIG. 5 shows a structural block diagram of a device for detecting a structured data watermark in an embodiment of the present invention;

图6示出了本发明实施例中结构化数据水印的检测装置的结构框图。Fig. 6 shows a structural block diagram of an apparatus for detecting a structured data watermark in an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本 发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描 述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。 基于本发明中的实施例,本领域技术人员在没有作出创造性劳动前提下所 获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative work belong to the protection scope of the present invention.

在本发明的描述中,需要说明的是,术语“中心”、“上”、“下”、“左”、 “右”、“竖直”、“水平”、“内”、“外”等指示的方位或位置关系为基于附 图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是 指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和 操作,因此不能理解为对本发明的限制。此外,术语“第一”、“第二”、“第 三”仅用于描述目的,而不能理解为指示或暗示相对重要性。In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer" etc. The indicated orientation or positional relationship is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the referred device or element must have a specific orientation, or in a specific orientation. construction and operation, therefore, should not be construed as limiting the invention. In addition, the terms "first", "second", and "third" are used for descriptive purposes only, and should not be construed as indicating or implying relative importance.

在本发明的描述中,需要说明的是,除非另有明确的规定和限定,术 语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也 可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接; 可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的 连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本 发明中的具体含义。In the description of the present invention, it should be noted that unless otherwise specified and limited, the terms "installation", "connection" and "connection" should be understood in a broad sense, for example, it can be a fixed connection or a detachable connection. Connected, or integrally connected; it can be mechanically connected or electrically connected; it can be directly connected or indirectly connected through an intermediary, and it can be the internal communication of two components. For those of ordinary skill in the art, the specific meanings of the above terms in the present invention can be understood in specific situations.

本发明实施例提供了一种结构化数据数水印的标注方法,该方法可以 应用于对各电力公司待分发的结构化数据集嵌入数字水印,如图1所示, 该方法包括以下步骤:The embodiment of the present invention provides a method for marking structured data digital watermarks, which can be applied to embed digital watermarks in structured data sets to be distributed by various electric power companies, as shown in Figure 1, the method includes the following steps:

步骤S101:获取待发送的结构化数据集及结构化数据集的传输属性信 息,在实际应用中,该待分发的结构化数据集为某电力公司的客户用电数 据,该结构化数据内容如表1所示:Step S101: Obtain the structured data set to be sent and the transmission attribute information of the structured data set. In practical applications, the structured data set to be distributed is the customer electricity consumption data of a power company. The structured data content is as follows: Table 1 shows:

表1Table 1

CONS_NOCONS_NONAMENAMEPOWER1POWER1POWER2POWER2POWER3POWER3POWER4POWER4POWER5POWER5POWER6POWER6POWER7POWER7MOBI_NOMOBI_NO00000276100000027610邵天表Shao Tianbiao0.03230.03230.03600.03600.03320.03320.01030.01030.13600.13600.13180.13180.13180.1318189313372661893133726600001847460000184746白忌bogey0.02320.02320.02690.02690.00100.00100.00060.00060.00060.00060.00060.00060.00060.0006151392587621513925876200001847480000184748贾貌微Jia Maungwei0.00200.00200.00210.00210.02510.02510.00260.00260.00260.00260.00260.00260.00410.0041188336411501883364115000001859140000185914武罢strike0.17680.17680.18910.18910.17630.17630.17570.17570.01660.01660.01570.01570.01570.0157130851624991308516249900001859150000185915樊天Fan Tian0.09000.09000.09000.09000.09000.09000.09000.09000.42500.42500.43620.43620.36490.3649184031426161840314261600001859160000185916吴宗Wu Zong0.03190.03190.03190.03190.03220.03220.03240.03240.01000.01000.07000.07000.02000.0200147615382761476153827600001859230000185923廖标独Liao Biaodu0.03760.03760.03760.03760.03760.03760.03760.03760.14580.14580.15030.15030.13130.1313176967469061769674690600001868210000186821长孙鸯丰Grandson Yangfeng0.10710.10710.11940.11940.10640.10640.10560.10560.13120.13120.14330.14330.11740.1174138523886491385238864900001874820000187482邓鹰Deng Ying0.01000.01000.01000.01000.01000.01000.01000.01000.45590.45590.84510.84510.58560.5856177689051121776890511200001874830000187483毛封边wool edge0.19890.19890.20070.20070.20010.20010.20140.20140.06000.06000.49000.49000.37000.37001310692097813106920978

该结构化数据集的传输属性信息可以包括:结构化数据集的分发日期 和接收者的用户信息,本实施例只是举例说明,并不限定传输属性信息的 种类。The transmission attribute information of the structured data set may include: the distribution date of the structured data set and the user information of the recipient. This embodiment is only an example and does not limit the type of transmission attribute information.

步骤S102:提取结构化数据集中预设范围内的原始数据,识别原始数 据的数据类型;在具体应用中,对与上述步骤S101中结构化数据集,预设 范围设置为POWER1、POWER2、POWER3、POWER4、POWER5、POWER6、 POWER7和MOBI_NO,该预设范围内的原始数据如表2灰色部分所示:Step S102: extract the original data within the preset range in the structured data set, and identify the data type of the original data; in specific applications, for the structured data set in the above step S101, the preset range is set to POWER1, POWER2, POWER3, POWER4, POWER5, POWER6, POWER7 and MOBI_NO, the original data within the preset range is shown in the gray part of Table 2:

表2Table 2

识别结构化数据集中的POWER1、POWER2、POWER3、POWER4、 POWER5、POWER6、POWER7和MOBI_NO范围内的原始数据类型,, MOBI_NO(即用户的手机号)为字符型数据,POWER1、POWER2、 POWER3、POWER4、POWER5、POWER6、POWER7为数值型数据;Identify the original data types in the range of POWER1, POWER2, POWER3, POWER4, POWER5, POWER6, POWER7 and MOBI_NO in the structured data set, MOBI_NO (that is, the user's mobile phone number) is character data, POWER1, POWER2, POWER3, POWER4, POWER5, POWER6, POWER7 are numerical data;

步骤S103:根据传输属性信息生成标识数据,例如,将上述步骤S101 中的结构化数据集的分发日期和接收者的用户信息生成标识数据,该标识 数据后续可用于寻找结构化数据集的负责人。。Step S103: Generate identification data according to the transmission attribute information, for example, generate identification data from the distribution date of the structured data set in the above step S101 and the user information of the recipient, and the identification data can be used to find the person in charge of the structured data set later . .

步骤S104:根据预设的水印算法及所述原始数据的数据类型、标识数 据对原始数据进行转换,得到嵌入标识数据的结构化数据集;将数字水印 嵌入数据中有很多种算法,例如,空域算法、Patchwork算法、变换域算法、 压缩域算法、NEC算法和生理模型算法等,可以根据不同的原始数据的数 据类型和标识数据采用不同水印算法,将标识数据嵌入结构化数据集中, 例如,对于本实施例中原始数据的数据类型,由于字符型数据(MOBI_NO) 是手机号以及数值型数据(POWER1、POWER2、POWER3、POWER4、 POWER5、POWER6、POWER7)是小数,所以采用手机号前缀水印算法 和时空域LSB(±1)算法作为水印算法。Step S104: Convert the original data according to the preset watermark algorithm, the data type of the original data, and the identification data to obtain a structured data set embedded with identification data; there are many algorithms for embedding digital watermarks into data, for example, airspace Algorithm, Patchwork algorithm, transform domain algorithm, compressed domain algorithm, NEC algorithm and physiological model algorithm, etc., can adopt different watermarking algorithms according to different data types of original data and identification data, and embed identification data into structured data sets, for example, for The data type of the original data in this embodiment, because the character data (MOBI_NO) is the mobile phone number and the numerical data (POWER1, POWER2, POWER3, POWER4, POWER5, POWER6, POWER7) is a decimal, so adopt the mobile phone number prefix watermark algorithm and The time-space domain LSB(±1) algorithm is used as the watermarking algorithm.

通过实施本发明实施例中的一种结构化数据水印的标注方法,将结构 化数据集的传输属性信息作为标识数据嵌入结构化数据集中,后续根据泄 露的结构化数据集提取标识数据,就能根据传输属性信息找到泄露结构化 数据的负责人。By implementing a structured data watermarking method in the embodiment of the present invention, the transmission attribute information of the structured data set is embedded in the structured data set as identification data, and the identification data is subsequently extracted according to the leaked structured data set. Find those responsible for leaking structured data based on transport attribute information.

可选地,在本发明一些实施例中,传输属性信息至少包括:接收发的 ID及传输日期信息,上述实施例中的步骤S103根据传输属性信息生成标识 数据是通过以下过程实现的:提取接收方的ID及传输日期信息中的数字信 息,根据预设排序方式对所述数字信息进行排序,生成所述标识数据;例 如,结构化数据集的接受者的ID为4231,传输日期为2018年6月21日, 排列方式为接收者的ID在前,传输日期在后,则提取到的数字信息为4231180621。Optionally, in some embodiments of the present invention, the transmission attribute information includes at least: the ID sent by the receiver and the transmission date information. Step S103 in the above embodiment generates identification data according to the transmission attribute information through the following process: extracting the receiving The ID of the party and the digital information in the transmission date information, sort the digital information according to the preset sorting method, and generate the identification data; for example, the recipient ID of the structured data set is 4231, and the transmission date is 2018 On June 21, the arrangement is such that the receiver's ID is first and the transmission date is last, so the extracted digital information is 4231180621.

可选地,在本发明一些实施例中,上述步骤S104中根据预设的水印算 法及所述原始数据的数据类型、标识数据对原始数据进行转换,得到嵌入 标识数据的结构化数据集还包括:将所述标识数据进行格式转换,生成结 构化标识数据;根据结构化标识数据及原始数据对应位的数据及数据类型 确定数据嵌入方式;在实际应用中,将通过上述实施例提取到的数据信息 生成如表3所示的美国信息交换标准代码,再生成如表4所示的结构化标 识数据。Optionally, in some embodiments of the present invention, in the above step S104, the original data is converted according to the preset watermark algorithm, the data type of the original data, and the identification data, and the structured data set embedded with the identification data is obtained. : Convert the format of the identification data to generate structured identification data; determine the data embedding method according to the data and data type of the corresponding bit of the structured identification data and the original data; in practical applications, the data extracted by the above embodiments The information generates the American Standard for Information Interchange code shown in Table 3, and then generates the structured identification data shown in Table 4.

表3table 3

表4Table 4

根据表4所示的结构化标识数据选择上述实施例中所述的手机号前缀 水印算法和时空域LSB(±1)算法,该手机号前缀水印算法为当嵌入比特 为0时,手机号不做变动;当嵌入比特为1时,手机号前增加前缀“+86”, 时空域LSB(±1)算法是对数值型数据进行加1减1操作,为极大减少对 原始数据的影响,所以对数据中的万分位进行加1减1操作,其具体算法 如表5所示:According to the structured identification data shown in Table 4, the mobile phone number prefix watermark algorithm and the space-time domain LSB (±1) algorithm described in the above-mentioned embodiments are selected. The mobile phone number prefix watermark algorithm is that when the embedded bit is 0, the mobile phone number does not Make changes; when the embedded bit is 1, the prefix "+86" is added before the mobile phone number. The time-space domain LSB (±1) algorithm is to add 1 and subtract 1 to the numerical data. In order to greatly reduce the impact on the original data, Therefore, the operation of adding 1 and subtracting 1 to the ten-thousandth place in the data is performed, and the specific algorithm is shown in Table 5:

表5table 5

通过上述手机号前缀水印算法和时空域LSB(±1)算法将表4灰色部 分所示的标识数据嵌入表2灰色部分所示的原始数据中,得到表6所示的 嵌入标识数据的结构化数据:Embed the identification data shown in the gray part of Table 4 into the original data shown in the gray part of Table 2 through the above mobile phone number prefix watermarking algorithm and the time-space domain LSB (±1) algorithm, and obtain the structure of the embedded identification data shown in Table 6 data:

表6Table 6

在本发明实施例中还提供一种结构化数据水印的检测方法,该检测方 法应用于检测上述实施例中已嵌入标识数据的某电力公司分发的结构化数 据集,如图2所示,该方法包括:In an embodiment of the present invention, a method for detecting a structured data watermark is also provided. The detection method is applied to detect a structured data set distributed by a power company that has embedded identification data in the above embodiment, as shown in FIG. 2 , the Methods include:

步骤S201:获取待测结构化数据集以及原始数据集,该待测结构化数 据是已泄露的结构化数据集,根据已泄露的结构化数据集从数据库中获取 到原始数据集。Step S201: Obtain the structured data set to be tested and the original data set, the structured data to be tested is a leaked structured data set, and the original data set is obtained from the database according to the leaked structured data set.

步骤S202:使用上述实施例中的结构化数据水印的标注方法根据所述 原始数据集生成第一结构化数据集;将该原始数据集的传输属性信息生成 的标识数据按照上述实施例中步骤S101至步骤S104中的结构化数据水印 的标注方法嵌入到原始数据集中,得到第一结构化数据集。Step S202: Use the structured data watermarking method in the above embodiment to generate the first structured data set according to the original data set; the identification data generated from the transmission attribute information of the original data set is according to step S101 in the above embodiment Embed the structured data watermarking method in step S104 into the original data set to obtain the first structured data set.

步骤S203:判断待测结构化数据集与第一结构化数据集是否匹配,通 过判断嵌入标识数据的第一结构化数据集与待测结构化数据集的内容否是 相同,判定泄露待测结构化数据集的来源;Step S203: Judging whether the structured data set to be tested matches the first structured data set, and judging whether the content of the first structured data set embedded with identification data is the same as that of the structured data set to be tested to determine whether the structure to be tested is leaked the source of the data set;

步骤S204:当待测结构化数据集第一结构化数据集匹配时,对原始数 据集的标识数据进行还原,得到待测结构化数据集的传输属性信息。当待 测结构化数据集与第一结构化数据集匹配时,原始数据集的标识数据相当 于待测结构化数据集的标识数据,通过还原原始数据集的标识数据,就可 以得到待测结构化数据集的传输属性信息,根据待测结构化数据集的传输 属性信息就可以找到待测结构化数据集的负责人。Step S204: When the first structured data set matches the structured data set to be tested, restore the identification data of the original data set to obtain the transmission attribute information of the structured data set to be tested. When the structured data set to be tested matches the first structured data set, the identification data of the original data set is equivalent to the identification data of the structured data set to be tested. By restoring the identification data of the original data set, the structure to be tested can be obtained According to the transmission attribute information of the structured data set to be tested, the person in charge of the structured data set to be tested can be found.

通过实施本发明实施例中的结构化数据水印的检测方法,通过待测结 构化数据集获取原始数据集,根据本发明的结构化数据水印的标注方法将 对应的标识数据嵌入原始数据中,再判断与待检测结构化数据集是否匹配, 找到泄露数据的负责人,该方法适用于检测采用盲水印算法的结构化数据 集的标识数据。By implementing the structured data watermark detection method in the embodiment of the present invention, the original data set is obtained through the structured data set to be tested, and the corresponding identification data is embedded in the original data according to the structured data watermark labeling method of the present invention, and then Judging whether it matches the structured data set to be detected, and finding the person in charge of the leaked data, this method is suitable for detecting the identification data of the structured data set using the blind watermark algorithm.

在本发明实施例还提供一种结构化数据水印的检测方法,该检测方法 应用于检测结构化数据集中的标识数据,例如,可以检测上述实施例中已 嵌入标识数据的某电力公司的客户用电结构化数据集,如图3所示,该方 法包括:The embodiment of the present invention also provides a method for detecting structured data watermark, which is applied to detect the identification data in the structured data set, for example, it can detect the watermark used by a customer of a power company that has embedded the identification data in the above embodiment Electrically structured data sets, as shown in Figure 3, the method includes:

步骤S301:获取待测结构化数据集,在实际应用中,该待测结构化数 据集为表7所示的已嵌入标识数据的结构化数集;Step S301: Obtain the structured data set to be tested. In practical applications, the structured data set to be tested is the structured data set embedded with identification data shown in Table 7;

表7Table 7

CONS_NOCONS_NONAMENAMEPOWER1POWER1POWER2POWER2POWER3POWER3POWER4POWER4POWER5POWER5POWER6POWER6POWER7POWER7MOBI_NOMOBI_NO00001859140000185914武罢strike0.17680.17680.18920.18920.17630.17630.17570.17570.01660.01660.01560.01560.01580.0158+8613085162499+861308516249900001874830000187483毛封边wool edge0.19900.19900.20060.20060.20010.20010.20150.20150.06000.06000.49000.49000.37000.3700+8613106920978+861310692097800001847460000184746白忌bogey0.02320.02320.02700.02700.00090.00090.00070.00070.00060.00060.00060.00060.00050.0005151392587621513925876200001847480000184748贾貌微Jia Maungwei0.00200.00200.00220.00220.02510.02510.00250.00250.00260.00260.00260.00260.00410.0041+8618833641150+861883364115000001868210000186821长孙鸯丰Grandson Yangfeng0.10720.10720.11940.11940.10630.10630.10570.10570.13120.13120.14330.14330.11730.1173138523886491385238864900001874820000187482邓鹰Deng Ying0.01000.01000.01000.01000.01010.01010.00990.00990.45600.45600.84500.84500.58560.5856177689051121776890511200001859160000185916吴宗Wu Zong0.03200.03200.03180.03180.03210.03210.03250.03250.00990.00990.07000.07000.02000.0200147615382761476153827600001859230000185923廖标独Liao Biaodu0.03760.03760.03760.03760.03770.03770.03750.03750.14580.14580.15040.15040.13120.1312176967469061769674690600001859150000185915樊天Fan Tian0.09000.09000.09000.09000.08990.08990.09010.09010.42500.42500.43620.43620.36480.3648+8618403142616+861840314261600000276100000027610邵天表Shao Tianbiao0.03220.03220.03600.03600.03330.03330.01030.01030.13600.13600.13170.13170.13180.13181893133726618931337266

步骤S302:对所述待测结构化数据集进行排序,在实际应用中,为了 避免传输过程中,结构化数据集的顺序发生改变影响标识数据的识别,对 结构化数据集的内容升序对齐排序,排序后如表8所示。Step S302: Sorting the structured data sets to be tested. In practical applications, in order to avoid changes in the order of the structured data sets during transmission and affect the identification of identification data, align and sort the contents of the structured data sets in ascending order , as shown in Table 8 after sorting.

表8Table 8

CONS_NOCONS_NONAMENAMEPOWER1POWER1POWER2POWER2POWER3POWER3POWER4POWER4POWER5POWER5POWER6POWER6POWER7POWER7MOBI_NOMOBI_NO00000276100000027610邵天表Shao Tianbiao0.03220.03220.03600.03600.03330.03330.01030.01030.13600.13600.13170.13170.13180.1318189313372661893133726600001847460000184746白忌bogey0.02320.02320.02700.02700.00090.00090.00070.00070.00060.00060.00060.00060.00050.0005151392587621513925876200001847480000184748贾貌微Jia Maungwei0.00200.00200.00220.00220.02510.02510.00250.00250.00260.00260.00260.00260.00410.0041+8618833641150+861883364115000001859140000185914武罢strike0.17680.17680.18920.18920.17630.17630.17570.17570.01660.01660.01560.01560.01580.0158+8613085162499+861308516249900001859150000185915樊天Fan Tian0.09000.09000.09000.09000.08990.08990.09010.09010.42500.42500.43620.43620.36480.3648+8618403142616+861840314261600001859160000185916吴宗Wu Zong0.03200.03200.03180.03180.03210.03210.03250.03250.00990.00990.07000.07000.02000.0200147615382761476153827600001859230000185923廖标独Liao Biaodu0.03760.03760.03760.03760.03770.03770.03750.03750.14580.14580.15040.15040.13120.1312176967469061769674690600001868210000186821长孙鸯丰Grandson Yangfeng0.10720.10720.11940.11940.10630.10630.10570.10570.13120.13120.14330.14330.11730.1173138523886491385238864900001874820000187482邓鹰Deng Ying0.01000.01000.01000.01000.01010.01010.00990.00990.45600.45600.84500.84500.58560.5856177689051121776890511200001874830000187483毛封边wool edge0.19900.19900.20060.20060.20010.20010.20150.20150.06000.06000.49000.49000.37000.3700+8613106920978+8613106920978

步骤S303:通过预设的水印算法从排序后的待测结构化数据集提取第 一标识数,在上述实施例中,设置的水印算法为手机号前缀水印算法和时 空域LSB(±1)算法,两种算法的具体内容参考上述实施例所述的内容。 根据这两种算法对表8中的数据进行提取:对于字符型MOBI_NO列,有 “+86”前缀提取比特为1,否则为0;对于数值型POWER1、POWER2、POWER3、POWER4、POWER5、POWER6、POWER7字段,其万分位为 奇数是提取比特为1,否则为0。得到表9所示的标识数据:Step S303: Extract the first identification number from the sorted structured data set to be tested through the preset watermark algorithm. In the above embodiment, the set watermark algorithm is the mobile phone number prefix watermark algorithm and the time-space domain LSB (±1) algorithm For the specific content of the two algorithms, refer to the content described in the above-mentioned embodiments. Extract the data in Table 8 according to these two algorithms: for the character type MOBI_NO column, the extraction bit with "+86" prefix is 1, otherwise it is 0; for the numeric type POWER1, POWER2, POWER3, POWER4, POWER5, POWER6, In the POWER7 field, if the ten-thousandth place is an odd number, the extracted bit is 1, otherwise it is 0. Obtain the identification data shown in Table 9:

表9Table 9

步骤S304:判断所述第一标识数据与数据库中的预设标识数据是否匹 配,根据上述步骤S303中的标识数据“4231180621”与数据库中的预设的 标识数据判断匹配;Step S304: judging whether the first identification data matches the preset identification data in the database, according to the identification data "4231180621" in the above-mentioned step S303 and the default identification data in the database to determine the match;

步骤305:当标识数据与数据库中的标识数据匹配时,对第一标识数据 进行还原,得到待测结构化数据集的传输属性信息,根据标识数据 “4231180621”就可以得到该待测结构化数据集的接收者的ID为4231,分 发时间为2018年6月21日。Step 305: When the identification data matches the identification data in the database, restore the first identification data to obtain the transmission attribute information of the structured data set to be tested, and obtain the structured data to be tested according to the identification data "4231180621" The recipient of the episode has an ID of 4231 and a distribution time of June 21, 2018.

通过实施本发明实施例中的结构化数据水印的检测方法,根据预设的 水印算法对待测结构化数据提取标识数据,判断标识数据是否与数据库中 预设的标识数据是否匹配,然后就可以根据标识数据找到待测结化数据集 的负责人及相关信息。By implementing the method for detecting structured data watermark in the embodiment of the present invention, the identification data is extracted from the structured data to be tested according to the preset watermark algorithm, and it is judged whether the identification data matches the identification data preset in the database, and then it can be based on Identify the data Find the person in charge and related information of the data set to be tested.

本发明实施例还提供一种结构化数据水印的标注装置,如图4所示, 包括:第一获取模块11,用于获取待发送的结构化数据集及所述结构化数 据集的传输属性信息,详细内容参见上述实施例中步骤S101的内容;识别 模块12,用于提取结构化数据集中预设范围内的原始数据,识别原始数据 的数据类型,详细内容参见上述实施例中步骤S102的内容;第一生成模块 13,用于根据传输属性信息生成标识数据,详细内容参见上述实施例中步 骤S103的内容;第一处理模块14,用于根据预设的水印算法及原始数据的 数据类型、标识数据对原始数据进行转换,得到嵌入标识数据的结构化数 据集,详细内容参见上述实施例中步骤S104的内容。The embodiment of the present invention also provides a watermarking device for structured data, as shown in FIG. 4 , including: a first acquisition module 11, configured to acquire the structured data set to be sent and the transmission attributes of the structured data set For details, refer to the content of step S101 in the above-mentioned embodiment; the identification module 12 is used to extract the original data within the preset range in the structured data set, and identify the data type of the original data. For details, refer to the step S102 in the above-mentioned embodiment Content; the first generation module 13 is used to generate identification data according to the transmission attribute information, and for details, refer to the content of step S103 in the above-mentioned embodiment; the first processing module 14 is used to generate the identification data according to the preset watermark algorithm and the data type of the original data 1. The identification data converts the original data to obtain a structured data set embedded with the identification data. For details, refer to the content of step S104 in the above-mentioned embodiment.

通过实施本发明实施例中的一种结构化数据水印的标注装置,将结构 化数据集的传输属性信息作为标识数据嵌入结构化数据集中,后续根据泄 露的结构化数据集提取标识数据,就能根据传输属性信息找到泄露结构化 数据的负责人By implementing a structured data watermarking device in the embodiment of the present invention, the transmission attribute information of the structured data set is embedded as identification data in the structured data set, and the identification data is subsequently extracted according to the leaked structured data set. Locate the person responsible for leaking structured data based on transport attribute information

本发明实施例还提供一种结构化数据水印的检测装置,如图5所示, 包括:第二获取模块21,用于获取待测结构化数据集及原始数据集,详细 内容参见上述实施例中步骤S201的内容;第二生成模块22,用于通过上述 实施例中步骤S101至步骤S104中的结构化数据的水印标注方法根据所述 原始数据集生成第一结构化数据集,详细内容参见上述实施例中步骤S202 的内容;第一匹配模块23,用于判断待测结构化数据集与第一结构化数据集是否匹配,详细内容参见上述实施例中步骤S203的内容;第二处理模块 24,用于当待测结构化数据集与第一结构化数据集匹配时,对待测原始数 据集的标识数据进行还原,得到待测结构化数据集的传输属性信息,详细 内容参见上述实施例中步骤S204的内容。The embodiment of the present invention also provides a structured data watermark detection device, as shown in FIG. 5 , including: a second acquisition module 21, which is used to acquire the structured data set and the original data set to be tested. For details, refer to the above-mentioned embodiment The content of step S201; the second generation module 22 is used to generate the first structured data set according to the original data set through the watermarking method of structured data in steps S101 to S104 in the above embodiment. For details, see The content of step S202 in the above-mentioned embodiment; the first matching module 23 is used to judge whether the structured data set to be tested matches the first structured data set. For details, refer to the content of step S203 in the above-mentioned embodiment; the second processing module 24. When the structured data set to be tested matches the first structured data set, the identification data of the original data set to be tested is restored to obtain the transmission attribute information of the structured data set to be tested. For details, refer to the above-mentioned embodiment The content of step S204.

通过实施本发明实施例中的结构化数据水印的检测装置,通过待测结 构化数据集获取原始数据集,根据本发明的结构化数据水印的标注方法将 对应的标识数据嵌入原始数据中,再判断与待检测结构化数据集是否匹配, 找到泄露数据的负责人,该方法适用于检测采用盲水印算法的结构化数据 集的标识数据。By implementing the structured data watermark detection device in the embodiment of the present invention, the original data set is obtained through the structured data set to be tested, and the corresponding identification data is embedded in the original data according to the structured data watermark labeling method of the present invention, and then Judging whether it matches the structured data set to be detected, and finding the person in charge of the leaked data, this method is suitable for detecting the identification data of the structured data set using the blind watermark algorithm.

本发明实施例提供一种结构化数据水印的检测装置,如图6所示,包 括:第三获取模块31,用于获取待测结构化数据集,详细内容参见上述实 施例中步骤S301的内容;排序模块32,用于对待测结构化数据集进行排序, 详细内容参见上述实施例中步骤S302的内容;提取模块33,用于通过水印 提取算法从排序后的待测结构化数据集提取第一标识数,详细内容参见上 述实施例中步骤S303的内容;第二匹配模块34,用于判断所述第一标识数据与数据库中的预设标识数据是否匹配,详细内容参见上述实施例中步骤 S304的内容;第三处理模块35,用于当标识数据与所述数据库中的标识数 据匹配时,对第一标识数据进行还原,得到待测结构化数据集的传输属性 信息,详细内容参见上述实施例中步骤S305的内容。An embodiment of the present invention provides a structured data watermark detection device, as shown in FIG. 6 , including: a third acquisition module 31, configured to acquire a structured data set to be tested. For details, refer to the content of step S301 in the above-mentioned embodiment The sorting module 32 is used to sort the structured data set to be tested. For details, refer to the content of step S302 in the above-mentioned embodiment; An identification number, refer to the content of step S303 in the above-mentioned embodiment for details; the second matching module 34 is used to judge whether the first identification data matches the preset identification data in the database, refer to the steps in the above-mentioned embodiment for details The content of S304; the third processing module 35 is used to restore the first identification data when the identification data matches the identification data in the database, and obtain the transmission attribute information of the structured data set to be tested. For details, refer to the above-mentioned The content of step S305 in the embodiment.

通过实施本发明实施例中的结构化数据水印的检测装置,根据预设的 水印算法对待测结构化数据提取标识数据,判断标识数据是否与数据库中 预设的标识数据是否匹配,然后就可以根据标识数据找到待测结化数据集 的负责人及相关信息。By implementing the structured data watermark detection device in the embodiment of the present invention, the identification data is extracted from the structured data to be tested according to the preset watermark algorithm, and it is judged whether the identification data matches the identification data preset in the database, and then it can be based on Identify the data Find the person in charge and related information of the data set to be tested.

本发明实施例还提供了一种电子设备,该电子设备可以包括处理器和 存储器,其中处理器和存储器可以通过总线或者其他方式连接。An embodiment of the present invention also provides an electronic device, which may include a processor and a memory, where the processor and the memory may be connected through a bus or in other ways.

处理器可以为中央处理器(Central Processing Unit,CPU)。处理器51 还可以为其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、 专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程 门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、 分立门或者晶体管逻辑器件、分立硬件组件等芯片,或者上述各类芯片的 组合。The processor may be a central processing unit (Central Processing Unit, CPU). Processor 51 can also be other general processors, digital signal processor (Digital Signal Processor, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field-Programmable Gate Array, FPGA) or Chips such as other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of the above-mentioned types of chips.

存储器作为一种非暂态计算机可读存储介质,可用于存储非暂态软件 程序、非暂态计算机可执行程序以及模块,如本发明实施例中的结构化数 据水印的标注方法以及检测方法对应的程序指令/模块(例如,图4所示的 第一获取模块11、识别模块12、第一生成模块13和第一处理模块14)。处 理器通过运行存储在存储器中的非暂态软件程序、指令以及模块,从而执 行处理器的各种功能应用以及数据处理,即实现上述方法实施例中的结构化数据水印的标注方法以及结构化数据水印的检测方法。As a non-transitory computer-readable storage medium, the memory can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as the structured data watermark marking method and detection method in the embodiment of the present invention. program instructions/modules (for example, the first acquiring module 11, the identifying module 12, the first generating module 13 and the first processing module 14 shown in FIG. 4). The processor executes various functional applications and data processing of the processor by running the non-transitory software programs, instructions and modules stored in the memory, that is, realizes the marking method of structured data watermarking and the structured data watermarking in the above method embodiments. Data watermark detection method.

存储器可以包括存储程序区和存储数据区,其中,存储程序区可存储 操作系统、至少一个功能所需要的应用程序;存储数据区可存储处理器所 创建的数据等。此外,存储器可以包括高速随机存取存储器,还可以包括 非暂态存储器,例如至少一个磁盘存储器件、闪存器件、或其他非暂态固 态存储器件。在一些实施例中,存储器可选包括相对于处理器远程设置的 存储器,这些远程存储器可以通过网络连接至处理器。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory may include a program storage area and a data storage area, wherein the program storage area may store an operating system and at least one application required by a function; the data storage area may store data created by the processor, etc. In addition, the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage device. In some embodiments, the memory optionally includes memory located remotely from the processor, which remote memory may be connected to the processor via a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.

所述一个或者多个模块存储在所述存储器中,当被所述处理器执行时, 执行如图1-3所示实施例中的结构化数据水印的标注方法以及结构化数据 水印的检测方法。The one or more modules are stored in the memory, and when executed by the processor, execute the structured data watermark labeling method and the structured data watermark detection method in the embodiment shown in Figures 1-3 .

上述电子设备具体细节可以对应参阅图1至图3所示的实施例中对应 的相关描述和效果进行理解,此处不再赘述。The specific details of the above-mentioned electronic device can be understood by correspondingly referring to the corresponding relevant descriptions and effects in the embodiments shown in FIG. 1 to FIG. 3 , and will not be repeated here.

本领域技术人员可以理解,实现上述实施例方法中的全部或部分流程, 是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一 计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施 例的流程。其中,所述存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)、随机存储记忆体(Random AccessMemory,RAM)、快 闪存储器(Flash Memory)、硬盘(Hard Disk Drive,缩写:HDD)或固态 硬盘(Solid-State Drive,SSD)等;所述存储介质还可以包括上述种类的存 储器的组合。Those skilled in the art can understand that all or part of the process in the method of the above-mentioned embodiments can be completed by instructing related hardware through a computer program. The program can be stored in a computer-readable storage medium. The program During execution, it may include the processes of the embodiments of the above-mentioned methods. Wherein, the storage medium can be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a flash memory (Flash Memory), a hard disk (Hard Disk Drive, abbreviation: HDD) or a solid-state drive (Solid-State Drive, SSD), etc.; the storage medium may also include a combination of the above-mentioned types of memory.

虽然结合附图描述了本发明的实施例,但是本领域技术人员可以在不 脱离本发明的精神和范围的情况下作出各种修改和变型,这样的修改和变 型均落入由所附权利要求所限定的范围之内。Although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art can make various modifications and variations without departing from the spirit and scope of the present invention, and such modifications and variations all fall into the scope of the appended claims. within the limited range.

Claims (10)

Translated fromChinese
1.一种结构化数据水印的标注方法,其特征在于,包括:1. A method for labeling structured data watermarks, comprising:获取待发送的结构化数据集及所述结构化数据集的传输属性信息;Obtain the structured data set to be sent and the transmission attribute information of the structured data set;提取所述结构化数据集中预设范围内的原始数据,识别所述原始数据的数据类型;extracting raw data within a preset range in the structured data set, and identifying the data type of the raw data;根据所述传输属性信息生成标识数据;generating identification data according to the transmission attribute information;根据预设的水印算法及所述原始数据的数据类型、所述标识数据对原始数据进行转换,得到嵌入标识数据的结构化数据集。The original data is converted according to the preset watermark algorithm, the data type of the original data, and the identification data to obtain a structured data set embedded with the identification data.2.根据权利要求1所述的结构化数据水印的标注方法,其特征在于,所述传输属性信息至少包括:接收方的ID及传输日期信息,2. The method for marking structured data watermarks according to claim 1, wherein the transmission attribute information at least includes: the receiver's ID and transmission date information,根据所述传输属性信息生成标识数据,包括:Generating identification data according to the transmission attribute information, including:提取所述接收方的ID及传输日期信息中的数字信息;Extracting the ID of the recipient and the digital information in the transmission date information;根据预设排序方式对所述数字信息进行排序,生成所述标识数据。sorting the digital information according to a preset sorting manner to generate the identification data.3.根据权利要求1所述的结构化数据水印的标注方法,其特征在于,根据预设的水印算法及所述原始数据的数据类型、所述标识数据将所述标识数据对原始数据进行转换,包括:3. The labeling method of structured data watermark according to claim 1, characterized in that, according to the preset watermark algorithm, the data type of the original data, and the identification data, the identification data is converted to the original data ,include:将所述标识数据进行格式转换,生成结构化标识数据;Converting the format of the identification data to generate structured identification data;根据所述结构化标识数据及原始数据对应位的数据及所述数据类型确定数据嵌入方式;Determine the data embedding method according to the data corresponding to the structured identification data and the original data and the data type;根据所述预设的水印算法及数据嵌入方式将所述原始数据转换为所述嵌入标识数据的结构化数据集。Converting the original data into the structured data set embedded with identification data according to the preset watermark algorithm and data embedding method.4.一种结构化数据水印的检测方法,其特征在于,包括:4. A method for detecting structured data watermarks, comprising:获取待测结构化数据集及原始数据集;Obtain the structured data set and the original data set to be tested;使用如权利要求1-3中任一项所述的结构化数据的水印标注方法根据所述原始数据集生成第一结构化数据集;Using the watermarking method for structured data according to any one of claims 1-3 to generate a first structured data set according to the original data set;判断所述待测结构化数据集与所述第一结构化数据集是否匹配;judging whether the structured data set to be tested matches the first structured data set;当所述待测结构化数据集与所述第一结构化数据集匹配时,对所述原始数据集的标识数据进行还原,得到所述待测结构化数据集的传输属性信息。When the structured data set to be tested matches the first structured data set, the identification data of the original data set is restored to obtain the transmission attribute information of the structured data set to be tested.5.一种结构化数据水印的检测方法,其特征在于,包括:5. A method for detecting structured data watermarks, comprising:获取待测结构化数据集;Obtain the structured data set to be tested;对所述待测结构化数据集进行排序;Sorting the structured data set to be tested;通过预设的水印算法从排序后的待测结构化数据集提取第一标识数;Extracting the first identification number from the sorted structured data set to be tested through a preset watermarking algorithm;判断所述第一标识数据与数据库中的预设标识数据是否匹配;judging whether the first identification data matches the preset identification data in the database;当所述标识数据与所述数据库中的标识数据匹配时,对所述第一标识数据进行还原,得到所述待测结构化数据集的传输属性信息。When the identification data matches the identification data in the database, restore the first identification data to obtain the transmission attribute information of the structured data set to be tested.6.一种结构化数据水印的标注装置,其特征在于,包括:6. A marking device for structured data watermarking, comprising:第一获取模块,用于获取待发送的结构化数据集及所述结构化数据集的传输属性信息;A first obtaining module, configured to obtain the structured data set to be sent and the transmission attribute information of the structured data set;识别模块,用于提取所述结构化数据集中预设范围内的原始数据,识别所述原始数据的数据类型;An identification module, configured to extract raw data within a preset range in the structured data set, and identify the data type of the raw data;第一生成模块,用于根据所述传输属性信息生成标识数据;A first generating module, configured to generate identification data according to the transmission attribute information;第一处理模块,用于根据预设的水印算法及所述原始数据的数据类型、所述标识数据对原始数据进行转换,得到嵌入标识数据的结构化数据集。The first processing module is configured to convert the original data according to a preset watermark algorithm, the data type of the original data, and the identification data to obtain a structured data set embedded with identification data.7.一种结构化数据水印的检测装置,其特征在于,包括:7. A detection device for a structured data watermark, comprising:第二获取模块,用于获取待测结构化数据集及原始数据集;The second acquisition module is used to acquire the structured data set to be tested and the original data set;第二生成模块,用于使用如权利要求1-3中任一项所述的结构化数据的水印标注方法根据所述原始数据集生成第一结构化数据集;A second generating module, configured to use the watermarking method for structured data according to any one of claims 1-3 to generate a first structured data set according to the original data set;第一匹配模块,用于判断所述待测结构化数据集与所述第一结构化数据集是否匹配;A first matching module, configured to determine whether the structured data set to be tested matches the first structured data set;第二处理模块,用于当所述待测结构化数据集与所述第一结构化数据集匹配时,对所述待测原始数据集的标识数据进行还原,得到所述待测结构化数据集的传输属性信息。A second processing module, configured to restore the identification data of the original data set to be tested to obtain the structured data to be tested when the structured data set to be tested matches the first structured data set The transport attribute information of the set.8.一种结构化数据水印的检测装置,其特征在于,包括:8. A detection device for a structured data watermark, comprising:第三获取模块,用于获取待测结构化数据集;The third acquisition module is used to acquire the structured data set to be tested;排序模块,用于对所述待测结构化数据集进行排序;A sorting module, configured to sort the structured data set to be tested;提取模块,用于通过水印提取算法从排序后的待测结构化数据集提取第一标识数;An extraction module, configured to extract the first identification number from the sorted structured data set to be tested through a watermark extraction algorithm;第二匹配模块,用于判断所述第一标识数据与数据库中的预设标识数据是否匹配;A second matching module, configured to determine whether the first identification data matches the preset identification data in the database;第三处理模块,用于当所述标识数据与所述数据库中的标识数据匹配时,对所述第一标识数据进行还原,得到所述待测结构化数据集的传输属性信息。A third processing module, configured to restore the first identification data when the identification data matches the identification data in the database, to obtain transmission attribute information of the structured data set to be tested.9.一种电子设备,其特征在于,包括:9. An electronic device, characterized in that, comprising:存储器和处理器,所述存储器和所述处理器之间互相通信连接,所述存储器中存储有计算机指令,所述处理器通过执行所述计算机指令,从而执行如权利要求1-3中任一项所述的结构化数据水印的标注方法;或,执行如权利要求4或5所述的结构化数据水印的检测方法。A memory and a processor, the memory and the processor are connected in communication with each other, computer instructions are stored in the memory, and the processor executes the computer instructions to perform any one of claims 1-3. The marking method of structured data watermark described in item; or, execute the detection method of structured data watermark as described in claim 4 or 5.10.一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机指令,所述计算机指令用于使所述计算机执行如权利要求1-3中任一项所述的结构化数据水印的标注方法;或,执行如权利要求4或5所述的结构化数据水印的检测方法。10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores computer instructions, and the computer instructions are used to make the computer execute the method according to any one of claims 1-3. A method for marking structured data watermarks; or, performing a method for detecting structured data watermarks as claimed in claim 4 or 5.
CN201910199779.7A2019-03-152019-03-15Mask method, detection method and device, the electronic equipment of structural data watermarkPendingCN110084046A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201910199779.7ACN110084046A (en)2019-03-152019-03-15Mask method, detection method and device, the electronic equipment of structural data watermark

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201910199779.7ACN110084046A (en)2019-03-152019-03-15Mask method, detection method and device, the electronic equipment of structural data watermark

Publications (1)

Publication NumberPublication Date
CN110084046Atrue CN110084046A (en)2019-08-02

Family

ID=67413189

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201910199779.7APendingCN110084046A (en)2019-03-152019-03-15Mask method, detection method and device, the electronic equipment of structural data watermark

Country Status (1)

CountryLink
CN (1)CN110084046A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111832043A (en)*2020-06-232020-10-27平安科技(深圳)有限公司Data encryption processing method and device, computer equipment and storage medium
CN113285801A (en)*2021-05-252021-08-20中国联合网络通信集团有限公司Data transmission method and device based on block chain
CN114356919A (en)*2021-12-232022-04-15绿盟科技集团股份有限公司Watermark embedding method, tracing method and device for structured database
CN115987682A (en)*2023-02-022023-04-18浙江网商银行股份有限公司Data processing method

Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102184243A (en)*2011-05-172011-09-14沈阳化工大学Text-type attribute-based relational database watermark embedding method
CN107992727A (en)*2017-12-112018-05-04北京安华金和科技有限公司A kind of watermark processing realized based on legacy data deformation and data source tracing method
CN107992726A (en)*2017-11-292018-05-04北京安华金和科技有限公司A kind of watermark processing and data source tracing method based on the pseudo- row of dummy lines
CN109033766A (en)*2018-08-032018-12-18合肥工业大学Database water mark method based on local difference privacy
CN109064379A (en)*2018-07-252018-12-21成都亚信网络安全产业技术研究院有限公司The mask method and the method for inspection and device of a kind of digital watermarking

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102184243A (en)*2011-05-172011-09-14沈阳化工大学Text-type attribute-based relational database watermark embedding method
CN107992726A (en)*2017-11-292018-05-04北京安华金和科技有限公司A kind of watermark processing and data source tracing method based on the pseudo- row of dummy lines
CN107992727A (en)*2017-12-112018-05-04北京安华金和科技有限公司A kind of watermark processing realized based on legacy data deformation and data source tracing method
CN109064379A (en)*2018-07-252018-12-21成都亚信网络安全产业技术研究院有限公司The mask method and the method for inspection and device of a kind of digital watermarking
CN109033766A (en)*2018-08-032018-12-18合肥工业大学Database water mark method based on local difference privacy

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111832043A (en)*2020-06-232020-10-27平安科技(深圳)有限公司Data encryption processing method and device, computer equipment and storage medium
CN111832043B (en)*2020-06-232023-10-03平安科技(深圳)有限公司Data encryption processing method, device, computer equipment and storage medium
CN113285801A (en)*2021-05-252021-08-20中国联合网络通信集团有限公司Data transmission method and device based on block chain
CN114356919A (en)*2021-12-232022-04-15绿盟科技集团股份有限公司Watermark embedding method, tracing method and device for structured database
CN114356919B (en)*2021-12-232025-08-22绿盟科技集团股份有限公司 A watermark embedding method, traceability method and device for structured database
CN115987682A (en)*2023-02-022023-04-18浙江网商银行股份有限公司Data processing method

Similar Documents

PublicationPublication DateTitle
CN110084046A (en)Mask method, detection method and device, the electronic equipment of structural data watermark
US10628577B2 (en)Identifying software components in a software codebase
CN112600834B (en)Content security identification method and device, storage medium and electronic equipment
CN103473346B (en)A kind of Android based on application programming interface beats again bag applying detection method
US12131401B1 (en)Dual watermarking method for trajectory data based on robust watermarking and fragile watermarking
JP7077425B2 (en) Checking the display of third-party content on client devices
JP6970686B2 (en) Salting text in database tables, text files, and data feeds
WO2024066271A1 (en)Database watermark embedding method and apparatus, database watermark tracing method and apparatus, and electronic device
CN113992625B (en)Domain name source station detection method, system, computer and readable storage medium
Zheng et al.A novel video copyright protection scheme based on blockchain and double watermarking
CN114626968A (en) Watermark embedding method, watermark extraction method and device
CN115982675A (en) Document processing method, device, electronic device, and storage medium
CN110874456B (en)Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
CN116383189A (en) Business data processing method, device, computer equipment, storage medium
CN114626079A (en)File viewing method, device, equipment and storage medium based on user permission
CN116702103A (en)Database watermark processing method, database watermark tracing method and device
CN106357682A (en)Phishing website detecting method
CN113901434B (en) Display terminal identification method, system and computer-readable storage medium
CN118784314A (en) Method and device for generating unique user identification
Tiwari et al.A novel watermarking scheme for secure relational databases
CN118734271A (en) Model infringement detection method, device and electronic equipment
CN110349072B (en) A Watermark Synchronization Method in the Process of Watermark Embedding and Detection for Vector Geographical Data
CN110866858B (en) Watermark embedding and query data providing method, device and data processing method
CN117689526A (en) Image hidden watermark adding method, image traceability method, device and equipment
US11714847B2 (en)Methods and apparatus to detect unconfined view media

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20190802

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp