技术领域technical field
本发明实施例涉及信息处理技术领域,尤其涉及一种数据检查装置及方法。Embodiments of the present invention relate to the technical field of information processing, and in particular, to a data checking apparatus and method.
背景技术Background technique
大数据的发展给企业或单位的数据管理带来巨大的挑战,在实际工作过程中,业务人员的一项重要工作就是数据检查,即对数据的准确性进行核对,以保证数据的准确性。The development of big data brings huge challenges to the data management of enterprises or units. In the actual work process, an important job of business personnel is data inspection, that is, checking the accuracy of data to ensure the accuracy of data.
目前,数据检查的过程为:业务人员确定待检查数据(例如excel 1)之后,手动的在众多数据库中寻找与这个待检查数据相关的数据库。以数据库具体是excel2为例,业务人员寻找到与这个待检查数据相关的excel 2之后,通过肉眼对比excel 2与excel 1,确定待检查的数据是否准确。At present, the process of data inspection is as follows: after business personnel determine the data to be inspected (for example, excel 1), they manually search for a database related to the data to be inspected in numerous databases. Taking the database as excel 2 as an example, after finding the excel 2 related to the data to be checked, the business personnel compares excel 2 and excel 1 with the naked eye to determine whether the data to be checked is accurate.
由此可见,目前的数据检查工作,一方面,对于业务人员来说工作量巨大,另一方面,在数据检查的过程中,由人为的误操作而导致出错的几率较大。It can be seen that the current data inspection work, on the one hand, has a huge workload for business personnel, and on the other hand, in the process of data inspection, the probability of errors caused by human misoperation is relatively large.
发明内容SUMMARY OF THE INVENTION
本发明实施例提供一种数据检查装置和方法,用以减轻业务人员的工作量,并降低数据检查的出错几率。Embodiments of the present invention provide a data checking apparatus and method, so as to reduce the workload of business personnel and reduce the error probability of data checking.
第一方面,本发明实施例提供一种数据检查方法。该方法可应用于财务领域的财务数据处理过程。具体的,该方法包括:数据检查装置确定待检查数据,然后根据待检查数据的属性信息从N个数据库中确定至少一个目标数据库;其中,待检查数据的属性信息与至少一个目标数据库中的每个目标数据库的配置信息相匹配。数据检查装置读取至少一个目标数据库中的数据,并将待检查数据与读取的数据进行对比,获得对比结果,然后输出对比结果。In a first aspect, an embodiment of the present invention provides a data checking method. The method can be applied to the financial data processing in the financial field. Specifically, the method includes: the data inspection device determines the data to be inspected, and then determines at least one target database from N databases according to the attribute information of the data to be inspected; wherein the attribute information of the data to be inspected is related to each of the at least one target database. match the configuration information of the target database. The data checking device reads data in at least one target database, compares the data to be checked with the read data, obtains a comparison result, and then outputs the comparison result.
在本发明实施例中,数据检查装置可以根据待检查数据的属性信息从N个目标数据库中寻找和该待检查数据的属性信息相匹配的至少一个目标数据库,然后将待检查数据与至少一个目标数据库中的数据进行对比,得到对比结果。通过这种方式,将目前的数据检查工作自动化,减轻了业务人员的工作量,相应地也会降低数据检查的出错几率。In this embodiment of the present invention, the data inspection apparatus may search for at least one target database matching the attribute information of the data to be inspected from N target databases according to the attribute information of the data to be inspected, and then compare the data to be inspected with the at least one target database. Compare the data in the database to get the comparison result. In this way, the current data inspection work is automated, the workload of business personnel is reduced, and the error probability of data inspection is correspondingly reduced.
在一个可能的设计中,配置信息包括如下中的一种或多种:每个数据库的类型;每个数据库能够处理的业务环节;每个数据库的数据配置方式;每个数据库的审批状态。In a possible design, the configuration information includes one or more of the following: the type of each database; the business links that each database can handle; the data configuration mode of each database; the approval status of each database.
在本发明实施例中,配置信息还可能是其它信息,以上几种只是举例,本发明实施例对此不作具体的限定。In this embodiment of the present invention, the configuration information may also be other information, and the above types are only examples, which are not specifically limited in this embodiment of the present invention.
在一个可能的设计中,数据检查装置读取至少一个目标数据库中的数据,包括:建立第一目标数据库与至少一个目标数据库中除第一目标数据库之外的其它目标数据库之间的数据同步,其中,第一目标数据库为至少一个目标数据库中的任意一个数据库;在第一目标数据库中执行数据读取程序,以分别读取至少一个目标数据库中每个目标数据库中的数据。In a possible design, the data checking device reads data in at least one target database, including: establishing data synchronization between the first target database and other target databases other than the first target database in the at least one target database, Wherein, the first target database is any one of the at least one target database; a data reading program is executed in the first target database to respectively read data in each target database in the at least one target database.
在本发明实施例中,由于第一目标数据库和其它目标数据库之间数据同步,所以数据检查装置在第一目标数据库中执行数据读取程序时,可以同步读取第一目标数据库和其它目标数据库中的数据,无需在每个目标数据库中分别执行一次数据读取程序,提高数据检查效率。In this embodiment of the present invention, due to the synchronization of data between the first target database and other target databases, when the data checking apparatus executes the data reading program in the first target database, the first target database and other target databases can be read synchronously There is no need to execute a data reading program in each target database separately, which improves the efficiency of data checking.
在一个可能的设计中,数据检查装置输出对比结果,包括:导出结果报告,该结果报告根据对比结果生成;若结果报告中存在异常值,则输出提示信息,以提示用户查看结果报告;其中,异常值为待检查数据和至少一个目标数据库中存储的数据不一致的数据。In a possible design, the data checking device outputs the comparison result, including: exporting a result report, which is generated according to the comparison result; if there is an abnormal value in the result report, outputting prompt information to prompt the user to check the result report; wherein, The abnormal value is the data that is inconsistent between the data to be checked and the data stored in at least one target database.
在本发明实施例中,若结果报告中存在异常值,数据检查装置输出提示信息,以提醒用户及时查看结果报告,进而可以及时的定位产生异常值的原因,有助于提高数据检查的效率。In the embodiment of the present invention, if there is an abnormal value in the result report, the data inspection device outputs prompt information to remind the user to check the result report in time, so as to locate the cause of the abnormal value in time, which helps to improve the efficiency of data inspection.
在一个可能的设计中,数据库为不同应用程序提供商提供的应用程序数据库,或为企业内部的业务数据库。In one possible design, the database is an application database provided by different application providers, or a business database within an enterprise.
在本发明实施例中,数据库不仅可以是企业内部的业务数据库,还可以是其它企业例如不同应用程序提供商提供的应用程序数据库。当然,还可能是其它种类的数据库,以上几种只是举例,本发明实施例对此不作具体的限定。In this embodiment of the present invention, the database may not only be an internal business database of an enterprise, but also an application database provided by other enterprises such as different application providers. Of course, there may also be other types of databases, and the above types are only examples, which are not specifically limited in the embodiment of the present invention.
在一个可能的设计中,数据库包含数据集市,数据仓库、和源数据库中的一种或多种。In one possible design, the database contains one or more of a data mart, a data warehouse, and a source database.
在本发明实施例中,数据库的规模可以不同,例如数据库可以是规模较大的源数据库或数据仓库,也可以是规模较小的数据集市。当然,数据库还可能是其它种类的数据库,以上几种只是举例,本发明实施例对此不作具体的限定。In the embodiment of the present invention, the scale of the database may be different, for example, the database may be a large-scale source database or data warehouse, or a small-scale data mart. Of course, the database may also be other types of databases, and the above types are only examples, which are not specifically limited in the embodiment of the present invention.
第二方面,本发明实施例还提供一种数据检查装置。该数据检查装置具有实现上述方法设计中数据检查装置的功能。这些功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。所述硬件或软件包括一个或多个与上述功能相对应的模块。In a second aspect, an embodiment of the present invention further provides a data checking apparatus. The data inspection device has the function of realizing the data inspection device in the above method design. These functions can be implemented by hardware or by executing corresponding software by hardware. The hardware or software includes one or more modules corresponding to the above functions.
在一个可能的设计中,数据检查装置的具体结构可包括数据导入模块、数据检查模块、数据导出模块。这些模块可执行上述第一方面或第一方面的任意一种可能的设计所提供的方法中的相应功能。In a possible design, the specific structure of the data checking device may include a data importing module, a data checking module, and a data exporting module. These modules can perform the corresponding functions in the method provided by the first aspect or any possible design of the first aspect.
第三方面,本发明实施例还提供一种数据检查装置。该数据检查装置具有实现上述方法设计中数据检查装置的功能。这些功能可以通过硬件实现。该数据检查装置包括:存储器,用于存储计算机可执行程序代码;处理器,处理器与存储器耦合。其中存储器所存储的程序代码包括指令,当处理器执行所述指令时,所述指令使数据检查装置执行上述第一方面或第一方面的任意一种可能的设计中数据检查装置所执行的方法。In a third aspect, an embodiment of the present invention further provides a data checking apparatus. The data inspection device has the function of realizing the data inspection device in the above method design. These functions can be implemented by hardware. The data inspection apparatus includes: a memory for storing computer-executable program code; and a processor coupled to the memory. The program code stored in the memory includes instructions. When the processor executes the instructions, the instructions cause the data inspection apparatus to execute the method performed by the data inspection apparatus in the first aspect or any possible design of the first aspect. .
第四方面,本发明实施例还提供一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述第一方面所述的数据检查方法。In a fourth aspect, an embodiment of the present invention further provides a computer-readable storage medium, where instructions are stored in the computer-readable storage medium, when the computer-readable storage medium runs on a computer, the computer executes the data checking method described in the first aspect .
第五方面,本发明实施例还提供一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述第一方面所述的数据检查方法。In a fifth aspect, an embodiment of the present invention further provides a computer program product including instructions, which, when running on a computer, causes the computer to execute the data checking method described in the first aspect.
附图说明Description of drawings
图1为本发明实施例提供的数据处理过程的示意图;1 is a schematic diagram of a data processing process provided by an embodiment of the present invention;
图2为本发明实施例提供的一种数据检查装置的架构图;FIG. 2 is an architectural diagram of a data inspection apparatus provided by an embodiment of the present invention;
图3为本发明实施例提供的一种数据检查方法的流程图;3 is a flowchart of a data checking method provided by an embodiment of the present invention;
图4为本发明实施例提供的目标数据库的数量为1时的数据检查过程的示意图;4 is a schematic diagram of a data checking process when the number of target databases provided by an embodiment of the present invention is 1;
图5为本发明实施例提供的目标数据库的数量大于等于2时的数据检查过程的示意图;5 is a schematic diagram of a data checking process when the number of target databases provided by an embodiment of the present invention is greater than or equal to 2;
图6为本发明实施例提供的5个目标数据库之间数据同步的示意图;6 is a schematic diagram of data synchronization between five target databases provided by an embodiment of the present invention;
图7为本发明实施例提供的一种数据检查装置的结构示意图。FIG. 7 is a schematic structural diagram of a data checking apparatus according to an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例提供的技术方案进行清楚、完整地描述。The technical solutions provided by the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.
以下,首先对本发明实施例中的部分用语进行解释说明,以便于本领域技术人员理解。Hereinafter, some terms in the embodiments of the present invention will be explained first, so as to facilitate the understanding of those skilled in the art.
(1)数据库,用于存放数据的仓库。数据库的格式有很多种类型,例如,比较简单的是用来存储各种数据的表格,或者能够存储海量数据的大型数据库系统。在本发明实施例中,也可以将“数据库”称为“数据源”。也就是说,本文对于“数据库”的名称不作限制,只要表达的是如上的概念即可。(1) Database, a warehouse for storing data. There are many types of database formats, such as relatively simple tables used to store various data, or large database systems capable of storing massive amounts of data. In this embodiment of the present invention, a "database" may also be referred to as a "data source". That is to say, this article does not limit the name of "database", as long as the above concept is expressed.
(2)结构化查询语言,也被称为SQL(structured query Language,SQL)语言,是一种数据库查询和程序设计语言,用于存取、查询、更新和管理数据库。(2) Structured Query Language, also known as SQL (structured query Language, SQL) language, is a database query and programming language used to access, query, update and manage databases.
(3)本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,如无特殊说明,一般表示前后关联对象是一种“或”的关系。且在本申请的描述中,“第一”、“第二”等词汇,仅用于区分描述的目的,而不能理解为指示或暗示相对重要性,也不能理解为指示或暗示顺序。(3) The term "and/or" in this article is only an association relationship to describe related objects, indicating that there can be three kinds of relationships, for example, A and/or B, which can mean that A exists alone, and A and B exist at the same time , there are three cases of B alone. In addition, the character "/" in this text, unless otherwise specified, generally indicates that the related objects before and after are an "or" relationship. In the description of this application, words such as "first" and "second" are only used for the purpose of distinguishing and describing, and cannot be understood as indicating or implying relative importance, nor can they be understood as indicating or implying order.
以交易数据为例,图1示出了目前的企业内部的数据处理过程,如图1所示,目前,企业中业务人员的数据处理过程为手工分录、数据检查和数据分析三个过程。手工分录即业务人员记录企业的所有的收入或支出的交易数据。为了保证交易数据的准确性,业务人员需要对这些交易数据进行数据检查,该数据检查过程为:业务人员确定待检查数据(例如excel 1)之后,手动的在众多数据库中寻找与这个待检查数据相关的目标数据库。以目标数据库具体是excel 2为例,业务人员寻找到与这个待检查数据相关的excel 2之后,通过肉眼对比excel 2与excel 1来确定待检查数据是否准确。当确定交易数据准确之后,业务人员可以对交易数据进行数据分析,例如分析企业的盈利情况等。由此可见,目前的数据检查工作,一方面,对于业务人员来说工作量巨大,另一方面,在数据检查的过程中,由人为的误操作而导致出错的几率也较大。Taking transaction data as an example, Figure 1 shows the current internal data processing process of an enterprise. As shown in Figure 1, at present, the data processing process of business personnel in an enterprise includes three processes: manual entry, data inspection and data analysis. Manual entry means that business personnel record the transaction data of all the income or expenditure of the enterprise. In order to ensure the accuracy of transaction data, business personnel need to perform data inspection on these transaction data. The data inspection process is as follows: after business personnel determine the data to be inspected (for example, excel 1), they manually search for the data to be inspected in many databases. The relevant target database. Taking the target database as excel 2 as an example, after the business personnel find the excel 2 related to the data to be checked, they can compare excel 2 and excel 1 with the naked eye to determine whether the data to be checked is accurate. After confirming that the transaction data is accurate, the business personnel can perform data analysis on the transaction data, such as analyzing the profitability of the enterprise. It can be seen that the current data inspection work, on the one hand, has a huge workload for business personnel, and on the other hand, in the process of data inspection, the probability of errors caused by human misoperation is also high.
为了解决这个技术问题,本发明实施例提供一种数据检查装置。该数据检查装置200可以适用于终端设备,例如台式电脑、平板电脑(PAD)、或手机等。图2示出了一种可能的数据检查装置的结构图。请参见图2,数据检查装置200包括:数据库管理模块210、数据导入模块220、数据检查模块230和数据导出模块240。In order to solve this technical problem, an embodiment of the present invention provides a data checking apparatus. The data checking apparatus 200 can be applied to terminal equipment, such as a desktop computer, a tablet computer (PAD), or a mobile phone. Figure 2 shows a block diagram of a possible data checking device. Referring to FIG. 2 , the data checking apparatus 200 includes: a database management module 210 , a data importing module 220 , a data checking module 230 and a data exporting module 240 .
其中,数据库管理模块210可以对N个数据库进行管理,每个数据库对应一个配置信息,N为大于等于1的整数。数据导入模块220可以导入待检查数据。数据检查模块230可以根据待检查数据的属性信息从N个数据库中确定至少一个目标数据库;其中,待检查数据的属性信息与至少一个目标数据库中的每个目标数据库的配置信息相匹配;数据检查模块230读取至少一个目标数据库中的数据,并将待检查数据与读取的数据进行对比,获得对比结果。数据导出模块240用于导出对比结果。The database management module 210 can manage N databases, each database corresponds to one piece of configuration information, and N is an integer greater than or equal to 1. The data import module 220 can import the data to be checked. The data checking module 230 can determine at least one target database from the N databases according to the attribute information of the data to be checked; wherein, the attribute information of the data to be checked matches the configuration information of each target database in the at least one target database; the data check The module 230 reads data in at least one target database, and compares the data to be checked with the read data to obtain a comparison result. The data export module 240 is used to export the comparison results.
在实际操作过程中,数据库管理模块210和数据检查模块230可以是两个相互独立的模块,也可以是同一个模块,即由数据检查模块230对N个数据库进行管理等操作。In actual operation, the database management module 210 and the data checking module 230 may be two independent modules, or may be the same module, that is, the data checking module 230 manages N databases and other operations.
需要说明的是,在本发明实施例中,上述数据检查装置以功能模块的形式展示。在不受限制的情况下,本文所使用的术语“模块”可指执行一个或多个软件或固件程序的专用集成电路(application-specific integrated circuit,ASIC)、电子电路、(共享、专用或组)处理器以及存储器,组合逻辑电路,和/或提供所述功能的其它合适的部件。It should be noted that, in the embodiment of the present invention, the above-mentioned data checking apparatus is presented in the form of functional modules. Without limitation, the term "module" as used herein may refer to an application-specific integrated circuit (ASIC), electronic circuit, (shared, dedicated, or group) that executes one or more software or firmware programs ) processors and memories, combinational logic circuits, and/or other suitable components that provide the described functions.
本发明实施例提供的数据检查装置200可以根据待检查数据的属性信息从N个数据库中确定与该待检查数据的属性信息相匹配的至少一个目标数据库,然后将待检查数据与至少一个目标数据库中的数据进行对比,得到对比结果。通过该数据检查装置200,可以将目前的数据检查工作自动化,减轻了业务人员的工作量,相应的也会降低数据检查的出错几率。The data inspection apparatus 200 provided in this embodiment of the present invention may determine at least one target database matching the attribute information of the data to be inspected from N databases according to the attribute information of the data to be inspected, and then compare the data to be inspected with the at least one target database Compare the data in and get the comparison result. With the data inspection device 200, the current data inspection work can be automated, the workload of the business personnel is reduced, and the error probability of the data inspection is correspondingly reduced.
本发明实施例中,数据检查装置200的处理对象可以是交易数据,也可以是其它数据,例如业务数据或财经数据等。In this embodiment of the present invention, the processing object of the data checking apparatus 200 may be transaction data, or may be other data, such as business data or financial data.
在本发明实施例中,数据检查装置200的应用场景可以有多种,例如整个企业均采用该数据检查装置200即总公司和分公司统一采用该数据检查装置200,或者企业中的个别部分或者子公司采用该数据检查装置200。如果整个企业均采用该数据检查装置200,那么该数据检查装置200便可管理该企业的所有业务数据库,如果企业中的个别部分或者子公司采用该数据检查装置200,那么该数据检查装置200便管理个别部门或者子公司的所有业务数据库。In this embodiment of the present invention, the data inspection apparatus 200 may have various application scenarios. For example, the entire enterprise adopts the data inspection apparatus 200, that is, the head office and the branch company adopt the data inspection apparatus 200 uniformly, or individual parts of the enterprise or Subsidiaries use this data checking device 200 . If the data checking apparatus 200 is used by the entire enterprise, the data checking apparatus 200 can manage all business databases of the enterprise. If the data checking apparatus 200 is used by individual parts or subsidiaries of the enterprise, the data checking apparatus 200 can Manage all business databases of individual departments or subsidiaries.
下面介绍数据检查装置200进行数据检查的过程。请参见图3,图3示出了本发明实施例提供的一种数据检查方法的流程图。在下文的介绍中,以将该方法应用在图2所示的数据检查装置200为例。该方法的流程描述如下:The following describes the process of data inspection performed by the data inspection apparatus 200 . Referring to FIG. 3, FIG. 3 shows a flowchart of a data checking method provided by an embodiment of the present invention. In the following description, the method is applied to the data checking apparatus 200 shown in FIG. 2 as an example. The flow of this method is described as follows:
S301:确定待检查数据。S301: Determine the data to be checked.
在本发明实施例中,待检查数据可以是各种形式,例如简单的来说,一个Excel便可作为待检查数据,或者一张发票也可以作为待检查数据。In this embodiment of the present invention, the data to be checked can be in various forms, for example, simply speaking, an Excel can be used as the data to be checked, or an invoice can also be used as the data to be checked.
在本发明实施例中,S301的实现方式可以是,业务人员在数据检查装置200的操作界面(规则配置界面)导入待检查数据,该操作界面可以由业务人员进行操作。业务人员导入待检查数据之后,数据检查装置200便可对该待检查数据进行检查。In the embodiment of the present invention, the implementation of S301 may be that the business personnel import the data to be checked on the operation interface (rule configuration interface) of the data inspection apparatus 200, and the operation interface can be operated by the business personnel. After the business personnel imports the data to be inspected, the data inspection device 200 can inspect the data to be inspected.
S302:根据待检查数据的属性信息从N个数据库中确定至少一个目标数据库。S302: Determine at least one target database from the N databases according to the attribute information of the data to be checked.
在本发明实施例中,数据检查装置200可以对N个数据库进行管理。一种可能的实现方式为,数据检查装置200可以对每个数据库进行配置化,即每个数据库对应一个配置信息。这里的配置信息可以包括每个数据库的业务类型、所处的业务环节、数据配置方式、所处的审批状态中的一种或多种。以配置信息是业务类型为例,可以有“采购支付业务”数据库、“出差费报销”数据库、“流动资金记账”数据库等等。以配置信息是所处的业务环节为例,可以有“核算”数据库、“汇总”数据库等等。以配置信息是审批状态为例,可以有“审批通过的”数据库、“审批未通过的”数据库等等。由于每个数据库对应一个配置信息,所以数据检查装置200便可通过该配置信息找到对应的数据库。In this embodiment of the present invention, the data checking apparatus 200 may manage N databases. A possible implementation manner is that the data checking apparatus 200 may configure each database, that is, each database corresponds to one configuration information. The configuration information here may include one or more of the business type of each database, the business link in which it is located, the data configuration method, and the approval state it is in. Taking the configuration information as an example of a business type, there may be a "purchasing payment business" database, a "travel expense reimbursement" database, a "current capital accounting" database, and so on. Taking the business link where the configuration information is located as an example, there may be an "accounting" database, a "summary" database, and so on. Taking the configuration information as the approval status as an example, there may be a "approved" database, a "approved failed" database, and so on. Since each database corresponds to one configuration information, the data checking apparatus 200 can find the corresponding database through the configuration information.
当然,在本发明实施例中,数据库不仅可以是企业内部的业务数据库,还可以是不同应用程序提供商提供的应用程序数据库,如果数据库是不同应用程序提供商提供的应用程序数据库,那么数据检查装置200可以将每个数据库按照应用程序所能处理的业务类型对每个数据库进行配置。当然,数据库还可以是其它的数据库,本发明实施例对此不作具体的限定。Of course, in this embodiment of the present invention, the database may not only be an internal business database, but also an application database provided by different application providers. If the database is an application database provided by different application providers, then the data check The apparatus 200 may configure each database for each database according to the business type that the application program can handle. Certainly, the database may also be other databases, which are not specifically limited in this embodiment of the present invention.
在本发明实施例中,每个数据库的规模可以相同或者不同。例如数据库可以是数据集市,数据仓库、和源数据库等。其中,数据集市的规模小于数据仓库的规模,而数据仓库的规模小于源数据库的规模。数据集市也被称为数据市场,主要面向企业内部的部门级业务,并且只面向某个特定的主题,数据集市是数据仓库的子集。数据仓库是用来为企业管理者提供管理决策的数据库,主要面向企业级业务。源数据库,即数据的源头,能直接为用户提供原始数据,可以是数值数据库,文本-数值数据库,全文数据库,术语数据库,图像数据库,音视频数据库等等。In this embodiment of the present invention, the scale of each database may be the same or different. For example, the database can be a data mart, a data warehouse, a source database, and the like. Among them, the scale of the data mart is smaller than the scale of the data warehouse, and the scale of the data warehouse is smaller than the scale of the source database. Data marts, also known as data markets, are mainly for department-level businesses within an enterprise and only for a specific topic. Data marts are a subset of data warehouses. Data warehouse is a database used to provide management decision-making for enterprise managers, mainly for enterprise-level business. Source database, that is, the source of data, can directly provide users with original data, which can be numerical database, text-numerical database, full-text database, term database, image database, audio and video database and so on.
在本发明实施例中,数据库可以是关系型数据库,也可以是非关系型数据库。In this embodiment of the present invention, the database may be a relational database or a non-relational database.
在执行S302之前,数据检查装置200可以先确定待检查数据的属性信息,下面介绍数据检查装置200确定待检查数据的属性信息的三种可能的实现方式。Before executing S302, the data inspection apparatus 200 may first determine the attribute information of the data to be inspected. Three possible implementation manners for the data inspection apparatus 200 to determine the attribute information of the data to be inspected are described below.
第一种可能的实现方式,业务人员可以在规则配置界面输入待检查数据的属性信息。例如业务人员可以在规则配置界面填写待检查数据的基本的属性信息,例如待检查数据的业务类型、所处的业务环节等。然后数据检查装置200自动确定待检查数据的属性信息。In the first possible implementation, business personnel can input attribute information of the data to be checked on the rule configuration interface. For example, a business person can fill in basic attribute information of the data to be checked on the rule configuration interface, such as the business type of the data to be checked, the business link it is in, and so on. Then the data checking apparatus 200 automatically determines the attribute information of the data to be checked.
第二种可能的实现方式,业务人员在导入待检测数据之后,数据检查装置200可以自动识别该待检查数据的属性信息。例如数据检查装置200可以自动提取待检查数据的文件命名中的关键字,这些关键字便可作为待检查数据的属性信息。In the second possible implementation manner, after the business personnel imports the data to be inspected, the data inspection apparatus 200 can automatically identify the attribute information of the data to be inspected. For example, the data checking apparatus 200 can automatically extract keywords in the file names of the data to be checked, and these keywords can be used as attribute information of the data to be checked.
第三种可能的实现方式,数据检查装置200可以事先设置好待检查数据的所有可能的属性信息,业务人员无需手动输入待检查数据的属性信息,而是从事先设置好的多种属性信息中选择符合待检查数据的属性信息即可。In the third possible implementation manner, the data inspection apparatus 200 can set all possible attribute information of the data to be inspected in advance, and the business personnel do not need to manually input the attribute information of the data to be inspected, but can select the attribute information from the pre-set various attribute information. Just select the attribute information that matches the data to be checked.
以上只是列举了数据检查装置200确定待检查数据的属性信息的三种方式,当然还可以有其它的实现方式,本发明实施例对此不作具体的限定。The above only enumerates three manners in which the data checking apparatus 200 determines the attribute information of the data to be checked. Of course, there may also be other implementation manners, which are not specifically limited in this embodiment of the present invention.
数据检查装置200确定待检查数据的属性信息之后,便可执行S302,即根据待检查数据的属性信息从N个数据库中确定至少一个目标数据库。下面介绍S302的一种可能的实现方式。After the data inspection apparatus 200 determines the attribute information of the data to be inspected, S302 may be executed, that is, at least one target database is determined from the N databases according to the attribute information of the data to be inspected. A possible implementation manner of S302 is described below.
在本发明实施例中,数据检查装置200中可以事先存储数据库的配置信息与待检查数据的属性信息之间的映射关系。当数据检查装置200确定待检查数据的属性信息之后,便可通过该映射关系确定符合待检查数据的属性信息的至少一个目标数据库。数据库的配置信息与待检查数据的属性信息之间的映射关系有多种,表1为一种示例。并且,表1只是对数据库的配置信息与待检查数据的属性信息之间的映射关系的举例,并不是对数据库的配置信息与待检查数据的属性信息之间的映射关系的限定。In this embodiment of the present invention, the data checking apparatus 200 may store the mapping relationship between the configuration information of the database and the attribute information of the data to be checked in advance. After the data inspection apparatus 200 determines the attribute information of the data to be inspected, at least one target database that conforms to the attribute information of the data to be inspected can be determined through the mapping relationship. There are various mapping relationships between the configuration information of the database and the attribute information of the data to be checked, and Table 1 is an example. Moreover, Table 1 is only an example of the mapping relationship between the configuration information of the database and the attribute information of the data to be checked, and does not limit the mapping relationship between the configuration information of the database and the attribute information of the data to be checked.
表1Table 1
举例来说,如果数据检查装置200确定待检查数据的属性信息是“飞机票”“住宿发票”或“审批单”中某一种或者多种时,数据检查装置200根据表1便能确定与该待检查数据的属性信息相匹配的是配置信息为“出差费报销”的数据库。如果数据检查装置200确定待检查数据的属性信息是“采购支票”“采购订单”或“采购发票”中的某一种或多种时,数据检查装置200根据表1便能确定与该待检查数据的属性信息相匹配的是配置信息为“采购业务”的数据库。For example, if the data inspection device 200 determines that the attribute information of the data to be inspected is one or more of "air ticket", "accommodation invoice" or "approval form", the data inspection device 200 can determine and match according to Table 1. The attribute information of the data to be checked is matched with the database whose configuration information is "reimbursement of business trip expenses". If the data inspection device 200 determines that the attribute information of the data to be inspected is one or more of “purchase check”, “purchase order” or “purchase invoice”, the data inspection device 200 can determine, according to Table 1, that the data to be inspected is related to the data to be inspected. The attribute information of the data matches the database whose configuration information is "purchasing business".
S303:读取至少一个目标数据库中的数据,并将待检查数据与读取的数据进行对比,获得对比结果。S303: Read data in at least one target database, and compare the data to be checked with the read data to obtain a comparison result.
当数据检查装置200根据待检查数据的属性信息确定至少一个目标数据库之后,便可将待检查数据与至少一个目标数据库中的数据进行对比。为了便于理解,下面分两种情况来介绍S303的实现过程。第一种情况,数据检查装置200确定的目标数据库的数量为1。第二种情况,数据检查装置200确定的目标数据库的数量大于等于2。After the data checking apparatus 200 determines at least one target database according to the attribute information of the data to be checked, the data to be checked can be compared with the data in the at least one target database. For ease of understanding, the implementation process of S303 is described below in two cases. In the first case, the number of target databases determined by the data checking apparatus 200 is one. In the second case, the number of target databases determined by the data checking apparatus 200 is greater than or equal to two.
对于第一种情况,由于只有一个目标数据库,即数据检查装置200只需要将待检查数据和这个唯一的目标数据库中的数据进行对比即可。图4示出了目标数据库数量为1时的数据检查过程的示意图。如图4所示,数据检查装置200的数据检查过程可以为三步进行。第一步,数据检查装置200在目标数据库中执行SQL数据检查程序。第二步,数据检查装置200从目标数据库中读取数据。第三步,数据检查装置将读取的数据和待检查数据进行对比,得到对比结果。第四步,数据检查装置200将对比结果存储。For the first case, since there is only one target database, that is, the data checking apparatus 200 only needs to compare the data to be checked with the data in this unique target database. FIG. 4 shows a schematic diagram of the data checking process when the number of target databases is one. As shown in FIG. 4 , the data checking process of the data checking apparatus 200 may be performed in three steps. In the first step, the data checking apparatus 200 executes the SQL data checking program in the target database. In the second step, the data checking apparatus 200 reads data from the target database. In the third step, the data checking device compares the read data with the data to be checked to obtain a comparison result. In the fourth step, the data checking apparatus 200 stores the comparison result.
在第一步之前,业务人员可以在数据检查装置200的规则配置界面输入脚本参数,然后数据检查装置200调用这些脚本参数生成SQL数据检查程序。Before the first step, the business personnel can input script parameters in the rule configuration interface of the data inspection apparatus 200, and then the data inspection apparatus 200 invokes these script parameters to generate the SQL data inspection program.
在本发明实施例中,脚本参数可以包括条件和条件范围。以目标数据库的配置信息是“采购业务”为例,条件可以是办公用品,条件范围可以金额的范围。例如业务人员可以在规则配置界面输入脚本参数为:条件“办公用品”、条件范围“金额在10000-50000”。那么数据检查装置便可以调用该脚本参数生成SQL数据检查程序,然后在目标数据库中执行SQL数据检查程序。In this embodiment of the present invention, the script parameters may include conditions and condition ranges. Taking the configuration information of the target database as "purchasing business" as an example, the condition can be office supplies, and the condition range can be the range of amount. For example, business personnel can input script parameters in the rule configuration interface as: condition "office supplies", condition range "amount in 10000-50000". Then the data checking device can call the script parameter to generate the SQL data checking program, and then execute the SQL data checking program in the target database.
需要说明的是,传统的数据检查装置仅仅作为IT(information technology,IT)人员的使用工具,业务人员不能够自己设置需求,即脚本参数。业务人员可以把需求告诉IT人员,然后由IT人员设置需求,但是往往IT人员对业务的理解有偏差,所以通过IT人员设置的规则提供的数据和业务人员需要的数据有所差别。在本发明实施例中,业务人员可以自己从业务角度录入需求,然后由数据检查装置200自动生成SQL数据检查程序,自动执行检查任务,方便业务人员的需求。It should be noted that the traditional data checking device is only used as a tool for IT (information technology, IT) personnel, and business personnel cannot set requirements, ie, script parameters, by themselves. Business personnel can tell the requirements to the IT personnel, and then the IT personnel set the requirements, but often the IT personnel have a biased understanding of the business, so the data provided by the rules set by the IT personnel and the data required by the business personnel are different. In this embodiment of the present invention, business personnel can input requirements from a business perspective, and then the data checking device 200 automatically generates an SQL data checking program to automatically perform checking tasks to facilitate business personnel's needs.
当数据检查装置200生成SQL数据检查程序后,便可执行第一步,即在目标数据库中执行该SQL数据检查程序。After the data checking apparatus 200 generates the SQL data checking program, the first step may be executed, that is, executing the SQL data checking program in the target database.
在第二步中,数据检查装置200读取满足条件和条件范围的数据。In the second step, the data checking apparatus 200 reads data satisfying the condition and the condition range.
在第三步中,数据检查装置200将读取出的数据与待检查数据进行对比,得到对比结果。In the third step, the data checking device 200 compares the read data with the data to be checked to obtain a comparison result.
在本发明实施例中,数据检查装置200将读取出的数据与待检查数据的对比过程可以为,数据检查装置200提取目标数据库中的数据元素,并提取待检查数据中的数据元素,然后将数据元素进行单独对比。In this embodiment of the present invention, the process of comparing the read data with the data to be inspected by the data inspection device 200 may be as follows: the data inspection device 200 extracts data elements in the target database, extracts data elements in the data to be inspected, and then Compare data elements individually.
举例来说,A部门在进行季度核算时,业务人员可以将季度采购支出明细表作为待检查数据,数据检查装置200根据待检查数据的属性信息便可确定目标数据库,即配置信息为“采购业务”的数据库。数据检查装置200提取待检查数据中包含的数据元素,例如采购物品名称、采购金额,采购数量等,并提取目标数据库中的数据元素例如采购物品名称、采购金额,采购数量等,将每个数据元素与待检查数据中的数据元素进行一一对比。For example, when department A performs quarterly accounting, the business personnel can use the quarterly purchase expenditure list as the data to be inspected, and the data inspection device 200 can determine the target database according to the attribute information of the data to be inspected, that is, the configuration information is "purchasing business. " database. The data inspection apparatus 200 extracts data elements contained in the data to be inspected, such as the name of the purchased item, purchase amount, purchase quantity, etc., and extracts data elements in the target database, such as the purchase item name, purchase amount, purchase quantity, etc. Elements are compared one-to-one with the data elements in the data to be checked.
在第三步中,数据检查装置200可以将对比结果存储,以便后续使用。关于对比结果的使用,将在后续介绍。In the third step, the data checking apparatus 200 may store the comparison result for subsequent use. The use of comparison results will be introduced later.
对于第二种情况,由于目标数据库的数量大于等于2,即数据检查装置200需要将待检查数据和至少两个目标数据库中的数据进行对比。图5示出了目标数据库的数据大于等于2时数据检查过程的示意图。如图5所示,数据检查装置200的数据检查装置可以分五步进行。第一步,数据检查装置200在至少两个目标数据库中确定一个第一目标数据库。第二步,数据检查装置200在第一目标数据库中执行SQL数据检查程序。第三步,数据检查装置200从至少两个目标数据库中读取数据。第四步,数据检查装置200将读取的数据与待检查数据进行对比,得到对比结果。第五步,数据检查装置200将对比结果存储。For the second case, since the number of target databases is greater than or equal to 2, that is, the data checking apparatus 200 needs to compare the data to be checked with the data in at least two target databases. Fig. 5 shows a schematic diagram of the data checking process when the data of the target database is greater than or equal to 2. As shown in FIG. 5, the data checking apparatus of the data checking apparatus 200 may be performed in five steps. In the first step, the data checking apparatus 200 determines a first target database among at least two target databases. In the second step, the data checking apparatus 200 executes the SQL data checking program in the first target database. In the third step, the data checking apparatus 200 reads data from at least two target databases. In the fourth step, the data checking device 200 compares the read data with the data to be checked to obtain a comparison result. In the fifth step, the data checking apparatus 200 stores the comparison result.
在第一步中,数据检查装置200在至少两个目标数据库中确定第一目标数据库的方式可以有多种。第一种方式,业务人员可以凭借工作经验,事先设置包含数据较多的数据库作为第一目标数据库的备选数据库。这样,如果数据检查装置200确定的至少两个目标数据库中有备选数据库,则默认备选数据库为第一目标数据库。第二种方式,数据检查装置200可以确定至少两个目标数据库中任意一个数据库作为第一目标数据库。第三种方式,数据检查装置200可以创建一个临时的空的数据库作为第一目标数据库,然后将其它目标数据库的数据同步到该第一目标数据库中。以上只是列举了三种确定第一目标数据的方式,当然还有其它的实现方式,本发明实施例对此不作具体的限定。In the first step, the data checking apparatus 200 may determine the first target database among the at least two target databases in various ways. In the first way, business personnel can set a database containing more data as a candidate database of the first target database in advance by virtue of their work experience. In this way, if there are candidate databases in the at least two target databases determined by the data checking apparatus 200, the default candidate database is the first target database. In the second manner, the data checking apparatus 200 may determine any one of the at least two target databases as the first target database. In a third manner, the data checking apparatus 200 may create a temporary empty database as the first target database, and then synchronize the data of other target databases to the first target database. The above only enumerates three manners for determining the first target data. Of course, there are other implementation manners, which are not specifically limited in this embodiment of the present invention.
这里确定第一目标数据库的原因为:前面已经提到过,数据检查装置200进行数据检查时,需要在目标数据库中执行SQL数据检查程序来读取数据,而该SQL数据检查程序只能在一个目标数据库中执行。所以在第一步中,数据检查装置200可以确定一个第一目标数据库。The reason for determining the first target database here is: as mentioned above, when the data checking device 200 performs data checking, it is necessary to execute an SQL data checking program in the target database to read data, and the SQL data checking program can only be used in one Executed in the target database. Therefore, in the first step, the data checking apparatus 200 can determine a first target database.
需要说明的是,由于各个数据库之间是相互独立的,所以数据检查装置200在确定第一目标数据库之后,可以建立至少两个目标数据库中除第一目标数据之外的其它目标数据库与该第一目标数据库之间的数据同步。这样,在第二步中,数据检查装置200可以在第一目标数据库中执行SQL数据检查程序时,来读取到每个目标数据库中的数据。It should be noted that, since each database is independent of each other, after determining the first target database, the data checking apparatus 200 can establish other target databases except the first target data in the at least two target databases and the first target database. Data synchronization between a target database. In this way, in the second step, the data checking apparatus 200 can read the data in each target database when executing the SQL data checking program in the first target database.
下面介绍数据检查装置200建立第一目标数据库和其它目标数据库之间的数据同步的两种方式。The following introduces two ways for the data checking apparatus 200 to establish data synchronization between the first target database and other target databases.
第一种方式,数据检查装置200将至少两个目标数据库中除第一目标数据库之外的其它目标数据库中的数据复制到第一目标数据库中。复制方式可以有多种,例如采用数据备份(oracle golden gate,OGG)技术或者数据同步(datastage)技术将其它目标数据库中的数据复制到第一目标数据库中,当然,还可以有其它的复制方式,本发明实施例对此不作具体的限定。In a first manner, the data checking apparatus 200 replicates data in other target databases except the first target database in the at least two target databases to the first target database. There are various replication methods, for example, data in other target databases are replicated to the first target database by using data backup (oracle golden gate, OGG) technology or data synchronization (datastage) technology. Of course, there are other replication methods. , which is not specifically limited in this embodiment of the present invention.
OGG技术或者DataStage技术是数据复制备份技术,它可以获知其它目标数据库中数据,再将这些数据复制到第一目标数据库,从而实现其它目标数据库与第一目标数据库之间的数据同步。OGG technology or DataStage technology is a data replication backup technology, which can learn data in other target databases, and then replicate the data to the first target database, thereby realizing data synchronization between other target databases and the first target database.
第二种方式,数据检查装置200将至少两个目标数据库中除第一目标数据库之外的其它目标数据库与第一目标数据库通过数据库链接(database link,Dblingk)技术建立数据链接。这种方式下,数据检查装置200不需要将其它目标数据库中的数据复制到第一目标数据库,也可以读取其它目标数据库中的数据。In a second manner, the data checking apparatus 200 establishes a data link between other target databases except the first target database and the first target database in the at least two target databases through a database link (database link, Dblingk) technology. In this manner, the data checking apparatus 200 does not need to copy data in other target databases to the first target database, and can also read data in other target databases.
Dblingk技术是将两个数据库之间建立链接的技术,如同电话线一般,当访问两个数据库中的其中一个数据库时,可以同时访问到另外一个数据库中的数据。这种技术适用于两个数据库之间的距离较远的情况,例如总公司和分总司在不同的地区,那么对于子公司来说,可以将本地数据库与总公司的远程数据库之间通过Dblingk技术建立数据链接,这样,分公司的数据检查装置200在访问本地数据库时,便可同时访问远程数据库。Dblingk technology is a technology that establishes a link between two databases, just like a telephone line. When accessing one of the two databases, the data in the other database can be accessed at the same time. This technology is suitable for the situation where the distance between the two databases is relatively far, for example, the head office and the branch office are in different regions, then for the subsidiary company, the local database and the remote database of the head office can be connected through Dblingk The technology establishes a data link, so that when the data checking device 200 of the branch office accesses the local database, it can simultaneously access the remote database.
在实际应用中,以上的两种方式可以单独使用,也可以结合使用。以目标数据库的数量是5为例,图6示出了5个目标数据库之间数据同步的示意图。如图6所示,目标数据库A-B与第一目标数据库之间通过OGG技术建立数据同步,目标数据库C与第一目标数据库之间通过Dblingk技术建立数据同步,目标数据库D与第一目标数据库之间通过DataStage技术建立数据同步。In practical applications, the above two methods can be used alone or in combination. Taking the number of target databases as 5 as an example, FIG. 6 shows a schematic diagram of data synchronization among the 5 target databases. As shown in Figure 6, data synchronization is established between target databases A-B and the first target database through OGG technology, data synchronization between target database C and the first target database is established through Dblingk technology, and between target database D and the first target database Data synchronization is established through DataStage technology.
数据检查装置200在建立其它目标数据库与第一目标数据库之间的数据同步之后,便可执行第二步到第五步,关于第二步到第五步的执行过程请参见前述的关于第一种情况中的第一步到第四步的描述,为了说明书的简洁,在此不做赘述。After establishing the data synchronization between the other target database and the first target database, the data checking apparatus 200 can execute the second to fifth steps. For the execution process of the second to fifth steps, please refer to the aforementioned about the first step. The description of the first step to the fourth step in this case is not repeated here for the sake of brevity of the specification.
举例来说,A部门的工作人员出差时的飞机票是在财务处通过机票订单领取支票来付账的,那么财务处工作人员可以将该工作人员采购机票的“采购订单”和“采购支票”录入到配置信息为“采购业务”的数据库中。在该工作人员去报销此次出差费用时,财务处工作人员便将关于飞机票的报销信息录入到配置信息为“出差费报销”的数据库中。即,两个不同的数据库中包含同一件事情的相关数据。在A部分进行财务核算时,例如要对季度财务收支进行核算,那么待检查数据即财务收支明细表。数据检查装置200便确定配置信息为“采购业务”的数据库和配置信息为“出差费报销”的数据库为两个目标数据库。假设数据检查装置200确定配置信息为“出差费报销”的数据库为第一目标数据库,那么配置信息为“采购业务”的数据库和配置信息为“出差费报销”的数据库之间数据同步,数据检查装置200便可读取关于A部门财务收支的所有数据,与待检查的数据进行对比,得到对比结果。For example, when a staff member of Department A travels on a business trip, he pays the bill by picking up a check through the ticket order at the Finance Office, then the staff member of the Finance Office can enter the "purchase order" and "purchase check" for the staff member's purchase of the ticket. into the database whose configuration information is "purchasing business". When the staff member goes to reimburse the travel expenses, the staff of the finance department will enter the reimbursement information about the air ticket into the database whose configuration information is "business travel expense reimbursement". That is, two different databases contain data about the same thing. When performing financial accounting in Part A, for example, quarterly financial revenue and expenditure are to be accounted for, the data to be checked is the financial revenue and expenditure schedule. The data checking apparatus 200 determines that the database whose configuration information is "purchasing business" and the database whose configuration information is "business travel expense reimbursement" are two target databases. Assuming that the data checking apparatus 200 determines that the database whose configuration information is "travel expense reimbursement" is the first target database, then the data is synchronized between the database whose configuration information is "purchasing business" and the database whose configuration information is "business trip expense reimbursement". The device 200 can read all the data about the financial income and expenditure of the A department, compare it with the data to be checked, and obtain the comparison result.
通过以上的描述可知,在本发明实施例中,数据检查装置200不仅可以自动对待检查数据进行数据检查,减低了由于人为的误操作而导致的出错的几率,还可以跨多个数据库对待检查数据进行数据检查,进一步提高了数据检查的准确性。As can be seen from the above description, in the embodiment of the present invention, the data checking apparatus 200 can not only automatically perform data checking on the data to be checked, reducing the probability of errors caused by human misoperation, but also can cross multiple databases for the data to be checked. Data checking is performed to further improve the accuracy of data checking.
S304:输出对比结果。S304: Output the comparison result.
在本发明实施例中,为了方便业务人员查阅对比结果,数据检查装置200可以将对比结果生成结果报告。在该结果报告中可以统计有关键信息,例如结果报告中可以包含数据检查人员、数据检查时间、数据检查设备、数据检查时采用的目标数据库的配置信息、待检查数据的属性信息等。因此,业务人员在查阅结果报告时,便可以清楚的知道此次数据检查过程的明细。如果结果报告中有异常值(待检查数据和至少一个目标数据库中存储的数据不一致的数据),那么业务人员也可以快速的定位产生异常值的原因。In this embodiment of the present invention, in order to facilitate business personnel to check the comparison result, the data checking apparatus 200 may generate a result report from the comparison result. Key information can be counted in the result report, for example, the result report can include data inspector, data inspection time, data inspection equipment, configuration information of the target database used in data inspection, attribute information of the data to be inspected, etc. Therefore, business personnel can clearly know the details of the data inspection process when reviewing the result report. If there are abnormal values in the result report (data that is inconsistent between the data to be checked and the data stored in at least one target database), the business personnel can also quickly locate the cause of the abnormal values.
在本发明实施例中,如果结果报告中有异常值,为了及时提醒业务人员,数据检查装置200可以输出提示信息,以提示业务人员及时查看结果报告,以便及时定位产生异常值的原因。数据检查装置200输出提示信息的方式可以有多种,例如数据检查装置200可以绑定业务人员的邮箱,当结果报告中有异常值时,数据检查装置200便向绑定的邮箱发送提醒邮件。当然,还有其它的输出提示信息的方式,本发明实施例对比不作具体的限定。In this embodiment of the present invention, if there is an abnormal value in the result report, in order to remind the business personnel in time, the data checking apparatus 200 may output prompt information to prompt the business personnel to check the result report in time, so as to locate the cause of the abnormal value in time. The data inspection apparatus 200 can output prompt information in various ways. For example, the data inspection apparatus 200 can bind the mailboxes of business personnel. When there are abnormal values in the result report, the data inspection apparatus 200 sends a reminder email to the bound mailboxes. Of course, there are other ways of outputting prompt information, and the comparison of the embodiments of the present invention is not specifically limited.
数据检查装置200可以将结果报告归类存储,对于存在异常值的结果报告记录归类为问题报告。业务人员可以参阅问题报告对数据检查装置200进行及时的修复和优化。The data checking apparatus 200 may classify and store the result reports, and classify the result report records with abnormal values as problem reports. The business personnel can refer to the problem report to repair and optimize the data checking device 200 in time.
总结来说,本发明实施例提供一种数据检查装置,该数据检查装置可以根据待检查数据的属性信息从N个目标数据库中寻找和该待检查数据的属性信息相匹配的至少一个目标数据库,然后将待检查数据与至少一个目标数据库中的数据进行对比,得到对比结果。通过这种方式,将目前的数据检查工作自动化,减轻了业务人员的工作量,同时降低数据检查的出错几率。To sum up, an embodiment of the present invention provides a data inspection device, which can search for at least one target database matching the attribute information of the data to be inspected from N target databases according to the attribute information of the data to be inspected, Then, the data to be checked is compared with the data in at least one target database to obtain a comparison result. In this way, the current data inspection work is automated, the workload of business personnel is reduced, and the error probability of data inspection is reduced.
下面结合附图介绍本发明实施例提供的装置。The following describes the device provided by the embodiments of the present invention with reference to the accompanying drawings.
在一个简单的实施例中,本领域的技术人员可以想到,还可以将数据检查装置200通过如图7所示的结构实现。数据检查装置200可以是一台或多台计算机。In a simple embodiment, those skilled in the art can imagine that the data checking apparatus 200 can also be implemented by the structure shown in FIG. 7 . The data checking apparatus 200 may be one or more computers.
如图7所示,数据检查装置700可以包括:存储器701、处理器702和总线703。处理器701和处理器702可以通过总线703连接。其中,存储器701用于存储计算机执行指令,当数据检查装置700运行时,处理器702执行存储器701存储的计算机执行指令,以使数据检查装置700执行图3所示的实施例提供的数据检查方法。具体的数据检查方法可参考上文及附图中的相关描述,此处不再赘述。As shown in FIG. 7 , the data checking apparatus 700 may include: a memory 701 , a processor 702 and a bus 703 . The processor 701 and the processor 702 may be connected by a bus 703 . The memory 701 is used to store computer-executed instructions. When the data inspection apparatus 700 is running, the processor 702 executes the computer-executed instructions stored in the memory 701, so that the data inspection apparatus 700 executes the data inspection method provided by the embodiment shown in FIG. 3 . . For a specific data checking method, reference may be made to the relevant descriptions above and in the accompanying drawings, and details are not repeated here.
在本发明实施例中,处理器702可以是现场可编程门阵列(field-programmablegate array,FPGA),专用集成芯片(application specific integrated circuit,ASIC),系统芯片(system on chip,SoC),中央处理器(central processor unit,CPU),网络处理器(network processor,NP),数字信号处理电路(digital signal processor,DSP),微控制器(micro controller unit,MCU),还可以采用可编程控制器(programmable logicdevice,PLD)或其他集成芯片。总线703可以是外设部件互连标准(peripheral componentinterconnect,PCI)总线或扩展工业标准结构(extended industry standardarchitecture,EISA)总线等。总线703可以分为地址总线、数据总线、控制总线等。为便于表示,图7中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。In this embodiment of the present invention, the processor 702 may be a field-programmable gate array (FPGA), an application specific integrated circuit (ASIC), a system on chip (SoC), or a central processing unit. A central processor unit (CPU), a network processor (NP), a digital signal processor (DSP), a microcontroller (micro controller unit, MCU), and a programmable controller ( programmable logic device, PLD) or other integrated chips. The bus 703 may be a peripheral component interconnect (PCI) bus or an extended industry standard architecture (EISA) bus or the like. The bus 703 can be divided into an address bus, a data bus, a control bus, and the like. For ease of presentation, only one thick line is used in FIG. 7, but it does not mean that there is only one bus or one type of bus.
综上所述,本发明实施例提供了一种数据检查装置以及方法。在该方法中,数据检查装置可以根据待检查数据的属性信息从N个目标数据库中寻找和该待检查数据的属性信息相匹配的至少一个目标数据库,然后将待检查数据与至少一个目标数据库中的数据进行对比,得到对比结果。通过这种方式,将目前的数据检查工作自动化,减轻了业务人员的工作量,相应地也会降低数据检查的出错几率。To sum up, the embodiments of the present invention provide a data checking apparatus and method. In this method, the data inspection device can search for at least one target database matching the attribute information of the data to be inspected from the N target databases according to the attribute information of the data to be inspected, and then match the data to be inspected with the at least one target database. The data are compared to obtain the comparative results. In this way, the current data inspection work is automated, the workload of business personnel is reduced, and the error probability of data inspection is correspondingly reduced.
需要说明的是,前述实施例中提出单元或模块的划分仅作为一种示例性的示出,所描述的各个单元的功能仅是举例说明,本申请并不以此为限。程序设计人员可以根据需求合并其中两个或更多单元的功能,或者将一个单元的功能拆分从而获得更多更细粒度的单元,以及其他变形方式。It should be noted that, the division of units or modules proposed in the foregoing embodiments is only an exemplary illustration, and the functions of the described units are only illustrative, and the present application is not limited thereto. Programmers can combine the functions of two or more units according to their needs, or split the functions of one unit to obtain more fine-grained units, as well as other variants.
以上描述的各个实施例之间相同或相似的部分可相互参考。The same or similar parts between the various embodiments described above may refer to each other.
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。另外,本发明提供的装置实施例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。The device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment. In addition, in the drawings of the apparatus embodiments provided by the present invention, the connection relationship between the modules indicates that there is a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines. Those of ordinary skill in the art can understand and implement it without creative effort.
在上述发明实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如,固态硬盘Solid State Disk(SSD))等。The above embodiments of the invention may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present invention are generated. The computer may be a general purpose computer, special purpose computer, computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another, for example, from a website site, computer, server or data center via Transmission to another website site, computer, server, or data center by wired (eg, coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.) means. The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media. The usable media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVD), or semiconductor media (eg, Solid State Disk (SSD)), and the like.
以上实施例仅用以对本发明实施例的技术方案进行详细介绍,但以上实施例的说明只是用于帮助理解本发明实施例的方法及其核心思想,不应该理解为对本申请的限制。本领域技术人员在本发明实施例揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本发明实施例的保护范围之内。The above embodiments are only used to describe the technical solutions of the embodiments of the present invention in detail, but the descriptions of the above embodiments are only used to help understand the methods and core ideas of the embodiments of the present invention, and should not be construed as limitations on the application. Changes or substitutions that can be easily conceived by those skilled in the art within the technical scope disclosed in the embodiments of the present invention should be covered by the protection scope of the embodiments of the present invention.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710644061.5ACN109947797B (en) | 2017-07-31 | 2017-07-31 | Data inspection device and method |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710644061.5ACN109947797B (en) | 2017-07-31 | 2017-07-31 | Data inspection device and method |
| Publication Number | Publication Date |
|---|---|
| CN109947797Atrue CN109947797A (en) | 2019-06-28 |
| CN109947797B CN109947797B (en) | 2022-04-05 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201710644061.5AActiveCN109947797B (en) | 2017-07-31 | 2017-07-31 | Data inspection device and method |
| Country | Link |
|---|---|
| CN (1) | CN109947797B (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110532023A (en)* | 2019-08-21 | 2019-12-03 | 北京达佳互联信息技术有限公司 | Information processing method and device for Data Detection |
| CN110717078A (en)* | 2019-09-16 | 2020-01-21 | 武汉安诠加信息技术有限公司 | Beauty shop business data monitoring method, device, equipment and medium |
| CN111639100A (en)* | 2020-04-02 | 2020-09-08 | 刘沁 | Method and device for comparing capital construction and capital supply material design inventory and declaration information |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7143107B1 (en)* | 2003-06-26 | 2006-11-28 | Microsoft Corporation | Reporting engine for data warehouse |
| CN101038683A (en)* | 2006-03-15 | 2007-09-19 | 富士通株式会社 | System and method for personal identificatioin using biometrics data, and computer-readable recording medium in which personal identification program is stored |
| CN101882159A (en)* | 2010-06-18 | 2010-11-10 | 中兴通讯股份有限公司 | Database detecting method of and device thereof |
| CN104200397A (en)* | 2014-08-27 | 2014-12-10 | 北京广利核系统工程有限公司 | Method for verifying consistence of designed input file and variable information database of DCS (Distributed Control System) of nuclear power station |
| CN104361001A (en)* | 2014-09-29 | 2015-02-18 | 株洲南车时代电气股份有限公司 | LKJ basic data comparison method and system |
| US20160036824A1 (en)* | 2009-05-06 | 2016-02-04 | Grigory Levit | Control and verification of permissions |
| CN106682033A (en)* | 2015-11-10 | 2017-05-17 | 北京国双科技有限公司 | Knowledge base check method and device |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7143107B1 (en)* | 2003-06-26 | 2006-11-28 | Microsoft Corporation | Reporting engine for data warehouse |
| CN101038683A (en)* | 2006-03-15 | 2007-09-19 | 富士通株式会社 | System and method for personal identificatioin using biometrics data, and computer-readable recording medium in which personal identification program is stored |
| US20160036824A1 (en)* | 2009-05-06 | 2016-02-04 | Grigory Levit | Control and verification of permissions |
| CN101882159A (en)* | 2010-06-18 | 2010-11-10 | 中兴通讯股份有限公司 | Database detecting method of and device thereof |
| CN104200397A (en)* | 2014-08-27 | 2014-12-10 | 北京广利核系统工程有限公司 | Method for verifying consistence of designed input file and variable information database of DCS (Distributed Control System) of nuclear power station |
| CN104361001A (en)* | 2014-09-29 | 2015-02-18 | 株洲南车时代电气股份有限公司 | LKJ basic data comparison method and system |
| CN106682033A (en)* | 2015-11-10 | 2017-05-17 | 北京国双科技有限公司 | Knowledge base check method and device |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110532023A (en)* | 2019-08-21 | 2019-12-03 | 北京达佳互联信息技术有限公司 | Information processing method and device for Data Detection |
| CN110717078A (en)* | 2019-09-16 | 2020-01-21 | 武汉安诠加信息技术有限公司 | Beauty shop business data monitoring method, device, equipment and medium |
| CN111639100A (en)* | 2020-04-02 | 2020-09-08 | 刘沁 | Method and device for comparing capital construction and capital supply material design inventory and declaration information |
| Publication number | Publication date |
|---|---|
| CN109947797B (en) | 2022-04-05 |
| Publication | Publication Date | Title |
|---|---|---|
| CN109522746B (en) | A data processing method, electronic device and computer storage medium | |
| US9152662B2 (en) | Data quality analysis | |
| CN112488652B (en) | Work order auditing method, system, terminal and storage medium | |
| US12045225B2 (en) | Multi-table data validation tool | |
| CN111177134B (en) | Data quality analysis method, device, terminal and medium applicable to massive data | |
| US20120072464A1 (en) | Systems and methods for master data management using record and field based rules | |
| US11907184B1 (en) | Collaborative data mapping system | |
| Xie et al. | Big data validation case study | |
| CN113268567A (en) | Multi-attribute text matching method, device, equipment and storage medium | |
| WO2019179025A1 (en) | Related transaction identifying method, program, device and storage medium | |
| US9864789B2 (en) | Method and system for implementing an on-demand data warehouse | |
| CN109947797A (en) | A data checking device and method | |
| CN117235040A (en) | Big data center frame system based on data management | |
| CN112990741B (en) | A workload assessment method, device, equipment and storage medium | |
| CN111400299A (en) | Method and system for testing fusion quality of multiple data | |
| CN109597828B (en) | A kind of off-line data check method, device and server | |
| CN117555520A (en) | Demand processing method and device, electronic equipment and storage medium | |
| TWM650648U (en) | Systems for verifying insurance application information | |
| CN116628518A (en) | Data consistency verification method, device, equipment and medium | |
| CN116755975A (en) | Data management capability rating method and device based on DCMM, electronic equipment and medium | |
| CN114610809A (en) | Power grid data structured processing method and device | |
| CN114840668A (en) | Network text review method, electronic device and storage medium | |
| US20120036110A1 (en) | Automatically Reviewing Information Mappings Across Different Information Models | |
| CN117635371A (en) | An enterprise data quality management method, device, electronic equipment and storage medium | |
| CN112395850B (en) | A method, device and electronic device for checking data definition consistency |
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |