Disclosure of Invention
Based on this, in order to solve the above technical problems, a method, a system, a computer device and a storage medium for medical data management based on a metadata model are provided, which can improve the non-orderliness and the low efficiency of data management in the current data management implementation and break through the pure human experience dependence mode.
A method of medical data governance based on a metadata model, the method comprising:
constructing a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classsify, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
monitoring medical data in real time, and performing classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scene problem, determining a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
when the data management type of the medical data is data quality management, checking the reason for locating the data quality and calling the dynamic blood relationship model; inputting the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
when the data management type of the medical data is data security management, identifying data security risk conditions and calling the dynamic incidence relation model; inputting the medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
when the data governance type of the medical data is data asset governance, identifying the data asset value and calling the dynamic cold-hot relationship model; inputting the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
In one embodiment, the classifying and managing the medical data through the TCG three-dimensional management model includes:
inputting the medical data into the TCG three-dimensional management model, and carrying out classification and grading processing on the medical data through data object dimensions by the TCG three-dimensional management model; the TCG three-dimensional management model grades the medical data from sensitivity and influence degrees through data grading dimensionality; the TCG three-dimensional management model carries out type division on the medical data from the aspect of data type or data scene application requirements through data classification dimensions to obtain a classification management result.
In one embodiment, the medical data is input into the dynamic blood relationship model, and a data direct blood relationship is obtained; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention, wherein the quality problem intervention method comprises the following steps:
inputting the medical data into the dynamic blood relationship model, and dynamically extracting a data model and a data operation joba for a data warehouse through the dynamic blood relationship model to form a data model and an operation set;
the dynamic blood relationship model carries out database analysis on the data model and the operation set, identifies different set data dependency relationships and forms a data dependency set;
the dynamic blood relationship model forms a data direct blood relationship corresponding to the medical data by using the TCG three-dimensional management model according to the data dependency set and an association algorithm based on an association rule, wherein the data direct blood relationship is marked with the quality problem classification information;
and displaying the data direct relationship and the quality problem classification information, and performing quality problem intervention.
In one embodiment, before performing the quality issue intervention, the method further comprises:
positioning medical data with quality problems according to the direct blood relationship and the quality problem classification information;
and acquiring a quality evaluation strategy, determining the reason of the quality problem according to the quality evaluation strategy and the medical data with the quality problem, and intervening the quality problem according to the reason of the quality problem.
In one embodiment, the medical data is input into the dynamic association relation model to obtain a data association relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention, wherein the method comprises the following steps:
inputting the medical data into the dynamic incidence relation model, and forming a data dependence set through the dynamic incidence relation model;
the dynamic incidence relation model filters transverse incidence type data in the medical data according to the data dependence set, and intelligently classifies the transverse incidence type data through a statistical clustering algorithm and the TCG three-dimensional management model to form an initial data incidence relation;
and the dynamic incidence relation model integrates and removes the duplication of the initial data incidence relation to obtain the data incidence relation, acquires the safety protection scheme corresponding to the data incidence relation, displays the data incidence relation and the safety protection scheme and performs data safety protection intervention.
In one embodiment, the medical data is input into the dynamic cold-hot relationship model, so as to obtain cold-hot relationship data; acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and carrying out data asset value marking, wherein the method comprises the following steps:
inputting the medical data into the dynamic cold-hot relationship model, and extracting and analyzing a data processing log through the dynamic cold-hot relationship model to obtain the use frequency of the medical data;
the dynamic cold-hot relationship model is used for calculating a cold-hot label of the medical data by combining a clustering algorithm according to the use frequency;
the dynamic cold-hot relationship model combines the TCG three-dimensional management model to cluster and classify the medical data according to the cold-hot degree label to obtain the cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
A metadata model-based medical data governance system, the system comprising:
the model building module is used for building a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classsify, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
the data management type determining module is used for monitoring medical data in real time and performing classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scene problem, determining a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
the data quality management module is used for checking the data quality reason and calling the dynamic blood relationship model when the data management type of the medical data is data quality management; inputting the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
the data security management module is used for identifying the data security risk condition and calling the dynamic incidence relation model when the data management type of the medical data is data security management; inputting the medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
the data asset management module is used for identifying the value of the data asset and calling the dynamic cold-hot relationship model when the data management type of the medical data is data asset management; inputting the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
A computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the following steps when executing the computer program:
constructing a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classsify, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
monitoring medical data in real time, and performing classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scene problem, determining a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
when the data management type of the medical data is data quality management, checking the reason for locating the data quality and calling the dynamic blood relationship model; inputting the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
when the data management type of the medical data is data security management, identifying data security risk conditions and calling the dynamic incidence relation model; inputting the medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
when the data governance type of the medical data is data asset governance, identifying the data asset value and calling the dynamic cold-hot relationship model; inputting the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
A computer-readable storage medium, on which a computer program is stored which, when executed by a processor, carries out the steps of:
constructing a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classsify, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
monitoring medical data in real time, and performing classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scene problem, determining a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
when the data management type of the medical data is data quality management, checking the reason for locating the data quality and calling the dynamic blood relationship model; inputting the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
when the data management type of the medical data is data security management, identifying data security risk conditions and calling the dynamic incidence relation model; inputting the medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
when the data governance type of the medical data is data asset governance, identifying the data asset value and calling the dynamic cold-hot relationship model; inputting the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
According to the medical data management method, the system, the computer equipment and the storage medium based on the metadata model, the metadata model is constructed according to the medical data management scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classsify, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model; monitoring medical data in real time, and performing classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scene problem, determining a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management; when the data management type of the medical data is data quality management, checking the reason for locating the data quality and calling the dynamic blood relationship model; inputting the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; obtaining quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention; when the data management type of the medical data is data security management, identifying data security risk conditions and calling the dynamic incidence relation model; inputting the medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention; when the data governance type of the medical data is data asset governance, identifying the data asset value and calling the dynamic cold-hot relationship model; inputting the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring an asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value. The metadata model is constructed, and the medical data are input into different metadata models according to the types of the medical data to obtain the data problems of all the medical data, so that different intervention measures are taken aiming at different data problems, manual data management is not needed, and the effectiveness of the data management is improved.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The metadata model-based medical data governance method provided by the embodiment of the application can be applied to the application environment shown in fig. 1. As shown in FIG. 1, the application environment includes acomputer device 110. The computer device 110 may construct a metadata model from the medical data governance scenario based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model; the computer device 110 can monitor the medical data in real time and perform classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data governance scenario problem, the computer device 110 may determine a data governance type of the medical data; the data management types comprise data quality management, data safety management and data asset management; when the data governance type of the medical data is data quality governance, the computer device 110 may investigate the data quality reason and invoke a dynamic blood relationship model; the computer device 110 may input the medical data into the dynamic blood relationship model to obtain a data direct blood relationship; the computer device 110 may obtain quality problem classification information according to the data direct blood relationship, display the data direct blood relationship, the quality problem classification information, and perform quality problem intervention; when the data governance type of the medical data is data security governance, the computer device 110 may identify a data security risk condition and invoke a dynamic association relationship model; the computer device 110 may input the medical data into the dynamic association relationship model to obtain a data association relationship; the computer device 110 may obtain the security protection scheme according to the data association relationship, display the data association relationship, the security protection scheme, and perform data security protection intervention; when the data management type of the medical data is data asset management, identifying the data asset value and calling a dynamic cold-hot relationship model; the computer device 110 may input the medical data into the dynamic cold-hot relationship model to obtain cold-hot relationship data; the computer device 110 may obtain asset value ratings from the cold-hot relationship data, present the cold-hot relationship data, asset value ratings, and perform data asset value tagging. Thecomputer device 110 may be, but is not limited to, various personal computers, notebook computers, smart phones, robots, tablet computers, and other devices.
In one embodiment, as shown in fig. 2, there is provided a method for medical data governance based on a metadata model, comprising the steps of:
step 202, constructing a metadata model according to a medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic association relationship model and a dynamic cold and hot relationship model.
The computer device may build a model from both the metadata taxonomy management and the metadata analysis based on the metadata of the data warehouse. Specifically, the computer device may construct four core models of metadata from two dimensions of metadata management and metadata analysis based on the collected and recovered source data according to actual scene needs and technical features. That is, the computer device may construct the metadata model from the medical data governance scenario.
The constructed metadata model may include a TCG (target, class, grade) three-dimensional management model, a dynamic blood relationship model, a dynamic association relationship model, and a dynamic cold-hot relationship model. The TCG three-dimensional management model is a process for carrying out detailed classification on specific metadata from the perspective of data management; the dynamic blood relationship model is designed from the perspective of the blood relationship of data dependence, and the upper-level membership and the lower-level dependency relationship of the data and the data are determined, so that the data are convenient to trace to the source and trace back; the dynamic association relation model is used for constructing and designing a relation from the association angle between data, is different from the longitudinal direct relationship blood relationship of the data, and emphasizes the transverse relation among the data, such as left and right collateral relations of 'brother and sister' and 'partner'; the dynamic cold-hot relationship model is designed from the data value perspective, and the data value which is more active is better relatively by combining the thinking of 'number to use'.
Step 204, monitoring medical data in real time, and performing classified management on the medical data through a TCG three-dimensional management model; when the medical data has a medical data treatment scene problem, determining the data treatment type of the medical data; the data governance types comprise data quality governance, data security governance and data asset governance.
The computer equipment can monitor the medical data in real time and carry out classified management through the TCG three-dimensional management model. The computer device may determine a data governance type of the medical data when the medical data has a medical data governance scenario problem or is triggered to generate a governance scenario problem. The data management types can be divided into data quality management, data safety management and data asset management.
Step 206, when the data management type of the medical data is data quality management, checking the data quality reason and calling a dynamic blood relationship model; inputting medical data into a dynamic blood relationship model to obtain a data direct blood relationship; and acquiring quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention.
The computer device can analyze and confirm the data management type of the medical data, the computer device can firstly judge whether the data management type is data quality management or not, if not, the computer device can further judge whether the data management type is data safety management or not, and if not, the computer device can further judge whether the data management type is data asset management or not.
When the data governance type of the medical data is data quality governance, the computer device can investigate and locate the data quality reasons of the medical data and call the dynamic blood relationship model. The computer equipment can input the medical data into the dynamic blood relationship model to obtain a data direct blood relationship corresponding to the medical data, and further obtain corresponding quality problem classification information. The computer equipment can display the data direct blood relationship and the quality problem classification information in a display interface, and a user can intervene in the quality problem according to the data direct blood relationship and the quality problem classification information.
Step 208, when the data management type of the medical data is data security management, identifying the data security risk condition and calling a dynamic incidence relation model; inputting medical data into the dynamic incidence relation model to obtain a data incidence relation; and acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention.
In this embodiment, when the data administration type of the medical data is data security administration, the computer device may identify a security risk condition of the medical data, and call the dynamic association relationship model, so as to input the medical data into the dynamic association relationship model, thereby obtaining the data association relationship. The computer device can obtain the safety protection scheme corresponding to the data association relation and display the data association relation and the safety protection scheme in the display interface. And the user can perform data security protection intervention according to the data association relation and the security protection scheme.
Step 210, when the data governance type of the medical data is data asset governance, identifying the data asset value and calling a dynamic cold-hot relationship model; inputting the medical data into a dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
The display interface of the computer device can display the cold and hot relationship data and the asset value registration, and a user can mark the data asset value according to the cold and hot relationship data and the asset value registration.
In this embodiment, as shown in fig. 3, the computer device implements a dynamic blood relationship model, a G dynamic association relationship model, and a dynamic cold-hot relationship model based on a constructed TCG three-dimensional management model of the medical metadata, and performs model materialization processing on a data warehouse level; the method comprises the following steps of (1) checking scenes around data quality reasons such as data inaccuracy, data operation abnormity and data modification influence, calling a dynamic blood relationship model, positioning related data, and determining reasons caused by specific quality by combining specific strategies such as integrity, consistency, normalization, timeliness and accuracy in a quality evaluation strategy; around data security protection scenes such as sensitive data protection, data leakage, medical data compliance and the like, calling a dynamic incidence relation model, dividing a data class set, and combining data security protection strategies such as desensitization, encryption, watermarking and the like to form security evaluation and strategy protection of data with different incidence relations; and calling a dynamic cold-hot relationship model according to asset evaluation scenes such as data cost control, data asset management and the like to form data assets with different activities, and forming final asset grade division and marking by combining an asset evaluation strategy.
In the embodiment, the computer device constructs a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model; monitoring medical data in real time, and performing classified management on the medical data through a TCG three-dimensional management model; when the medical data has a medical data treatment scene problem, determining the data treatment type of the medical data; the data management types comprise data quality management, data safety management and data asset management; when the data management type of the medical data is data quality management, checking the data quality reason and calling a dynamic blood relationship model; inputting medical data into a dynamic blood relationship model to obtain a data direct blood relationship; acquiring quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention; when the data management type of the medical data is data security management, identifying data security risk conditions and calling a dynamic incidence relation model; inputting medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention; when the data management type of the medical data is data asset management, identifying the data asset value and calling a dynamic cold-hot relationship model; inputting the medical data into a dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value. The metadata model is constructed, and the medical data are input into different metadata models according to the types of the medical data to obtain the data problems of all the medical data, so that different intervention measures are taken aiming at different data problems, manual data management is not needed, and the effectiveness of the data management is improved.
In one embodiment, the provided method for administering medical data based on a metadata model may further include a process of performing classification management on medical data by a TCG three-dimensional management model, where the specific process includes: inputting medical data into a TCG three-dimensional management model, and carrying out classification and grading processing on the medical data through data object dimensions by the TCG three-dimensional management model; the TCG three-dimensional management model carries out grading division on the sensitivity and the influence of the medical data through data grading dimensionality; the TCG three-dimensional management model carries out type division on the medical data from the aspect of data type or data scene application requirement through data classification dimensionality to obtain a classification management result.
As shown in fig. 4, three dimensions in the TCG three-dimensional management model specifically include: data object dimension (T, target), data classification dimension (C), data classification dimension (G, grade).
The data object dimension mainly refers to a specific content type which needs data classification and grading processing, and specifically includes technical metadata, service metadata and management metadata. Further subdivision is possible as required, such as: technical metadata: database, data table, field, etc., service metadata: analyzing models, indexes, etc., managing metadata: personnel, organizations, etc.
The data grading dimension mainly refers to grading the medical data from sensitivity level and influence level, and specifically includes 5 grades as shown in fig. 3: very sensitive L5, general sensitive L4, controlled access L3, partial disclosure L2, full disclosure L1.
The data classification dimension mainly refers to type division of data from the perspective of data type or data scene application needs, and at least comprises six types: personal attributes, health status, medical applications, medical payments, health resources, public health.
For example, three fields of metadata related to the patient in the technical metadata are: the results of the identification number, the patient chief complaints and the diagnosis names after classification management is carried out through the TCG three-dimensional model are as follows:
in one embodiment, the provided metadata model-based medical data governance method may further include a process of data processing by the dynamic blood relationship model, where the process includes: inputting medical data into a dynamic blood relationship model, and dynamically extracting a data model and a data operation joba for a data warehouse through the dynamic blood relationship model to form a data model and an operation set; the dynamic blood relationship model carries out database analysis on the data model and the operation set, identifies data dependency relationships of different sets and forms a data dependency set; the dynamic blood relationship model forms a data direct system blood relationship corresponding to the medical data by utilizing the TCG three-dimensional management model according to the data dependence set and an association algorithm based on an association rule, and quality problem classification information is marked in the data direct system blood relationship; and displaying the data direct relationship and the quality problem classification information, and performing quality problem intervention.
When the data governance type of the medical data is data quality governance, the computer device may invoke the dynamic blood relationship model and input the medical data into the dynamic blood relationship model. As shown in fig. 5, the dynamic blood relationship model dynamically extracts the data model and the data job joba daily for the data warehouse, forming a data model and a data set; the dynamic blood relationship model identifies the data dependency relationship of different data sets by analyzing serial SQL (structured query language) relationships such as job logs, modeling SQL (structured query language) and the like to form a data dependency set; and the dynamic blood relationship model further performs classification combination and relationship series connection by utilizing a TCG three-dimensional management model and an association algorithm based on an association rule according to the analyzed result to form a data direct blood relationship of a 'family tree' type, wherein the data direct blood relationship comprises quality problem classification information.
The computer equipment can form a clear data direct system blood relationship graph by utilizing a visual technical means for the clear data direct system blood relationship and display the graph in a display interface, so that a user can conveniently display and call scenes. The user can intervene in the quality problem according to the displayed data direct relationship.
In one embodiment, the provided medical data governance method based on the metadata model may further include a process of determining a cause of the quality problem, where the specific process includes: positioning medical data with quality problems according to the direct blood relationship and the quality problem classification information; and acquiring a quality evaluation strategy, determining the reason of the quality problem according to the quality evaluation strategy and the medical data with the quality problem, and intervening the quality problem according to the reason of the quality problem.
The medical data with different quality problems can correspond to different quality evaluation strategies, the computer equipment can acquire the corresponding quality evaluation strategies after positioning the medical data with the quality problems, the reasons of the quality problems are further determined, and a user can perform manual intervention on the quality problems of the medical data according to the reasons of the quality problems.
In one embodiment, the provided metadata model-based medical data governance method may further include a process of performing data processing by using a dynamic association relation model, where the specific process includes: inputting medical data into a dynamic incidence relation model, and forming a data dependence set through the dynamic incidence relation model; the dynamic incidence relation model filters transverse incidence type data in the medical data according to the data dependence set, and the transverse incidence type data are intelligently classified through a statistical clustering algorithm and a TCG three-dimensional management model to form an initial data incidence relation; and integrating and removing duplication of the initial data incidence relation by the dynamic incidence relation model to obtain a data incidence relation, acquiring a safety protection scheme corresponding to the data incidence relation, displaying the data incidence relation and the safety protection scheme, and performing data safety protection intervention.
When the data governance type of the medical data is data security governance, the computer device can call the dynamic association relation model and input the medical data into the dynamic association relation model. As shown in fig. 6, the computer device may extract the data model from the data warehouse by daily dynamics to form a set of data models, and then perform SQL analysis by using an analysis tool; the dynamic incidence relation model filters transverse incidence content according to the specific situation of the analyzed data relation, intelligent classification of different types such as derivation relation, homologous relation and the like is carried out through a statistical clustering algorithm and a TCG three-dimensional management model, the classified data form a data incidence relation, and the dynamic incidence relation model can further integrate and duplicate the similar incidence relation; for the processed data association relationship, the computer equipment can form a clear association relationship diagram by using a visual technical means, and performs scene display in a display interface. The different data association relations can correspond to different safety protection schemes, the computer equipment can acquire the safety protection schemes corresponding to the data association relations and display the safety protection schemes in the display interface, and a user can call the data association relations and perform manual intervention on data safety protection.
In one embodiment, the provided medical data governance method based on the metadata model may further include a process of data processing by the dynamic cold-hot relationship model, where the process includes: inputting medical data into a dynamic cold-hot relationship model, and extracting and analyzing a data processing log through the dynamic cold-hot relationship model to obtain the use frequency of the medical data; the dynamic cold-hot relation model is combined with a clustering algorithm to calculate and obtain a cold-hot label of the medical data according to the use frequency; the dynamic cold-hot relationship model combines the TCG three-dimensional management model to cluster and classify the medical data according to the cold-hot degree label to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
When the data governance type of the medical data is data asset governance, the computer device may invoke the dynamic cold-hot relationship model and input the medical data into the dynamic cold-hot relationship model. As shown in fig. 7, the computer device may extract and analyze the contents of the interface log, the data modeling log, and the like, which are called by the computer device, with respect to the data processing log system through the dynamic hot and cold relationship model; the dynamic cold-hot relationship model can be divided into four different grades of ice data, cold data, temperature data and hot data according to the frequency and frequency of data calling or modeling application by combining and utilizing a k-means clustering algorithm, and a cold-hot degree label is marked; the dynamic cold-hot relationship model can be combined with the TCG three-dimensional management model to further classify and collect the medical services of the data of different grades, and the cold-hot relationship data of different service classifications is formed after multiple clustering and classification; the different cold and hot relationship data can correspond to different asset value grades, the computer equipment can acquire the asset value grade corresponding to the cold and hot relationship data, the cold and hot relationship data and the asset value grade are displayed in the display interface, and a user can further mark the data asset value.
As shown in fig. 8, in an embodiment, in the provided metadata model-based medical data management method, in combination with actual services, a computer device finds the requirements of a data management scenario and management problems existing in the specific services, performs classification analysis on the requirements and problems of data management, and finds a suitable metadata model and management scheme. The method specifically comprises the following steps:
1. aiming at the data quality problem, identifying the problem, positioning the problem according to a data blood relationship model and a quality evaluation strategy, and forming an accurate and intelligent intervention scheme;
2. aiming at the data security problem, identifying the current situation, calling a data association relation model and a security protection scheme, determining the security scheme of each type of data, and then carrying out security implementation intervention;
3. aiming at the problem of data assets, the current status of asset value is identified, a data cold-hot relation model and an asset evaluation strategy are called to form assets of different grades, and then the classified marking of the asset value is completed.
It should be understood that, although the steps in the respective flowcharts described above are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in each of the flowcharts described above may include multiple sub-steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of performing the sub-steps or the stages is not necessarily sequential, but may be performed alternately or alternatingly with other steps or at least a portion of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 9, there is provided a metadata model-based medical data administration system comprising: amodel construction module 910, a data governancetype determination module 920, a dataquality governance module 930, a datasecurity governance module 940, and a dataasset governance module 950, wherein:
amodel construction module 910, configured to construct a metadata model according to a medical data governance scenario based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
the data managementtype determining module 920 is configured to monitor the medical data in real time and perform classified management on the medical data through the TCG three-dimensional management model; when the medical data has a medical data treatment scene problem, determining the data treatment type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
a dataquality management module 930, configured to, when the data management type of the medical data is data quality management, find out a data quality reason and invoke a dynamic blood relationship model; inputting medical data into a dynamic blood relationship model to obtain a data direct blood relationship; acquiring quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
the datasecurity management module 940 is configured to identify a data security risk condition and call a dynamic association relationship model when the data management type of the medical data is data security management; inputting medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
a dataasset management module 950 for identifying the value of the data asset and calling the dynamic cold-hot relationship model when the data management type of the medical data is data asset management; inputting the medical data into a dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
In one embodiment, the data governancetype determination module 920 is further configured to input the medical data into a TCG three-dimensional management model, where the TCG three-dimensional management model performs classification and hierarchical processing on the medical data through data object dimensions; the TCG three-dimensional management model carries out grading division on the sensitivity and the influence of the medical data through data grading dimensionality; the TCG three-dimensional management model carries out type division on the medical data from the aspect of data type or data scene application requirement through data classification dimensionality to obtain a classification management result.
In one embodiment, the dataquality governance module 930 is further configured to input the medical data into the dynamic blood relationship model, and dynamically extract the data model and the data job jobs for the data warehouse through the dynamic blood relationship model to form a data model and job set; the dynamic blood relationship model carries out database analysis on the data model and the operation set, identifies data dependency relationships of different sets and forms a data dependency set; the dynamic blood relationship model forms a data direct system blood relationship corresponding to the medical data by utilizing the TCG three-dimensional management model according to the data dependence set and an association algorithm based on an association rule, and quality problem classification information is marked in the data direct system blood relationship; and displaying the data direct relationship and the quality problem classification information, and performing quality problem intervention.
In one embodiment, the dataquality improvement module 930 is further configured to locate medical data with quality problems according to the direct blood relationship and the quality problem classification information; and acquiring a quality evaluation strategy, determining the reason of the quality problem according to the quality evaluation strategy and the medical data with the quality problem, and intervening the quality problem according to the reason of the quality problem.
In one embodiment, the datasecurity management module 940 is further configured to input the medical data into the dynamic association relationship model, and form a data dependency set through the dynamic association relationship model; the dynamic incidence relation model filters transverse incidence type data in the medical data according to the data dependence set, and the transverse incidence type data are intelligently classified through a statistical clustering algorithm and a TCG three-dimensional management model to form an initial data incidence relation; and integrating and removing duplication of the initial data incidence relation by the dynamic incidence relation model to obtain a data incidence relation, acquiring a safety protection scheme corresponding to the data incidence relation, displaying the data incidence relation and the safety protection scheme, and performing data safety protection intervention.
In one embodiment, the dataasset management module 950 is further configured to input the medical data into the dynamic cold-hot relationship model, and extract and analyze the data processing log through the dynamic cold-hot relationship model to obtain the usage frequency of the medical data; the dynamic cold-hot relation model is combined with a clustering algorithm to calculate and obtain a cold-hot label of the medical data according to the use frequency; the dynamic cold-hot relationship model combines the TCG three-dimensional management model to cluster and classify the medical data according to the cold-hot degree label to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 10. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a method of medical data management based on a metadata model. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 10 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program:
constructing a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
monitoring medical data in real time, and performing classified management on the medical data through a TCG three-dimensional management model; when the medical data has a medical data treatment scene problem, determining the data treatment type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
when the data management type of the medical data is data quality management, checking the data quality reason and calling a dynamic blood relationship model; inputting medical data into a dynamic blood relationship model to obtain a data direct blood relationship; acquiring quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
when the data management type of the medical data is data security management, identifying data security risk conditions and calling a dynamic incidence relation model; inputting medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
when the data management type of the medical data is data asset management, identifying the data asset value and calling a dynamic cold-hot relationship model; inputting the medical data into a dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
constructing a metadata model according to the medical data governance scene based on the collected and recovered metadata; the metadata model comprises a TCG (target, classify and grade) three-dimensional management model, a dynamic blood relationship model, a dynamic incidence relationship model and a dynamic cold-hot relationship model;
monitoring medical data in real time, and performing classified management on the medical data through a TCG three-dimensional management model; when the medical data has a medical data treatment scene problem, determining the data treatment type of the medical data; the data management types comprise data quality management, data safety management and data asset management;
when the data management type of the medical data is data quality management, checking the data quality reason and calling a dynamic blood relationship model; inputting medical data into a dynamic blood relationship model to obtain a data direct blood relationship; acquiring quality problem classification information according to the data direct blood relationship, displaying the data direct blood relationship and the quality problem classification information, and performing quality problem intervention;
when the data management type of the medical data is data security management, identifying data security risk conditions and calling a dynamic incidence relation model; inputting medical data into the dynamic incidence relation model to obtain a data incidence relation; acquiring a safety protection scheme according to the data association relation, displaying the data association relation and the safety protection scheme, and performing data safety protection intervention;
when the data management type of the medical data is data asset management, identifying the data asset value and calling a dynamic cold-hot relationship model; inputting the medical data into a dynamic cold-hot relationship model to obtain cold-hot relationship data; and acquiring the asset value grade according to the cold and hot relationship data, displaying the cold and hot relationship data and the asset value grade, and marking the data asset value.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.