CN111506637A

Movatterモバイル変換

Info

Publication number: CN111506637A
Application number: CN202010551259.0A
Authority: CN
Inventors: 程博; 成逸然; 张文池; 李则言; 隋楷心; 刘大鹏
Original assignee: Beijing Bishi Technology Co ltd
Current assignee: Beijing Bishi Technology Co ltd
Priority date: 2020-06-17
Filing date: 2020-06-17
Publication date: 2020-08-07
Anticipated expiration: 2040-06-17
Also published as: CN111506637B

Abstract

The invention relates to the technical field of computers, and discloses a multi-dimensional anomaly detection method, a multi-dimensional anomaly detection device and a storage medium based on KPI (Key Performance indicator), wherein the method comprises the following steps: acquiring transaction data of P + Q minutes before and after warning; filling missing values in the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time and evaluating the data scale; obtaining abnormal contributions of all dimension combinations by adopting partial abnormal detection or global abnormal detection according to the evaluation data scale; wherein, part of the abnormal detection only detects the abnormal contribution of the leaf nodes, and the abnormal contribution of the upper node is obtained by adding the abnormal contributions of the lower node; global anomaly detection detects the anomaly contribution of all dimension combinations. The invention is an anomaly detection method irrelevant to the index meaning, fully considers the influence of derived measurement values, can give a uniform anomaly score when a plurality of indexes are abnormal simultaneously, supports dimensionality of more than 10 dimensions, and is a practical method.

Description

Multi-dimensional anomaly detection method and device based on KPI (Key Performance indicator) and storage medium

Technical Field

The invention relates to the technical field of computers, in particular to a multi-dimensional anomaly detection method and device based on KPI (Key Performance indicator) indexes and a storage medium.

Background

KPI (transaction amount, transaction success rate, web page access amount, etc.) and multidimensional attributes (such as source system, transaction type, transaction channel, etc.) are common and important business monitoring indexes in the financial industry. When the overall value of an index is abnormal, an operation and maintenance person wants to quickly and accurately locate the attribute combination of the root cause in a huge multi-dimensional search space, which is a great challenge for the traditional operation and maintenance. Although there are also some algorithms and systems that locate by machine learning, these methods are often not universal and reliable. Because they are all affected by unrealistic root assumptions, too violent pruning is performed; or only the basic type of indicators (transaction amount, etc.) are processed, and the derived measurement values are not processed (success rate, etc.); in addition, most of the existing methods require manual fine-tuning of parameters or are too slow.

At present, algorithms (systems) for multidimensional analysis of service indexes mainly include adopter, IDcie, Hotspot, Squeeze and the like. Most methods are mainly derived theoretically, and have a certain distance from the actual landing.

HotSpot and Squeeze assume that the predicted values are accurate, and then follow-up searching steps are carried out, which is difficult to achieve in reality, and the accuracy of prediction/abnormal detection directly determines the result of follow-up root cause analysis.

The adobber only assumes that the root is one-dimensional, which is not suitable for the current complex micro-service system. The result of the Adtributor is simply the one that remains the simplest according to the principle of the oldham razor.

IDice aims at the root cause positioning of a time sequence, the time point of an abnormality is not clear in advance, and the method is different from a scene, so that extra time cost is brought, meanwhile, IDice adopts a very violent pruning strategy to reduce a search space, and uses G L R (Generalized L ike-probability Ratio) to carry out abnormality detection, for example, nodes (support degrees) smaller than a certain threshold value are directly removed, so that pruning influences the root cause judgment of upper-layer nodes.

Although the adopter and the Squeeze can perform root cause positioning on the derivative indexes, the cross-index root cause sequencing cannot be realized.

In an actual application scene, the use of resources is affected by dimension change, value quantity change and data composition change, and the prior algorithm does not perform targeted processing on data with different orders of magnitude, so that the problems of memory overflow and the like are easily caused when the data volume is overlarge.

Disclosure of Invention

The invention aims to solve the problems and provides a multidimensional and proper pruning abnormity automatic detection method, and the technical scheme provided by the invention is a multidimensional abnormity detection method based on KPI indexes, which comprises the following steps: acquiring transaction data of P + Q minutes before and after warning; filling missing values in the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time and evaluating the data scale; partial anomaly detection or global anomaly detection is adopted according to the scale of the evaluation data; wherein, part of the abnormal detection only carries out abnormal detection on leaf nodes, and the abnormal contribution of the upper node is obtained by adding the abnormal contributions of the lower node; the global anomaly detection detects anomaly contributions for all dimension combinations.

Preferably, the global anomaly detection includes the following steps:

s101, defining the feature type of the KPI single index;

s102, extracting a KPI characteristic value training set X of each point of all dimension combinations P + Q minutes of the single index, forming 1 binary tree by cutting and splitting each appointed characteristic value Q, and traversing each characteristic type by the appointed characteristic valueThen, t is generated₁Binary tree, recorded as T₁Wherein P represents a period of time before the warning, and Q represents a period of time after the warning;

s103, extracting KPI feature sets except the current dimension combination, and splitting to form t according to the cutting of S102₂Binary tree, recorded as T₂；

S104, calculating the sub-node T of all dimension combinations under the index₁And T₂Average height c of₁And c₂；

S105, calculating the abnormal contribution of any dimension combination to the single index;

and S106, when a plurality of indexes are abnormal, repeating S101-S105, and calculating the abnormal contribution of any dimension combination to the plurality of indexes.

Preferably, the feature type of the KPI single indicator in S101 includes at least one of the following features: mean, standard deviation, limit value, current dimension occurrence frequency, current dimension inverse text frequency index, first-order autocorrelation coefficient, linear intensity, curvature intensity, spectral entropy, residual variation standard deviation, number of intersection points, difference value with a front point, trend, periodicity and disorder.

Preferably, the specific cutting splitting mode of S102 is

S1021, extracting KPI characteristic values of each point of all dimension combinations P + Q minutes of the single index to form a training set X;

s1022. randomly extracting k sample points in a training set X to form a subset X of X^k；

S1023, each time randomly from X^kSpecifying a characteristic value q, and randomly generating a cutting point p;

s1024, sample points with characteristic values q smaller than p are placed into the left child nodes, and sample points with characteristic values q larger than or equal to p are placed into the right child nodes;

s1025, repeating S1024 at the left child node and the right child node, and stopping splitting when all leaf nodes have only one sample point or reach a specified number of layers to generate 1 binary tree;

s1026, after the specified characteristic value q traverses each characteristic type, generating t₁Binary tree, recorded as T₁。

Preferably, in S105, the feature vectors of the child nodes combined in any dimension are respectively substituted into T₁And T₂Calculating the child node at T₁And T₂Average height h of₁And h₂Combined with the average height c in S104₁And c₂Defining the abnormal contribution of any dimension combination to the single index asI_a:

。

Preferably, the method for detecting partial anomaly includes L ightGBM, extreme value theory.

Preferably, the input source of the transaction data comprises an elastic search, kafka or csv file of a specified format.

Based on the same inventive concept, the invention further provides a multi-dimensional anomaly detection device based on KPI, comprising:

the warning module is used for warning that the current KPI index is abnormal;

the data reading module is used for reading the transaction data of P + Q minutes before and after warning;

the data preprocessing module is used for filling missing values of the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time;

the data evaluation module is used for evaluating the current data scale;

the anomaly detection module comprises a partial anomaly detection module and a global anomaly detection module; wherein, the partial anomaly detection module only carries out partial anomaly detection on the leaf nodes, and the anomaly contribution of the upper-layer node is obtained by adding the anomaly contributions of the lower-layer nodes; the global anomaly detection module detects anomaly contributions of all dimension combinations.

The present invention further provides a computer-readable storage medium for storing a computer program for executing any one of the above methods for multi-dimensional abnormality detection based on KPI indicators.

The invention has the beneficial effects that:

(1) the dimensionality supported by the method is more than 10 dimensions, the typical analysis result exceeds 3 dimensions, and the method is a practical method, does not need to manually adjust parameters, and is high in speed.

(2) The invention relates to an abnormality detection method irrelevant to the meaning of indexes, which can give a uniform abnormality score when a plurality of indexes are abnormal simultaneously, such as transaction amount, success rate, response time and the like.

(3) The invention fully considers the influence of derived measured values such as success rate, and the result is more accurate.

Drawings

FIG. 1 is a schematic diagram of a portion of the anomaly detection method of the present invention;

FIG. 2 and FIG. 3 are schematic diagrams of the present invention using extreme value theory to detect partial anomalies;

FIG. 4 is a schematic diagram of a certain tree generated by the global anomaly detection method of the present invention;

fig. 5 is a root cause location method clustering diagram provided inembodiment 3 of the present invention;

FIG. 6 is an explanatory diagram of an information entropy search rule provided inembodiment 3 of the present invention;

FIG. 7 is a block diagram illustrating the surprise in the information entropy search rule according toembodiment 3 of the present invention;

FIG. 8 is a schematic diagram of MCTS pruning according to example 3 of the present invention;

FIG. 9 is a flowchart of a multi-dimensional anomaly detection method according toembodiment 3 of the present invention;

FIG. 10 is a flow chart of a multi-dimensional anomaly detection method provided by the present invention;

FIG. 11 is a flowchart of the overall anomaly detection steps of the multi-dimensional anomaly detection method provided by the present invention;

FIG. 12 is a flow chart of the cutting and splitting steps of the multi-dimensional anomaly detection method provided by the present invention.

Detailed Description

Specific embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While specific embodiments of the invention are shown in the drawings, it should be understood that the invention may be embodied in various forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

The invention provides a multi-dimensional anomaly detection method based on KPI (Key Performance indicator), as shown in figure 10, the method comprises the following steps: acquiring transaction data of P + Q minutes before and after warning; filling missing values in the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time and evaluating the data scale; partial anomaly detection or global anomaly detection is adopted according to the scale of the evaluation data; wherein, part of the abnormal detection only carries out abnormal detection on leaf nodes, and the abnormal contribution of the upper node is obtained by adding the abnormal contributions of the lower node; the global anomaly detection detects anomaly contributions for all dimension combinations.

In some alternative embodiments, the input source of the transaction data may include, but is not limited to, an elastic search, kafka, or csv file of a specified format.

As shown in fig. 11, the global anomaly detection can be implemented by, but not limited to, the following steps:

defining the feature type of the KPI single index; the feature types of the KPI single index include, but are not limited to, mean, standard deviation, extreme value, current dimension occurrence frequency, current dimension inverse text frequency index, first-order autocorrelation coefficient, linear intensity, curvature intensity, spectral entropy, residual variation standard deviation, number of intersection points, difference from previous point, trend, periodicity, and clutter.

S102, extracting KPI characteristic value set X of each point of all dimension combinations P + Q minutes of the single index, assigning a characteristic value Q each time, forming 1 binary tree by cutting and splitting, and generating t after the assigned characteristic value traverses each characteristic type₁Binary tree, recorded as T₁；

As shown in fig. 12, the specific cutting and splitting step can be realized by the following processes:

s1021, extracting KPI characteristic value values of each point of all dimension combinations P + Q minutes of the single index to form a training set X;

s1022, randomly drawing k sample points in the training set X to form a subset X of X^k；

S1023, randomly selecting X from each time^kSpecifying a characteristic value q, and randomly generating a cutting point p;

s1024, sample points with the characteristics smaller than p are placed into the left child nodes, and sample points with the characteristics larger than or equal to p are placed into the right child nodes;

s1026, after the specified characteristic value q traverses each characteristic type, generating t₁Binary tree, recorded as T₁；

S104, calculating all dimension combinations under the index at T₁And T₂Average height c of₁And c₂；

。

the warning module is used for warning that the current KPI index is abnormal;

the data evaluation module is used for evaluating the current data scale;

Embodiment 1 this embodiment provides an L ightGBM anomaly detection method based on KPI

The method comprises the following steps: acquiring transaction data of P + Q minutes before and after warning; filling missing values in the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time and evaluating the data scale; partial anomaly detection is adopted according to the scale of the evaluation data; wherein, part of the abnormal detection only carries out abnormal detection on leaf nodes, and the abnormal contribution of the upper node is obtained by adding the abnormal contributions of the lower node; the global anomaly detection detects anomaly contributions for all dimension combinations.

The strategy of the anomaly detection of data of different scales is different, and for the condition that the dimension and the feature are small, as shown in fig. 1, partial anomaly detection is adopted for acceleration, the specific method is that only the leaf node (outer node) is subjected to the anomaly detection, the fraction of the anomaly contribution of an upper node (inner node) is obtained by adding the fractions of the anomaly contribution of a lower node, the training data is enough and relatively stable, the Machine resources are rich, the historical time sequence can be referred to a longer window, and the algorithm is not required to have complete interpretability, the partial anomaly detection method is L g gbm (L g gradient Boosting Machine algorithm), L g gbm adopts a Histogram algorithm, the idea is to discretize continuous floating point features into m discrete values, construct a Histogram with the width of m, traverse the training data, count the accumulation of each floating point in the Histogram, and when the feature selection is carried out, the Histogram segmentation statistics only needs to be divided into discrete values, the optimal discrete values, the traversal speed is better, and the distributed type memory consumption is better, and the distributed support is better.

Embodiment 2 this embodiment provides an extreme value theory anomaly detection method based on KPI index

The strategies of anomaly detection of data of different scales are different, and for the case that the dimensionality and the characteristics are small, as shown in fig. 1, partial anomaly detection is adopted for acceleration, the specific method is that anomaly detection is only carried out on leaf nodes (outer-layer nodes), and the scores of the anomaly contributions of upper-layer nodes (inner-layer nodes) are obtained by adding the scores of the anomaly contributions of lower-layer nodes. For some time sequences without visible regularity, as shown in fig. 2 and fig. 3, the method of adopting the fixed threshold has a better effect, and the fixed threshold is dynamically calculated by using an extreme value theory.

Embodiment 3 this embodiment provides a global anomaly detection method based on KPI indicators

When an alarm occurs in the financial system, the transaction detail data of P + Q minutes before and after the alarm is read as input data, wherein the data input source may be an elastic search, a kafka or a csv file with a specified format. And then, filling missing values of data at other times according to the dimension combination of the alarm occurrence time, and then evaluating the current data scale.

For multi-index data, with the increase of dimensionality and dimensionality values, leaf nodes have less data and only have 0 or 1 in extreme cases, and anomaly detection in such cases is extremely inaccurate. Therefore, a self-research algorithm based on 'influence' is adopted. The ring ratio reference can be made among different KPIs. The global anomaly detection algorithm is described in detail below:

as shown in fig. 9, the following features are extracted according to the history of the single index data, only part of common features are listed below, and part of features such as trend, periodicity, clutter and the like are added to KPI common features of different single indexes.

TABLE 1 KPI common characteristics

Extracting all current detailed data of a certain index by using a sliding window, namely the characteristics of each point on all dimension combination time sequences (P + Q) are recorded as

. For a given training set X, randomly extracting k sample points to form a subset X of X^kEach time randomly from X^kA feature value q is specified and a cut point p is randomly generated. This cut point p generates a hyperplane, dividing the current data space into two subspaces: sample points with dimensions smaller than p are designated to be placed in the left child node, and sample points with dimensions larger than or equal to p are designated to be placed in the right child node. Stopping splitting until all leaf nodes have only one sample point or reach a specified number of layers, generating T binary trees recorded as T₁Fig. 4 shows an example of a generated tree sample.

Then extracting the combination except the current dimensionYFeature sets of other detailed data thanX-YRepeating the training steps to obtain T_2。For the dimension combination needing abnormal detection, respectively substituting the feature vectors of the child nodes of the dimension combination into T₁And T₂Calculating the child node x_iAt T₁And T₂Average height h of₁And h₂I.e., the degree of the tree, may also be referred to as the shortest path. All child nodes are at T₁And T₂Average height of (1) is denoted as c₁And c₂，c₁And c₂At T by each child or leaf node₁And T₂Is obtained by the average height weighted average of (1).

Defining the score of global influence or abnormal contribution of the abnormal under the index a to the index aI_aComprises the following steps:

。

when an abnormal accident occurs and a plurality of correlation indexes are abnormal, the average value of the influence of the correlation indexes is finally obtained, wherein the score of the abnormal contribution of each dimension combination is obtained.

As shown in fig. 5, clustering the PDF maps of scores of abnormal contributions, determining the order of subsequent searches and the selection of root causes, where the dimension combinations of different abnormal contributions are clustered into different clusters, each solid line represents a cluster center, and the clustering method is to find all maxima and minima in the abnormal score PDF maps. The ranges determined by the two minima adjacent to each maximum are grouped into a cluster.

The algorithm searches the root cause in the cluster with the largest cluster center and simulates the calculation of the information entropy to define the candidate root cause. When a dimension combination is a root, it will behave as follows: the information entropy is obviously larger than the information quotient of other dimension combinations of the same layer, and is larger than the node and all the child nodes of the layer above the same layer. Meanwhile, this is also part of our pruning, and when a dimension combination is found to satisfy the above conditions, the algorithm will not take all its child nodes as root candidate sets. While the algorithm considers both explanatory and surprise, i.e. whether the combination of dimensions can explain the change of the current overall KPI and whether the change is "surprised", as shown in fig. 6, the explanatory performance ofcombination 1 is higher than that ofcombination 2, socombination 1 is more likely to be the root cause, as shown in fig. 7, and the surprise ofcombination 2 is higher than that ofcombination 1, socombination 2 is more likely to be the root cause. Repeating the above process to find all candidate root cause sets.

The pruning of Volcano is a pruning strategy with improved MCTS (Monte Carlo Tree Search) as a main framework and multiple pruning parallels.

Pre-pruning: since the anomaly scores calculated by the anomaly detection algorithm built in Volcano are all summable, if a node anomaly score equals 0, it must not be the root cause. Pre-pruning the search tree in this manner can generally reduce more than 50% of the nodes.

Clustering and pruning: in the clustering algorithm, the maximum and minimum value clustering is carried out according to the PDF of the abnormal scores of the nodes, and the interior of each cluster is independently searched. The Volcano can configure the number of searched clusters and the upper limit of the number of root factors in each cluster according to the requirements of users to achieve the purpose of pruning.

MCTS pruning: and simulating search by using a sampling idea, then reversely propagating and updating the 'income' of each node, and selecting the node with the maximum 'income' to continuously search until the root is found. As shown in fig. 8, the dark dots represent the dots that have already been searched, and the light dots are the candidate nodes for the next search.

Two parameters, N and Q, are defined for each node. The former represents the number of times the node is accessed by simulation, and the latter represents the sum of the simulation benefits of the node, wherein the calculated anomaly detection score is used for representing the simulation benefits. Finally, the UCT (v) of each candidate node is calculated_iV) value, selecting UCT (v)_iV) (UCB for Tree, upper bound core Tree search) value as next search path, other nodes will be pruned, UCT (v)_iV) the calculation formula is as follows:

。

post pruning: after a candidate root cause is searched, its child nodes are pruned and are no longer treated as root causes. In order to deal with the actual situation, some special optimization is also performed on post pruning, for example, if the value of the current node is null, downward search is continued, and if the current node only has one direct point (1 to 1), downward search is continued, and the like.

After all candidate root cause sets are found, distributed similarity measurement is carried out on different dimensionality combinations, and JS divergence can be mainly used for KPIs (transaction amount, failure amount, response time and the like) according to different KPI indexes. Un-additive KPI (success rate, response rate, etc.) similarity was measured using Wasserstein (Watherstein distance). The purpose is to combine similar dimension combinations and simplify the result.

In a more preferred embodiment, a computer-readable storage medium is provided for storing a computer program for performing any of the above-described anomaly detection methods.

By analyzing a large amount of financial data, a global anomaly detection strategy is adopted to be different from most of the existing algorithms and devices. Most data in the financial industry are abnormal simultaneously by multiple indexes, so the embodiment is an abnormal detection method irrelevant to the index meaning. In the aspect of searching, a set of scalable searching schemes is used, the time efficiency and the space efficiency are flexibly switched to adapt to data with different sizes, and MCTS is introduced to prune and speed up the searching. Different from the previous 'top-down' search mode, the Volcano carries out 'bottom-up' clustering before searching, so that on one hand, the root cause search can be carried out more effectively, and on the other hand, the Volcano can be used as a pruning means to reduce the search space. Finally, the Volcano carries out similarity test on the results, can combine the results in indexes, and can solve the problem that multiple indexes are mutually included.

In a preferred embodiment, there is provided a KPI indicator-based multi-dimensional anomaly detection apparatus, comprising:

the warning module is used for warning that the current KPI index is abnormal;

the data reading module is used for reading the transaction data of P + Q minutes before and after the warning, and the input source of the transaction data can include but is not limited to an elastic search, kafka or csv files in a specified format;

the data evaluation module is used for evaluating the current data scale;

Wherein, the global anomaly detection module comprises:

defining a submodule, and defining the characteristic type of the KPI single index;

extracting KPI characteristic value set of each point of all dimension combinations P + Q minutes of the single index, forming 1 binary tree by cutting and splitting after each characteristic value Q is appointed, and generating t after the appointed characteristic value traverses each characteristic type₁Binary tree, recorded as T₁；

Extracting KPI characteristic set except the current dimension combination, and forming t according to the cutting division of the first extraction submodule₂Binary tree, recorded as T₂；

Calculating all dimension combinations under the index at T₁And T₂Average height c of₁And c₂；

Calculating the abnormal contribution of any dimension combination to the single index;

and when a plurality of indexes are abnormal, repeating the submodules and calculating the abnormal contribution of any dimension combination to the plurality of indexes.

In some optional embodiments, defining the feature type of the KPI single indicator in the sub-module comprises at least one of the following features: mean, standard deviation, limit value, current dimension occurrence frequency, current dimension inverse text frequency index, first-order autocorrelation coefficient, linear intensity, curvature intensity, spectral entropy, residual variation standard deviation, number of intersection points, difference value with a front point, trend, periodicity and disorder.

In some optional embodiments, the first extraction sub-module comprises:

extracting KPI characteristic value sets X of each point of all dimension combinations P + Q minutes of the single index to form a training set X;

randomly extracting k sample points in a training set X to form a subset X of X^k；

Cutting point generating unit, each time random from X^kSpecifying a characteristic value q, and randomly generating a cutting point p;

appointing a sample point with a characteristic value smaller than p to be placed into a left child node, and a sample point with a characteristic value larger than or equal to p to be placed into a right child node;

repeating the feature processing unit at the left child node and the right child node, and stopping splitting when all leaf nodes have only one sample point or reach a specified number of layers to generate 1 binary tree;

after the appointed characteristic value traverses each characteristic type, t is generated₁Binary tree, recorded as T₁。

In some optional embodiments, the second computation submodule specifically substitutes the feature vectors of the child nodes of any dimension combination into T respectively₁And T₂Calculating the child node at T₁And T₂Average height h of₁And h₂Combined with average height c in S4₁And c₂Defining the abnormal contribution of any dimension combination to the single index asI_a:

。

The embodiment of the invention provides a multi-dimensional anomaly detection method and device based on KPI (Key performance indicator): the supported dimensionality is more than 10 dimensions, the typical analysis result exceeds 3 dimensions, and the method is a set of completely practical and production-verified method; the method is an abnormality detection method irrelevant to the meaning of indexes, and can give a uniform abnormality score when a plurality of indexes are abnormal simultaneously, such as transaction amount, success rate, response time and the like; the influence of derived measurement values such as success rate is fully considered, and the result is more accurate.

By analyzing a large amount of financial data, a global anomaly detection strategy is adopted to be different from most of the existing algorithms and devices. Most data in the financial industry are abnormal simultaneously by multiple indexes, so the embodiment is an abnormal detection method irrelevant to the index meaning.

The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that although the present specification describes the embodiments, the above-mentioned embodiments are exemplary and not intended to limit the scope of the present invention, and any changes, modifications, substitutions and alterations made by those skilled in the art without departing from the principle and spirit of the present invention shall be included in the scope of the present invention.

Claims

1. A multi-dimensional anomaly detection method based on KPI indexes comprises the following steps:

acquiring transaction data of P + Q minutes before and after warning;

filling missing values in the dimension combination of P + Q minutes according to the dimension combination of the alarm occurrence time and evaluating the data scale;

obtaining abnormal contributions of all dimension combinations by adopting partial abnormal detection or global abnormal detection according to the data scale;

wherein, part of the abnormal detection only detects the abnormal contribution of the leaf nodes, and the abnormal contribution of the upper node is obtained by adding the abnormal contributions of the lower node; the global anomaly detection detects anomaly contributions for all dimension combinations.

2. A KPI indicator-based multi-dimensional anomaly detection method according to claim 1, characterized in that: the global anomaly detection comprises the following steps:

s101, defining the feature type of the KPI single index;

s102, extracting a KPI characteristic value training set X of each point of all dimension combinations P + Q minutes of the single index, forming 1 binary tree by cutting and splitting after each characteristic value Q is appointed, and generating t after the appointed characteristic value traverses each characteristic type₁Binary tree, recorded as T₁；

S104, calculating the sub-node T of all dimension combinations X under the index₁And T₂Average height c of₁And c₂；

3. A KPI indicator-based multi-dimensional anomaly detection method according to claim 2, characterized in that: the feature type of the KPI single index in S101 comprises at least one of the following features: mean, standard deviation, limit value, current dimension occurrence frequency, current dimension inverse text frequency index, first-order autocorrelation coefficient, linear intensity, curvature intensity, spectral entropy, residual variation standard deviation, number of intersection points, difference value with a front point, trend, periodicity and disorder.

4. A KPI indicator-based multi-dimensional anomaly detection method according to claim 2, characterized in that: s102 the specific cutting and splitting mode is as follows:

s1024, sample points with characteristic values smaller than p are placed into the left child nodes, and sample points with characteristic values larger than or equal to p are placed into the right child nodes;

5. A KPI indicator-based multi-dimensional anomaly detection method according to claim 2, characterized in that: in S105, the feature vectors of the child nodes of the dimension combination are respectively substituted into T₁And T₂Calculating the child node at T₁And T₂Average height h of₁And h₂Combined with average height c in S014₁And c₂Defining the abnormal contribution of any dimension combination to the single index asI_a:

。

6. The method as claimed in claim 1, wherein the partial anomaly detection method comprises L ightGBM, extremum theory.

7. A KPI indicator-based multi-dimensional anomaly detection method according to claim 1, characterized in that: the input source of the transaction data includes an elastic search, kafka, or csv file of a specified format.

8. A multi-dimensional abnormality detection apparatus based on KPI indicators, comprising:

the warning module is used for warning that the current KPI index is abnormal;

the data evaluation module is used for evaluating the current data scale;

9. A computer-readable storage medium for storing a computer program for executing the KPI indicator-based multi-dimensional abnormality detection method according to any one of claims 1 to 7.