It should be understood that if the prediction share is not blinded, the aggregation server is obtaining share₀(Y_i) And share₁(Y_i) Then, Y can be directly reconstructed_i＝share₀(Y_i)+share₁(Y_i) And leaking the prediction result.

Further, the step S103: the calculation server carries out blind processing on the prediction result share to obtain a blind prediction result share; the method comprises the following specific steps:

and the calculation server performs blind processing on the share of the prediction result by adopting a blind matrix to obtain the share of the blind prediction result.

Further, the step of obtaining the blinding matrix includes:

and the aggregation server randomly generates a blinding matrix in the credible region.

As one or more embodiments, the S104: the computing server sends the blinded prediction result share to an aggregation server; the method comprises the following specific steps:

the first computing server sends the first blinded prediction result share to the aggregation server; the second computing server sends the second blinded prediction result share to the aggregation server.

As one or more embodiments, the S105: the aggregation server carries out blind removing processing and noise adding processing on the blind prediction result share, and feeds back the result to the client; the method comprises the following specific steps:

s1051: the aggregation server reconstructs the blind prediction result from the first blind prediction result share and the second blind prediction result share in the untrusted area to obtain a third blind prediction result share;

s1052: the aggregation server carries out de-blinding processing on the third blinded prediction result share in the credible area to obtain an intermediate result; the aggregation server calculates an aggregation prediction result based on the intermediate result;

s1053: and the aggregation server performs noise processing on the aggregation prediction result and sends the aggregation prediction result subjected to the noise processing to the client.

Further, the S1051 aggregation server reconstructs the blind prediction result from the first blind prediction result share and the second blind prediction result share in the untrusted region, to obtain a third blind prediction result share; the method comprises the following specific steps:

the aggregation server obtains the share of the blinded prediction result and rebuilds the blinded prediction result, namely Y in advance in the untrusted region_mask＝share₀(Y_i)+mask₀+share₁(Y_i)+mask₁＝Y_i+mask。

It should be understood that the prediction Y is not revealed here since there is no blinding matrix in the untrusted region_i。

Further, the S1052: the aggregation server carries out de-blinding processing on the third blinded prediction result share in the credible area to obtain an intermediate result; the method comprises the following specific steps:

the aggregation server removes the blinded matrix from the trusted zone Encalve to obtain a prediction result:

Y_i＝Y_mask-mask。

further, the S1052: the aggregation server calculates an aggregation prediction result based on the intermediate result; the method comprises the following specific steps:

the aggregation server calculates the aggregated prediction result after voting by using a soft voting method

Soft voting has a higher accuracy than hard voting.

Further, the S1053: the aggregation server carries out noise processing on the aggregation prediction result and sends the aggregation prediction result subjected to the noise processing to the client; the method comprises the following specific steps:

aggregating server first computes entropy of results

For predictors with higher entropy, less noise is added, whereas for predictors with lower entropy, more noise is added.

According to the entropy, the aggregation server calculates the corresponding noise coefficient

Wherein d is the distribution of classes of training data;

finally, the aggregation Server adds noise, Y ', to the prediction'_a＝Y_a+N*c*(d-Y_a) Wherein c is a control coefficient for controlling the magnitude of the noise addition.

In order to solve the privacy disclosure problem of the PATE framework in knowledge transfer from a teacher model to a student model and solve the performance limitation of the PATE framework, the scheme combining secret sharing and trusted computing SGX is provided. In the off-line stage, a model holder (teacher) uses secret sharing to divide the technology into two model shares to be uploaded and stored in two computing servers, and moreover, an aggregation server generates a blinding matrix in a credible region and sends the blinding matrix to the two computing servers so as to protect the prediction result. In the online prediction stage, as shown in fig. 2, the client (student) also uploads private data to be predicted to two servers in a share form for prediction calculation, the calculation server protects the share of the prediction result through a blinding matrix, the aggregation server receives the blinded prediction share and removes the blinding matrix in the trusted zone, the prediction results from a plurality of privacy models are aggregated, noise is added to the aggregation result for optimization protection, and the aggregation result is returned to the client as shown in fig. 4.

The method is divided into three parts, namely a model holder, a server (comprising two omega computing servers and an aggregation server), and a client specifically comprises the following steps:

1. model holder P_iThe locally trained model W is used_iShare divided into two models₀(W_i) And share₁(W_i) Sent to a calculation server S₀，S₁。

2. Aggregation server S₂Randomly generating a blind matrix mask in a trusted zone Encalve₀，mask₁，mask＝mask₀+mask₁And sent to the calculation server S through a secure channel₀，S₁。

3. The client C divides the data x to be predicted into two data share shares share₀(x) And share₁(x) Is sent to the server S₀，S₁。

4. Server S₀，S₁Calculating a prediction result, namely share, on owned model shares₀(Y_i) And share₁(Y_i) Where Y is the predicted vector ═ Y₁，y₂，.....y_j) J is the category of the prediction result, and y is the prediction probability.

5. Server S₀，S₁Blinded prediction of the result share is:

and sending the blinded result to an aggregation server.

6. Aggregation server calculates blinded prediction result Y in untrusted zone_mask＝share₀(Y_i)+mask0+share1Yi+mask1＝Yi+mask

7. Aggregation server removes blinding Y in trusted zone Encalve_i＝Y_mask-mask

8. Aggregation server computing aggregated prediction results using soft voting

9. The aggregation server optimizes the aggregation result, adds noise, reduces the information entropy of the prediction result, and adds the noise-added prediction result Y'_aAnd sending the data to the client C.

Table 1 algorithm 1: execution of the framework

Example two

The embodiment provides a machine learning security aggregation prediction system supporting bidirectional privacy protection;

In the foregoing embodiments, the descriptions of the embodiments have different emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.

The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.