Background technology
Shown in Fig. 1 is the basic process of esterification, and esterification is as the key link of whole polyester production process, and to stabilized polyester, production plays a decisive role.And in reaction unit the first esterifying kettle outlet Key Quality Indicator---the height of esterification yield directly affects the carrying out of subsequent reactions and the crystal property of polyester product, therefore usually by controlling esterification yield, control whole production run.But different polycondensating process has different requirements to esterification yield, so must reach required esterification yield by adjusting the operating conditionss such as reaction pressure and material quantity compare in production run.The flip-flop of operating conditions can cause the quality fluctuation of esterification yield, is unfavorable for the real-time control of whole production run.On the other hand, esterification process adopts two esterifiers to reach the esterification yield of technological requirement mostly, and the nonlinearity of reactive system, time variation and the uncertain difficulty of esterification yield on-line measurement that strengthened.
The on-the site analysis instrument not only expensive, maintaining is complicated, and while using analysis meter to be measured esterification yield, usually has hysteresis for a long time, the hydraulic performance decline that this will cause controlling quality, be difficult to meet production requirement.The basic skills of soft measurement is that Theory of Automatic Control and production run knowledge are combined, Applied Computer Techniques, for being difficult to, measure or temporary transient immeasurable leading variable, the auxiliary variable of selecting other easily to measure, infer and estimate by forming certain mathematical relation, replacing the function of on-the site analysis instrument with software.Flexible measurement method, because responding rapidly, can provide leading variable information continuously, and invests the advantage such as low, that maintaining is simple and obtain broad research and application in each field.But, in nearest decades, along with scientific and technical progress, modern industry production is more and more higher for the requirement of production run, data volume sharply increases, data type becomes increasingly complex, and operating mode is complicated and changeable, on the other hand, industrial process is all generally dynamic, static flexible measurement method can't reflect multidate information and the global property of industrial process usually, causes the bad adaptability of model, can't use for a long time.So simple, conventional flexible measurement method can not meet the needs of modern production technique in the past, be prone to that process characteristic coupling is not good, precision of prediction is low and the problem such as bad adaptability.
In order to obtain being applicable to more in general sense the flexible measurement method that the esterification yield data are predicted and analyzed, many improving one's methods is suggested, and formed plentiful and substantial achievement in research, mainly contain the following aspects: utilize various modeling methods, as Analysis on Mechanism, artificial neural network, least square method supporting vector machine and Gaussian process, sample set is set up to model and predict leading variable output; Utilize various intelligent optimization methods, as: particle cluster algorithm, genetic algorithm and evolution algorithm etc. carry out preferably the parameter of model; Utilize various clustering methods as the K-mean cluster, Fuzzy C-Means Clustering, stream shape cluster and affine propagation clustering method are polymerized to several subclasses by sample set, build several submodels and improve model prediction performance etc.
Summary of the invention
The present invention is directed to the prior art above shortcomings, a kind of multi-model dynamic soft measuring modeling method is provided, based on Combination Rules of Evidence Theory (D-S rule) and autoregressive moving-average model (ARMA), have better adaptability than prior art, in the soft measurement to esterification yield, precision is higher.
The present invention is achieved through the following technical solutions:
A kind of multi-model dynamic soft measuring modeling method comprises the following steps:
S1, data pre-service: select training sample data collection Xm*n, m is the sample dimension, and n is number of samples, and the rejecting abnormalities data are also carried out normalized to data;
S2, adaptive fuzzy kernel clustering are analyzed: adopt adaptive fuzzy kernel clustering method to training sample data collection Xm*ncarry out cluster, obtain fuzzy class degree of membership and each cluster centre of each sample, and automatically determine best clusters number c;
S3, set up submodel: adopt least square method supporting vector machine to carry out training study to the training sample set of c cluster, select the kernel function of gaussian kernel function as least square method supporting vector machine, set up and determine the parameter of c submodel by cross-validation method: penalty factor and nuclear parameter σ, and obtain the output of each submodel
S4, the output of the submodel based on Combination Rules of Evidence Theory are merged: calculate the evidential probability partition function value of each submodel, the weight using it as submodel, then carry out evidence fusion to the output of each submodel, obtains static multiple mode output
The mobilism of S5, model output: use the multi-model output of autoregressive moving-average model to current time t, right
dynamically adjust, at first judgement
whether be stationary sequence, if not, will
be converted to stationary sequence; Otherwise directly will
with true measurement, y subtracts each other, obtain a time series about output valve error delta y, then utilize autoregressive moving-average model (p, q) this time series is carried out to modeling, obtain the autoregressive moving-average model about predicated error, finally, above two models are combined and carry out model prediction, final dynamic multi-model is output as
Preferably, in step S2, the step of adaptive fuzzy kernel clustering method comprises:
S21: cluster objective function: to training sample set X={xi| i=1,2...n}, the objective function of adaptive fuzzy kernel clustering method is defined as
In formula, μ
ij∈ [0,1];
j=1,2...c, m is the fuzzy control index, μ
ijbe the degree of membership value of i sample corresponding to j cluster, v
jbe j cluster centre, K(x
i, v
j) be gaussian kernel function;
S22: degree of membership is upgraded:
S23: cluster centre upgrades:
S23: cluster result evaluation: cluster is estimated the result of cluster Validity Index after finishing
Preferably, step S4 comprises:
S41: the evidential probability partition function of first submodel: using the framework of identification of all c submodels in evidence theory of cluster gained, and arbitrary submodel is considered as to burnt first Cj(j=1,2...c), for sample x1, calculate it for first submodel, i.e. first burnt first C1fuzzy class degree of membership, and, according to evidence theory, using it as an evidence, the probability assignments function of remembering this evidence is m ({ C1| x1)=μ11;
And for all n test sample book data X={xi| i=1,2...n}, obtain n bar evidence equally, and its probability assignments function is designated as m ({ C1| xi)=μi1(i=1,2...n);
Then, use Combination Rules of Evidence Theory to be merged these probability assignments functions, the probability assignments function using the probability assignments function after fusion as first submodel:
Wherein, the contradiction factorIn order to reflect the conflict spectrum of evidence;
S44: the evidential probability partition function of all submodels: the rest may be inferred, for all c submodel, according to step S43, obtains c evidential probability partition function m ({ C1| X) ... m{Cc| X;
S45: multi-model output: calculate respectively the son output of X for each submodel
weight using the c in S44 probability assignments function as each submodel, be weighted fusion to the submodel Output rusults of gained, and the output of the multi-model of training sample data collection is expressed as:
Preferably, step S5 comprises:
S51: adopt autoregressive moving-average model to static multi-model output
carry out dynamic calibration, the response of autoregressive moving-average model descriptive system current time t
?
not only relevant with the observed reading before it in time, also with present worth and the lagged value of system disturbance, there is certain dependence, autoregressive moving-average model (p, q) can be expressed as
Wherein, p is the autoregression item; Q is the moving average item number;
S52: introduce linear shift operator B, have
therefore the formula in S51 is variable, be changed to
In formula, εtfor meeting N (0, σ2) white noise sequence, the m rank that Φ (B) and θ (B) are Shift operators B and n rank polynomial expression.
According to the basic theories of Hilbert space Linear Operators, to meeting steadily, the random time sequence of normal state, zero-mean
an available autoregressive moving-average model (p, q) approaches with arbitrary accuracy.
At first the method utilizes the evidence composition rule to process the focusing advantage of uncertain information, each submodel obtained for the affine propagation clustering method has been set up a plurality of evidential probability partition functions, weight using it as each submodel, output to each submodel is weighted the multi-model output that fusion obtains test sample book, the concussion of having avoided switching mode to cause, eliminate the sample mistake and divided the impact on the model output accuracy, effectively improved the predictive ability of model; Then in conjunction with autoregressive moving-average model, the static multiple mode output error information obtained is carried out to dynamic calibration, significantly improved the dynamic response characteristic of system.
Embodiment
Below with reference to accompanying drawing of the present invention; technical scheme in the embodiment of the present invention is carried out to clear, complete description and discussion; obviously; as described herein is only a part of example of the present invention; it is not whole examples; embodiment based in the present invention, the every other embodiment that those of ordinary skills obtain under the prerequisite of not making creative work, belong to protection scope of the present invention.
For the ease of the understanding to the embodiment of the present invention, take specific embodiment below in conjunction with accompanying drawing and be further explained as example, and each embodiment does not form the restriction to the embodiment of the present invention.
At first utilize the evidence composition rule to process the focusing advantage of uncertain information, each submodel obtained for the affine propagation clustering method has been set up a plurality of evidential probability partition functions, weight using it as each submodel, output to each submodel is weighted the multi-model output that fusion obtains test sample book, the concussion of having avoided switching mode to cause, eliminate the sample mistake and divided the impact on the model output accuracy, effectively improved the predictive ability of model; Then in conjunction with autoregressive moving-average model (ARMA, Auto-Regressive Moving Average Model), the static multiple mode output error information obtained is carried out to dynamic calibration, significantly improved the dynamic response characteristic of system.
The technical scheme that this method technical solution problem is taked is:
Please refer to Fig. 2, a kind of multi-model dynamic soft measuring modeling method based on Combination Rules of Evidence Theory and autoregressive moving-average model comprises the following steps:
S1: data pre-service: select training sample data collection Xm*n, m is the sample dimension, and n is number of samples, and the rejecting abnormalities data are also carried out normalized to data;
S2: the adaptive fuzzy kernel clustering is analyzed: adopt adaptive fuzzy kernel clustering method to sample set Xm*ncarry out cluster, obtain fuzzy class degree of membership and each cluster centre of each sample, and automatically determine best clusters number c;
S3: set up submodel: to each sub-training sample set, adopt least square method supporting vector machine (LS-SVM, below replace with LS-SVM) to carry out training study to it, and determine the parameter of each submodel.Select the kernel function of gaussian kernel function as LS-SVM, determine the parameter of each submodel by cross-validation method: penalty factor and nuclear parameter σ, as shown in Figure 3;
S4: the submodel output based on D-S is merged: obtain the evidential probability partition function value of each submodel according to the method for formula (6), the weight using it as submodel, then utilize the output of formula (7) to each submodel
carry out evidence fusion, obtain the output of multi-model
S5: the mobilism of model output: the static model on utilize obtain the multi-model output of sample
after, use the multi-model output of arma modeling to current time t
right
dynamically adjust.At first judgement
whether be stationary sequence, if not, will
be converted to stationary sequence; Otherwise directly will
with true measurement, y subtracts each other, and obtains a time series about output valve error delta y, then utilizes arma modeling (p, q) to carry out modeling to this time series, obtains the arma modeling about predicated error.Finally, above two models are combined and carry out model prediction, sample finally is output as
In step S2, the step of " adaptive fuzzy kernel clustering method " is as follows:
S21: cluster objective function: to training sample set X={xi| i=1,2...n}, the objective function of adaptive fuzzy kernel clustering method is defined as
In formula, μ
ij∈ [0,1];
j=1,2...c, m is the fuzzy control index, μ
ijbe the degree of membership value of i sample corresponding to j cluster, v
jbe j cluster centre, K (x, y) is gaussian kernel function.
S22: degree of membership is upgraded:
S23: cluster centre upgrades:
S23: cluster result evaluation: cluster adopts following Validity Index to be estimated the result of cluster after finishing
In step S4, the concrete steps of " the model prediction output based on Combination Rules of Evidence Theory " are as follows:
S41: the evidential probability partition function of first submodel: using the framework of identification of all c submodels in evidence theory of cluster gained, and arbitrary submodel is considered as to burnt first Cj(j=1,2...c).So, for sample x1, at first according to formula (3), obtain it for first submodel, be also first burnt first C1fuzzy class degree of membership.And, according to evidence theory, using it as an evidence, the probability assignments function of remembering this evidence is m ({ C1| xi)=μ11.
And for all n test sample book data X={xi| i=1,2...n}, in like manner, can obtain n bar evidence, and its probability assignments function is designated as m ({ C1| xi)=μi1(i=1,2...n).
Then, use Combination Rules of Evidence Theory to be merged these probability assignments functions, the probability assignments function after merging is as the probability assignments function of first submodel, shown in (6):
Wherein, the contradiction factorIts size has reflected the conflict spectrum of evidence.
S44: the evidential probability partition function of all submodels: the rest may be inferred, for all c submodel, according to step S43, can obtain c evidential probability partition function m ({ C1| X) ... m ({ Cc| X).
S45: multi-model output: calculate respectively X for each submodel LS-SVM1 ... the son output of LS-SVMc
weight using c obtained above probability assignments function as each submodel, be weighted fusion to the submodel Output rusults of gained, and the output of the multi-model of test sample book collection can be expressed as
In step S5, the concrete steps of " mobilism of model output " are:
S51: the static multiple mode output that adopts autoregressive moving-average model (ARMA) to obtain upper joint
carry out dynamic calibration.The response of arma modeling descriptive system current time t
?
not only relevant with the observed reading before it in time, also with present worth and the lagged value of system disturbance, there is certain dependence.Arma modeling (p, q) can be expressed as
Wherein, AR is autoregression, and p is the autoregression item; MA is moving average, and q is the moving average item number.
S52: introduce linear shift operator B, have
therefore variable being changed to of formula (8)
In formula, εtfor meeting N (0, σ2) white noise sequence, the m rank that Φ (B) and θ (B) are Shift operators B and n rank polynomial expression.
According to the basic theories of Hilbert space Linear Operators, to meeting steadily, the random time sequence of normal state, zero-mean
an available arma modeling (p, q) approaches with arbitrary accuracy.
Following according to real data for an embodiment:
The first step: to collection in worksite to data processed, obtain 1000 groups of normal datas.Using 900 groups of data wherein as training dataset X, for the foundation of model; Remaining 100 groups as test data set, for the predictive ability of testing model.
Second step: utilize the affine propagation clustering method to carry out cluster to training dataset, obtaining the optimum cluster number is c=4, corresponding cluster centre v.
The 3rd step: to resulting four the sub-training sample sets of cluster, utilize the LS-SVM method to set up four submodels, and training study, determine the parameter of LS-SVM through cross-validation method, as shown in Figure 3.
The 4th step: calculate the probability assignments function of each submodel of test sample book set pair according to formula (6), the weight using it as each submodel, then calculate test sample book X
testoutput with respect to each submodel
then utilize formula (7) to be merged the output of each submodel, obtain the output of test sample book
The 5th step: by the predicted value of the test sample book of current time t
with the manual analysis value, y subtracts each other, and the time series of output error Δ y is carried out to the ARMA modeling.Show that the estimated performance of algorithm is best when optimal factor p=4.
The estimated performance curve that Fig. 4-7 are three kinds of different measuring methods and measuring method of the present invention.From simulation result, can find out, the multi-model dynamic soft measuring modeling method based on Combination Rules of Evidence Theory and autoregressive moving-average model of the present invention is than single model and traditional multi-model process, and the estimated performance of esterification yield has been had to larger improvement.This is because esterification has than the characteristics of high non-linearity and multi-state, and a model need to be considered whole training samples during the single model modeling, and this has limited the precision of model; And although traditional multi-model process has carried out clustering to training dataset when modeling, set up respectively different submodels, but deeply do not consider the impact of the dynamic change of the difference of test sample book and training sample and dividing condition and process on the multi-model Output rusults at prediction test sample book output stage, so not significantly improvement of estimated performance.Method of the present invention utilizes the affine propagation clustering method by the first clustering of the identical sample similar with characteristic of operating mode, then take into full account the output stage in the prediction test sample book, the impact of each submodel output on the final Output rusults of sample, utilize Combination Rules of Evidence Theory structure weight to carry out the multi-model fusion to the output of submodel, obtain the final output of test sample book, the prediction deviation that the concussion of having avoided switching mode to cause and sample mistake minute cause; In addition, consider the dynamic perfromance of actual industrial process, utilize autoregressive moving-average model to export and carry out dynamic calibration multi-model, improved the dynamic response characteristic of system, thereby there is better adaptability, obtained fitting effect preferably in the soft measurement to esterification yield.
Listed the performance parameter of different flexible measurement methods in Fig. 8.As can be seen from Figure 8, adopt multi-model dynamic soft measuring modeling method provided by the present invention, more existing modeling method, root-mean-square error and maximum relative error all are improved.
The above; be only the present invention's embodiment preferably, but protection scope of the present invention is not limited to this, anyly is familiar with in technical scope that those skilled in the art disclose in the present invention; the variation that can expect easily or replacement, within all should being encompassed in protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.