Movatterモバイル変換


[0]ホーム

URL:


CN107301577A - Training method, credit estimation method and the device of credit evaluation model - Google Patents

Training method, credit estimation method and the device of credit evaluation model
Download PDF

Info

Publication number
CN107301577A
CN107301577ACN201610236701.4ACN201610236701ACN107301577ACN 107301577 ACN107301577 ACN 107301577ACN 201610236701 ACN201610236701 ACN 201610236701ACN 107301577 ACN107301577 ACN 107301577A
Authority
CN
China
Prior art keywords
training
feature
tree
primitive character
models
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610236701.4A
Other languages
Chinese (zh)
Inventor
杜玮
李文鹏
余舟华
施兴
王晓光
杨旭
张柯
程孟力
曾海峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding LtdfiledCriticalAlibaba Group Holding Ltd
Priority to CN201610236701.4ApriorityCriticalpatent/CN107301577A/en
Publication of CN107301577ApublicationCriticalpatent/CN107301577A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

This application discloses a kind of training method of credit evaluation model, credit estimation method and device, the wherein training method includes:Obtain training primitive behavior data of the training user in operation system;Extract the training primitive character in training primitive behavior data;Combinations of features is carried out according to iteration decision tree GBDT models to training primitive character to generate corresponding training combined crosswise feature;Logistic regression Logistic Regression models are trained to build credit evaluation model according to training combined crosswise feature.This method is trained to generate corresponding training combined crosswise feature by non-linear GBDT models to training primitive character, and by training combined crosswise feature to be trained to build credit evaluation model linear LR models, so that the credit evaluation model had both possessed the high-performance of nonlinear model, but also with the interpretation of linear model.

Description

Training method, credit estimation method and the device of credit evaluation model
Technical field
The application is related to data assessment field, more particularly to a kind of training method of credit evaluation model, uses the credit evaluationModel carries out the method and device of credit evaluation.
Background technology
At present, the Credit Model of user is built in conventional traditional credit investigation system using scorecard mostly, scorecard is actuallyIt is a kind of two disaggregated model, builds the model and mainly realized using logistic regression Logistic Regression algorithms.CommentSnap gauge type is divided to need to pre-process variable, the need for making scoring, variable first can be segmented by scorecard model,Then variable is encoded based on target again, the coded system commonly used in Credit Model is WOE (Weight OfEvidence), it intuitively can be understood as becoming influence when measuring some value to target, can improve linear by this methodThe performance of model.
But, the problem of presently, there are is:In traditional credit investigation system, using purely linear model although ensure that model canIt is explanatory, but performance comparatively can be weaker, also, the method for traditional linear model combinations of features is less efficient, leadsCause the user credit model performance built relatively low, so as to cause to forbid the credit evaluation result of user by this modelReally.
The content of the invention
The purpose of the application is intended at least solve one of technical problem in correlation technique to a certain extent.
Therefore, first purpose of the application is to propose a kind of training method of credit evaluation model.This method passes through non-threadProperty GBDT models training primitive character is trained to generate corresponding training combined crosswise feature, and by training intersection groupClose feature to be trained to build credit evaluation model linear LR models so that the credit evaluation model both possesses nonlinear modelThe high-performance of type, but also with the interpretation of linear model.
Second purpose of the application is to propose a kind of method of credit evaluation.
The 3rd purpose of the application is to propose a kind of trainer of credit evaluation model.
The 4th purpose of the application is to propose a kind of device of credit evaluation.
For up to above-mentioned purpose, the training method of the credit evaluation model of the application first aspect embodiment, including:Obtain trainingTraining primitive behavior data of the user in operation system;Extract the training primitive character in the training primitive behavior data;Carry out combinations of features to the training primitive character to generate corresponding training combined crosswise according to iteration decision tree GBDT modelsFeature;Logistic regression Logistic Regression models are trained according to the training combined crosswise feature to buildCredit evaluation model.
The training method of the credit evaluation model of the embodiment of the present application, is carried out by non-linear GBDT models to training primitive characterTraining with generate it is corresponding training combined crosswise feature, and by train combined crosswise feature to be trained linear LR models withBuild credit evaluation model so that the credit evaluation model had both possessed the high-performance of nonlinear model, but also with linear modelInterpretation.
For up to above-mentioned purpose, the credit evaluation described in use the application first aspect embodiment of the application second aspect embodimentThe method that model carries out credit evaluation, including:Obtain primitive behavior data of the targeted customer in operation system;Extract describedPrimitive character in primitive behavior data;Carry out combinations of features to the primitive character to generate corresponding friendship according to GBDT modelsPitch assemblage characteristic;The combined crosswise feature is predicted according to the credit evaluation model to obtain the targeted customer'sCredit information.
The method of the credit evaluation of the embodiment of the present application, when carrying out credit evaluation prediction to targeted customer, can first obtain the meshPrimitive behavior data of the user in operation system are marked, and extract the primitive character in primitive behavior data, afterwards, will be originalFeature is trained by GBDT tree-model to obtain corresponding combined crosswise feature, finally, and the combined crosswise feature is putIt is predicted into credit evaluation model, draws the user profile of the targeted customer, i.e., by using both possesses higher solveThe property released carries out credit evaluation but also with high performance credit evaluation model to targeted customer so that assessment result is relatively reliable, effectFruit more preferably, improves the accuracy of assessment result.
For up to above-mentioned purpose, the trainer of the credit evaluation model of the application third aspect embodiment, including:Acquisition module,For obtaining training primitive behavior data of the training user in operation system;Extraction module, it is original for extracting the trainingTraining primitive character in behavioral data;Generation module, for training original spy to described according to iteration decision tree GBDT modelsProgress combinations of features is levied to generate corresponding training combined crosswise feature;Training module, for according to the training combined crosswiseFeature is trained to build credit evaluation model to logistic regression Logistic Regression models.
The trainer of the credit evaluation model of the embodiment of the present application, is entered by non-linear GBDT models to training primitive characterRow is trained to build corresponding training combined crosswise feature, and by training combined crosswise feature to be trained linear LR modelsTo build credit evaluation model so that the credit evaluation model had both possessed the high-performance of nonlinear model, but also with linear modelInterpretation.
For up to above-mentioned purpose, the credit evaluation described in use the application third aspect embodiment of the application fourth aspect embodimentModel carries out the device of credit evaluation, including:Acquisition module, for obtaining primitive behavior of the targeted customer in operation systemData;Extraction module, for extracting the primitive character in the primitive behavior data;Generation module, for according to GBDT mouldsType carries out combinations of features to generate corresponding combined crosswise feature to the primitive character;Prediction module, for according to the letterThe combined crosswise feature is predicted with assessment models to obtain the credit information of the targeted customer.
The device of the credit evaluation of the embodiment of the present application, can be by obtaining mould when carrying out credit evaluation prediction to targeted customerBlock obtains primitive behavior data of the targeted customer in operation system, and extraction module extracts the original spy in primitive behavior dataLevy, primitive character is trained to obtain corresponding combined crosswise feature by generation module by GBDT tree-model, predicts mouldThe combined crosswise feature is put into credit evaluation model and is predicted by block, draws the user profile of the targeted customer, that is, passes throughUsing both possess higher interpretation but also with high performance credit evaluation model to targeted customer carry out credit evaluation so thatAssessment result is relatively reliable, better, improves the accuracy of assessment result.
The aspect and advantage that the application is added will be set forth in part in the description, and partly will become bright from the following descriptionIt is aobvious, or recognized by the practice of the application.
Brief description of the drawings
The above-mentioned and/or additional aspect and advantage of the application will be apparent from description of the accompanying drawings below to embodiment is combinedBe readily appreciated that, wherein:
Fig. 1 is the flow chart of the training method of the credit evaluation model according to the application one embodiment;
Fig. 2 is the flow chart of the implementation process of the generation training combined crosswise feature according to the embodiment of the present application;
Fig. 3 is the schematic diagram that training combined crosswise feature is generated according to the utilization GBDT models of the embodiment of the present application;
Fig. 4 is the structured flowchart of the trainer of the credit evaluation model according to the application one embodiment;
Fig. 5 is the structured flowchart of the trainer of the credit evaluation model according to the application another embodiment;
Fig. 6 is the flow chart of the method for the credit evaluation according to the application one embodiment;
Fig. 7 is the structured flowchart of the device of the credit evaluation according to the application one embodiment;
Fig. 8 is the structured flowchart of the device of the credit evaluation according to the application another embodiment.
Embodiment
Embodiments herein is described below in detail, the example of the embodiment is shown in the drawings, wherein identical from beginning to endOr similar label represents same or similar element or the element with same or like function.Retouched below with reference to accompanying drawingThe embodiment stated is exemplary, it is intended to for explaining the application, and it is not intended that limitation to the application.
, it is necessary to build the credit of the user by the big data of long term accumulation user in the application scenarios of credit productModel, wherein, the sample of use is performance data of the user in credit product, as with purchase function by stagesBehavioral data in product and product with refund function after first use, the variable of use is the phase of each dimension of userClose variable, it is overdue that the target of models fitting is whether user occurs in the preset time period after drawing, and passes through fittingThe data build Credit Model.
Because credit product is the product that region be directly facing user, the interpretation to model requires higher, in practical applicationModel is often built using linear algorithm such as Logistic Regression (abbreviation LR), but linear model is complicatedDegree is relatively low compared to nonlinear model, and the performance of model can be less than nonlinear model, and the complexity of nonlinear model is higher,Model performance is preferable, but explanatory generally poor.
Therefore, present applicant proposes a kind of training method of credit evaluation model, by using nonlinear tree-model structure groupFeature is closed, linear Algorithm for Training final mask is reused, finally show that a performance is higher than purely linear model, while have againThe model of standby interpretation.Specifically, below with reference to the accompanying drawings the training side of the credit evaluation model of the embodiment of the present application is describedMethod, method and device using credit evaluation model progress credit evaluation.
Fig. 1 is the flow chart of the training method of the credit evaluation model according to the application one embodiment.As shown in figure 1, shouldThe training method of credit evaluation model can include:
S110, obtains training primitive behavior data of the training user in operation system.
Specifically, the user profile of a large amount of training users can be first obtained, afterwards, can be according to the user profile for largely training usersObtain training primitive behavior data of the training user in operation system.It should be noted that in embodiments herein,The operation system can be the system with credit product, and can also be has the system of purchase function or with energyEnough embody the system of user credit function.
Wherein, in embodiments herein, the training primitive behavior data may include but be not limited to website or webpage click rowFor (such as click on the time, number of times, frequency), training customer transaction data and behavior (as pay product information, payment,Means of payment etc.), fund incremental data and behavior (flow direction of such as fund, the funds flow amount of money) etc..
In addition, user profile may include but be not limited to user name or ID, account name etc..That is, for example working as businessWhen system carrys out the uniqueness of recognition training user by using user name, can now obtain training user user name, and according toThe user name obtains all training primitive behavior data of the training user in the operation system;When operation system is by using accountWhen name in an account book carrys out the uniqueness of recognition training user, the account name of training user can be now obtained, and instruction is obtained according to account namePractice training primitive behavior data of the user in current business system.
S120, extracts the training primitive character in training primitive behavior data.
Specifically, it can will be extracted from training primitive behavior data with the data that feature is showed, and this is had into featureThe data of performance as the training primitive behavior data training primitive character.For example, the training primitive character may include purchaseThe amount of money, web page/site number of clicks, web page/site click frequency, funds flow etc..
S130, carries out combinations of features to training primitive character according to iteration decision tree GBDT models and is handed over generating corresponding trainingPitch assemblage characteristic.
Specifically, in embodiments herein, GBDT models can be trained according to training primitive character to buildGBDT models with N tree, wherein, N is positive integer, and the N tree in GBDT models is excavated and train originalIncidence relation between feature, finally, carries out combinations of features to training primitive character according to incidence relation and is intersected with generating trainingAssemblage characteristic.
It is appreciated that GBDT is a kind of decision Tree algorithms of iteration, the algorithm is made up of many decision trees, the conclusion of all treesAdd up and do final result, for example, each tree can be gone to the residual error of K tree before being fitted, it is possible to understand that into every one treeThe result of one tree before being dependent on, therefore, between tree needs to ensure certain order.So, by many in GBDT modelsDecision tree carries out Decision Classfication to training primitive character, so as to find out the incidence relation between training primitive character, andFeature with incidence relation is combined, obtains training combined crosswise feature.
In one embodiment of the application, as shown in Fig. 2 the N tree in GBDT models excavates the original spy of trainingIncidence relation between levying, and carry out combinations of features to training primitive character to generate training combined crosswise spy according to incidence relationThe process that implements levied may include following steps:
S131, will train the corresponding sample data of primitive character to pass sequentially through N tree in GBDT models, until each sampleData divide equally the leaf node for being assigned to each tree.
S132, for each tree in GBDT models, by from the root node of each tree to leaf node paths traversedCorresponding training primitive character is combined, to generate training combined crosswise feature.
Specifically, as shown in figure 3, node of each tree in addition to leaf node in GBDT models all corresponds to a division spyDisruptive features of seeking peace value, if the value of the disruptive features of sample is more than the disruptive features value of node, will be trained in primitive characterSample be assigned to the right child node of the node, otherwise assign to left child node, lower level node similarly, until the sample falls on certainLeaf node.For example, by taking one tree as shown in Figure 3 as an example, it is assumed that sample is assigned to No. 2 leaf nodes, then is formedAssemblage characteristic is [G<=g_v&&I>I_v]=1, equivalent to punishing into two sections from g_v this value to this feature of G, I thisFrom i_v, this value punishes into two sections to feature, and two Feature Segmentations of G and I are multiplied, and draw 4 features, wherein G first paragraphsThe characteristic value being multiplied with I second segments is 1, and remaining characteristic value is 0.It is appreciated that in as shown in Figure 3, depth is 3The multiplication cross of as two features drawn is set, similarly, what the tree that depth is 4 drew is the multiplication cross of three features.
It is further appreciated that each node of tree corresponds to a disruptive features, the node one split values of correspondence divide this featureInto two parts, for example, it is assumed that this feature assigns to the left side of the node less than or equal to the sample of split values, more than split valuesSample assign to the right side of the node, then the node can be translated into a pair of 0,1 binary features, e.g., work as G<=G_split is (i.e.Node G split values) when, F_G_L=1, F_G_R=0 work as G>During G_split, F_G_L=0, F_G_R=1, it is corresponding underFeature F_I_L and F_I_R as a pair, and F_H_L and F_H_R can be also formed after node layer division, ifG<=G_split, then sample fall into left then node I, upper layer node is multiplied with the feature of lower level node, obtains two pairs of new features,F_G_L*F_I_L, F_G_L*F_I_R, if sample eventually falls into F_I_R, F_G_L*F_I_R=1, remaining assemblage characteristic is equalFor 0.So, it is combined by finding feature to be multiplied the feature for being 1, so as to obtain the combination with incidence relationFeature.It is appreciated that above-mentioned implementation is intended only as a kind of example, with for the one kind run and provided suitable for computerRepresentation, is restriction as the application without being understood that.
Thus, the association between feature is excavated by using GBDT tree-models, to build training combined crosswise feature, is carriedThe high performance in combinations of features stage, improves combinations of features efficiency.
S140, is trained to build according to training combined crosswise feature to logistic regression Logistic Regression modelsCredit evaluation model.
Specifically, after generation training combined crosswise feature, training combined crosswise feature can be used linear modelLogistic Regression (referred to as LR) model is trained to obtain credit evaluation model.
In order to improve the performance of credit evaluation model, and the accuracy of assessment result when using the model is improved, further,, can be according to training primitive character and training combined crosswise feature to Logistic in one embodiment of the applicationRegression models are trained to build credit evaluation model.Specifically, after training combined crosswise feature is obtained,Training combined crosswise feature can be put into LR models together with training primitive character and be trained, finally give credit evaluation mouldType, the model is explainable linear model.It is appreciated that the model is on the premise of interpretation is ensured, effect is moreBetter than GBDT and LR models.
The training method of the credit evaluation model of the embodiment of the present application, is carried out by non-linear GBDT models to training primitive characterTraining with build it is corresponding training combined crosswise feature, and by train combined crosswise feature to be trained linear LR models withBuild credit evaluation model so that the credit evaluation model had both possessed the high-performance of nonlinear model, but also with linear modelInterpretation.
Training method with the credit evaluation model that above-mentioned several embodiments are provided is corresponding, and a kind of embodiment of the application is also carriedFor a kind of trainer of credit evaluation model, the trainer of the credit evaluation model provided due to the embodiment of the present application with it is upperThe training method for stating the credit evaluation model that several embodiments are provided is corresponding, therefore in the training side of foregoing credit evaluation modelThe embodiment of method is also applied for the trainer of the credit evaluation model of the present embodiment offer, no longer detailed in the present embodimentDescription.Fig. 4 is the structured flowchart of the trainer of the credit evaluation model according to the application one embodiment.As shown in figure 4,The trainer of the credit evaluation model can include:Acquisition module 110, extraction module 120, generation module 130 and trainingModule 140.
Wherein, acquisition module 110 can be used for obtaining training primitive behavior data of the training user in operation system.
Extraction module 120 can be used for extracting the training primitive character in training primitive behavior data.
Generation module 130 can be used for carrying out combinations of features to training primitive character to generate according to iteration decision tree GBDT modelsCorresponding training combined crosswise feature.
Specifically, in one embodiment of the application, as shown in figure 5, on the basis of as shown in Figure 4, the generationModule 130 may include:Training unit 131 and generation unit 132.
Wherein, training unit 131 can be used for being trained iteration decision tree GBDT models according to training primitive character buildingGBDT models with N tree, wherein, N is positive integer.
The N tree that generation unit 132 can be used in GBDT models excavates the incidence relation between training primitive character, andCarry out combinations of features to training primitive character to generate training combined crosswise feature according to incidence relation.
Specifically, in embodiments herein, generation unit 132 will can train the corresponding sample data of primitive character according toN tree in the secondary model by GBDT, until each sample data divides equally the leaf node for being assigned to each tree, and for GBDTEach tree in model, by the training primitive character corresponding from the root node of each tree to leaf node paths traversedIt is combined, to generate training combined crosswise feature.
Training module 140 can be used for entering logistic regression Logistic Regression models according to training combined crosswise featureRow trains to build credit evaluation model.
In order to improve the performance of credit evaluation model, and the accuracy of assessment result when using the model is improved, further,In one embodiment of the application, training module 140 can be additionally used according to training primitive character and training combined crosswise featureLogistic Regression models are trained to build credit evaluation model.
The trainer of the credit evaluation model of the embodiment of the present application, is carried out by non-linear GBDT models to training primitive characterTraining with build it is corresponding training combined crosswise feature, and by train combined crosswise feature to be trained linear LR models withBuild credit evaluation model so that the credit evaluation model had both possessed the high-performance of nonlinear model, but also with linear modelInterpretation.
The application also proposed a kind of method of credit evaluation, and the credit described in any of the above-described embodiment can be used to comment for this methodEstimate model and credit evaluation is carried out to user.
Fig. 6 is the flow chart of the method for the credit evaluation according to the application one embodiment.As shown in fig. 6, the credit evaluationMethod can include:
S610, obtains primitive behavior data of the targeted customer in operation system.
Specifically, the user profile of targeted customer can be first obtained, afterwards, the targeted customer can be obtained according to user profile in industryPrimitive behavior data in business system.It should be noted that in embodiments herein, the operation system can be hadThe system of credit product, can also be system with purchase function or with can embody user credit functionSystem.
Wherein, in embodiments herein, the primitive behavior data may include but be not limited to website or webpage click behavior (such asClick time, number of times, frequency etc.), the transaction data of targeted customer and behavior be (as paid product information, payment, branchPay mode etc.), fund incremental data and behavior (flow direction of such as fund, the funds flow amount of money) etc..
In addition, user profile may include but be not limited to user name or ID, account name etc..That is, for example working as businessWhen system recognizes the uniqueness of targeted customer by using user name, the user name of targeted customer can be now obtained, and according toThe user name obtains all primitive behavior data of the targeted customer in the operation system;When operation system is by using account nameDuring uniqueness to recognize targeted customer, the account name of targeted customer can be now obtained, and target is obtained according to account name and is usedPrimitive behavior data of the family in current business system.
S620, extracts the primitive character in primitive behavior data.
Specifically, it can will be extracted from primitive behavior data with the data that feature is showed, and this is had into feature performanceData as the primitive behavior data primitive character.For example, the primitive character may include the purchase amount of money, web page/site pointHit number of times, web page/site click frequency, funds flow etc..
S630, carries out combinations of features to generate corresponding combined crosswise feature according to GBDT models to primitive character.
Specifically, in embodiments herein, first GBDT models can be trained according to primitive character has to buildThe GBDT models of N tree, wherein, N is positive integer, afterwards, can excavate primitive character according to the tree of N in GBDT modelsBetween incidence relation, and combinations of features is carried out to primitive character to generate combined crosswise feature according to incidence relation.
It is appreciated that GBDT is a kind of decision Tree algorithms of iteration, the algorithm is made up of many decision trees, the conclusion of all treesAdd up and do final result, for example, each tree can be gone to the residual error of K tree before being fitted, it is possible to understand that into every one treeThe result of one tree before being dependent on, therefore, between tree needs to ensure certain order.So, by many in GBDT modelsDecision tree carries out Decision Classfication to primitive character, is closed so as to find out the incidence relation between primitive character, and will haveThe feature of connection relation is combined, and obtains combined crosswise feature.
In one embodiment of the application, the process that implements of generation combined crosswise feature may include:By primitive character pairThe sample data answered passes sequentially through N tree in GBDT models, until each sample data divides equally the leaf section for being assigned to each treePoint, and for each tree in GBDT models, will be corresponding from the root node of each tree to leaf node paths traversedPrimitive character be combined, to generate combined crosswise feature.It is appreciated that the method for the credit evaluation of the embodiment of the present applicationThe side of the mode of middle generation combined crosswise feature and generation training combined crosswise feature in the training method of above-mentioned credit evaluation modelThe realization principle of formula is identical, and the realization that can refer to the above-mentioned generating mode to training combined crosswise feature is described, and is no longer gone to live in the household of one's in-laws on getting married hereinState.
S640, is predicted to obtain the credit information of targeted customer according to credit evaluation model to combined crosswise feature.
Specifically, after generation combined crosswise feature, combined crosswise feature can be trained using LR models to obtainCredit evaluation model.
, further, can be according to credit evaluation in one embodiment of the application in order to improve the accuracy of assessment resultModel is predicted to obtain the credit information of targeted customer to primitive character and combined crosswise feature.Specifically, handed overPitch after assemblage characteristic, combined crosswise feature can be put into credit evaluation model together with primitive character and carry out credit prediction, obtainedGo out it is final predict the outcome, i.e. the credit information of the targeted customer.
It should be noted that the method for the credit evaluation of the embodiment of the present application is based on using C language and across the communication association of languageMPI parallel computation frames are discussed come what is realized, better performance can be so reached.
The method of the credit evaluation of the embodiment of the present application, when carrying out credit evaluation prediction to targeted customer, can first obtain the meshPrimitive behavior data of the user in operation system are marked, and extract the primitive character in primitive behavior data, afterwards, will be originalFeature is trained by GBDT tree-model to obtain corresponding combined crosswise feature, finally, and the combined crosswise feature is putIt is predicted into credit evaluation model, draws the user profile of the targeted customer, i.e., by using both possesses higher solveThe property released carries out credit evaluation but also with high performance credit evaluation model to targeted customer so that assessment result is relatively reliable, effectFruit more preferably, improves the accuracy of assessment result.
Method with the credit evaluation that above-mentioned several embodiments are provided is corresponding, and a kind of embodiment of the application also provides a kind of letterWith the device of assessment, because the credit that the device and above-mentioned several embodiments of the credit evaluation of the embodiment of the present application offer are provided is commentedThe method estimated is corresponding, thus the method in foregoing credit evaluation embodiment be also applied for the present embodiment offer credit commentThe device estimated, is not described in detail in the present embodiment.Fig. 7 is the device of the credit evaluation according to the application one embodimentStructured flowchart.It should be noted that the device of the credit evaluation of the embodiment of the present application is by using any of the above-described embodimentDescribed credit evaluation model carries out credit evaluation to targeted customer.
As shown in fig. 7, the device of the credit evaluation can include:Acquisition module 210, extraction module 220, generation module 230With prediction module 240.
Wherein, acquisition module 210 can be used for obtaining primitive behavior data of the targeted customer in operation system.
Extraction module 220 can be used for extracting the primitive character in primitive behavior data.
Generation module 230 can be used for carrying out combinations of features to primitive character to generate corresponding combined crosswise according to GBDT modelsFeature.
Specifically, in one embodiment of the application, as shown in figure 8, on the basis of as shown in Figure 7, the generationModule 230 may include:Training unit 231 and generation unit 232.
Wherein, training unit 231 can be used for being trained GBDT models according to primitive character what is set with N to buildGBDT models, wherein, N is positive integer.
The incidence relation that generation unit 232 can be used between the N tree excavation primitive character in GBDT models, and according toIncidence relation carries out combinations of features to generate combined crosswise feature to primitive character.
Specifically, in embodiments herein, generation unit 232 can lead to the corresponding sample data of primitive character successivelyThe N tree crossed in GBDT models, until each sample data divides equally the leaf node for being assigned to each tree, and for GBDT mouldsEach tree in type, by the primitive character carry out group corresponding from the root node of each tree to leaf node paths traversedClose, to generate combined crosswise feature.
Prediction module 240 can be used for combined crosswise feature is predicted according to credit evaluation model to obtain the letter of targeted customerUse information.
In order to improve the accuracy of assessment result, further, in one embodiment of the application, prediction module 240 is alsoBelieved available for being predicted according to credit evaluation model to primitive character and combined crosswise feature with the credit for obtaining targeted customerBreath.
The device of the credit evaluation of the embodiment of the present application, can be by obtaining mould when carrying out credit evaluation prediction to targeted customerBlock obtains primitive behavior data of the targeted customer in operation system, and extraction module extracts the original spy in primitive behavior dataLevy, primitive character is trained to obtain corresponding combined crosswise feature by generation module by GBDT tree-model, predicts mouldThe combined crosswise feature is put into credit evaluation model and is predicted by block, draws the user profile of the targeted customer, that is, passes throughUsing both possess higher interpretation but also with high performance credit evaluation model to targeted customer carry out credit evaluation so thatAssessment result is relatively reliable, better, improves the accuracy of assessment result.
In the description of the present application, in the description of the present application, " multiple " are meant that at least two, such as two, threeDeng unless otherwise specifically defined.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specific example ",Or the description of " some examples " etc. means to combine specific features, structure, material or the feature that the embodiment or example are describedIt is contained at least one embodiment of the application or example.In this manual, need not to the schematic representation of above-mentioned termIdentical embodiment or example must be directed to.Moreover, specific features, structure, material or the feature of description can be with officeCombined in an appropriate manner in one or more embodiments or example.In addition, in the case of not conflicting, this areaTechnical staff can be tied the not be the same as Example or the feature of example and non-be the same as Example or example described in this specificationClose and combine.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes oneOr more be used for executable instruction the step of realize specific logical function or process code module, fragment or part,And the scope of the preferred embodiment of the application includes other realization, wherein order that is shown or discussing can not be pressed,Including according to involved function by it is basic simultaneously in the way of or in the opposite order, carry out perform function, this should be by the application'sEmbodiment person of ordinary skill in the field is understood.
Represent in flow charts or logic and/or step described otherwise above herein, for example, being considered for realThe order list of the executable instruction of existing logic function, may be embodied in any computer-readable medium, for instructionExecution system, device or equipment (such as computer based system including the system of processor or other can be performed from instructionThe system of system, device or equipment instruction fetch and execute instruction) use, or combine these instruction execution systems, device or setIt is standby and use.For the purpose of this specification, " computer-readable medium " can any can be included, store, communicating, propagatingOr transmission procedure uses for instruction execution system, device or equipment or with reference to these instruction execution systems, device or equipmentDevice.The more specifically example (non-exhaustive list) of computer-readable medium includes following:With one or more clothThe electrical connection section (electronic installation) of line, portable computer diskette box (magnetic device), random access memory (RAM) is read-onlyMemory (ROM), erasable edit read-only storage (EPROM or flash memory), fiber device, and it is portableCompact disc read-only memory (CDROM).In addition, computer-readable medium, which can even is that, to print the paper of described program thereonOr other suitable media, because can then enter edlin, solution for example by carrying out optical scanner to paper or other mediaTranslate or handled electronically to obtain described program with other suitable methods if necessary, be then stored in computerIn memory.
It should be appreciated that each several part of the application can be realized with hardware, software, firmware or combinations thereof.In above-mentioned realityApply in mode, software that multiple steps or method can be performed in memory and by suitable instruction execution system with storage orFirmware is realized.If, and in another embodiment, can be with well known in the art for example, realized with hardwareAny one of row technology or their combination are realized:With the logic gates for realizing logic function to data-signalDiscrete logic, the application specific integrated circuit with suitable combinational logic gate circuit, programmable gate array (PGA) is existingField programmable gate array (FPGA) etc..
Those skilled in the art be appreciated that to realize all or part of step that above-described embodiment method is carried is canTo instruct the hardware of correlation to complete by program, described program can be stored in a kind of computer-readable recording medium,The program upon execution, including one or a combination set of the step of embodiment of the method.
In addition, each functional unit in the application each embodiment can be integrated in a processing module or eachIndividual unit is individually physically present, can also two or more units be integrated in a module.Above-mentioned integrated module was bothIt can be realized in the form of hardware, it would however also be possible to employ the form of software function module is realized.If the integrated module withThe form of software function module realize and as independent production marketing or in use, can also be stored in one it is computer-readableTake in storage medium.
Storage medium mentioned above can be read-only storage, disk or CD etc..Although having been shown and described aboveEmbodiments herein, it is to be understood that above-described embodiment is exemplary, it is impossible to be interpreted as the limitation to the application,One of ordinary skill in the art can be changed to above-described embodiment, change, replacing and modification within the scope of application.

Claims (16)

CN201610236701.4A2016-04-152016-04-15Training method, credit estimation method and the device of credit evaluation modelPendingCN107301577A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201610236701.4ACN107301577A (en)2016-04-152016-04-15Training method, credit estimation method and the device of credit evaluation model

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201610236701.4ACN107301577A (en)2016-04-152016-04-15Training method, credit estimation method and the device of credit evaluation model

Publications (1)

Publication NumberPublication Date
CN107301577Atrue CN107301577A (en)2017-10-27

Family

ID=60137958

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201610236701.4APendingCN107301577A (en)2016-04-152016-04-15Training method, credit estimation method and the device of credit evaluation model

Country Status (1)

CountryLink
CN (1)CN107301577A (en)

Cited By (36)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108009914A (en)*2017-12-192018-05-08马上消费金融股份有限公司Credit risk assessment method, system, equipment and computer storage medium
CN108022146A (en)*2017-11-142018-05-11深圳市牛鼎丰科技有限公司Characteristic item processing method, device, the computer equipment of collage-credit data
CN108229549A (en)*2017-12-282018-06-29杭州大搜车汽车服务有限公司A kind of intelligent recognition car trader fits up method, electronic equipment and the storage medium of degree
CN108427658A (en)*2018-03-122018-08-21北京奇艺世纪科技有限公司A kind of data predication method, device and electronic equipment
CN108681750A (en)*2018-05-212018-10-19阿里巴巴集团控股有限公司The feature of GBDT models explains method and apparatus
CN109035003A (en)*2018-07-042018-12-18北京玖富普惠信息技术有限公司Anti- fraud model modelling approach and anti-fraud monitoring method based on machine learning
CN109447273A (en)*2018-09-302019-03-08深圳市元征科技股份有限公司Model training method, advertisement recommended method, relevant apparatus, equipment and medium
CN109754157A (en)*2018-11-302019-05-14畅捷通信息技术股份有限公司A kind of methods of marking and system for reflecting enterprise's health management, financing and increasing letter
CN109767312A (en)*2018-12-102019-05-17江西师范大学 A credit evaluation model training and evaluation method and device
WO2019128526A1 (en)*2017-12-272019-07-04阿里巴巴集团控股有限公司Method, apparatus, and device for training risk control model and risk control
CN110008349A (en)*2019-02-012019-07-12阿里巴巴集团控股有限公司The method and device for the event risk assessment that computer executes
CN110322334A (en)*2018-03-292019-10-11上海麦子资产管理集团有限公司Credit rating method and device, computer readable storage medium, terminal
CN110335134A (en)*2019-04-152019-10-15梵界信息技术(上海)股份有限公司A method of it is converted based on WOE and realizes the classification of credit customer qualification
CN110399602A (en)*2018-04-252019-11-01北京京东尚科信息技术有限公司 A method and device for evaluating text reliability
CN110443346A (en)*2019-08-122019-11-12腾讯科技(深圳)有限公司A kind of model explanation method and device based on input feature vector importance
CN110544165A (en)*2019-09-022019-12-06中诚信征信有限公司credit risk score card creating method and device and electronic equipment
CN110634060A (en)*2018-06-212019-12-31马上消费金融股份有限公司User credit risk assessment method, system, device and storage medium
CN110728301A (en)*2019-09-092020-01-24北京镭文科技有限公司Credit scoring method, device, terminal and storage medium for individual user
CN111275062A (en)*2018-12-042020-06-12北京嘀嘀无限科技发展有限公司Model training method, device, server and computer readable storage medium
CN111340121A (en)*2020-02-282020-06-26支付宝(杭州)信息技术有限公司Target feature determination method and device
CN111563810A (en)*2020-04-282020-08-21北京云从科技有限公司Credit wind control model generation method, credit evaluation system, machine-readable medium and device
CN111861750A (en)*2020-07-222020-10-30北京睿知图远科技有限公司Feature derivation system based on decision tree method and readable storage medium
CN111898675A (en)*2020-07-302020-11-06北京云从科技有限公司Credit wind control model generation method and device, scoring card generation method, machine readable medium and equipment
CN112035453A (en)*2020-08-272020-12-04平安科技(深圳)有限公司GBDT high-order feature combination-based recommendation method and device and storage medium
CN112084519A (en)*2020-09-182020-12-15支付宝(杭州)信息技术有限公司GBDT model training method and device based on access mode protection
CN112184485A (en)*2020-09-302021-01-05深圳供电局有限公司Power consumer credit assessment method
CN112270547A (en)*2020-10-272021-01-26上海淇馥信息技术有限公司Financial risk assessment method and device based on feature construction and electronic equipment
CN112363716A (en)*2020-10-132021-02-12杭州城市大数据运营有限公司Method, system and device for dynamically assembling evaluation model
CN112418520A (en)*2020-11-222021-02-26同济大学Credit card transaction risk prediction method based on federal learning
CN112598252A (en)*2020-12-162021-04-02中国建设银行股份有限公司Rating model training method and device
CN112819610A (en)*2021-02-262021-05-18重庆度小满优扬科技有限公司Credit evaluation method, credit evaluation model training method and device
WO2021098385A1 (en)*2019-11-212021-05-27支付宝(杭州)信息技术有限公司Method and apparatus for training gbdt model in trusted execution environment, and device
CN113240355A (en)*2021-07-122021-08-10贝壳找房(北京)科技有限公司Training method and evaluation method of IM service evaluation model
CN113297289A (en)*2021-04-282021-08-24上海淇玥信息技术有限公司Method and device for extracting business data from database and electronic equipment
CN113763092A (en)*2020-11-102021-12-07北京沃东天骏信息技术有限公司Method, device, equipment and computer readable medium for recommending commodities
CN115271928A (en)*2022-08-172022-11-01中国工商银行股份有限公司Credit card user value analysis method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030187765A1 (en)*2002-03-272003-10-02First Data CorporationSystems and methods for monitoring credit risk
CN101046876A (en)*2006-03-312007-10-03探宇科技股份有限公司Credit scoring system and method of using data mining method
CN101493913A (en)*2008-01-232009-07-29阿里巴巴集团控股有限公司Method and system for assessing user credit in internet

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030187765A1 (en)*2002-03-272003-10-02First Data CorporationSystems and methods for monitoring credit risk
CN101046876A (en)*2006-03-312007-10-03探宇科技股份有限公司Credit scoring system and method of using data mining method
CN101493913A (en)*2008-01-232009-07-29阿里巴巴集团控股有限公司Method and system for assessing user credit in internet

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XINRAN HE等: "Practical Lessons from Predicting Clicks on Ads at Facebook", 《KDD》*

Cited By (44)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108022146A (en)*2017-11-142018-05-11深圳市牛鼎丰科技有限公司Characteristic item processing method, device, the computer equipment of collage-credit data
CN108009914A (en)*2017-12-192018-05-08马上消费金融股份有限公司Credit risk assessment method, system, equipment and computer storage medium
WO2019128526A1 (en)*2017-12-272019-07-04阿里巴巴集团控股有限公司Method, apparatus, and device for training risk control model and risk control
CN108229549A (en)*2017-12-282018-06-29杭州大搜车汽车服务有限公司A kind of intelligent recognition car trader fits up method, electronic equipment and the storage medium of degree
CN108427658A (en)*2018-03-122018-08-21北京奇艺世纪科技有限公司A kind of data predication method, device and electronic equipment
CN110322334A (en)*2018-03-292019-10-11上海麦子资产管理集团有限公司Credit rating method and device, computer readable storage medium, terminal
CN110399602A (en)*2018-04-252019-11-01北京京东尚科信息技术有限公司 A method and device for evaluating text reliability
CN108681750A (en)*2018-05-212018-10-19阿里巴巴集团控股有限公司The feature of GBDT models explains method and apparatus
US11205129B2 (en)2018-05-212021-12-21Advanced New Technologies Co., Ltd.GBDT model feature interpretation method and apparatus
CN110634060A (en)*2018-06-212019-12-31马上消费金融股份有限公司User credit risk assessment method, system, device and storage medium
CN109035003A (en)*2018-07-042018-12-18北京玖富普惠信息技术有限公司Anti- fraud model modelling approach and anti-fraud monitoring method based on machine learning
CN109447273A (en)*2018-09-302019-03-08深圳市元征科技股份有限公司Model training method, advertisement recommended method, relevant apparatus, equipment and medium
CN109754157A (en)*2018-11-302019-05-14畅捷通信息技术股份有限公司A kind of methods of marking and system for reflecting enterprise's health management, financing and increasing letter
CN111275062A (en)*2018-12-042020-06-12北京嘀嘀无限科技发展有限公司Model training method, device, server and computer readable storage medium
CN109767312A (en)*2018-12-102019-05-17江西师范大学 A credit evaluation model training and evaluation method and device
CN109767312B (en)*2018-12-102023-05-09江西师范大学 A credit evaluation model training, evaluation method and device
CN110008349A (en)*2019-02-012019-07-12阿里巴巴集团控股有限公司The method and device for the event risk assessment that computer executes
CN110335134A (en)*2019-04-152019-10-15梵界信息技术(上海)股份有限公司A method of it is converted based on WOE and realizes the classification of credit customer qualification
CN110443346A (en)*2019-08-122019-11-12腾讯科技(深圳)有限公司A kind of model explanation method and device based on input feature vector importance
CN110443346B (en)*2019-08-122023-05-02腾讯科技(深圳)有限公司Model interpretation method and device based on importance of input features
CN110544165A (en)*2019-09-022019-12-06中诚信征信有限公司credit risk score card creating method and device and electronic equipment
CN110544165B (en)*2019-09-022022-06-03中诚信征信有限公司Credit risk score card creating method and device and electronic equipment
CN110728301A (en)*2019-09-092020-01-24北京镭文科技有限公司Credit scoring method, device, terminal and storage medium for individual user
WO2021098385A1 (en)*2019-11-212021-05-27支付宝(杭州)信息技术有限公司Method and apparatus for training gbdt model in trusted execution environment, and device
CN111340121A (en)*2020-02-282020-06-26支付宝(杭州)信息技术有限公司Target feature determination method and device
CN111340121B (en)*2020-02-282022-04-12支付宝(杭州)信息技术有限公司Target feature determination method and device
CN111563810A (en)*2020-04-282020-08-21北京云从科技有限公司Credit wind control model generation method, credit evaluation system, machine-readable medium and device
CN111861750A (en)*2020-07-222020-10-30北京睿知图远科技有限公司Feature derivation system based on decision tree method and readable storage medium
CN111898675A (en)*2020-07-302020-11-06北京云从科技有限公司Credit wind control model generation method and device, scoring card generation method, machine readable medium and equipment
CN112035453B (en)*2020-08-272024-03-12平安科技(深圳)有限公司Recommendation method, device and storage medium based on GBDT high-order feature combination
CN112035453A (en)*2020-08-272020-12-04平安科技(深圳)有限公司GBDT high-order feature combination-based recommendation method and device and storage medium
WO2021147568A1 (en)*2020-08-272021-07-29平安科技(深圳)有限公司Gbdt high-order feature combination-based recommendation method, apparatus, and storage medium
CN112084519A (en)*2020-09-182020-12-15支付宝(杭州)信息技术有限公司GBDT model training method and device based on access mode protection
CN112184485A (en)*2020-09-302021-01-05深圳供电局有限公司Power consumer credit assessment method
CN112363716A (en)*2020-10-132021-02-12杭州城市大数据运营有限公司Method, system and device for dynamically assembling evaluation model
CN112270547A (en)*2020-10-272021-01-26上海淇馥信息技术有限公司Financial risk assessment method and device based on feature construction and electronic equipment
CN113763092A (en)*2020-11-102021-12-07北京沃东天骏信息技术有限公司Method, device, equipment and computer readable medium for recommending commodities
CN112418520B (en)*2020-11-222022-09-20同济大学 A credit card transaction risk prediction method based on federated learning
CN112418520A (en)*2020-11-222021-02-26同济大学Credit card transaction risk prediction method based on federal learning
CN112598252A (en)*2020-12-162021-04-02中国建设银行股份有限公司Rating model training method and device
CN112819610A (en)*2021-02-262021-05-18重庆度小满优扬科技有限公司Credit evaluation method, credit evaluation model training method and device
CN113297289A (en)*2021-04-282021-08-24上海淇玥信息技术有限公司Method and device for extracting business data from database and electronic equipment
CN113240355A (en)*2021-07-122021-08-10贝壳找房(北京)科技有限公司Training method and evaluation method of IM service evaluation model
CN115271928A (en)*2022-08-172022-11-01中国工商银行股份有限公司Credit card user value analysis method and device

Similar Documents

PublicationPublication DateTitle
CN107301577A (en)Training method, credit estimation method and the device of credit evaluation model
CN111080338B (en)User data processing method and device, electronic equipment and storage medium
CN110659744A (en)Training event prediction model, and method and device for evaluating operation event
CN111709244B (en)Deep learning method for identifying cause and effect relationship of contradictory dispute
CN112270547A (en)Financial risk assessment method and device based on feature construction and electronic equipment
CN112037038B (en)Bank credit risk prediction method and device
CN116861258B (en)Model processing method, device, equipment and storage medium
CN109670191A (en)Calibration optimization method, device and the electronic equipment of machine translation
CN109034201B (en)Model training and rule mining method and system
CN105447038A (en)Method and system for acquiring user characteristics
CN111199474A (en)Risk prediction method and device based on network diagram data of two parties and electronic equipment
CN111736940A (en)Business interface display method and device for intelligent counter
CN109829129A (en)A kind of clicking rate predictor method based on FFM deep neural network
CN110413307A (en)Correlating method, device and the electronic equipment of code function
CN116402352A (en)Enterprise risk prediction method and device, electronic equipment and medium
CN113159778B (en)Financial fraud detection method and device
CN115510188A (en) Text keyword association method, device, equipment and storage medium
CN110675250A (en)Credit line management method and device based on user marketing score and electronic equipment
CN114138954B (en) User consultation question recommendation method, system, computer device and storage medium
CN114691981A (en)Session recommendation method, system, device and storage medium
Li et al.Dual convolutional network based on hypergraph and multilevel feature fusion for road extraction from high-resolution remote sensing images
CN114862496A (en) A session recommendation method, device and medium based on user personalized modeling
Sarigiannidis et al.A novel lexicon-based approach in determining sentiment in financial data using learning automata
CN116976992A (en)Target object group determination method and device and computer equipment
CN116977064A (en)Wind control model construction method, system and device based on loss function

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20171027

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp