CN107463879A

Movatterモバイル変換

Info

Publication number: CN107463879A
Application number: CN201710544873.2A
Authority: CN
Inventors: 康青杨; 刘世林; 张学锋
Original assignee: Chengdu Business Big Data Technology Co Ltd
Current assignee: Chengdu Business Big Data Technology Co Ltd
Priority date: 2017-07-05
Filing date: 2017-07-05
Publication date: 2017-12-12

Abstract

The present invention relates to captcha fields, Human bodys' response method more particularly to based on deep learning, the inventive method realizes the judgement of current page operating main body using the neutral net comprising LSTM, in structure machine behavior sample based on human body behavior sample, employ 4 kinds of generating modes, including random generation, positive sample track is cut into some subsegments at random again to be spliced the subsegment after cutting at random, the proportional zoom, disturbance etc. of trajectory parameters is carried out on the basis of positive sample；For such negative sample producing method based on positive sample, to have higher similitude with positive sample, confusion is stronger, and training sample, which trains the neutral net come, has higher resolution capability.This method realizes the identification of the current page operating main body based on LSTM networks first, and the inventive method is particularly suitable for the checking in dragging image hotpoint operating main body, the business scenario judged operating main body.

Description

Human bodys' response method based on deep learning

Technical field

The present invention relates to captcha fields, the Human bodys' response method more particularly to based on deep learning.

Background technology

Network today technology is more and more flourishing, and network application is more and more, as various websites, Email, blog, electronicsGovernment websites etc. have turned into the necessity of everybody daily life.But along with fast development internet, network security also intoThe problem of increasingly being highlighted for one.Particularly the automatic registering and logging of rogue program, maliciously pour water, with specific program Brute ForceThe network security attacks such as account and password.That currently register or access the webpage to avoid these generations from being identified with regard to needs is peopleOr program.Most common captcha (Completely Automated Public Turing Test to TellComputers and Humans Apart automatically distinguish the abbreviation of computer and the turing test of the mankind), it is in 1997Invented by Mark D.Lillibridge et al..At present because of its huge meaning, each website is widely used in.Very simultaneouslyMore academic institutions and commercial company are also studied to it.

Captcha common at present has based on computer vision, including character recognition and object identification.These identifications are askedTopic is often fairly simple for people but more difficult for computer, so as to distinguish people or program.Also it is based on human bodyBehavior, including tap keyboard and mobile mouse.These behaviors of human body possess certain characteristic rule, can be special by theseIt is people or program to levy distinguish operation computer.With the development of deep learning, computer vision also leads to a leap formulaIt is progressive, either character recognition or object identification, the recognition accuracy more and more higher of computer.Most common base before causingIt is gradually reduced in the captcha of computer vision defence capability.And the captcha based on human body behavior starts to occur.UsingCaptcha network security mean of defenses based on human body behavior start to be widely used, and net is being carried out using this kind of meansWhen network protects, the recognition capability of human body behavior or machine behavior is just needed to have first, but based on traditional machine learning sideThe judging nicety rate that method is first classified again to human body behavior extraction feature is not high.Because some of human body behavior are characterized in profound levelFeature, it is difficult to extract this feature by the rule artificially formulated.

The content of the invention

It is an object of the invention to overcome the above-mentioned deficiency in the presence of prior art, there is provided the human body based on deep learningActivity recognition method, developed on the basis of human body behavior and then generate substantial amounts of machine Behavioral training sample so that machineDevice Behavioral training sample it is higher with the similarity of human body behavior, the coverage mode of training sample is more comprehensively.Pass through the people of magnanimityTraining sample of both body behavior and machine behavior carrys out the nerve that training package contains LSTM (Long Short-Term Memory)Network model；And the neural network model by training judges whether new input behavior is human body behavior.Methods described is specialSuitable for drag figure realize checking identifying code identification system operating main body judgement application scenarios.

In order to realize foregoing invention purpose, based on the Human bodys' response method of deep learning, using human body behavior andThe mouse drag movement locus training sample of production of machinery carrys out the neural network model of training package network containing LSTM, and uses trainingNeural network model afterwards judges whether the operating main body of current page is human body behavior.

Methods described includes implemented below step：

(1) structure includes the neural network model of LSTM networks；

(2) magnanimity human body behavior sample and the sample of machine behavior are obtained, the machine behavior sample is in human body behavior sampleGenerated on the basis of this, human body behavior sample is as positive sample, and machine behavior sample is as negative sample；

(3) neural network model built is trained using positive and negative samples；

(4) it is people or machine by the operating main body of the neural network model current page to judge trained.

Specifically, in the step (2), the single order difference (dx of mouse track information is used_i, dy_i, dt_i) describe to operateBehavior, wherein dx_i=x_i-x_i-1, dy_i=y_i-y_i-1, dt_i=t_i-t_i-1；Wherein x_iFor the abscissa in screen position of mouse, y_iFor ordinate of the mouse in screen position, t_iFor time information.

The mouse movement trace information that the positive sample of training recorded from people when browsing webpage.

Further, in the step (2), negative sample is generated by machine, and the mode of generation is as follows：

A, in the range of the maximum occurrences of setting, path length is randomly generated, it is random to generate (dx_i, dy_i, dt_i)；

B, concentrated in positive sample and extract N bars track, the trajectory random being drawn into is divided into n subsegment, then will be divided intoSubsegment random groups be spliced into new track；

C, concentrated in positive sample and extract M bars track；Calculate the transverse shifting of each track always distance sum (dx_i), longitudinal direction moveDynamic always distance sum (dy_i) and mobile total time sum (dt_i)；It is random to generate transverse shifting always distance sum (dx_i) ', vertically moveAlways distance sum (dy_i) ' and mobile total time sum (dt_i) ', the characterising parameter of new movement locus is generated using below equation：

Wherein dx_i′、dy_i′、dt_i' be respectively new track lateral coordinates, longitudinal coordinate, the single order difference of time；

D, K bars track is extracted in positive sample；To dx_i, dy_i, dt_iThe disturbance of [- 0.5,0.5] again is randomly generated respectively, is obtainedObtain the characterising parameter of movement locus newly；

The sample that a, b, c, d mode are generated is mixed, as negative sample collection.

Further, in the step (3), in the training process of the neural network model, the onwards transmission mistake of signalJourney is as follows：

By (the dx of training sample_i, dy_i, dt_i) signal input embedding layers, by embedding layers by dx_i, dy_i, dt_iM dimensional vectors corresponding to changing into respectively, and by dx_i, dy_i, dt_iCorresponding m dimensional vectors are spliced into the vector of a 3m dimension；Will be toAmount is arranged in the LSTM neutral nets of 3m dimensions according to the sequential input hidden layer latitude of movement locus, is exported by LSTM neutral nets3m*L vector into full articulamentum, and by softmax layers export this track whether be human body behavior judged result；ItsMiddle L is the path length set.

Further, before neural network model training is carried out, including data are optimized with the process of processing：The placeReason includes：

In the step (2), preceding 100 (x are used_i, y_i, t_i) it is used as the characterising parameter of mouse movement track；Work as mouseIt is shorter to mark displacement, when movable information is inadequate 100, with (0,0,0) polishing to 100；That is L=100.

In the step (2), by dx_i, dy_iValue be converted into integer between [- 50,50], by dt_iValue conversionFor (0,200] between integer, when actual value beyond set span when, replaced using boundary value.

Further, the sample size included in the positive sample collection is no less than 5000, concentrates and selects in positive sample80% sample selects 20% sample as test sample as training sample.

Further, the sample size included in the negative sample collection is no less than 5000, concentrates and selects in negative sample80% sample selects 20% sample as test sample as training sample.

Compared with prior art, beneficial effects of the present invention：The present invention provides the Human bodys' response based on deep learningMethod, the inventive method realize the judgement of current page operating main body, the inventive method using the neutral net comprising LSTMWhen building machine behavior sample based on human body behavior sample, 4 kinds of generating modes, including random generation are employed, will justSample trace is cut into some subsegments and is again spliced the subsegment after cutting at random at random, is entered on the basis of positive sampleThe proportional zoom of row trajectory parameters, disturbance etc.；Such negative sample producing method, it is based on positive sample, has with positive sampleHigher similitude, confusion is stronger, and the neutral net come is trained using such training sample has higher explanation energyPower.The inventive method realizes the identification of the current page operating main body based on LSTM networks first, and LSTM networks are a kind ofTime recurrent neural network, it is suitable for being spaced in processing and predicted time sequence and postponing relatively long critical event, this hairBright method is particularly suitable for the checking in dragging image hotpoint operating main body, the business scenario judged operating main body.

Brief description of the drawings：

Fig. 1 is to realize step schematic diagram based on the Human bodys' response method of deep learning.

Embodiment

With reference to test example and embodiment, the present invention is described in further detail.But this should not be understoodFollowing embodiment is only limitted to for the scope of the above-mentioned theme of the present invention, it is all that this is belonged to based on the technology that present invention is realizedThe scope of invention.

It is an object of the invention to overcome the above-mentioned deficiency in the presence of prior art, there is provided the human body based on deep learningActivity recognition method, the mouse drag movement locus training sample using human body behavior and production of machinery carry out training package net containing LSTMThe neural network model of network, and judge whether the operating main body of current page is human body using the neural network model after trainingBehavior.

Methods described includes implemented below step as shown in Figure 1：

(1) structure includes the neural network model of LSTM networks；The neural network model includes：Embedding layers,LSTM, full articulamentum and softmax layers.Wherein embedding layers will input discrete signal therein and change into continuous realityNumber vector, the vector after the conversion of embedding layers is input in LSTM according to sequential, operation behavior will be described by LSTMSeveral time series vectors be spliced into a high dimension vector after be input in full articulamentum, and by after full articulamentum dimensionality reduction toAmount is input in softmax layers；The LSTM neutral nets that the inventive method uses are a kind of time recurrent neural networks, are suitable forIt is spaced in processing and predicted time sequence and postpones relatively long critical event.LSTM is different from RNN place, essentially consists inOne is added in algorithm and judges that whether useful information is " processor ", the structure of this processor effect is referred to as cell.OneThree fan doors have been placed among individual cell, has been called input gate respectively, forgets door and out gate.When an information enters LSTM netAmong network, it can be judged whether according to rule useful.Only meeting the information of algorithm certification can just leave, and the information not being inconsistent is thenPassed into silence by forgeing door, selectivity is more embodied when being handled in information, treatment effeciency is higher, solves RNN neutral nets notThe long sequence Dependence Problem that can be realized very well.

(2) magnanimity human body behavior sample and the sample of machine behavior are obtained, human body behavior sample is as positive sample, machine rowIt is sample as negative sample；Sample size included in the positive sample collection is no less than 5000, concentrates and selects in positive sample80% sample selects 20% sample as test sample as training sample.Sample included in the negative sample collectionQuantity is no less than 5000, concentrates the sample of selection 80% to select 20% sample to be used as training sample and survey in negative sampleSample sheet.

(3) neural network model built is trained using positive and negative samples；The training of neural network model is using forwardBackward algorithm.After in the rate of accuracy reached in test sample collection to the threshold value set, it is possible to think neural network modelTraining is completed.

Specifically, in the step (2), the single order difference (dx of mouse movement trace information is used_i, dy_i, dt_i) describeOperation behavior, wherein dx_i=x_i-x_i-1, dy_i=y_i-y_i-1, dt_i=t_i-t_i-1；Wherein x_iFor the horizontal seat in screen position of mouseMark, y_iFor ordinate of the mouse in screen position, t_iFor time information.The positive sample of training is remembered from people when browsing webpageThe mouse movement trace information recorded.Mouse movement track can very easily be collected by information by network front end function,Position and time information of the cursor of mouse in screen can be returned to during mouse is dragged by the function, can be with(x₁, y₁, t₁)、(x₂, y₂, t₂)、(x₃, y₃, t₃)…(x_n, y_n, t_n) form return, the present invention use mouse movement trace informationSingle order difference (dx_i, dy_i, dt_i) corresponding operation behavior is described, it can reflect mouse of the mouse in moving process rightIn each small period answered, in the translational speed of transverse direction, and lengthwise travel rate, and mobile lateral displacement and length travel feelingsCondition, embody fine feature when operator is operated.

When carrying out neural metwork training, it is necessary to the training sample of magnanimity, if human body Behavioral training sample is using manualIf exclusively carrying out collection, substantial amounts of manpower will be expended, and by setting picture to verify to gather people in web terminal in the existing stageBody behavior, volunteers drag these identifying codes and realize checking, such accelerated accumulation sample money when logging in or browsing webpageSource, improve operational efficiency.

B, N bars tracks, such as 2000 are extracted in positive sample, the trajectory random being drawn into is divided into n (such as 3-10) subsegment, then the thousands of subsegment random groups formed after segmentation are spliced into new track；

C, M bars track is extracted in positive sample；Calculate the transverse shifting of track always distance sum (dx_i), vertically move it is totalDistance sum (dy_i) and mobile total time sum (dt_i)；It is random to generate transverse shifting always distance sum (dx_i) ', vertically move always away fromFrom sum (dy_i) ' and mobile total time sum (dt_i) ', the characterising parameter of new movement locus is generated using below equation：

The inventive method uses the negative sample that a, b, c, d mode generate, and fully positive sample feature is combined, compared to directThe sample generated at random, has a simulation degree higher to human body behavior, thus by such negative sample train come godThere is higher identification capability through network.

The sample that a, b, c, d mode are generated respectively selection 2500, forms negative sample collection.Produced relative to single modeRaw pays sample, and negative sample concentrates the sample generated comprising 4 kinds of modes so that negative sample collection has larger coverage.

In the step (2), preceding 100 (x are used_i, y_i, t_i) it is used as the characterising parameter of mouse movement track；Work as mouseIt is shorter to mark displacement, when movable information is inadequate 100, with (0,0,0) polishing to 100.The every track manually draggedLength may be different, the motion track information (x of mouse₁, y₁, t₁)、(x₂, y₂, t₂)、(x₃, y₃, t₃)…(x_n, y_n, t_n), middle n canIt can be more than or ＜ 100, uniform length will be arranged to for the track trained and classified, meet the need that neural network parameter is setWill.The form of every track is [(dx₁, dy₁, dt₁), (dx₂, dy₂, dt₂) ..., (dx₁₀₀, dy₁₀₀, dt₁₀₀)], if track is grownInadequate 100 are spent, then with 0 filling, i.e. [(dx₁, dy₁, dt₁), (dx₂, dy₂, dt₂) ..., (0,0,0)].

For the ease of analyzing dx_i, dy_iValue be converted into integer between [- 49,50], by dt_iValue be converted to(0,200] between integer, when actual value beyond set span when, replaced using boundary value, by dx_i, dy_i, dt_iValue to switch into integer be to calculate for convenience, such as (dx_i, dy_i, dt_i) actual value for (60.0, -75.3,300.3), the numerical value change after integer processing and span limit is (50, -50,200).

Further, in the step (3), in the training process of the neural network model, the onwards transmission mistake of signalJourney is as follows：By (the dx of training sample_i, dy_i, dt_i) signal input embedding layers, discrete (dx, dy, dt) is passed throughEmbedding changes into continuous real number vector.Implementation process is：By dx=dx+50, dy=dy+50, dx and dy scope are put downMove on to [1,100], embedding process is first dx, and dy, dt change into the one-hot vectors of 101 dimensions respectively, then multiply respectivelyWith the lookup_table matrixes of [101,10], by dx, dy, dt distinguish embedding into the vectors of 10 dimensions, then by these three toAmount splicing obtains input of 30 dimensional vectors as lstm.

The vector of 30 dimensions is arranged in the LSTM neutral nets of 30 dimensions according to sequential input hidden layer dimension, by LSTM nervesThe expression vector of this movement locus of the dimension of network output 3000 exports this rail into full articulamentum, and by softmax layersMark whether be human body behavior judged result.

The mark result of the preceding backward algorithm of the training process use classics of neutral net, judged result and training sample hasDuring deviation, signal successively adjusts weight coefficient according to loss function back-propagation.

The inventive method, which uses, intersects entropy loss as loss function.Loss function is carried out using stochastic gradient descent methodOptimization.Final mask classification accuracy on checking sample set reaches 95%.

Claims

1. the Human bodys' response method based on deep learning, it is characterised in that produced using human body behavior and machine behaviorMouse movement track training sample carry out the neural network model of training package network containing LSTM, and use the neutral net after trainingModel judges whether the operating main body of current page is human body behavior.

2. the method as described in claim 1, it is characterised in that include implemented below step：

(1) structure includes the neural network model of LSTM networks；

(2) obtain magnanimity human body behavior sample and machine behavior sample, wherein machine behavior sample are made a living with human body behavior sampleInto basis；Human body behavior sample is as positive sample, and machine behavior sample is as negative sample；

(4) judge that the operating main body of current page is people or machine by the neural network model trained.

3. the method as described in claim 1, it is characterised in that in the step (2), use a jump of mouse track informationIt is worth (dx_i, dy_i, dt_i) operation behavior, wherein dx described_i=x_i-x_i-1, dy_i=y_i-y_i-1, dt_i=t_i-t_i-1, x_iFor mouseAbscissa value in screen position, y_iFor ordinate value of the mouse in screen position, t_iFor time information.

4. method as claimed in claim 3, it is characterised in that in the step (2), negative sample is generated by machine, generationMode is as follows：

B, concentrated in positive sample and extract N bars track, the trajectory random being drawn into is divided into n subsegment, the subsegment that will be divided intoRandom groups are spliced into new track；

C, M bars track is extracted in positive sample；Calculate the transverse shifting of corresponding track always distance sum (dx_i), vertically move it is totalDistance sum (dy_i) and mobile total time sum (dt_i)；It is random to generate transverse shifting always distance sum (dx_i) ', vertically move always away fromFrom sum (dy_i) ' and mobile total time sum (dt_i) ', the characterising parameter (dx ' of new movement locus is generated using below equation_i,dy′_i, dt_i′)：

<mrow> <msup> <msub> <mi>dx</mi> <mi>i</mi> </msub> <mo>&prime;</mo> </msup> <mo>=</mo> <msub> <mi>dx</mi> <mi>i</mi> </msub> <mfrac> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <msup> <mrow> <mo>(</mo> <msub> <mi>dx</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>,</mo> </msup> </mrow> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <mrow> <mo>(</mo> <msub> <mi>dx</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow> </mfrac> </mrow>

<mrow> <msup> <msub> <mi>dy</mi> <mi>i</mi> </msub> <mo>&prime;</mo> </msup> <mo>=</mo> <msub> <mi>dy</mi> <mi>i</mi> </msub> <mfrac> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <msup> <mrow> <mo>(</mo> <msub> <mi>dy</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>,</mo> </msup> </mrow> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <mrow> <mo>(</mo> <msub> <mi>dy</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow> </mfrac> </mrow>

<mrow> <msup> <msub> <mi>dt</mi> <mi>i</mi> </msub> <mo>&prime;</mo> </msup> <mo>=</mo> <msub> <mi>dt</mi> <mi>i</mi> </msub> <mfrac> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <msup> <mrow> <mo>(</mo> <msub> <mi>dt</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>,</mo> </msup> </mrow> <mrow> <mi>s</mi> <mi>u</mi> <mi>m</mi> <mrow> <mo>(</mo> <msub> <mi>dxt</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow> </mfrac> </mrow>

D, K bars track is extracted in positive sample；Dx in corresponding track_i, dy_i, dt_iOn the basis of randomly generate respectively [- 0.5,0.5] disturbance again, the characterising parameter of new movement locus is obtained.

5. method as claimed in claim 4, it is characterised in that the neutral net employed in the step (1) includes followingStructure：Embedding layers, LSTM, full articulamentum and softmax layers.

6. method as claimed in claim 5, it is characterised in that in the step (3), the training of the neural network model orIn prediction, the onwards transmission process of signal is as follows：

By (the dx of training sample_i, dy_i, dt_i) signal input embedding layers, by the embedding layers by dx_i、dy_i、dt_iThe vector of m dimensions corresponding to changing into respectively, and by dx_i、dy_i、dt_iCorresponding m dimensional vectors are spliced into the vector of a 3m dimension；WillThe vector of 3m dimensions is inputted in LSTM neutral nets according to sequential, by the expression of this track of LSTM neutral nets output 3m*L dimensionsVector into full articulamentum, and by softmax layers export this track whether be human body behavior judged result.

7. method as claimed in claim 6, it is characterised in that in the step (2), use preceding 100 (x_i, y_i, t_i) comeCharacterising parameter as mouse movement track；When mouse moving distance is shorter, and trace information is inadequate 100, mended with (0,0,0)Together to 100.

8. method as claimed in claim 7, it is characterised in that in the step (2), by dx_i, dy_iValue be converted into [-49,50] integer between, by dt_iValue be converted to (0,200] between integer.

9. method as claimed in claim 8, it is characterised in that the sample size included in the positive sample collection is no less than5000, the sample of selection 80% is concentrated as training sample in positive sample, selects 20% sample as test sample.

10. method as claimed in claim 9, it is characterised in that the sample size included in the negative sample collection is no less than5000, the sample of selection 80% is concentrated as training sample in negative sample, selects 20% sample as test sample.