Movatterモバイル変換


[0]ホーム

URL:


CN108268429A - The determining method and apparatus of online literature chapters and sections - Google Patents

The determining method and apparatus of online literature chapters and sections
Download PDF

Info

Publication number
CN108268429A
CN108268429ACN201710452914.5ACN201710452914ACN108268429ACN 108268429 ACN108268429 ACN 108268429ACN 201710452914 ACN201710452914 ACN 201710452914ACN 108268429 ACN108268429 ACN 108268429A
Authority
CN
China
Prior art keywords
chapters
sections
chapter
acyclic
established
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710452914.5A
Other languages
Chinese (zh)
Other versions
CN108268429B (en
Inventor
庞培宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Guangdong Shenma Search Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Shenma Search Technology Co LtdfiledCriticalGuangdong Shenma Search Technology Co Ltd
Priority to CN201710452914.5ApriorityCriticalpatent/CN108268429B/en
Publication of CN108268429ApublicationCriticalpatent/CN108268429A/en
Application grantedgrantedCritical
Publication of CN108268429BpublicationCriticalpatent/CN108268429B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The application provides a kind of determining method and apparatus of online literature chapters and sections, and the sequence of the chapters and sections by obtaining the N number of first candidate website establishes acyclic Markov chain model, according to the acyclic Markov chain model, obtains the subsequence of the maximum weight in chain model;The corresponding chapters and sections sequence of subsequence for determining the maximum weight is credible chapters and sections sequence, the last one node is newest chapters and sections.I.e. by the sequence of the chapters and sections of multiple candidate websites, the acyclic Markov model of Weighted Coefficients is established, the subsequence of maximum weights is obtained, so that it is determined that most believable chapters and sections sequence and newest chapters and sections, improve user experience.

Description

The determining method and apparatus of online literature chapters and sections
Technical field
This application involves computer networking technology more particularly to a kind of determining method and apparatus of online literature chapters and sections.
Background technology
Online literature is the literary works delivered using network as carrier, in general, being updated in a manner of online updating newestChapters and sections.
The quality for the online literature that different websites provides is different, such as:The online literature that certain site provides mayThere are following situations, such as:Situations such as chapters and sections entanglement, network directory do not update or steal chapter, in the prior art, user are wanted to readDuring the certain chapters and sections of online literature, the title of online literature is inputted, website shows all search according to the search term of the input of userHitch fruit.
However, using the method for the prior art, the quality of the search result provided to the user is irregular, and user is difficult to pointDistinguish the with a high credibility of chapters and sections in which search result, user experience is not high.
Invention content
The application provides a kind of determining method and apparatus of online literature chapters and sections, improves the search result that provides to the userQuality improves user experience.
In a first aspect, the application provides a kind of determining method of online literature chapters and sections, including:
The sequence of the chapters and sections of the N number of first candidate website is obtained, the N is the integer more than or equal to 2;
According to the sequence of the chapters and sections of the described N number of first candidate website, acyclic Markov chain model is established, wherein, it is describedThe node of acyclic Markov chain model determines that the directed edge of the acyclic Markov chain model is according to institute according to the chapters and sectionsThe sequencing stated between chapters and sections determines that the weights of the directed edge are according to the secondary of the corresponding chapters and sections sequence appearance of the directed edgeNumber determines;
According to the acyclic Markov chain model, the subsequence of maximum weight is obtained;
The corresponding chapters and sections sequence of subsequence for determining the maximum weight is credible chapters and sections sequence.
Optionally, this method further includes:
The corresponding chapters and sections of end node for determining the subsequence of the maximum weight are the newest chapter of the online literatureSection.
Optionally, it before the sequence of the chapters and sections for obtaining the N number of first candidate website, further includes:
According to the number of the chapters and sections of M second candidate website, the described N number of first candidate website is determined, wherein, the M isInteger more than N.
Optionally, the number of the chapters and sections according to M second candidate website determines the described N number of first candidate website, packetIt includes:
According to pi=| si- u |/δ obtains the score value of each second candidate website;It obtains score value and is more than DmaxIt is second candidateWebsite is the described N number of first candidate website, wherein, i is the integer less than or equal to M, the s more than or equal to 1iIt is i-th secondThe number of the chapters and sections of candidate website, u are the average of the number of the chapters and sections of the candidate websites of the M second, and δ is the M theThe standard deviation of the number of the chapters and sections of two candidate websites, it is describedThe n is equal to M.
Optionally, the sequence of the chapters and sections for obtaining the N number of first candidate website, including:
The sequence of the L chapters and sections reciprocal of the described N number of first candidate website is obtained, the L is the integer more than or equal to 2.
Optionally, the sequence of the chapters and sections according to the described N number of first candidate website, establishes acyclic Markov Chain mouldType, including:
The chapters and sections of each first candidate website of the described N number of first candidate website are merged into successively, acyclic Ma Er has been establishedIn section's husband's chain;
Wherein, the chapters and sections of each first candidate website are merged into and acyclic Markov Chain has been established include:
According to the sequence of the sequence of the chapters and sections in the first candidate website, each chapters and sections are merged into according to preset rules successivelyIt has been established in acyclic Markov Chain, wherein, it is described that markovian initial value has been established as sky;
Wherein, described be merged into each chapters and sections according to preset rules is had been established in acyclic Markov Chain, including:
Determine that first chapters and sections whether there is in acyclic Markov Chain has been established, if first chapters and sections are not depositedIt is then adding first chapters and sections and is being had been established in acyclic Markov Chain to described;Determine that nothing has been established described in second chapterIt whether there is in ring Markov Chain, if the second chapter is not present, adds the second chapter and nothing has been established described inIn ring Markov Chain, and first chapters and sections are established to the directed edge of the second chapter;If the second chapter exists,First chapters and sections are established to the directed edge of the second chapter;
If first chapters and sections exist, it is determined that whether second chapter is deposited in described have been established in acyclic Markov ChainIf the second chapter is not present, is adding the second chapter and had been established in acyclic Markov Chain, and establish to describedFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to instituteThe directed edge for stating second chapter whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined thatIf establish first chapters and sections to the second chapter directed edge whether with existing directed edge formed loop, if not forming ringFirst chapters and sections are then established to the directed edge of the second chapter in road;If the first chapters and sections having to the second chapterExist to side, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, it is describedFirst chapters and sections are chapters and sections to be combined, and the second chapter is next chapters and sections of first chapters and sections.
Second aspect, the application provide a kind of determining device of online literature chapters and sections, including:
Acquisition module, for obtaining the sequence of the chapters and sections of the N number of first candidate website, the N is the integer more than or equal to 2;
Processing module for the sequence of the chapters and sections according to the described N number of first candidate website, establishes acyclic Markov Chain mouldType, wherein, the node of the acyclic Markov chain model is determined according to the chapters and sections, the acyclic Markov chain modelDirected edge determines that the weights of the directed edge are according to the corresponding chapters and sections of the directed edge according to the sequencing between the chapters and sectionsThe number that sequence occurs determines;
The processing module is additionally operable to, according to the acyclic Markov chain model, obtain the subsequence of maximum weight;
Output module, for determining that the corresponding chapters and sections sequence of the subsequence of the maximum weight is credible chapters and sections sequence.
Optionally, the output module is additionally operable to determine the corresponding chapter of end node of the subsequence of the maximum weightSave the newest chapters and sections for the online literature.
Optionally, the processing module is additionally operable to the number of the chapters and sections according to M second candidate website, determines described N number ofFirst candidate website, wherein, the M is the integer more than N.
Optionally, the processing module is specifically used for
According to pi=| si- u |/δ obtains the score value of each second candidate website;It obtains score value and is more than DmaxIt is second candidateWebsite is the described N number of first candidate website, wherein, i is the integer less than or equal to M, the s more than or equal to 1iIt is i-th secondThe number of the chapters and sections of candidate website, u are the average of the number of the chapters and sections of the candidate websites of the M second, and δ is the M theThe standard deviation of the number of the chapters and sections of two candidate websites, it is describedThe n is equal to M.
Optionally, the acquisition module is specifically used for obtaining the sequence of the L chapters and sections reciprocal of the described N number of first candidate websiteRow, the L are the integer more than or equal to 2.
Optionally, the processing module is specifically used for each first candidate stations of the described N number of first candidate website successivelyThe chapters and sections of point, which are merged into, to be had been established in acyclic Markov Chain;Wherein, the chapters and sections of each first candidate website are merged into builtAcyclic Markov Chain is found to include:
According to the sequence of the sequence of the chapters and sections in the first candidate website, each chapters and sections are merged into according to preset rules successivelyIt has been established in acyclic Markov Chain, wherein, it is described that markovian initial value has been established as sky;
Wherein, described be merged into each chapters and sections according to preset rules is had been established in acyclic Markov Chain, including:
Determine that first chapters and sections whether there is in acyclic Markov Chain has been established, if first chapters and sections are not depositedIt is then adding first chapters and sections and is being had been established in acyclic Markov Chain to described;Determine that nothing has been established described in second chapterIt whether there is in ring Markov Chain, if the second chapter is not present, adds the second chapter and nothing has been established described inIn ring Markov Chain, and first chapters and sections are established to the directed edge of the second chapter;If the second chapter exists,First chapters and sections are established to the directed edge of the second chapter;
If first chapters and sections exist, it is determined that whether second chapter is deposited in described have been established in acyclic Markov ChainIf the second chapter is not present, is adding the second chapter and had been established in acyclic Markov Chain, and establish to describedFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to instituteThe directed edge for stating second chapter whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined thatIf establish first chapters and sections to the second chapter directed edge whether with existing directed edge formed loop, if not forming ringFirst chapters and sections are then established to the directed edge of the second chapter in road;If the first chapters and sections having to the second chapterExist to side, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, it is describedFirst chapters and sections are chapters and sections to be combined, and the second chapter is next chapters and sections of first chapters and sections.
The determining method and apparatus for the online literature chapters and sections that the application provides, by the chapter for obtaining the N number of first candidate websiteThe sequence of section according to the sequence of the chapters and sections of the N number of first candidate website, establishes acyclic Markov chain model, according to described acyclicMarkov chain model obtains the subsequence of maximum weight;The corresponding chapters and sections sequence of subsequence for determining the maximum weight isCredible chapters and sections sequence.I.e. by establishing acyclic Markov chain model, the sequence of the chapters and sections of comprehensive multiple first candidate website is led toThe subsequence for determining maximum weight is crossed, is credible chapters and sections sequence by the corresponding chapters and sections sequence of the subsequence of maximum weight, so as to beUser provides believable chapters and sections sequence, improves user experience.
Description of the drawings
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or it will show belowThere is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only thisSome embodiments of application, for those of ordinary skill in the art, without having to pay creative labor, may be used alsoTo obtain other attached drawings according to these attached drawings.
Fig. 1 is the flow diagram of the determining embodiment of the method one of the application online literature chapters and sections;
Fig. 2-Figure 10 is the schematic diagram that the application establishes acyclic Markov chain model;
Figure 11 is the flow diagram of the determining embodiment of the method two of the application online literature chapters and sections;
Figure 12 is the structure diagram of the determining device embodiment of the application online literature chapters and sections.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, the technical solution in the embodiment of the present application is carried out clear, completeSite preparation describes, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based onEmbodiment in the application, those of ordinary skill in the art are obtained every other without making creative workEmbodiment shall fall in the protection scope of this application.
Term " first ", " second ", " third " " in the description and claims of this application and above-mentioned attached drawingThe (if present)s such as four " are the objects for distinguishing similar, and specific sequence or precedence are described without being used for.It should manageThe data that solution uses in this way can be interchanged in the appropriate case, so that embodiments herein described herein for example can be to removeSequence other than those for illustrating or describing herein is implemented.In addition, term " comprising " and " having " and theirs is anyDeformation, it is intended that cover it is non-exclusive include, for example, containing the process of series of steps or unit, method, system, productionProduct or equipment are not necessarily limited to those steps or unit clearly listed, but may include not listing clearly or for thisThe intrinsic other steps of processes, method, product or equipment or unit a bit.
Fig. 1 is the flow diagram of the determining embodiment of the method one of the application online literature chapters and sections, as shown in Figure 1:
S101:Obtain the sequence of the chapters and sections of the N number of first candidate website.
Wherein, N is the integer more than or equal to 2.
A kind of possible realization method, obtains the sequence of the L chapters and sections reciprocal of N number of first candidate website, L for more thanInteger equal to 2.
For being 3 as 9, L using N, it is assumed that the sequence of 3 chapters and sections of inverse of 9 first candidate websites is as described in Table 1:
Table 1
First candidate website3rd chapter reciprocal2nd chapter reciprocal1st chapter reciprocal
X1ABE
X2BCD
X3ABC
X4CDE
X5CDE
X6JCE
X7JXZ
X8CDA
X9123
S102:According to the sequence of the chapters and sections of the described N number of first candidate website, acyclic Markov chain model is established.
Wherein, the node of the acyclic Markov chain model is determined according to the chapters and sections, the acyclic Markov ChainThe directed edge of model is determined according to the sequencing between the chapters and sections.
Wherein, a kind of possible realization method is as follows:Successively by each first candidate stations of the N number of first candidate websiteThe chapters and sections of point, which are merged into, to be had been established in acyclic Markov Chain;
Wherein, the chapters and sections of each first candidate website are merged into and a kind of possible reality in acyclic Markov Chain has been establishedExisting mode is as follows:
According to the sequence of the sequence of the chapters and sections in the first candidate website, each chapters and sections are merged into according to preset rules successivelyIt has been established in acyclic Markov Chain, wherein, it is described that markovian initial value has been established as sky;
Each chapters and sections are merged into the possible realization side of one kind having been established in acyclic Markov Chain according to preset rulesFormula is as follows:
Determine that first chapters and sections whether there is in acyclic Markov Chain has been established, if first chapters and sections are not depositedIt is then adding first chapters and sections and is being had been established in acyclic Markov Chain to described;Determine that nothing has been established described in second chapterIt whether there is in ring Markov Chain, if the second chapter is not present, adds the second chapter and nothing has been established described inIn ring Markov Chain, and first chapters and sections are established to the directed edge of the second chapter;If the second chapter exists,First chapters and sections are established to the directed edge of the second chapter;
If first chapters and sections exist, it is determined that whether second chapter is deposited in described have been established in acyclic Markov ChainIf the second chapter is not present, is adding the second chapter and had been established in acyclic Markov Chain, and establish to describedFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to instituteThe directed edge for stating second chapter whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined thatIf establish first chapters and sections to the second chapter directed edge whether with existing directed edge formed loop, if not forming ringFirst chapters and sections are then established to the directed edge of the second chapter in road;If the first chapters and sections having to the second chapterExist to side, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, it is describedFirst chapters and sections are chapters and sections to be combined, and the second chapter is next chapters and sections of first chapters and sections.
With reference to table 1, it is assumed that successively by X1-X9In chapters and sections be merged into and have been established in acyclic Markov Chain;
By X1In chapters and sections be merged into and have been established in acyclic Markov Chain;The results are shown in Figure 2;Acyclic Markov ChainInitial value is sky;
Wherein, by X1In A be merged into acyclic Markov Chain;
According to above-mentioned preset rules, addition A to acyclic Markov Chain, B is to acyclic Markov Chain for addition, establishes A to BDirected edge;
By X1In B be merged into acyclic Markov Chain;
According to above-mentioned preset rules, C is to acyclic Markov Chain for addition, establishes the directed edge of B to C.
On the basis of Fig. 2, by X2In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 3 institutes after mergingShow.
Wherein, by X2In B be merged into acyclic Markov Chain;
According to above-mentioned preset rules, C is to acyclic Markov Chain for addition, establishes the directed edge of B to C;
By X2In C be merged into acyclic Markov Chain;
According to above-mentioned preset rules, D is to acyclic Markov Chain for addition, establishes the directed edge of C to D.
On the basis of Fig. 3, by X3In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 4 institutes after mergingShow:
Wherein, by X3In A be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of A to B is increased into a unit, it is assumed that a unit is 1,Then the weighted value of the directed edge of A to B is 2.
By X3In B be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of B to C is increased into a unit, then the directed edge of B to CWeighted value is 2.
On the basis of Fig. 4, by X4In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 5 institutes after mergingShow;
Wherein, by X4In C be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of C to D is increased into a unit, it is assumed that a unit is 1,Then the weighted value of the directed edge of C to D is 2.
By X4In D be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the directed edge of D to E is established.
On the basis of Fig. 5, by X5In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 6 institutes after mergingShow;
Wherein, by X5In C be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of C to D is increased into a unit, it is assumed that a unit is 1,Then the weighted value of the directed edge of C to D is 3.
By X5In D be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of D to E is increased into a unit, it is assumed that a unit is 1,Then the weighted value of the directed edge of D to E is 2.
On the basis of Fig. 6, by X6In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 7 institutes after mergingShow.
Wherein, by X6In J be merged into acyclic Markov Chain;
According to above-mentioned preset rules, J is added to acyclic Markov Chain, establishes the directed edge of J to C.
By X6In C be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the directed edge of C to E is established.
On the basis of Fig. 7, by X7In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 8 institutes after mergingShow;
Wherein, by X7In J be merged into acyclic Markov Chain;
According to above-mentioned preset rules, X is added to acyclic Markov Chain, establishes the directed edge of J to X.
By X6In X be merged into acyclic Markov Chain;
According to above-mentioned preset rules, Z is added to acyclic Markov Chain, establishes the directed edge of X to Z.
On the basis of Fig. 8, by X8In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Fig. 9 institutes after mergingShow;
Wherein, by X8In C be merged into acyclic Markov Chain;
According to above-mentioned preset rules, the weighted value of the directed edge of C to D is increased into a unit, it is assumed that a unit is 1,Then the weighted value of the directed edge of C to D is 4.
By X8In D be merged into acyclic Markov Chain;
According to above-mentioned preset rules, without operation.
On the basis of Fig. 9, by X9In chapters and sections be merged into and have been established in acyclic Markov Chain, such as Figure 10 institutes after mergingShow.
By X9In 1 be merged into acyclic Markov Chain;
According to above-mentioned preset rules, 1 is added in acyclic Markov Chain, 2 are added in acyclic Markov Chain,Establish 1 to 2 directed edge;
By X9In 2 be merged into acyclic Markov Chain;
According to above-mentioned preset rules, 3 are added in acyclic Markov Chain, establishes 2 to 3 directed edge.
S103:According to acyclic Markov chain model, the subsequence of maximum weight is obtained.
Wherein it is possible to the subsequence of maximum weight is obtained using Dynamic Programming or Greedy strategy.
The value of the subsequence of maximum weight can basis:
Li,n+1=Li,n+max(Vi,n+1)
With reference to the example in S102, it can be seen that:The value of the subsequence of maximum weight is 10, and it is traversed to obtain maximum valueSubsequence be maximum weight subsequence, therefore, the subsequence of maximum weight is A, B, C, D, E.
S104:The corresponding chapters and sections sequence of subsequence for determining the maximum weight is credible chapters and sections sequence.
User can be according to credible chapters and sections sequence, the accurate chapters and sections for obtaining desired reading, such as:User wants to read newest chapterSection, it is determined that the corresponding chapters and sections of end node of the subsequence of the maximum weight are the newest chapters and sections of the online literature,With reference to aforementioned exemplary, then E chapters and sections are obtained.User with reference to aforementioned exemplary, then obtains C, D, E tri- to newest 3 chapters and sections are readChapters and sections.
The present embodiment, the sequence of the chapters and sections by obtaining the N number of first candidate website, according to the chapter of the N number of first candidate websiteThe sequence of section establishes acyclic Markov chain model, according to the acyclic Markov chain model, obtains the sub- sequence of maximum weightRow;The corresponding chapters and sections sequence of subsequence for determining the maximum weight is credible chapters and sections sequence.I.e. by establishing acyclic Ma ErkeHusband's chain model, the sequence of the chapters and sections of comprehensive multiple first candidate website, by determining the subsequence of maximum weight, by maximum weightThe corresponding chapters and sections sequence of subsequence for credible chapters and sections sequence, so as to provide believable chapters and sections sequence to the user, improve user's bodyIt tests.
Figure 11 is the flow diagram of the determining embodiment of the method two of the application online literature chapters and sections, and Figure 11 is in Fig. 1 institutesOn the basis of showing embodiment, when the quantity of candidate website is larger, such as:More than 6, then according to the number of the chapters and sections of candidate websiteMesh excludes the more website of mistake, to reduce the workload calculated, therefore, before S101, can also include:
S100:According to the number of the chapters and sections of M second candidate website, the described N number of first candidate website is determined.
Wherein, the M is the integer more than N.
Wherein, a kind of possible realization method is to remove noise website by CHauvent:
Specifically, according to pi=| si- u |/δ obtains the score value of each second candidate website;It obtains score value and is more than DmaxTwo candidate websites are the described N number of first candidate website, wherein, i is the integer less than or equal to M, the s more than or equal to 1iIt is i-thThe number of the chapters and sections of a second candidate website, u are the average of the number of the chapters and sections of the M second candidate website, and δ is describedThe standard deviation of the number of the chapters and sections of M second candidate website, it is describedThe n is equal to M.
The present embodiment by the number of the chapters and sections according to M second candidate website, determines the described N number of first candidate website,The more website of debug to reduce the workload calculated, improves the efficiency for determining the newest chapters and sections of online literature.
Structure diagrams of the Figure 12 for the determining device embodiment of the application online literature chapters and sections, the device packet of the present embodimentIt includes:Acquisition module 1201, processing module 1202 and output module 1203, wherein, acquisition module 1201 is waited for obtaining N number of firstThe sequence of the chapters and sections of selective calling point, the N are the integer more than or equal to 2;Processing module 1202 is used for candidate according to described N number of firstThe sequence of the chapters and sections of website establishes acyclic Markov chain model, wherein, the node of the acyclic Markov chain model according toThe chapters and sections determine that the directed edge of the acyclic Markov chain model is determined according to the sequencing between the chapters and sections, instituteThe weights for stating directed edge are determined according to the number that the corresponding chapters and sections sequence of the directed edge occurs;The processing module 1202 is also usedAccording to the acyclic Markov chain model, maximum weight is obtained;Output module 1203 is used to determine the maximum weightThe corresponding chapters and sections sequence of subsequence be credible chapters and sections sequence.
In the above-described embodiments, output module 1203 is additionally operable to determine the end node pair of the subsequence of the maximum weightThe chapters and sections answered are the newest chapters and sections of the online literature.
In the above-described embodiments, the processing module 1202 is specifically used for the every of the described N number of first candidate website successivelyThe chapters and sections of a first candidate website, which are merged into, to be had been established in acyclic Markov Chain;Wherein, by the chapter of each first candidate websiteSection, which is merged into, to be had been established acyclic Markov Chain and includes:
According to the sequence of the sequence of the chapters and sections in the first candidate website, each chapters and sections are merged into according to preset rules successivelyIt has been established in acyclic Markov Chain, wherein, it is described that markovian initial value has been established as sky;
Wherein, described be merged into each chapters and sections according to preset rules is had been established in acyclic Markov Chain, including:
Determine that first chapters and sections whether there is in acyclic Markov Chain has been established, if first chapters and sections are not depositedIt is then adding first chapters and sections and is being had been established in acyclic Markov Chain to described;Determine that nothing has been established described in second chapterIt whether there is in ring Markov Chain, if the second chapter is not present, adds the second chapter and nothing has been established described inIn ring Markov Chain, and first chapters and sections are established to the directed edge of the second chapter;If the second chapter exists,First chapters and sections are established to the directed edge of the second chapter;
If first chapters and sections exist, it is determined that whether second chapter is deposited in described have been established in acyclic Markov ChainIf the second chapter is not present, is adding the second chapter and had been established in acyclic Markov Chain, and establish to describedFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to instituteThe directed edge for stating second chapter whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined thatIf establish first chapters and sections to the second chapter directed edge whether with existing directed edge formed loop, if not forming ringFirst chapters and sections are then established to the directed edge of the second chapter in road;If the first chapters and sections having to the second chapterExist to side, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, it is describedFirst chapters and sections are chapters and sections to be combined, and the second chapter is next chapters and sections of first chapters and sections.
In the above-described embodiments, the acquisition module 1201 is specifically used for obtaining the L reciprocal of the described N number of first candidate websiteThe sequence of a chapters and sections, the L are the integer more than or equal to 2.
The device of the present embodiment accordingly can be used for performing the technical solution of embodiment of the method shown in Fig. 1, realization principleSimilar with technique effect, details are not described herein again.
On the basis of embodiment illustrated in fig. 12, the processing module 1202 is additionally operable to according to a second candidate websites of MThe number of chapters and sections determines the described N number of first candidate website, wherein, the M is the integer more than N.
Wherein, the processing module 1202 is specifically used for according to pi=| si- u |/δ obtains point of each second candidate websiteValue;It obtains score value and is more than DmaxThe second candidate website be the described N number of first candidate website, wherein, i is to be less than more than or equal to 1 etc.In the integer of M, the siThe number of chapters and sections for i-th second candidate websites, u are the chapters and sections of the M second candidate websiteNumber average, δ is the standard deviation of the number of the chapters and sections of the candidate websites of the M second, describedInstituteN is stated equal to M.
The device of the present embodiment accordingly can be used for performing the technical solution of embodiment of the method shown in Figure 11, realize formerReason is similar with technique effect, and details are not described herein again.
One of ordinary skill in the art will appreciate that:Realizing all or part of step of above-mentioned each method embodiment can lead toThe relevant hardware of program instruction is crossed to complete.Aforementioned program can be stored in a computer read/write memory medium.The journeySequence when being executed, performs the step of including above-mentioned each method embodiment;And aforementioned storage medium includes:ROM, RAM, magnetic disc orThe various media that can store program code such as person's CD.
Finally it should be noted that:The above various embodiments is only to illustrate the technical solution of the application, rather than its limitations;To the greatest extentPipe is described in detail the application with reference to foregoing embodiments, it will be understood by those of ordinary skill in the art that:Its according toCan so modify to the technical solution recorded in foregoing embodiments either to which part or all technical features intoRow equivalent replacement;And these modifications or replacement, each embodiment technology of the application that it does not separate the essence of the corresponding technical solutionThe range of scheme.

Claims (12)

  1. If first chapters and sections exist, it is determined that second chapter whether there is in described have been established in acyclic Markov Chain, ifThe second chapter is not present, then adds the second chapter and had been established in acyclic Markov Chain to described, and described in foundationFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to described theThe directed edge of two chapters and sections whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined that if buildingStand first chapters and sections to the second chapter directed edge whether with existing directed edge formation loop, if not forming loop,First chapters and sections are established to the directed edge of the second chapter;If the directed edge of the first chapters and sections to the second chapter is deposited, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, the chapter 1It saves as chapters and sections to be combined, next chapters and sections of the second chapter for first chapters and sections.
  2. If first chapters and sections exist, it is determined that second chapter whether there is in described have been established in acyclic Markov Chain, ifThe second chapter is not present, then adds the second chapter and had been established in acyclic Markov Chain to described, and described in foundationFirst chapters and sections are to the directed edge of the second chapter;If the second chapter exists, it is determined that first chapters and sections to described theThe directed edge of two chapters and sections whether there is, if the directed edge of the first chapters and sections to the second chapter is not present, it is determined that if buildingStand first chapters and sections to the second chapter directed edge whether with existing directed edge formation loop, if not forming loop,First chapters and sections are established to the directed edge of the second chapter;If the directed edge of the first chapters and sections to the second chapter is deposited, then the weighted value of directed edge of first chapters and sections to the second chapter is increased into a unit, wherein, the chapter 1It saves as chapters and sections to be combined, next chapters and sections of the second chapter for first chapters and sections.
CN201710452914.5A2017-06-152017-06-15Method and device for determining network literature chaptersActiveCN108268429B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201710452914.5ACN108268429B (en)2017-06-152017-06-15Method and device for determining network literature chapters

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201710452914.5ACN108268429B (en)2017-06-152017-06-15Method and device for determining network literature chapters

Publications (2)

Publication NumberPublication Date
CN108268429Atrue CN108268429A (en)2018-07-10
CN108268429B CN108268429B (en)2021-08-06

Family

ID=62771764

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201710452914.5AActiveCN108268429B (en)2017-06-152017-06-15Method and device for determining network literature chapters

Country Status (1)

CountryLink
CN (1)CN108268429B (en)

Citations (19)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1604075A (en)*2004-11-222005-04-06北京北大方正技术研究院有限公司 A Method for Restoring the Reading Order of the Newspaper Layout
CN1604073A (en)*2004-11-222005-04-06北京北大方正技术研究院有限公司 A Method of Logically Associating Headlines and Main Texts of Newspaper Layouts
CN101866418A (en)*2009-04-172010-10-20株式会社理光Method and equipment for determining file reading sequences
CN102270343A (en)*2011-07-272011-12-07宁波大学Image segmentation method based on Ising graph model
CN102831059A (en)*2012-08-232012-12-19北京工业大学Software behavior modeling method based on state layer
CN102937933A (en)*2012-11-142013-02-20中国矿业大学Class test sequence determining method based on testing level
CN103377271A (en)*2012-04-242013-10-30英奇达资讯股份有限公司Method for making knowledge map
CN103544172A (en)*2012-07-132014-01-29深圳市世纪光速信息技术有限公司Method and device for processing chapter catalogs of E-book
CN103577566A (en)*2013-10-252014-02-12北京奇虎科技有限公司Web reading content loading method and device
US20140245233A1 (en)*2013-02-272014-08-28International Business Machines CorporationReadable structural text-based representation of activity flows
CN104268127A (en)*2014-09-222015-01-07同方知网(北京)技术有限公司Method for analyzing reading order of electronic layout file
CN104331438A (en)*2014-10-242015-02-04北京奇虎科技有限公司Method and device for selectively extracting content of novel webpage
CN104391889A (en)*2014-11-112015-03-04西安交通大学 A Community Structure Discovery Method for Directed-Weighted Networks
WO2015099810A1 (en)*2013-12-292015-07-02Hewlett-Packard Development Company, L.P.Learning graph
CN105096240A (en)*2015-07-212015-11-25南京师范大学Method for hiding image sensitive object based texture synthesis
CN105095613A (en)*2014-04-162015-11-25华为技术有限公司Method and device for prediction based on sequential data
CN105513400A (en)*2015-12-032016-04-20四川长虹电器股份有限公司Method for dynamically planning travel route
CN105609107A (en)*2015-12-232016-05-25北京奇虎科技有限公司Text processing method and device based on voice identification
CN106844350A (en)*2017-02-152017-06-13广州索答信息科技有限公司A kind of computational methods of short text semantic similarity

Patent Citations (19)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1604073A (en)*2004-11-222005-04-06北京北大方正技术研究院有限公司 A Method of Logically Associating Headlines and Main Texts of Newspaper Layouts
CN1604075A (en)*2004-11-222005-04-06北京北大方正技术研究院有限公司 A Method for Restoring the Reading Order of the Newspaper Layout
CN101866418A (en)*2009-04-172010-10-20株式会社理光Method and equipment for determining file reading sequences
CN102270343A (en)*2011-07-272011-12-07宁波大学Image segmentation method based on Ising graph model
CN103377271A (en)*2012-04-242013-10-30英奇达资讯股份有限公司Method for making knowledge map
CN103544172A (en)*2012-07-132014-01-29深圳市世纪光速信息技术有限公司Method and device for processing chapter catalogs of E-book
CN102831059A (en)*2012-08-232012-12-19北京工业大学Software behavior modeling method based on state layer
CN102937933A (en)*2012-11-142013-02-20中国矿业大学Class test sequence determining method based on testing level
US20140245233A1 (en)*2013-02-272014-08-28International Business Machines CorporationReadable structural text-based representation of activity flows
CN103577566A (en)*2013-10-252014-02-12北京奇虎科技有限公司Web reading content loading method and device
WO2015099810A1 (en)*2013-12-292015-07-02Hewlett-Packard Development Company, L.P.Learning graph
CN105095613A (en)*2014-04-162015-11-25华为技术有限公司Method and device for prediction based on sequential data
CN104268127A (en)*2014-09-222015-01-07同方知网(北京)技术有限公司Method for analyzing reading order of electronic layout file
CN104331438A (en)*2014-10-242015-02-04北京奇虎科技有限公司Method and device for selectively extracting content of novel webpage
CN104391889A (en)*2014-11-112015-03-04西安交通大学 A Community Structure Discovery Method for Directed-Weighted Networks
CN105096240A (en)*2015-07-212015-11-25南京师范大学Method for hiding image sensitive object based texture synthesis
CN105513400A (en)*2015-12-032016-04-20四川长虹电器股份有限公司Method for dynamically planning travel route
CN105609107A (en)*2015-12-232016-05-25北京奇虎科技有限公司Text processing method and device based on voice identification
CN106844350A (en)*2017-02-152017-06-13广州索答信息科技有限公司A kind of computational methods of short text semantic similarity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MARCO AIELLO 等: "Document understanding for a broad class of documents", 《HTTP://WWW.RUG.NL/RESEARCH/PORTAL》*

Also Published As

Publication numberPublication date
CN108268429B (en)2021-08-06

Similar Documents

PublicationPublication DateTitle
US20190042982A1 (en)Automatic Multi-Threshold Feature Filtering Method and Apparatus
CN111967964B (en)Intelligent recommending method and device for bank client sites
CN105975641A (en)Video recommendation method ad device
CN106649877A (en)Density peak-based big data mining method and apparatus
CN109067654A (en)Network communication optimal route selection method and device
CN111768761B (en)Training method and device for speech recognition model
CN104881415B (en)The method and apparatus of tree graph generation
CN105931046A (en)Suspected transaction node set detection method and device
CN105681868B (en)Barrage content display method and terminal equipment
CN109067427A (en)A kind of frequency hop sequences prediction technique based on Optimization-type wavelet neural network
CN112420079A (en)Voice endpoint detection method and device, storage medium and electronic equipment
CN103200676A (en)Method and device for building fingerprint database
CN105045867B (en)A kind of books writer based on books scoring recommends method
CN107169871A (en)It is a kind of to optimize many relation community discovery methods expanded with seed based on composition of relations
CN111081276A (en)Audio segment matching method, device and equipment and readable storage medium
CN117894306B (en)Voice processing method, device, computer equipment and storage medium
CN108268429A (en)The determining method and apparatus of online literature chapters and sections
CN117528538B (en)Fuzzy learning anti-interference method and system for coping with incomplete channel information
CN112948372B (en) A method and system for index-based fixed-track length seismic data retrieval
CN108055638A (en)Obtain method, apparatus, computer-readable medium and the equipment of target location
CN112348188A (en)Model generation method and device, electronic device and storage medium
CN117008195A (en)Method, system and equipment for predicting transverse waves based on random forest
KR20170062308A (en)Method for mining important pattern of high rank k, apparatus performing the same and storage medium storing the same
Chhabra et al.A method for bounding imprecise probabilistic criteria when using a sequential decision process for the design of structural systems
CN115291793A (en) Attribute data conversion method and device, storage medium and electronic device

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
TA01Transfer of patent application right
TA01Transfer of patent application right

Effective date of registration:20200423

Address after:310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after:Alibaba (China) Co.,Ltd.

Address before:510627 Guangdong city of Guangzhou province Whampoa Tianhe District Road No. 163 Xiping Yun Lu Yun Ping square B radio tower 13 layer self unit 01

Applicant before:GUANGZHOU SHENMA MOBILE INFORMATION TECHNOLOGY Co.,Ltd.

GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp