Movatterモバイル変換


[0]ホーム

URL:


CN113836434B - Web page data processing method based on database - Google Patents

Web page data processing method based on database
Download PDF

Info

Publication number
CN113836434B
CN113836434BCN202111411396.5ACN202111411396ACN113836434BCN 113836434 BCN113836434 BCN 113836434BCN 202111411396 ACN202111411396 ACN 202111411396ACN 113836434 BCN113836434 BCN 113836434B
Authority
CN
China
Prior art keywords
keywords
web page
database
page
data processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111411396.5A
Other languages
Chinese (zh)
Other versions
CN113836434A (en
Inventor
朱春华
王涛
曾繁诚
程晓梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Jerei Digital Technology Co Ltd
Original Assignee
Shandong Jerei Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Jerei Digital Technology Co LtdfiledCriticalShandong Jerei Digital Technology Co Ltd
Priority to CN202111411396.5ApriorityCriticalpatent/CN113836434B/en
Publication of CN113836434ApublicationCriticalpatent/CN113836434A/en
Application grantedgrantedCritical
Publication of CN113836434BpublicationCriticalpatent/CN113836434B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Classifications

Landscapes

Abstract

The invention relates to the technical field of electric digital data processing, in particular to a database-based web page data processing method, which comprises the following four steps: s1: setting related words of the website industry through a web keyword acquisition tool, and mining keywords; s2: importing the keywords into a cloud database in the same environment as the web page, and filtering, classifying and labeling the keywords; s3: constructing a page template layout and corresponding module elements required by web page data processing; s4: and the web page loads keywords through the page template, processes the module elements and the content data and generates a web page data display effect. Compared with the prior art, the web page uses the dynamic page template technology when processing data, and has the advantages of flexibility and accuracy when processing data compared with the traditional single fixed template form.

Description

Web page data processing method based on database
Technical Field
The invention relates to the technical field of electric digital data processing, in particular to a database-based web page data processing method.
Background
For website construction, data processing of a web page generally mainly depends on code writing of developers and manual processing of operation and maintenance personnel, and data processing and calling are performed in the web page through fixed code rules. The accuracy and effectiveness of the presented data depend on the skill level of developers and the familiarity of operation and maintenance personnel with the business, and the manual processing mode is low in efficiency and the presented effect is uncontrollable.
With the continuous development of internet technology, the types of data information that can be displayed by a website for a user are more and more abundant, and meanwhile, the amount of data is more and more. However, as the data information increases, it becomes more and more difficult to directly and effectively transfer the data that the user wants to acquire to the user. At present, organization data is processed mainly in a mode of manually processing data or standardizing formats by technicians and operators, the technicians write code rules firstly, and then the operators screen the data and finally display the data to users, so that the data is used as an important component element in a web page, and reasonable and effective processing is very important.
Disclosure of Invention
In order to solve the above problems, the present invention provides a method for processing web page data based on a database, which can quickly process data in a web page, preferentially display content with the highest relevance through analysis and processing, and ensure high readability of the web page data for a user.
In order to achieve the above object, the present invention comprises the steps of:
s1: setting related words of the website industry through a web keyword acquisition tool, and mining keywords based on a search engine;
s2: importing the keywords into a cloud database in the same environment as the web page, and filtering, classifying and labeling the keywords;
s3: constructing a page template layout and corresponding module elements required by web page data processing;
s4: and the web page loads keywords through the page template, processes the module elements and the content data and generates a web page data display effect.
Further, in S1, website industry related words are added to the web keyword collection tool, and the industry keywords are mined in the search engine by a simulation search.
Further, S2 further includes:
s21: leading the mined keywords into a temporary data table of a database;
s22: filtering keywords irrelevant to the website, and taking the screened keywords as keywords to be classified; filtering is to filter and delete the key words irrelevant to the industry in the database through character extraction;
s23: matching keyword classification, and determining keyword classification and presentation forms; the classification is to determine the classification and content display form of the keywords by identifying the core semantics in the keywords;
s24: marking key word attributes, marking all levels of attributes of the key words through matching of a pre-constructed database dictionary, and specifically, carrying out step-by-step matching with contents in the database according to the sequence from large word meaning range to small word meaning range of the dictionary;
s25: and moving the keywords which are filtered, classified and labeled from the temporary table of the database to the formal table to be used as core keywords.
Further, in S3, the layout for constructing the page template is to perform information level setting on the page layout area; and sets a content presentation form for the module elements in the page layout area.
Further, in S4, the web page reads the corresponding relationship of the keywords stored in the database table and the characteristics of the categories, forms, semantics, and the like thereof, and simultaneously loads the page template and sends the keywords to the page template through the interface, and the page template identifies the categories of the keywords and dynamically matches the module element forms in the layout area of the loaded page; the content of each module element is classified according to the information of the layout area where the module element is located, and the related content with the highest matching degree is displayed by combining the corresponding keyword attributes.
Compared with the prior art, the invention has the beneficial effects that:
1. the method adopts a keyword mining mode, mines the keywords concerned by the user based on the search engine, and takes the keywords as the core keywords of the web page processing data for processing and displaying the data required to be displayed by the web page, thereby improving the readability of the web page data for the user.
2. When the method is used for processing the keywords of the web page, the keywords are analyzed from three aspects of classification, form and content, the semantics of the keywords are mined to the minimum semantic degree, the identification and labeling effect on the keywords is improved, and the data processing dimensionality and accuracy in the web page are wider.
3. The web page uses the dynamic page template technology when processing data, and has the advantages of flexibility and accuracy when processing data compared with the traditional single fixed template form.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a flow chart of keyword attribute tagging in accordance with the present invention;
FIG. 3 is a schematic diagram of a page template of an embodiment of the invention.
Detailed Description
The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
As shown in fig. 1, a method for processing web page data based on a database according to an embodiment of the present invention includes the following steps:
s1, setting related words of the website industry through an acquisition tool, and mining keywords based on a search engine;
firstly, according to the industry to which a website belongs, industry related words are set in a collection tool, and the mining of industry keywords is carried out, wherein in order to ensure the accuracy and the relevance of the mined keywords, the set industry related words need to be added with industry long-tail words as initiator words for mining, the industry long-tail words refer to a combined word of a word under different semantic contexts, for example, the automobile price is used as a standard related word of the automobile industry, so that the lowest query of the automobile price can be used as the industry long-tail words under the semantic of a question sentence and under the regional condition. In the mining process, the set industry related words are searched and simulated in the search engine, and the recommended words of the search result of the search engine are extracted to be used as the keywords to be screened. The collection tool is a web keyword collection tool that can be loaded on a web page, and is a very mature technology in the prior art, and is not described herein again.
S2, importing the keywords into a cloud database in the same environment as the web page, and filtering, classifying and labeling the keywords;
s2 is further subdivided into S21-S25:
s21: in order to ensure the purity of database table data before and after key screening, keywords to be screened are firstly led into a temporary table of a database;
s22, filtering keywords irrelevant to the website, and taking the screened keywords as keywords to be classified; the filtering is to filter and delete the key words irrelevant to the industry in the database through character extraction. Due to the industry relevance of the keywords, the readability of the aggregated module elements and content data is directly determined, so that the keywords to be screened need to be compared with an industry database dictionary, the industry database dictionary can be constructed by recording words of industry vertical websites, and the matching coincidence times of the keywords to be screened and the database dictionary are determined by recording. When the number of matching times outside the industry is greater than the number of matching times inside the industry, the keyword can be judged to belong to a non-industry keyword. And when the key words do not belong to the industry key words, filtering and deleting, and when the key words belong to the industry key words, keeping in the database.
S23, matching the keyword classification, and determining the keyword classification and display form; the classification is to determine the keyword classification and content display form by identifying the core semantics in the keywords.
It should be noted that, the keyword classification is general, such as characters, products, buildings, events and brands, and in combination with the characteristics of website building, differentiated keyword classification is added, core semantic recognition of the keyword is performed, a tag set of the keyword is determined, each classification coefficient of the tag set is counted, and the classification with the largest matching number can be determined as the keyword classification; further, the content form tag in the keyword is extracted, and if the content form tag can be directly extracted from the keyword, the content form tag can be directly used, such as characters, videos and pictures. If the content form label cannot be extracted, the content display form can be determined according to the invisible meaning of the keyword, if the playback of a certain event can be determined as a video form, and if a certain player accesses a brief draft, the content display form can be determined as a character form.
And S24, labeling the key word attributes, and labeling all levels of attributes of the key words through matching of a pre-constructed database dictionary, wherein the step-by-step matching is performed with the contents in the database dictionary according to the sequence from large word meaning range to small word meaning range of the dictionary.
Since there may be more than one attribute of a keyword and the range of semantic inclusion of the attribute of the keyword is from large to small, there is a large difference. Therefore, in the process of labeling the keywords, in order to present accurate and effective content, the keywords need to be labeled by deep matching. In the matching process, a semantic dictionary base is needed, data in the semantic dictionary base is obtained, professional terms in each field and each scene in each industry are taken, semantic words in the semantic dictionary words are divided according to the size of a semantic range, and meanwhile the full-network content data quantity of the semantic words is used as the weight of the semantic words. And matching the keywords with the data in the semantic dictionary base according to the sequence of the semantics from large to small, wherein when the matched attributes of the keywords are one, the attributes are the primary attributes of the keywords. When the two attributes matched by the key words are provided, the smaller range is the first-level attribute, and the larger range is the second-level attribute. When the number of the attributes matched by the key words is equal to or more than three, if the number of the attributes is an odd number, the middle value of the range is taken as a first-level attribute, the minimum value of the range is taken as a second-level attribute, the maximum value of the range is taken as a third-level attribute, other residual attributes are four-level attributes, if the number of the attributes is an even number, the weights of the two semantic words in the middle are taken as the first-level attributes, the minimum value of the range is taken as the second-level attribute, the maximum value of the range is taken as the third-level attribute, and other residual attributes are four-level attributes.
And S25, moving the filtered, classified and labeled keywords from the temporary database table to the formal database table to be used as core keywords.
S3, constructing a page template layout and corresponding module elements required by web page data processing;
the layout for constructing the page template is to set the information level of a page layout area; and sets a content presentation form for the module elements in the page layout area. And constructing the layout of the page template, and setting the information types of the page layout area according to the browsing habits of the user, wherein the page layout area is divided into a first type information area, a second type information area, a third type information area and a fourth type information area. The first-class information is used for displaying first-class attribute related information of the keywords, the first-class information area preferentially displays the first-class attribute related information of the keywords, the second-class information area preferentially displays the second-class attribute related information of the keywords, the third-class information area preferentially displays the third-class attribute related information of the keywords, and the fourth-class information area preferentially displays the remaining fourth-class attribute related information of the keywords; it should be noted that when the key word attributes are less than the four types of settings of the page layout, replacement and supplement are performed in sequence according to the order of the key word attributes from large to small; and determining the information category, and selecting module elements according to the category and setting a content display form in each area.
As shown in fig. 3, the first-class information region is set in the central region of the page template, has the largest area and the highest attention, and can be observed without rotating the mouse or dragging the vertical scroll bar; the second-class information area is arranged in the top area of the page template, has the second-highest attention degree and can be observed without rotating a mouse or dragging a vertical scroll bar; the three types of information areas are arranged in the right area of the page template, and can not be displayed completely, the attention degree is not high, and the information can be observed completely only by dragging a horizontal scroll bar; the four types of information areas are arranged in the area at the lower right of the page template, and can not be displayed completely, and the attention can be completely observed only by rotating a mouse or dragging a vertical scroll bar or a horizontal scroll bar.
S4, loading keywords on the web page through the page template, processing the module elements and the content data, and generating the web page data display effect;
specifically, a web page reads the corresponding relation of keywords stored in a database table and the characteristics of the keywords, such as classification, form, semantics and the like, simultaneously loads a page template and sends the keywords to the page template through an interface, and the page template identifies the classification of the keywords and dynamically matches and loads the module element form in a page layout area; the content of each module element is classified according to the information of the layout area where the module element is located, and the related content with the highest matching degree is displayed by combining the corresponding keyword attributes. Taking an example that a certain player of a certain football team participates in a certain event, by the processing method, the keywords are labeled, the result obtained by the matching mode of the semantic range from large to small is football- > event- > team- > player, and at the moment, the football is put into three types of information areas as three-level attributes, and simultaneously, the related content of the football is displayed; the player can be placed in a second-class information area as a second-class attribute, and relevant contents of the player are displayed; for semantic words between the event and the team, as the semantic word weight of the event is higher than that of the team, the event is taken as a first-level attribute and put in a first-level information area to display the related content of the event; and putting the team as a four-level attribute into a four-type information area, and displaying the relevant content of the team.
The above embodiments are merely technical solutions of the present invention and not limitations, it should be noted that, for those skilled in the art, modifications or equivalents may be made to the specific embodiments of the present invention without departing from the technical principles of the present invention, and it should be understood that all modifications or equivalents may fall within the scope of the claims of the present invention.

Claims (3)

s4: the web page loads keywords through a page template, processes module elements and content data and generates a web page data display effect; the web page reads the corresponding relation of the keywords stored in the database table and the classification, form and semantic features of the keywords, loads the page template and sends the keywords to the page template through an interface, and the page template identifies the classification of the keywords and dynamically matches and loads the form of module elements in the page layout area; the content of each module element is classified according to the information of the layout area where the module element is located, and the related content with the highest matching degree is displayed by combining the corresponding keyword attributes.
CN202111411396.5A2021-11-252021-11-25Web page data processing method based on databaseActiveCN113836434B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202111411396.5ACN113836434B (en)2021-11-252021-11-25Web page data processing method based on database

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202111411396.5ACN113836434B (en)2021-11-252021-11-25Web page data processing method based on database

Publications (2)

Publication NumberPublication Date
CN113836434A CN113836434A (en)2021-12-24
CN113836434Btrue CN113836434B (en)2022-03-04

Family

ID=78971764

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202111411396.5AActiveCN113836434B (en)2021-11-252021-11-25Web page data processing method based on database

Country Status (1)

CountryLink
CN (1)CN113836434B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN115098581B (en)*2022-08-262023-02-28金联创网络科技有限公司Method, device and equipment for storing numerical heterogeneous data and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1932817A (en)*2006-09-152007-03-21陈远Common interconnection network content keyword interactive system
CN103425741A (en)*2013-07-162013-12-04北京中科汇联信息技术有限公司Information exhibiting method and device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102402539A (en)*2010-09-152012-04-04倪毅Design technology for object-level personalized vertical search engine
KR20120072041A (en)*2010-12-232012-07-03한국전자통신연구원Internet searching apparatus and internet search result providing method for display appartus
CN103488781B (en)*2013-09-302017-06-23北京奇虎科技有限公司Method, the search engine server of information search are provided
CN104951572B (en)*2015-07-282018-07-17郑州悉知信息科技股份有限公司A kind of method for building website and server
CN111859195A (en)*2020-07-312020-10-30北京字节跳动网络技术有限公司Information display method, information search method and device
CN112559850B (en)*2020-12-092024-01-09苏州闻道网络科技股份有限公司Keyword mining system and mining method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1932817A (en)*2006-09-152007-03-21陈远Common interconnection network content keyword interactive system
CN103425741A (en)*2013-07-162013-12-04北京中科汇联信息技术有限公司Information exhibiting method and device

Also Published As

Publication numberPublication date
CN113836434A (en)2021-12-24

Similar Documents

PublicationPublication DateTitle
CN109992645B (en)Data management system and method based on text data
US9514216B2 (en)Automatic classification of segmented portions of web pages
CN104361111B (en)A kind of archives are compiled and grind method automatically
US10410224B1 (en)Determining item feature information from user content
CN103678564B (en)Internet product research system based on data mining
CN107122400B (en)Method, computing system and storage medium for refining query results using visual cues
CN107844565B (en)Commodity searching method and device
CN105760439B (en)A kind of personage's cooccurrence relation map construction method based on specific behavior co-occurrence network
US20150113388A1 (en)Method and apparatus for performing topic-relevance highlighting of electronic text
CN104636408B (en)News certification method for early warning and system based on user-generated content
CN106202514A (en)Accident based on Agent is across the search method of media information and system
CN103473369A (en)Semantic-based information acquisition method and semantic-based information acquisition system
CN102207948A (en)Method for generating incident statement sentence material base
WO2012106941A1 (en)Method and device for full-text search
CN102073641A (en)Method, device and program for processing consumer-generated media information
CN110134844A (en) Public opinion monitoring method, device, computer equipment and storage medium in subdivided fields
CN116401343A (en) A data compliance analysis method
KR100876214B1 (en)Apparatus and method for context aware advertising and computer readable medium processing the method
CN104881447A (en)Searching method and device
CN113836434B (en)Web page data processing method based on database
JP2006146802A (en) Text mining apparatus and text mining method
TWI396990B (en)Citation record extraction system and method, and program product
CN105279287A (en)Material catalogue retrieval method
KR101850853B1 (en)Method and apparatus of search using big data
CN109034908A (en)A kind of film ranking prediction technique of combination sequence study

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp