Embodiment
For making above-mentioned purpose of the present invention, feature and advantage more obviously understandable, the present invention is done further detailed explanation below in conjunction with accompanying drawing and embodiment.
Fig. 1 is the functional block diagram of the Information Acquisition System under mobile Internet of the present invention and the CMMB mixed channel; Wherein comprise like lower module:
Information grabbing treatment module 1; It mainly is responsible for automation ground and obtains information from the various websites of content that provide; For example from Baidu periodic search, from RSS, obtain the real-time update source, from Timeline such as microblogging use, obtain real time information stream, from signatory cooperating content merchant such as masses' comment, obtain information, the information of obtaining need be passed through the editor and the processing of index/summary subsequently;
Information index and summary editor module 2; Be used for mainly passing through the index and the summary info of automated method information extraction; For example can therefrom extract according to the structure of information source; The information of for example obtaining in the summary among the RSS, the for example microblogging (through the Internet API) is quoted (microblogging itself is exactly a kind of summary), from search engine, is extracted (the search engine generally automatic meeting of meeting information generates a skimmed summaries), simple information extraction head section); Leaching process comprises the classification of generation information, label, and places etc. (come from the label in the information source; And offer artificial editing environment, supply artificial index/summary info clauses and subclauses that automation was handled to proofread and edit again;
Information index and summary release module 3; Mainly be to extract field information in the summary data table from database and generate the CML/XML specification that issue needs, detailed process is exactly for each bar data-base recording generates a data entries, and data entries is changed into structurized CML/XML form; A series of data entries CML/XML tissues are become a packets of information; Save as file, send, when generating CML/XML issue form; Also need internet link be transformed into an address of pointing to the internet information access portal server, so that can carry out data statistic analysis.
CMMB publisher server 4, it is used for through CMMB channel release news index and summary;
Terminal receiver module 5 is used for receiving and regularly receive data through CMMB.The data that receive are given the terminal browser module and are browsed.
CML can embed browser 6, and it is used to dock the CMMB data of accepting browses, and when the needs access internet links, starts the service of communication module access internet automatically.This module can be embedded in the bubble window of map (clicks the window that ejects behind the terrestrial reference on the map), make can be on map browsing information index/summary.The terminal can be with UI form display message clauses and subclauses such as tabulation or maps (on bubble); And a details button can be provided; When the user clicks this button; Automatically open any browser is browsed the mobile Internet link (Web webpage) of appointment, and browser need start the webpage on the communication module 7 download the Internets and show at this moment.
Internet information access portal server 8; When it was used for browsing at the terminal index that broadcasting issues with summary, any one left the index summary, and the operation of beginning access internet; The capital at first gets into the internet information access portal server, is redirected by portal server.So just reached the purpose of collecting user behavior information.
Backstage statistic analysis server 9; It is responsible for the depth information that statistic of user accessing is crossed, and idiographic flow is following: the example flow process of statistical analysis is following: (every click explains that once the user is interested in making a summary according to the information classification in the clauses and subclauses, information time, information place, information, label substance provider the number of clicks of the internet link of each clauses and subclauses to be segmented statistics; Think further to understand depth information); And forming multiple statistical report (like certain regional potential consumption demand, whether certain zone is interested in certain type of news), this part information can feed back to businessman; To impel businessman to improve marketing planning; Information updating is provided, also can be used as the important evidence that informative abstract is broadcasted adjustment on the other hand, send the more users information of interest.
Information getting method under mobile Internet of the present invention and the CMMB mixed channel comprises the steps:
Step 1, obtain source information respectively from the plurality of kinds of contents source;
In this step, can be with machine and artificial mode, extracting information is also carried out preliminary treatment, forms classified information.
Under the machine pattern, information grasp with processing module through search engine and social networks (, commenting on etc.) extracting information and data are sorted from the Internet according to search engine and social networks like microblogging.Module can be according to the extracting of classifying of predefined thematic classification, such as special topics such as news, luxurious life by eating, drinking and playing, tourism, physical culture.Process result deposits in the database and manages.
Under the artificial mode, then from the specific contents source, directly obtain content (like famous magazine, publications such as newspaper),, and deposit database in then by the manual work typing of classifying.
Step 2, according to the structure of said source information, the index of information extraction and summary info;
In this step, through artificial/automatically/method such as semi-automatic is the index and the summary of the content information information extraction that needs issue, forms one and is suitable for broadcasting packets of information transmission, content intact, send through radio network.
These packets of information are made up of a series of data entries with CML (the HTML standard of CMMB) or XML form tissue, the logical constitution of data entries for example can for:
● information time
● the information place
● information classification is (for example: news, information of discount, luxurious life by eating, drinking and playing, action message.Every kind of classification is thinner subclassification down.Classification is encoded through numeral.)
● the information content
■ information labels (key word index)
■ content supplier
The ■ clip Text
The ■ clauses and subclauses are quoted (in same packets of information, quote other clauses and subclauses, can use the ID indexed links)
■ extend information (according to some information of service needed customization, like advertisement)
■ mobile Internet link (with http://xxx Web link form)
The terminal can be with UI form display message clauses and subclauses such as tabulation or maps (on bubble), and a details button can be provided, and when the user clicked this button, the open any browser mobile Internet of browsing appointment linked (Web webpage) automatically.
Generate information index and summary through information index and summary editor module automation, and confirm and examine by manual work.
The technical characterictic of information index/summary: information is carried out index by the title of information, and carries out taxonomic organization according to Tag (label).The literal number of title has maximum limit (for example can be defined as 64); Each index entry points to an informative abstract; Informative abstract is to an information source breviary in full; Limit literal sum (for example can be defined as 256), if some pictures of information source are important, the breviary that summary can comprise these pictures extracts.
The mode of this informative abstract similar with microblogging (Twitter) to the method that message length limits, impel the structure that forms a kind of information index.
Step 3, index and the summary info internet address with depth information is linked, formation can be by the broadcast channel information releasing;
The internet address of index/summary info with depth information linked, and formation can be by the broadcast channel information releasing, and issues through information index and summary release module.Link to depth information in the summary info is replaced with an access request of pointing to the internet access portal server; And simultaneously these linking sources are sent to the internet information access portal server, shine upon (so that being redirected to source address after receiving request) inner formation of server.
Step 4, issue said information through mobile multimedia broadcasting channel; For the big classification of public character, such as news, luxurious life by eating, drinking and playing etc. adopt wheel to sow the originating party formula.For the segmentation classification,, then can adopt the pattern that regularly issues (regularly receiving) such as the magazine index.
Regularly the step of broadcasts/reception is: in advance column (classification) broadcast time is issued down through ESG (programme) information, and in the content of the time period of programme appointment broadcast appointment.According to the indication of ESG, start shooting in the time period of appointment in the terminal simultaneously, and receive the information of designated frequency band.
Step 5, terminal receive the said information of issue through mobile multimedia broadcasting channel;
Browse information index and summary at step 6, terminal; Through embeddable terminal browser with form web page browsing information index and summary.Can embed the terminal browser and can be embedded into during map and navigation etc. use, information index and summary are superimposed upon on the map show (for example bubble pattern).
When step 7, the internet address link in the user makes a summary through the browser click; Browser will jump to the address of its internet information access portal server that is linked to; The internet sources address at request visit summary depth information place, and then the depth information of access internet.Internet address is all with http: // beginning, and local links is then with file: ///begin perhaps with self-defining xx: // beginning.
The internet information portal server is redirected to the internet sources address that depth information belongs to request.Simultaneously record is carried out in user's click, collect user behavior information.The statistics of user's behavioural information forms the clicking rate form (being similar to the audience rating statistical of radio and television) of information category, and in view of the above the broadcast strategy of information category is adjusted and optimized (needing manual work).
The present invention mainly is to combine to broadcast the advantage with the two kinds of patterns of communicating by letter, and broadcasting is more suitable for sending the high information of multiplicity (frequency), and to-talk internet is more suitable for sending out the high information of randomness.And issue the high information index/summary of public degree (similar microblogging pattern) with broadcast mode through a kind of, and obtain the mode of the information of the personalized degree of depth with the Internet, just can the maximized advantage of bringing into play the CMMB radio network.
Being directed to radio network does not in addition have up feedback, can't accomplish to continue to optimize this problem, jumps to the inlet service between the internet access through design in the broadcasting visit, and user behavior is carried out statistical analysis, thereby has accomplished to continue feedback optimized closed loop flow process.
Should be noted that at last: above embodiment is only in order to technical scheme of the present invention to be described but not to its restriction; Although with reference to preferred embodiment the present invention has been carried out detailed explanation, the those of ordinary skill in affiliated field is to be understood that: still can specific embodiments of the invention make amendment or the part technical characterictic is equal to replacement; And not breaking away from the spirit of technical scheme of the present invention, it all should be encompassed in the middle of the technical scheme scope that the present invention asks for protection.