Background technology
Real time data stream detects and is stored as the bottleneck of the business that detects for data stream, how to design a kind of brand-new mass data stage memory storage, this apparatus features is the data in the distributed work station regularly can be carried out classified finishing, and the staff is needed the real time data stream quick calling of match query.And expired data stream stores into independently and prepares against the function of doing long term data excavation rational analysis in the massage storage in the memory mechanism.
The monitoring means of real time data stream is brought in constant renewal in and to the continuous demand of result data degree of accuracy, make to real time data stream and national standard accurately, The matching analysis becomes the professional problem that needs solution of current real time data stream monitoring fast.Compare with traditional data structure, characteristics such as it is real-time, continuous, endless that data stream has, these singularity of data stream have brought new opportunities and challenges for the research of the information processing technology and application. and the characteristics of data stream have determined that its main match query mode is the continuous-query coupling.Continuous-query is meant execution continuously for a long time after an inquiry is established, along with the arrival of new data will constantly produce new match query result.Because data stream is the set of series of discrete state, if the data base querying matching way that adopts tradition to calculate based on CPU, be difficult to obtain fast match query result accurately under a lot of querying conditions, because traditional data stream storage mode all deposits mass data flow in storage mediums such as disk, when carrying out the match query order, need carry out a large amount of I/O operations, its response speed is slow, is difficult to adapt to the requirement of real-time match query.In fact the data of latest stage are more important than historical data, so the data of the latest stage often that people are concerned about in the data stream The matching analysis.
Summary of the invention
The invention provides the design of a kind of real time data stream high speed The matching analysis system, particularly, can be used on the portable terminal real time data stream being carried out the high speed The matching analysis based on the design of the real time data stream high speed The matching analysis system of GPU.
Native system mainly comprises the data stream receiving trap, mass data stage memory storage, the complete real time data stream high speed The matching analysis system of a cover, wireless telecommunications system, and mobile intelligent terminal; Implementation procedure is such, observation satellite is passed to ground data stream receiving station with observed result as data stream, the station spreads Monitoring Data to data stream receiving station, the data stream receiving trap of these two kinds of forms has received Monitoring Data stream, and data stream is stored into the workstation of distribution various places, mass data stage memory storage is responsible for the regular and stage storage the data qualification on the workstation, afterwards when receiving the The matching analysis request of mobile intelligent terminal, system checks at first whether the user is authorized to, confirming will carry out from the system of real time data stream high speed The matching analysis after errorless The matching analysis handles, after The matching analysis result is responsible for by wireless telecommunications system and corresponding commission merchant reaches an agreement on, send on the mobile intelligent terminal equipment.The present invention can carry out The matching analysis to real time data stream effectively on portable unit, The matching analysis speed is very fast, accuracy rate is higher, the engineering staff of the operation that is particularly suitable for going out.
Embodiment
Technological difficulties to be solved by this invention be how satisfy to the real time data matching inquiry provide at a high speed, the data stream The matching analysis mechanism of pinpoint accuracy, to tackle the application demand of various data streaming service information.
As Fig. 1, provide the real time data flow receiving device to comprise various different classes of monitoring devices, detect equipment such as data sink, workstation.Monitoring device has observation satellite, the station etc., has different monitoring equipments towards different field; Each monitoring equipment all has corresponding with it data stream receiving trap, detects DRS, station DRS etc. such as satellite.These devices provide the most original measured data for us, these data can be according to instruction storage in distributed workstation, the data of this moment are not through unified data preprocessing process, because data volume is huge, so will take a large amount of stand-by period of user when carrying out the The matching analysis instruction, be not suitable for outworker's query calls at any time.Improved method adopts mass data stage memory storage and based on the real time data stream high speed The matching analysis system of GPU, to storing and computing velocity all is once tremendous improvement and optimization.
Mass data stage memory storage is used for preserving being stored in the distributed data base of data stream that each workstation sends and stageization.This is comprising the pre-service to raw data, data qualification, the operation of warehouse-in.The data pre-service is responsible for the data that transmit are carried out operations such as data cleansing, denoising, and the purpose in this step is to reduce data redundancy, data inconsistency, avoids reading invalid dirty data.Data qualification is that raw data is classified by attribute information and format differences, is different classes of data such as station data and satellite monitoring data, must be stored in the different databases.Data qualification finishes, and next is exactly these data are carried out in-stockroom operation.
Real time data stream high speed The matching analysis system based on GPU mainly comprises intermediary device, matched-field processors.The query requests of system receiving terminal is obtained data from heterogeneous database.Query Result must just can turn back on the portable terminal by wireless telecommunications hardware and the agential transfer of wireless telecommunications.The wireless telecommunications hardware is here realized Query Result is passed to the commission merchant, is responsible for transferring electromagnetic signal to digital signal by the commission merchant, sends to terminal request.
As Fig. 2, the real time data stream high speed The matching analysis system based on GPU among the present invention mainly is made up of intermediary device, matched-field processors.Intermediary device receives user's query requests, is responsible for local heterogeneous data source mutual by intermediary device.System offers global schema of user, and the data of mass data stage memory storage are carried out matching inquiry.Matched-field processors is carried out query requests, and the result is returned to intermediary device, and intermediary device carries out after the overall treatment result being returned to the user.Wherein intermediary device is made up of integrated manager and query processor, and bookkeepings such as the responsible increase to distribution local data of integrated manager, deletion can easily be managed each data source by global map, realize the data integration management; Be responsible for local data simultaneously and arrive the integrated of global data.Matched-field processors is responsible for accepting user's query requests, and Query Result is returned.
Continuing referring to Fig. 3, is the system handles process flow diagram to the data continuous query, mainly comprises following steps: at first open the web browser in the terminal (following is example with the mobile phone), open corresponding website; Then carry out query manipulation; System checks earlier whether this user has authority, if not then getting back to login interface; If this user has authority, then enter real time data stream high speed The matching analysis system and carry out query manipulation, system returns to terminal with Query Result, finishes.
Continue referring to Fig. 4, this figure is based on the core processing unit figure of the data stream matched-field processors of GPU among Fig. 2.This module is the nucleus module of system, is based on GPU and carries out high-performance calculation, is divided into data stream Uniform Scaling processing, Normalization processing and three steps of data stream matching treatment.The data stream of in calculator memory data stream being carried out that UniformScaling handles, Normalization handles and will change is sent in the GPU video memory, the video memory of GPU uses Scaling Subsequence Matrix memorying data flow and utilizes the moving window model and standardized data stream carries out the high speed The matching analysis, after at last Query Result being gathered, feed back to the user., because the characteristics of GPU are the intensive computings at a large amount of data of the same type, so the advantage that the numerical evaluation of matrix-type is operated makes the matched data flow operation more quick, matching precision is higher.
In sum, the present invention is based on the design of the real time data stream high speed The matching analysis system of GPU, Monitoring Data by various monitoring device aggregated data stream types, and through storing in the mass data stage memory storage after the data pre-service, portable terminal can be carried out corresponding real time data stream matching request by authority at any time.
The above only is an illustrative, but not is restricted.This description should not be construed as limitation of the present invention, and any spirit of the present invention and category of not breaking away from all should be contained within the application range.