Movatterモバイル変換


[0]ホーム

URL:


CN103838867A - Log processing method and device - Google Patents

Log processing method and device
Download PDF

Info

Publication number
CN103838867A
CN103838867ACN201410106430.1ACN201410106430ACN103838867ACN 103838867 ACN103838867 ACN 103838867ACN 201410106430 ACN201410106430 ACN 201410106430ACN 103838867 ACN103838867 ACN 103838867A
Authority
CN
China
Prior art keywords
cluster server
log
daily record
data
log data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410106430.1A
Other languages
Chinese (zh)
Inventor
洪珂
刘华明
卢荣斌
闵杰
李波
陈燕华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wangsu Science and Technology Co Ltd
Original Assignee
Wangsu Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wangsu Science and Technology Co LtdfiledCriticalWangsu Science and Technology Co Ltd
Priority to CN201410106430.1ApriorityCriticalpatent/CN103838867A/en
Publication of CN103838867ApublicationCriticalpatent/CN103838867A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Classifications

Landscapes

Abstract

The invention discloses a log processing method and device. The log processing method comprises the steps that a cluster server receives log files of user terminals; the cluster server stores the log files; the cluster server analyzes the log files, so that analyzing results are obtained; the cluster server outputs the analysis results. According to the log processing method and device, the effect of improving log processing efficiency is achieved.

Description

Log processing method and device
Technical field
The present invention relates to log processing field, in particular to a kind of log processing method and device.
Background technology
Existing log processing system adopts traditional database as large data carrier conventionally, unstructured data or semi-structured data are stored in tables of data, and the read-write of daily record data is comparatively complicated like this, and performance is lower, retractility is poor, cannot adapt to the quick variation of business.The storage of traditional log processing system to massive logs data and analysis chronic, and along with the explosive growth of daily record data, can only improve data-handling efficiency and increase memory space by hardware simply, not only cost be high, and the efficiency of processing high dimensional data can not improve much yet.
Conventional architectures cannot realize the linear expansion of the memory property of log processing system, in the time that memory property pressure reaches the limit of storage, cannot promote fast and effectively the readwrite performance of storage.Along with the explosive growth of daily record data, the inefficient problem of existing log processing is increasingly serious.
For the inefficient problem of log processing in prior art, effective solution is not yet proposed at present.
Summary of the invention
Fundamental purpose of the present invention is to provide a kind of log processing method and device, to solve the inefficient problem of log processing.
To achieve these goals, according to an aspect of the present invention, provide a kind of log processing method.Log processing method according to the present invention comprises: cluster server receives the journal file of user side; Cluster server storing daily record file; Cluster server is analyzed journal file, obtains analysis result; And cluster server output analysis result.
Further, cluster server storing daily record file comprises: journal file is split into daily record data by cluster server; And cluster server is sent to daily record data in Distributed Message Queue, wherein, cluster server reads daily record data from Distributed Message Queue, and daily record data is analyzed.
Further, in daily record data is sent to Distributed Message Queue by cluster server after, log processing method also comprises: cluster server reads daily record data from Distributed Message Queue; Cluster server is resolved the daily record data reading, and obtains analysis result; Cluster server generates key-value pair corresponding to daily record data according to analysis result; And cluster server carrys out storing daily record file by key-value pair is stored in distributed data base.
Further, cluster server comprises journal file analysis: the daily record data of cluster server Real-time Obtaining increment from distributed data base; And cluster server adopts streaming calculating to add up to the daily record data of increment.
Further, cluster server comprises journal file analysis: cluster server obtains the daily record data of increment from distributed data base according to predetermined period; And cluster server carries out statistical computation to the daily record data of increment.
To achieve these goals, according to a further aspect in the invention, provide a kind of log processing device.Log processing device according to the present invention comprises: receiving element, for making cluster server receive the journal file of user side; Storage unit, for making cluster server storing daily record file; Analytic unit, for cluster server is analyzed journal file, obtains analysis result; And output unit, for making cluster server output analysis result.
Further, storage unit comprises: split module, for making cluster server that journal file is split into daily record data; And delivery module, for making cluster server that daily record data is sent to Distributed Message Queue, wherein, cluster server reads daily record data from Distributed Message Queue, and daily record data is analyzed.
Further, storage unit also comprises: read module, for after daily record data is sent to Distributed Message Queue by cluster server, makes cluster server from Distributed Message Queue, read daily record data; Parsing module, for cluster server is resolved the daily record data reading, obtains analysis result; Generation module, for making cluster server generate key-value pair corresponding to daily record data according to analysis result; And memory module, for making cluster server carry out storing daily record file by key-value pair being stored into distributed data base.
Further, analytic unit comprises: the first acquisition module, for making the daily record data of cluster server from distributed data base Real-time Obtaining increment; And first computing module, add up for making cluster server adopt streaming to calculate to the daily record data of increment.
Further, analytic unit comprises: the second acquisition module, for making cluster server obtain the daily record data of increment from distributed data base according to predetermined period; And second computing module, for making cluster server carry out statistical computation to the daily record data of increment.
By the present invention, the classification that adopts cluster server to store and analyze processes to reach the high-effect of massive logs processing, realize massive logs analysis, solved the inefficient problem of log processing in prior art, reached the effect that improves log processing efficiency.
Brief description of the drawings
The accompanying drawing that forms the application's a part is used to provide a further understanding of the present invention, and schematic description and description of the present invention is used for explaining the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is according to the process flow diagram of the log processing method of the embodiment of the present invention;
Fig. 2 is according to the process flow diagram of a kind of preferred log processing method of the embodiment of the present invention;
Fig. 3 is according to the schematic diagram of the log processing device of the embodiment of the present invention; And
Fig. 4 is according to the schematic diagram of a kind of preferred log processing device of the embodiment of the present invention.
Embodiment
It should be noted that, in the situation that not conflicting, the feature in embodiment and embodiment in the application can combine mutually.Describe below with reference to the accompanying drawings and in conjunction with the embodiments the present invention in detail.
In order to make those skilled in the art person understand better the present invention program, below in conjunction with the accompanying drawing in the embodiment of the present invention, technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the embodiment of a part of the present invention, instead of whole embodiment.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtaining under creative work prerequisite, should belong to the scope of protection of the invention.
It should be noted that, term " first ", " second " etc. in instructions of the present invention and claims and above-mentioned accompanying drawing are for distinguishing similar object, and needn't be used for describing specific order or precedence.Should be appreciated that the data of such use are suitably exchanging in situation, so that embodiments of the invention described herein.In addition, term " comprises " and " having " and their any distortion, intention is to cover not exclusive comprising, for example, those steps or unit that process, method, system, product or the equipment that has comprised series of steps or unit is not necessarily limited to clearly list, but can comprise clearly do not list or for these processes, method, product or equipment intrinsic other step or unit.
The embodiment of the present invention also provides a kind of log processing method.The method operates on computer equipment.
Fig. 1 is according to the process flow diagram of the log processing method of the embodiment of the present invention.As shown in Figure 1, this log processing method comprises that step is as follows:
Step S102, cluster server receives the journal file of user side.
User side can be the server that need to gather daily record, can be also the client that side of user need to gather daily record.For example, user is by the corresponding client of a station server, and different clients is moved respectively business separately, and client can produce daily record.Meanwhile, server is for individual client provides background service, and server also can produce some daily records in operational process.Cluster server can reception server or the journal file sending over of client, for journal file is processed.Cluster server can receive the journal file of multiple user sides simultaneously, and the journal file of different user end is processed respectively.
In the embodiment of the present invention, can, need to gathering the user side setting of daily record or carrying a proxy module, for timing acquiring journal file, send to cluster server.User side sends request by http protocol and corresponding journal file, after cluster server response request, receives journal file, so that journal file is stored on cluster server by the service interface providing.
Step S104, cluster server storing daily record file.
After receiving the journal file of user side, can store journal file into cluster server.
Particularly, storing daily record file can be first journal file to be split into multirow daily record data, then multirow daily record data is sent in Distributed Message Queue successively, for example kafka message queue, so that cluster server reads daily record data analysis from Distributed Message Queue.After daily record data is sent to Distributed Message Queue successively, cluster server can also read daily record data from Distributed Message Queue, the daily record data reading is resolved, and the form that generates key-value pair (key-value) is stored in distributed data base.In storing daily record file, can obtain the descriptor (as path, the creation-time etc. of journal file) of journal file, leave in the database of cluster server.
Step S106, cluster server is analyzed journal file, obtains analysis result.
After journal file is transferred to cluster server by user side, user can access cluster server, the analysis result of inquiry cluster server to journal file.For example, by log analysis, can obtain operation conditions or the fault state of user side business.Can be that the information in journal file is added up to journal file analysis, obtain statistics.
Due to the difference of the request for information of the analysis result of user to journal file, the analysis of daily record can be divided into real-time analysis and off-line analysis according to the promptness of search request.Real-time analysis requires to return within the several seconds analysis of more than one hundred million row daily record datas conventionally, just can reach the object that does not affect user's query analysis result.Daily record data is carried out to real-time statistics, and this part daily record data amount is generally not too large, can calculate statistical study by streaming, and for example in redis database, store analysis result after processing in result temporal data storehouse again.
Off-line analysis is less demanding to the promptness of statistics, can be every other day or every other month analysis result show.Daily record data after resolving is first left in to distributed data base as in Hbase database, require to finish writing task job according to service logic in advance, carry out counting statistics by predetermined period timing race task and analyze daily record.
Step S108, cluster server output analysis result.
Output analysis result can be that analysis result is exported to corresponding user side, can show analysis result by webpage or application program, so that staff checks at user side.
In the embodiment of the present invention, in cluster server, multiple servers are used for receiving journal file, multiple servers are for storing daily record file, and multiple servers are used for analyzing journal file, complex calculations are all assigned to each station server by the embodiment of the present invention, the concurrent ability of height that has realized whole system, processing power can reach the more than 10 times of conventional architectures.The classification of being stored and being analyzed by cluster server is processed to reach the high-effect of massive logs processing, has realized massive logs analysis, has solved the inefficient problem of log processing in prior art, has reached the effect that improves log processing efficiency.
The embodiment of the present invention can be to adopt cloud computing principle, and journal file is processed.Wherein, cloud computing (cloudcomputing) is increase, use and the delivery mode of the related service based on internet, and being usually directed to is provided dynamically easily expansion and be often virtualized resource by internet.Cloud is the one metaphor saying of network, internet.Past often represents telecommunications network with cloud in the drawings, is also used for afterwards representing the abstract of internet and underlying basis facility.Narrow sense cloud computing refers to payment and the use pattern of IT infrastructure, refers to obtain resource requirement by network in the mode of as required, easily expanding; Broad sense cloud computing refers to payment and the use pattern of service, refers to obtain required service by network in the mode of as required, easily expanding.It is relevant with software, internet that this service can be IT, also other services.It means that computing power also can be used as a kind of commodity and circulates by internet.Cloud computing is as a kind of emerging technical concept, cloud storage (mass data distributed store technology), cloud computing (the map reduce of hadoop, streaming are calculated in real time), Yunan County's congruence that it provides are applicable to the demands such as large data storage, excavation, analysis, early warning, statistics very much, and its efficient performance allows being protected in time and accurately of data processing.Based on the principle of cloud computing platform, carry out the selection of daily record data storage in early stage and done classification according to the requirement of data volume and inquiry real-time and processed, most importantly accomplished the parallel processing of a business task analysis, instead of the parallel processing of multitask, greatly promoted the correctness of search efficiency and statistics.
The object of the embodiment of the present invention is to solve the cloud storage of massive logs, and massive logs can be analyzed and analyse in depth the cloud computing service of excavation in time, and ensures security, the accuracy of daily record data.Solved the growth of daily record amount as long as solve by new computing node simultaneously, and without just improving data-handling efficiency and increase memory space by hardware simply.
Preferably, the step of cluster server storing daily record file comprises the following steps:
Step S1, journal file is split into daily record data by cluster server.
Because the form of the journal file of different user end is different, and in each journal file, include multiple log recordings, it can be that journal file is split into multirow daily record data that journal file is split into daily record data, form data line, be sent in distributed message row so that the journal file by form is not split into daily record data.
Step S2, cluster server is sent to daily record data in Distributed Message Queue.Wherein, cluster server reads daily record data from Distributed Message Queue, and daily record data is analyzed.
Distributed Message Queue can be kafka message queue, the Distributed Message Queue of kafka is relatively applicable to simple message transmission and distribution, can support big data quantity, especially daily record data, and be combined with mapreduce and do real-time analysis and also can reach good effect.
Preferably, after daily record data is sent to the step in Distributed Message Queue by cluster server, log processing method also comprises: cluster server reads daily record data from Distributed Message Queue; Cluster server is resolved the daily record data reading, and obtains analysis result; Cluster server generates key-value pair corresponding to daily record data according to analysis result; And cluster server carrys out storing daily record file by key-value pair is stored in distributed data base.
Particularly, from Distributed Message Queue, read daily record data, every daily record data is resolved, parsing obtains the key word of daily record, and such as mac address, flow, concrete application etc., generate key-value pair corresponding to daily record data based on these analysis results, as utilize mac address for key, other analysis result is value, then obtains the key-value pair of daily record data, then daily record data mapping is stored into distributed data base as in hbase database.
The embodiment of the present invention, the data of utilizing distributed data base hbase storing daily record to resolve, because hbase database is the data model storage based on key-value, favorable expandability, carry out analysis speed from hbase peek enough fast, and result can store arbitrarily, continue to store hbase, relational data or redis all can, do not have incompatible situation and occur.
Preferably, cluster server comprises journal file analysis: the daily record data of cluster server Real-time Obtaining increment from distributed data base; And cluster server adopts streaming calculating to add up to the daily record data of increment.
Due to constantly adding up of journal file, the daily record data being stored in distributed data base also constantly increases, real-time analysis in the embodiment of the present invention can be the cluster server daily record data of Real-time Obtaining increment from distributed data base in real time, the daily record data of increment is carried out to counting statistics, avoid the daily record data to having calculated to carry out double counting.The daily record data of Real-time Obtaining increment, adopts streaming to calculate the data of increment is added up.Wherein, it is to adopt the bolt of storm to complete that streaming is calculated, in bolt, carry the sequence of operations such as filtration, polymerization, Query Database, wherein, filter operation can complete in the parse in early stage analyzes, form with DB table leaves in hbase, only in streaming is calculated, has done map mapping the Organization of Data needing is got up to carry out polymerization computational analysis.
Particularly, first, take out daily record data and resolve and leave in hbase through parse from kafka queue, this process splits log recording, and the form that is mapped to DB table leaves in hbase.Then, adopt streaming to calculate to carry out real-time analysis statistics, it is to adopt the bolt of storm to complete that streaming is calculated, in bolt, carry the sequence of operations such as filtration, polymerization, Query Database, wherein, filter operation can complete in the parse in early stage analyzes, and leaves in hbase with the form of DB table, only in streaming is calculated, has done map mapping the Organization of Data needing is got up to carry out polymerization computational analysis.Then leave result complete streaming counting statistics in database as in redis database.Finally, the result data that is stored in redis is left in to hbase database according to actual needs, or in relevant database mysql, inquire about these statisticss for user.
Above-described embodiment has been described a flow process of the real-time analysis in log analysis, processes the real-time analysis of massive logs according to real-time analysis flow, and moment is given client result feedback, improves the promptness of log analysis result.
Preferably, cluster server comprises journal file analysis: cluster server obtains the daily record data of increment from distributed data base according to predetermined period; And cluster server carries out statistical computation to the daily record data of increment.
Because the difference of the request for information of the analysis result of user to journal file can adopt the mode of off-line analysis to carry out analyzing and processing to daily record data.The cycle that can set in advance analysis is predetermined period, and predetermined period can arrange as required, for example a week or one month etc.From distributed data base, obtain the daily record data of increment according to predetermined period, the daily record data of increment is being carried out to statistical computation.
Particularly, can realize by following steps:
Step 1 is taken out daily record data and is resolved and leave in hbase through parse from kafka queue, and this process splits log recording, and the form that is mapped to DB table leaves in hbase.
Step 2, creates job task one by one according to specific needs, and the Logic of Tasks is determined according to actual service logic.
Step 3, creates periodic scheduling Task, and periodic schedule job task is set exactly, such as being pre-created task 1, runs task 1 zero point every day.
Step 4, the scheduling time of arrival, according to scheduling content start task.
Step 5, carries out concrete the Logic of Tasks counting statistics daily record data.
Step 6, if tasks carrying failure is notified associated user by the notification module setting in advance in the mode of note or mail, user is manually being restarted job task after investigation reason.
Step 7, after tasks carrying success, leaves execution result in hbase database in, facilitates user to inquire about.
Step 8, tasks carrying success and result is left in after hbase database, can notify user in the mode of note or mail by notification module, tasks carrying success.
Above-described embodiment has been described a flow process of the off-line analysis in log analysis, according to the off-line analysis of such off-line analysis flow process parallel processing massive logs, and result is reported to front end for user's displaying.
Fig. 2 is according to the process flow diagram of a kind of preferred log processing method of the embodiment of the present invention.As shown in Figure 2, this log processing method comprises that step is as follows:
Step 202, the journal file of extraction user side.Extracting journal file can be to extract the default relevant journal file of key word.By designing the agent proxy module of a script type, built on the server of user side, gather at regular intervals the daily record needing based on service needed.After extracting the journal file of user side, the journal file of extraction can be pushed to cluster server.
Step 204, is stored in cluster server by the journal file pushing out.On cluster server, storing daily record file comprises: being first storing daily record file, is secondly that description document daily record (comprising path that daily record deposits, size, time etc.) is stored in redis.
Step 206, cluster server reads daily record data, and daily record data is sent in Distributed Message Queue.
Step 208, cluster server reads daily record data from Distributed Message Queue, and daily record data is resolved.First carry out daily record parsing, useful Data Analysis out, the data after parsing are stored in the corresponding literary name section of hbase.
Step 210, reads the data analysis after daily record is resolved, and obtains analysis result.Can take real-time analysis and two kinds of modes of off-line analysis to the data after resolving.
Step 212, analysis result by being illustrated in user side.Can be to represent with the form of webpage or mobile phone A PP by Thrift.
Above-described embodiment has been described a daily record and has finally been shown a whole flow process to result from collecting analysis, and the classification of being stored and being analyzed by cluster server is processed to reach the high-effect of massive logs processing, has realized massive logs analysis.
An application scenarios below by the log processing method of the embodiment of the present invention is described the present invention in detail.
Processing procedure for the daily record of aggregate video flow comprises: first, gather the daily record of aggregate video flow.Then, cluster server splits into the flow daily record collecting in the capable kfaka of the being sent to queue of daily record data.
After flow Log Shipping is in kfaka queue, cluster server reads successively daily record data from kfaka queue, and every daily record is resolved, and resolves to some key words, such as mac address, flow, concrete application etc.
Result after cluster server is resolved, can form the pattern of the key-value that daily record data is corresponding, as utilize mac for key, and all the other are value, and daily record data mapping is stored in hbase.
Then can be as required, adopt the mode of real-time analysis or off-line analysis to carry out analytic statistics to journal file.Wherein, off-line analysis can be every 2H as a dispatching cycle, the scheduling moment one is to starting the task of designing in advance, the flow situation of this 2H of incremental computations and renewal discharge record monthly.Inform the implementation status of user task simultaneously.
Real-time analysis can be according to query statement, inquires about rapidly last task and run through the flow information of query point, and the statistics that the result of real-time query and last task are run through is fed back to user as actual data on flows.
Finally, by analysis result showing interface to user.
Based on the principle of cloud computing platform, carry out the selection of Primary Stage Data storage and done classification according to the requirement of data volume and inquiry real-time and processed, most importantly accomplished the parallel processing of a business task analysis, instead of the parallel processing of multitask, greatly promoted the correctness of search efficiency and statistics.
The embodiment of the present invention provides a kind of log processing device, and this device can be realized its function by cluster server.It should be noted that, the log processing method that the log processing device of the embodiment of the present invention can provide for carrying out the embodiment of the present invention, the log processing device that the log processing method of the embodiment of the present invention also can provide by the embodiment of the present invention is carried out.
Fig. 3 is according to the schematic diagram of the log processing device of the embodiment of the present invention.As shown in Figure 3, this log processing device comprises receiving element 10, storage unit 30, analytic unit 50 and output unit 70.
Receiving element 10 is for making cluster server receive the journal file of user side.
User side can be the server that need to gather daily record, can be also the client that side of user need to gather daily record.For example, user is by the corresponding client of a station server, and different clients is moved respectively business separately, and client can produce daily record.Meanwhile, server is for individual client provides background service, and server also can produce some daily records in operational process.Cluster server can reception server or the journal file sending over of client, for journal file is processed.Cluster server can receive the journal file of multiple user sides simultaneously, and the journal file of different user end is processed respectively.
In the embodiment of the present invention, can, need to gathering the user side setting of daily record or carrying a proxy module, for timing acquiring journal file, send to cluster server.User side sends request by http protocol and corresponding journal file, after cluster server response request, receives journal file, so that journal file is stored on cluster server by the service interface providing.
Storage unit 30 is for making cluster server storing daily record file.
After receiving the journal file of user side, can store journal file into cluster server.
Particularly, storing daily record file can be first journal file to be split into multirow daily record data, then multirow daily record data is sent in Distributed Message Queue successively, for example kafka message queue, so that cluster server reads daily record data analysis from Distributed Message Queue.After daily record data is sent to Distributed Message Queue successively, cluster server can also read daily record data from Distributed Message Queue, the daily record data reading is resolved, and the form that generates key-value pair (key-value) is stored in distributed data base.In storing daily record file, can obtain the descriptor (as path, the creation-time etc. of journal file) of journal file, leave in the database of cluster server.
Analytic unit 50, for cluster server is analyzed journal file, obtains analysis result.
After journal file is transferred to cluster server by user side, user can access cluster server, the analysis result of inquiry cluster server to journal file.For example, by log analysis, can obtain operation conditions or the fault state of user side business.Can be that the information in journal file is added up to journal file analysis, obtain statistics.
Due to the difference of the request for information of the analysis result of user to journal file, the analysis of daily record can be divided into real-time analysis and off-line analysis according to the promptness of search request.Real-time analysis requires to return within the several seconds analysis of more than one hundred million row daily record datas conventionally, just can reach the object that does not affect user's query analysis result.Daily record data is carried out to real-time statistics, and this part daily record data amount is generally not too large, can calculate statistical study by streaming, and for example in redis database, store analysis result after processing in result temporal data storehouse again.
Off-line analysis is less demanding to the promptness of statistics, can be every other day or every other month analysis result show.Daily record data after resolving is first left in to distributed data base as in Hbase database, require to finish writing task job according to service logic in advance, carry out counting statistics by predetermined period timing race task and analyze daily record.
Output unit 70 is for making cluster server output analysis result.
Output analysis result can be that analysis result is exported to corresponding user side, can show analysis result by webpage or application program, so that staff checks at user side.
In the embodiment of the present invention, in cluster server, multiple servers are used for receiving journal file, multiple servers are for storing daily record file, and multiple servers are used for analyzing journal file, complex calculations are all assigned to each station server by the embodiment of the present invention, the concurrent ability of height that has realized whole system, processing power can reach the more than 10 times of conventional architectures.The classification of being stored and being analyzed by cluster server is processed to reach the high-effect of massive logs processing, has realized massive logs analysis, has solved the inefficient problem of log processing in prior art, has reached the effect that improves log processing efficiency.
The embodiment of the present invention can be to adopt cloud computing principle, and journal file is processed.Wherein, cloud computing (cloud computing) is increase, use and the delivery mode of the related service based on internet, and being usually directed to is provided dynamically easily expansion and be often virtualized resource by internet.Cloud is the one metaphor saying of network, internet.Past often represents telecommunications network with cloud in the drawings, is also used for afterwards representing the abstract of internet and underlying basis facility.Narrow sense cloud computing refers to payment and the use pattern of IT infrastructure, refers to obtain resource requirement by network in the mode of as required, easily expanding; Broad sense cloud computing refers to payment and the use pattern of service, refers to obtain required service by network in the mode of as required, easily expanding.It is relevant with software, internet that this service can be IT, also other services.It means that computing power also can be used as a kind of commodity and circulates by internet.Cloud computing is as a kind of emerging technical concept, cloud storage (mass data distributed store technology), cloud computing (the map reduce of hadoop, streaming are calculated in real time), Yunan County's congruence that it provides are applicable to the demands such as large data storage, excavation, analysis, early warning, statistics very much, and its efficient performance allows being protected in time and accurately of data processing.Based on the principle of cloud computing platform, carry out the selection of daily record data storage in early stage and done classification according to the requirement of data volume and inquiry real-time and processed, most importantly accomplished the parallel processing of a business task analysis, instead of the parallel processing of multitask, greatly promoted the correctness of search efficiency and statistics.
The object of the embodiment of the present invention is to solve the cloud storage of massive logs, and massive logs can be analyzed and analyse in depth the cloud computing service of excavation in time, and ensures security, the accuracy of daily record data.Solved the growth of daily record amount as long as solve by new computing node simultaneously, and without just improving data-handling efficiency and increase memory space by hardware simply.
Preferably, storage unit comprises fractionation module and delivery module.
Splitting module is used for making cluster server that journal file is split into daily record data.
Because the form of the journal file of different user end is different, and in each journal file, include multiple log recordings, it can be that journal file is split into multirow daily record data that journal file is split into daily record data, form data line, be sent in distributed message row so that the journal file by form is not split into daily record data.
Delivery module is used for making cluster server that daily record data is sent to Distributed Message Queue.Wherein, cluster server reads daily record data from Distributed Message Queue, and daily record data is analyzed.
Distributed Message Queue can be kafka message queue, the Distributed Message Queue of kafka is relatively applicable to simple message transmission and distribution, can support big data quantity, especially daily record data, and be combined with mapreduce and do real-time analysis and also can reach good effect.
Preferably, storage unit also comprises read module, parsing module, generation module and memory module.
Read module, for after daily record data is sent to Distributed Message Queue by cluster server, makes cluster server from Distributed Message Queue, read daily record data.Parsing module, for cluster server is resolved the daily record data reading, obtains analysis result.Generation module is for making cluster server generate key-value pair corresponding to daily record data according to analysis result.Memory module is used for making cluster server to carry out storing daily record file by key-value pair being stored into distributed data base.
Particularly, from Distributed Message Queue, read daily record data, every daily record data is resolved, parsing obtains the key word of daily record, and such as mac address, flow, concrete application etc., generate key-value pair corresponding to daily record data based on these analysis results, as utilize mac address for key, other analysis result is value, then obtains the key-value pair of daily record data, then daily record data mapping is stored into distributed data base as in hbase database.
The embodiment of the present invention, the data of utilizing distributed data base hbase storing daily record to resolve, because hbase database is the data model storage based on key-value, favorable expandability, carry out analysis speed from hbase peek enough fast, and result can store arbitrarily, continue to store hbase, relational data or redis all can, do not have incompatible situation and occur.
Preferably, analytic unit comprises the first acquisition module and the first computing module.
The first acquisition module is for making the daily record data of cluster server from distributed data base Real-time Obtaining increment.The first computing module is added up for making cluster server adopt streaming to calculate to the daily record data of increment.
Due to constantly adding up of journal file, the daily record data being stored in distributed data base also constantly increases, real-time analysis in the embodiment of the present invention can be the cluster server daily record data of Real-time Obtaining increment from distributed data base in real time, the daily record data of increment is carried out to counting statistics, avoid the daily record data to having calculated to carry out double counting.The daily record data of Real-time Obtaining increment, adopts streaming to calculate the data of increment is added up.Wherein, it is to adopt the bolt of storm to complete that streaming is calculated, in bolt, carry the sequence of operations such as filtration, polymerization, Query Database, wherein, filter operation can complete in the parse in early stage analyzes, form with DB table leaves in hbase, only in streaming is calculated, has done map mapping the Organization of Data needing is got up to carry out polymerization computational analysis.
Particularly, first, take out daily record data and resolve and leave in hbase through parse from kafka queue, this process splits log recording, and the form that is mapped to DB table leaves in hbase.Then, adopt streaming to calculate to carry out real-time analysis statistics, it is to adopt the bolt of storm to complete that streaming is calculated, in bolt, carry the sequence of operations such as filtration, polymerization, Query Database, wherein, filter operation can complete in the parse in early stage analyzes, and leaves in hbase with the form of DB table, only in streaming is calculated, has done map mapping the Organization of Data needing is got up to carry out polymerization computational analysis.Then leave result complete streaming counting statistics in database as in redis database.Finally, the result data that is stored in redis is left in to hbase database according to actual needs, or in relevant database mysql, inquire about these statisticss for user.
Above-described embodiment has been described a flow process of the real-time analysis in log analysis, processes the real-time analysis of massive logs according to real-time analysis flow, and moment is given client result feedback, improves the promptness of log analysis result.
Preferably, analytic unit comprises the second acquisition module and the second computing module.
The second acquisition module is for making cluster server obtain the daily record data of increment from distributed data base according to predetermined period.The second computing module is for making cluster server carry out statistical computation to the daily record data of increment.
Because the difference of the request for information of the analysis result of user to journal file can adopt the mode of off-line analysis to carry out analyzing and processing to daily record data.The cycle that can set in advance analysis is predetermined period, and predetermined period can arrange as required, for example a week or one month etc.From distributed data base, obtain the daily record data of increment according to predetermined period, the daily record data of increment is being carried out to statistical computation.
Particularly, can realize by following steps:
Step 1 is taken out daily record data and is resolved and leave in hbase through parse from kafka queue, and this process splits log recording, and the form that is mapped to DB table leaves in hbase.
Step 2, creates job task one by one according to specific needs, and the Logic of Tasks is determined according to actual service logic.
Step 3, creates periodic scheduling Task, and periodic schedule job task is set exactly, such as being pre-created task 1, runs task 1 zero point every day.
Step 4, the scheduling time of arrival, according to scheduling content start task.
Step 5, carries out concrete the Logic of Tasks counting statistics daily record data.
Step 6, if tasks carrying failure is notified associated user by the notification module setting in advance in the mode of note or mail, user is manually being restarted job task after investigation reason.
Step 7, after tasks carrying success, leaves execution result in hbase database in, facilitates user to inquire about.
Step 8, tasks carrying success and result is left in after hbase database, can notify user in the mode of note or mail by notification module, tasks carrying success.
Above-described embodiment has been described a flow process of the off-line analysis in log analysis, according to the off-line analysis of such off-line analysis flow process parallel processing massive logs, and result is reported to front end for user's displaying.
Fig. 4 is according to the schematic diagram of a kind of preferred log processing device of the embodiment of the present invention.As shown in Figure 4, the log processing device of this embodiment comprises log collection module 20, log store module 40, log analysis module 60 and display module 80.
Log collection module 20 is for extracting relevant daily record from external system.External system can be the server that need to gather daily record, can be also the client that side of user need to gather daily record, that is, and and the user side providing in the embodiment of the present invention.Particularly, can be an agent agency by design, be mounted on the server that need to gather daily record, the relevant daily record of timing acquiring is toward memory module transmission.
Log store module 40 for gather come log store at collector cluster server.Log store module 40 has two parts function, and the one, leave on cluster server gathering the journal file coming by http protocol, and the descriptor of journal file (such as file path, creation-time etc.) is left in Redis; The 2nd, processor processing procedure, the descriptor that reads journal file by redis is sent to concrete journal file data in kafka message queue, calls analysis for log analysis module 60.Log store module 40 can realize its function by the storage unit in the embodiment of the present invention.
Log analysis module 60, for counting statistics daily record related data, is divided into real-time analysis and off-line analysis according to the promptness of search request.Log analysis module 60 can realize its function by the analytic unit of the embodiment of the present invention.
Real-time analysis requires to return within the several seconds analysis of more than one hundred million row data conventionally, from log store module 40, point send instant daily record data and carry out real-time statistics, this part data volume is generally not too large, can calculate statistical study by streaming, in the temporary redis of result, after processing, in hbase, deposit, convenient peek front end is shown.
Off-line analysis is less demanding to the promptness of statistics, can show every other day or every other month.From log store module, the daily record data after resolving is first left in Hbase database, require to finish writing in advance task job according to service logic, timing race task is carried out counting statistics and is analyzed daily record.
Display module 80 is for showing user by log analysis result by webpage or mobile phone A PP.
The advantage of the embodiment of the present invention is: the first, and employing can be taken detachable agent agent acquisition daily record, can conveniently configure collection Log Types, does not need also can unload at any time, convenient and swift, without customized development again.The second, adopt cluster storage, as a daily record center, can accept all daily records that send, concentrate and carry out storing after key-value processing, especially along with the growth of daily record amount, as long as carry out dilatation by increasing the hardware such as hard disk, internal memory, convenient and swiftly pare down expenses again.The 3rd, log analysis module 60 is for daily record amount and the actual demand processing of classifying, very fast to the analysis speed of big data quantity, and accuracy is higher, can automatically notify user for the feedback of result, and promptness is well ensured.The 4th, the data of utilizing hbase storing daily record to resolve, because hbase is the data model storage based on key value, favorable expandability, carry out analysis speed from hbase database peek enough fast, and result can store arbitrarily, continue to store hbase database, relational data or redis database all can, do not have incompatible situation and occur.
To sum up, the present invention has following effect:
High arithmetic capability, is all assigned to each station server by complex calculations, has realized the concurrent ability of height of whole device, and processing power is the more than 10 times of conventional architectures.
In user's actual application environment, the probability that various dissimilar hardware and software failures occur is higher, service disruption even causes loss of data as hardware damage, network interruption, system crash etc. extremely all can cause.The embodiment of the present invention is a log processing device that is structured in the massive logs on cloud platform, and therefore it can utilize many master redundancys of cloud computing environment to ensure the high reliability of service.
The embodiment of the present invention can be done this locality storage of all user sides to gather, and can support the memory capacity of PB scale, and be very easy to store dilatation, and whole expansion process can service impacting continuous service.
The software product that the embodiment of the present invention is used is the product of increasing income, the PC-SERVER of hardware using low side, and total cost is lower.
It should be noted that, for aforesaid each embodiment of the method, for simple description, therefore it is all expressed as to a series of combination of actions, but those skilled in the art should know, the present invention is not subject to the restriction of described sequence of movement, because according to the present invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in instructions all belongs to preferred embodiment, and related action and module might not be that the present invention is necessary.
In the above-described embodiments, the description of each embodiment is all emphasized particularly on different fields, in certain embodiment, there is no the part of detailed description, can be referring to the associated description of other embodiment.
In the several embodiment that provide in the application, should be understood that disclosed device can be realized by another way.For example, device embodiment described above is only schematic, the division of for example described unit, be only that a kind of logic function is divided, when actual realization, can there is other dividing mode, for example multiple unit or assembly can in conjunction with or can be integrated into another system, or some features can ignore, or do not carry out.Another point, shown or discussed coupling each other or direct-coupling or communication connection can be by some interfaces, indirect coupling or the communication connection of device or unit can be electrical or other form.
The described unit as separating component explanation can or can not be also physically to separate, and the parts that show as unit can be or can not be also physical locations, can be positioned at a place, or also can be distributed in multiple network element.Can select according to the actual needs some or all of unit wherein to realize the object of the present embodiment scheme.
In addition, the each functional unit in each embodiment of the present invention can be integrated in a processing unit, can be also that the independent physics of unit exists, and also can be integrated in a unit two or more unit.Above-mentioned integrated unit both can adopt the form of hardware to realize, and also can adopt the form of SFU software functional unit to realize.
If described integrated unit is realized and during as production marketing independently or use, can be stored in a computer read/write memory medium using the form of SFU software functional unit.Based on such understanding, the all or part of of the part that technical scheme of the present invention contributes to prior art in essence in other words or this technical scheme can embody with the form of software product, this computer software product is stored in a storage medium, comprises that some instructions are in order to make a computer equipment (can be personal computer, mobile terminal, server or the network equipment etc.) carry out all or part of step of method described in the present invention each embodiment.And aforesaid storage medium comprises: various media that can be program code stored such as USB flash disk, ROM (read-only memory) (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), portable hard drive, magnetic disc or CDs.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any amendment of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (10)

Translated fromChinese
1.一种日志处理方法,其特征在于,包括:1. A log processing method, characterized in that, comprising:集群服务器接收用户端的日志文件;The cluster server receives the log files of the client;所述集群服务器存储所述日志文件;The cluster server stores the log file;所述集群服务器对所述日志文件进行分析,得到分析结果;以及The cluster server analyzes the log file to obtain an analysis result; and所述集群服务器输出所述分析结果。The cluster server outputs the analysis result.2.根据权利要求1所述的日志处理方法,其特征在于,所述集群服务器存储所述日志文件包括:2. The log processing method according to claim 1, wherein storing the log file in the cluster server comprises:所述集群服务器将所述日志文件拆分成日志数据;以及The cluster server splits the log file into log data; and所述集群服务器将所述日志数据传送到分布式消息队列中,The cluster server transmits the log data to a distributed message queue,其中,所述集群服务器从所述分布式消息队列中读取所述日志数据,并对所Wherein, the cluster server reads the log data from the distributed message queue, and述日志数据进行分析。Analyze the above log data.3.根据权利要求2所述的日志处理方法,其特征在于,在所述集群服务器将所述日志数据传送到分布式消息队列中之后,所述日志处理方法还包括:3. The log processing method according to claim 2, wherein, after the cluster server transmits the log data to the distributed message queue, the log processing method further comprises:所述集群服务器从所述分布式消息队列中读取所述日志数据;The cluster server reads the log data from the distributed message queue;所述集群服务器对读取的日志数据进行解析,得到解析结果;The cluster server parses the read log data to obtain a parsing result;所述集群服务器根据所述解析结果生成所述日志数据对应的键值对;以及The cluster server generates a key-value pair corresponding to the log data according to the parsing result; and所述集群服务器通过将所述键值对存储到分布式数据库中来存储所述日志文件。The cluster server stores the log file by storing the key-value pair in a distributed database.4.根据权利要求3所述的日志处理方法,其特征在于,所述集群服务器对所述日志文件进行分析包括:4. The log processing method according to claim 3, wherein said cluster server analyzing said log file comprises:所述集群服务器从所述分布式数据库中实时获取增量的日志数据;以及The cluster server acquires incremental log data from the distributed database in real time; and所述集群服务器对所述增量的日志数据采用流式计算进行统计。The cluster server performs statistics on the incremental log data by stream computing.5.根据权利要求3所述的日志处理方法,其特征在于,所述集群服务器对所述日志文件进行分析包括:5. The log processing method according to claim 3, wherein said cluster server analyzing said log file comprises:所述集群服务器按照预设周期从所述分布式数据库中获取增量的日志数据;以及The cluster server acquires incremental log data from the distributed database according to a preset period; and所述集群服务器对所述增量的日志数据进行统计计算。The cluster server performs statistical calculation on the incremental log data.6.一种日志处理装置,其特征在于,包括:6. A log processing device, comprising:接收单元,用于使得集群服务器接收用户端的日志文件;The receiving unit is used to make the cluster server receive the log file of the client;存储单元,用于使得所述集群服务器存储所述日志文件;a storage unit, configured to enable the cluster server to store the log file;分析单元,用于使得所述集群服务器对所述日志文件进行分析,得到分析结果;以及an analysis unit, configured to enable the cluster server to analyze the log file to obtain an analysis result; and输出单元,用于使得所述集群服务器输出所述分析结果。an output unit, configured to enable the cluster server to output the analysis result.7.根据权利要求6所述的日志处理装置,其特征在于,所述存储单元包括:7. The log processing device according to claim 6, wherein the storage unit comprises:拆分模块,用于使得所述集群服务器将所述日志文件拆分成日志数据;以及A splitting module, configured to cause the cluster server to split the log file into log data; and传送模块,用于使得所述集群服务器将所述日志数据传送到分布式消息队列中,a transmission module, configured to enable the cluster server to transmit the log data to a distributed message queue,其中,所述集群服务器从所述分布式消息队列中读取所述日志数据,并对所述日志数据进行分析。Wherein, the cluster server reads the log data from the distributed message queue, and analyzes the log data.8.根据权利要求7所述的日志处理装置,其特征在于,所述存储单元还包括:8. The log processing device according to claim 7, wherein the storage unit further comprises:读取模块,用于在所述集群服务器将所述日志数据传送到分布式消息队列中之后,使得所述集群服务器从所述分布式消息队列中读取所述日志数据;A reading module, configured to enable the cluster server to read the log data from the distributed message queue after the cluster server transmits the log data to the distributed message queue;解析模块,用于使得所述集群服务器对读取的日志数据进行解析,得到解析结果;An analysis module, configured to enable the cluster server to analyze the read log data to obtain an analysis result;生成模块,用于使得所述集群服务器根据所述解析结果生成所述日志数据对应的键值对;以及A generating module, configured to enable the cluster server to generate a key-value pair corresponding to the log data according to the parsing result; and存储模块,用于使得所述集群服务器通过将所述键值对存储到分布式数据库中来存储所述日志文件。A storage module, configured to enable the cluster server to store the log file by storing the key-value pair in a distributed database.9.根据权利要求8所述的日志处理装置,其特征在于,所述分析单元包括:9. The log processing device according to claim 8, wherein the analyzing unit comprises:第一获取模块,用于使得所述集群服务器从所述分布式数据库中实时获取增量的日志数据;以及A first obtaining module, configured to enable the cluster server to obtain incremental log data from the distributed database in real time; and第一计算模块,用于使得所述集群服务器对所述增量的日志数据采用流式计算进行统计。The first calculation module is configured to enable the cluster server to perform statistics on the incremental log data by stream calculation.10.根据权利要求8所述的日志处理装置,其特征在于,所述分析单元包括:10. The log processing device according to claim 8, wherein the analyzing unit comprises:第二获取模块,用于使得所述集群服务器按照预设周期从所述分布式数据库中获取增量的日志数据;以及The second acquisition module is configured to enable the cluster server to acquire incremental log data from the distributed database according to a preset period; and第二计算模块,用于使得所述集群服务器对所述增量的日志数据进行统计计算。The second calculation module is configured to enable the cluster server to perform statistical calculation on the incremental log data.
CN201410106430.1A2014-03-202014-03-20Log processing method and devicePendingCN103838867A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201410106430.1ACN103838867A (en)2014-03-202014-03-20Log processing method and device

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201410106430.1ACN103838867A (en)2014-03-202014-03-20Log processing method and device

Publications (1)

Publication NumberPublication Date
CN103838867Atrue CN103838867A (en)2014-06-04

Family

ID=50802363

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201410106430.1APendingCN103838867A (en)2014-03-202014-03-20Log processing method and device

Country Status (1)

CountryLink
CN (1)CN103838867A (en)

Cited By (67)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104113605A (en)*2014-07-302014-10-22浪潮软件股份有限公司Enterprise cloud application development monitoring processing method
CN104486107A (en)*2014-12-052015-04-01曙光信息产业(北京)有限公司Log collection device and method
CN104501848A (en)*2014-12-042015-04-08国家电网公司Data accessing method and system of substation equipment
CN104516970A (en)*2014-12-232015-04-15广州酷狗计算机科技有限公司Method and device both for log analysis
CN104579789A (en)*2015-01-232015-04-29广东能龙教育股份有限公司 A method and system for collecting mass user behavior data based on message queue
CN105205167A (en)*2015-10-102015-12-30国网信息通信产业集团有限公司Log data system
CN105278996A (en)*2015-11-032016-01-27亚信科技(南京)有限公司Log collection method and device and log service system
CN105337748A (en)*2014-06-202016-02-17北京奇虎科技有限公司Log file collection method and system, server, and service cluster controlling apparatus
CN105426292A (en)*2015-10-292016-03-23网易(杭州)网络有限公司Game log real-time processing system and method
CN105512297A (en)*2015-12-102016-04-20中国测绘科学研究院Distributed stream-oriented computation based spatial data processing method and system
CN105589856A (en)*2014-10-212016-05-18阿里巴巴集团控股有限公司Log data processing method and log data processing system
CN105590259A (en)*2015-11-042016-05-18中国银联股份有限公司Device and method for diagnosis of transaction system
CN105608188A (en)*2015-12-232016-05-25北京奇虎科技有限公司Data processing method and data processing device
CN105656706A (en)*2014-11-142016-06-08北京通达无限科技有限公司 Business data processing method and equipment
CN105681397A (en)*2015-12-302016-06-15曙光信息产业(北京)有限公司Network traffic data storage method and system, query method and device
CN105718295A (en)*2016-01-272016-06-29四川长虹电器股份有限公司Data collecting and analyzing method and system
CN105812202A (en)*2014-12-312016-07-27阿里巴巴集团控股有限公司Log real time monitoring and early warning method and device employing same
CN105933736A (en)*2016-04-182016-09-07天脉聚源(北京)传媒科技有限公司Log processing method and device
CN106055703A (en)*2016-06-222016-10-26北京科摩仕捷科技有限公司Real-time log analysis method and system
CN106126730A (en)*2016-07-012016-11-16百势软件(北京)有限公司A kind of method and device of Mass production warning information
CN106156079A (en)*2015-03-312016-11-23西门子公司Daily record data treating method and apparatus
CN106201739A (en)*2016-06-292016-12-07上海浦东发展银行股份有限公司信用卡中心A kind of remote invocation method of Storm based on Redis
CN106254086A (en)*2015-06-042016-12-21重庆达特科技有限公司Cloud daily record is managed concentratedly, analyzes, monitoring and alarm platform
CN106294721A (en)*2016-08-082017-01-04无锡天脉聚源传媒科技有限公司A kind of company-data statistics and deriving method and device
CN106354434A (en)*2016-08-312017-01-25中国人民大学Log data storing method and system
CN106383917A (en)*2016-11-112017-02-08苏州天平先进数字科技有限公司Data processing method based on user logs
CN106407232A (en)*2015-08-032017-02-15天脉聚源(北京)科技有限公司A method and a system for statistical analysis for television shopping
CN106406858A (en)*2016-08-302017-02-15国电南瑞科技股份有限公司Streaming type statistical definition and operation method based on configuration file
CN106484709A (en)*2015-08-262017-03-08北京神州泰岳软件股份有限公司A kind of auditing method of daily record data and audit device
CN106528798A (en)*2016-11-112017-03-22苏州天平先进数字科技有限公司Data processing system based on user logs
CN106681846A (en)*2016-12-292017-05-17北京奇虎科技有限公司Log data statistical method, device and system
CN106792876A (en)*2016-12-262017-05-31浙江省公众信息产业有限公司End to end network perception evaluating method and system
CN106850295A (en)*2017-02-042017-06-13郑州云海信息技术有限公司A kind of log collection monitoring method of privatization cloud platform
CN106992886A (en)*2017-04-052017-07-28国家电网公司 A log analysis method and device based on distributed storage
CN107038162A (en)*2016-02-032017-08-11滴滴(中国)科技有限公司Real time data querying method and system based on database journal
CN107315830A (en)*2017-07-102017-11-03深圳市视维科技股份有限公司A kind of method and system of intellectual analysis document
CN107395446A (en)*2017-09-182017-11-24北京奇虎科技有限公司Daily record real time processing system
CN107463648A (en)*2017-07-262017-12-12苏州乐麟无线信息科技有限公司Data analysing method and system based on distributed communication
CN107526808A (en)*2017-08-222017-12-29中国联合网络通信集团有限公司Real-time data processing method and device
CN107609129A (en)*2017-09-182018-01-19北京奇虎科技有限公司Daily record real time processing system
CN107908748A (en)*2017-11-172018-04-13南京感度信息技术有限责任公司Website user's behavioral data acquisition method, system and application based on big data
CN108073716A (en)*2017-12-272018-05-25北京诸葛找房信息技术有限公司Online active user portrait generation method
CN108073625A (en)*2016-11-142018-05-25北京京东尚科信息技术有限公司For the system and method for metadata information management
CN108133043A (en)*2018-01-122018-06-08福建星瑞格软件有限公司A kind of server running log structured storage method based on big data
CN108170538A (en)*2017-12-082018-06-15北京奇艺世纪科技有限公司A kind of information processing method, device and electronic equipment
CN108234210A (en)*2017-12-292018-06-29北京奇虎科技有限公司The log processing method and device of a kind of content distributing network
CN108563744A (en)*2018-04-122018-09-21武汉斗鱼网络科技有限公司Slow querying method, device and terminal device based on Redis databases
CN108616556A (en)*2016-12-132018-10-02阿里巴巴集团控股有限公司Data processing method, device and system
CN108804237A (en)*2017-05-052018-11-13北京京东尚科信息技术有限公司Data real-time statistical method, device, storage medium and electronic equipment
CN108874524A (en)*2018-06-212018-11-23山东浪潮商用系统有限公司Big data distributed task dispatching system
CN109408330A (en)*2018-10-152019-03-01东软集团股份有限公司Log analysis method, device, terminal device and readable storage medium storing program for executing
CN109428914A (en)*2017-08-242019-03-05北京国双科技有限公司Monitoring method and device, storage medium, processor
CN109508318A (en)*2018-11-152019-03-22北京金山云网络技术有限公司A kind of amount of storage statistical method, device, electronic equipment and readable storage medium storing program for executing
CN109522285A (en)*2018-11-142019-03-26北京首信科技股份有限公司A kind of daily record data statistical method and system
CN109933505A (en)*2019-03-142019-06-25深圳市珍爱捷云信息技术有限公司Log processing method, device, computer equipment and storage medium
CN110032546A (en)*2019-04-182019-07-19厦门大学嘉庚学院System and method for rapidly satisfying temporary log analysis
CN110196794A (en)*2018-02-262019-09-03深圳市丰巢科技有限公司A kind of operation log processing method and system based on express delivery cabinet
CN110321273A (en)*2019-07-092019-10-11政采云有限公司A kind of business statistical method and device
CN110362544A (en)*2019-05-272019-10-22中国平安人寿保险股份有限公司Log processing system, log processing method, terminal and storage medium
CN110674211A (en)*2019-09-292020-01-10南京大学 A kind of automatic parsing method and device of Oracle database AWR report
CN110769290A (en)*2019-11-132020-02-07北京齐尔布莱特科技有限公司Play event updating method and system and computing device
CN110968561A (en)*2018-09-302020-04-07北京国双科技有限公司Log storage method and distributed system
CN111897704A (en)*2020-06-282020-11-06杭州涂鸦信息技术有限公司 Session log analysis method, electronic device and storage medium
CN112100148A (en)*2020-07-312020-12-18紫光云(南京)数字技术有限公司Increment processing method for packed log
CN112134719A (en)*2019-06-252020-12-25中兴通讯股份有限公司 A method and system for analyzing base station security logs
CN112905618A (en)*2021-04-062021-06-04浙江网商银行股份有限公司Data processing method and device
CN113010480A (en)*2020-03-262021-06-22腾讯科技(深圳)有限公司Log processing method and device, electronic equipment and computer readable storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
魏彬: "基于分布式日志系统的数据云服务平台设计与实现", 《万方数据库浙江大学硕士学位论文》*

Cited By (87)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN105337748A (en)*2014-06-202016-02-17北京奇虎科技有限公司Log file collection method and system, server, and service cluster controlling apparatus
CN104113605A (en)*2014-07-302014-10-22浪潮软件股份有限公司Enterprise cloud application development monitoring processing method
CN105589856A (en)*2014-10-212016-05-18阿里巴巴集团控股有限公司Log data processing method and log data processing system
CN105589856B (en)*2014-10-212019-04-26阿里巴巴集团控股有限公司Daily record data processing method and system
CN105656706A (en)*2014-11-142016-06-08北京通达无限科技有限公司 Business data processing method and equipment
CN104501848A (en)*2014-12-042015-04-08国家电网公司Data accessing method and system of substation equipment
CN104486107A (en)*2014-12-052015-04-01曙光信息产业(北京)有限公司Log collection device and method
CN104516970A (en)*2014-12-232015-04-15广州酷狗计算机科技有限公司Method and device both for log analysis
CN104516970B (en)*2014-12-232018-06-22广州酷狗计算机科技有限公司A kind of method and apparatus for carrying out log analysis
CN105812202A (en)*2014-12-312016-07-27阿里巴巴集团控股有限公司Log real time monitoring and early warning method and device employing same
CN104579789A (en)*2015-01-232015-04-29广东能龙教育股份有限公司 A method and system for collecting mass user behavior data based on message queue
CN106156079A (en)*2015-03-312016-11-23西门子公司Daily record data treating method and apparatus
CN106254086A (en)*2015-06-042016-12-21重庆达特科技有限公司Cloud daily record is managed concentratedly, analyzes, monitoring and alarm platform
CN106407232A (en)*2015-08-032017-02-15天脉聚源(北京)科技有限公司A method and a system for statistical analysis for television shopping
CN106484709A (en)*2015-08-262017-03-08北京神州泰岳软件股份有限公司A kind of auditing method of daily record data and audit device
CN105205167A (en)*2015-10-102015-12-30国网信息通信产业集团有限公司Log data system
CN105426292A (en)*2015-10-292016-03-23网易(杭州)网络有限公司Game log real-time processing system and method
CN105426292B (en)*2015-10-292018-03-16网易(杭州)网络有限公司A kind of games log real time processing system and method
CN105278996A (en)*2015-11-032016-01-27亚信科技(南京)有限公司Log collection method and device and log service system
CN105590259A (en)*2015-11-042016-05-18中国银联股份有限公司Device and method for diagnosis of transaction system
CN105512297A (en)*2015-12-102016-04-20中国测绘科学研究院Distributed stream-oriented computation based spatial data processing method and system
CN105608188A (en)*2015-12-232016-05-25北京奇虎科技有限公司Data processing method and data processing device
CN105681397A (en)*2015-12-302016-06-15曙光信息产业(北京)有限公司Network traffic data storage method and system, query method and device
CN105718295A (en)*2016-01-272016-06-29四川长虹电器股份有限公司Data collecting and analyzing method and system
CN107038162B (en)*2016-02-032021-03-02北京嘀嘀无限科技发展有限公司Real-time data query method and system based on database log
CN107038162A (en)*2016-02-032017-08-11滴滴(中国)科技有限公司Real time data querying method and system based on database journal
CN105933736A (en)*2016-04-182016-09-07天脉聚源(北京)传媒科技有限公司Log processing method and device
CN106055703A (en)*2016-06-222016-10-26北京科摩仕捷科技有限公司Real-time log analysis method and system
CN106201739A (en)*2016-06-292016-12-07上海浦东发展银行股份有限公司信用卡中心A kind of remote invocation method of Storm based on Redis
CN106126730B (en)*2016-07-012019-10-11百势软件(北京)有限公司A kind of method and device of Mass production warning information
CN106126730A (en)*2016-07-012016-11-16百势软件(北京)有限公司A kind of method and device of Mass production warning information
CN106294721A (en)*2016-08-082017-01-04无锡天脉聚源传媒科技有限公司A kind of company-data statistics and deriving method and device
CN106294721B (en)*2016-08-082020-05-19无锡天脉聚源传媒科技有限公司Cluster data counting and exporting methods and devices
CN106406858A (en)*2016-08-302017-02-15国电南瑞科技股份有限公司Streaming type statistical definition and operation method based on configuration file
CN106406858B (en)*2016-08-302019-08-16国电南瑞科技股份有限公司A kind of streaming statistical definition and operation method based on configuration file
CN106354434A (en)*2016-08-312017-01-25中国人民大学Log data storing method and system
CN106354434B (en)*2016-08-312019-07-23中国人民大学The storage method and system of daily record data
CN106383917A (en)*2016-11-112017-02-08苏州天平先进数字科技有限公司Data processing method based on user logs
CN106528798A (en)*2016-11-112017-03-22苏州天平先进数字科技有限公司Data processing system based on user logs
CN108073625A (en)*2016-11-142018-05-25北京京东尚科信息技术有限公司For the system and method for metadata information management
CN108073625B (en)*2016-11-142021-03-30北京京东尚科信息技术有限公司System and method for metadata information management
CN108616556A (en)*2016-12-132018-10-02阿里巴巴集团控股有限公司Data processing method, device and system
CN108616556B (en)*2016-12-132021-01-19阿里巴巴集团控股有限公司Data processing method, device and system
CN106792876A (en)*2016-12-262017-05-31浙江省公众信息产业有限公司End to end network perception evaluating method and system
CN106681846A (en)*2016-12-292017-05-17北京奇虎科技有限公司Log data statistical method, device and system
CN106681846B (en)*2016-12-292020-10-13北京奇虎科技有限公司Statistical method, device and system of log data
CN106850295A (en)*2017-02-042017-06-13郑州云海信息技术有限公司A kind of log collection monitoring method of privatization cloud platform
CN106992886A (en)*2017-04-052017-07-28国家电网公司 A log analysis method and device based on distributed storage
CN108804237A (en)*2017-05-052018-11-13北京京东尚科信息技术有限公司Data real-time statistical method, device, storage medium and electronic equipment
CN107315830A (en)*2017-07-102017-11-03深圳市视维科技股份有限公司A kind of method and system of intellectual analysis document
CN107463648A (en)*2017-07-262017-12-12苏州乐麟无线信息科技有限公司Data analysing method and system based on distributed communication
CN107526808B (en)*2017-08-222020-09-01中国联合网络通信集团有限公司Real-time data processing method and device
CN107526808A (en)*2017-08-222017-12-29中国联合网络通信集团有限公司Real-time data processing method and device
CN109428914B (en)*2017-08-242022-01-25北京国双科技有限公司Monitoring method and device, storage medium and processor
CN109428914A (en)*2017-08-242019-03-05北京国双科技有限公司Monitoring method and device, storage medium, processor
CN107395446B (en)*2017-09-182021-07-23北京奇虎科技有限公司 Log real-time processing system
CN107609129A (en)*2017-09-182018-01-19北京奇虎科技有限公司Daily record real time processing system
CN107395446A (en)*2017-09-182017-11-24北京奇虎科技有限公司Daily record real time processing system
CN107908748A (en)*2017-11-172018-04-13南京感度信息技术有限责任公司Website user's behavioral data acquisition method, system and application based on big data
CN108170538A (en)*2017-12-082018-06-15北京奇艺世纪科技有限公司A kind of information processing method, device and electronic equipment
CN108170538B (en)*2017-12-082021-05-28北京奇艺世纪科技有限公司Information processing method and device and electronic equipment
CN108073716A (en)*2017-12-272018-05-25北京诸葛找房信息技术有限公司Online active user portrait generation method
CN108234210A (en)*2017-12-292018-06-29北京奇虎科技有限公司The log processing method and device of a kind of content distributing network
CN108133043A (en)*2018-01-122018-06-08福建星瑞格软件有限公司A kind of server running log structured storage method based on big data
CN110196794A (en)*2018-02-262019-09-03深圳市丰巢科技有限公司A kind of operation log processing method and system based on express delivery cabinet
CN108563744B (en)*2018-04-122021-07-23武汉斗鱼网络科技有限公司 Slow query method, device and terminal device based on Redis database
CN108563744A (en)*2018-04-122018-09-21武汉斗鱼网络科技有限公司Slow querying method, device and terminal device based on Redis databases
CN108874524A (en)*2018-06-212018-11-23山东浪潮商用系统有限公司Big data distributed task dispatching system
CN110968561A (en)*2018-09-302020-04-07北京国双科技有限公司Log storage method and distributed system
CN109408330A (en)*2018-10-152019-03-01东软集团股份有限公司Log analysis method, device, terminal device and readable storage medium storing program for executing
CN109522285A (en)*2018-11-142019-03-26北京首信科技股份有限公司A kind of daily record data statistical method and system
CN109508318A (en)*2018-11-152019-03-22北京金山云网络技术有限公司A kind of amount of storage statistical method, device, electronic equipment and readable storage medium storing program for executing
CN109933505A (en)*2019-03-142019-06-25深圳市珍爱捷云信息技术有限公司Log processing method, device, computer equipment and storage medium
CN110032546A (en)*2019-04-182019-07-19厦门大学嘉庚学院System and method for rapidly satisfying temporary log analysis
CN110362544A (en)*2019-05-272019-10-22中国平安人寿保险股份有限公司Log processing system, log processing method, terminal and storage medium
CN110362544B (en)*2019-05-272024-04-02中国平安人寿保险股份有限公司Log processing system, log processing method, terminal and storage medium
WO2020258982A1 (en)*2019-06-252020-12-30中兴通讯股份有限公司Method and system for analyzing security log of base station, and computer-readable storage medium
CN112134719A (en)*2019-06-252020-12-25中兴通讯股份有限公司 A method and system for analyzing base station security logs
CN110321273B (en)*2019-07-092023-10-03政采云有限公司Service statistics method and device
CN110321273A (en)*2019-07-092019-10-11政采云有限公司A kind of business statistical method and device
CN110674211A (en)*2019-09-292020-01-10南京大学 A kind of automatic parsing method and device of Oracle database AWR report
CN110769290A (en)*2019-11-132020-02-07北京齐尔布莱特科技有限公司Play event updating method and system and computing device
CN113010480A (en)*2020-03-262021-06-22腾讯科技(深圳)有限公司Log processing method and device, electronic equipment and computer readable storage medium
CN113010480B (en)*2020-03-262024-03-19腾讯科技(深圳)有限公司Log processing method, device, electronic equipment and computer readable storage medium
CN111897704A (en)*2020-06-282020-11-06杭州涂鸦信息技术有限公司 Session log analysis method, electronic device and storage medium
CN112100148A (en)*2020-07-312020-12-18紫光云(南京)数字技术有限公司Increment processing method for packed log
CN112905618A (en)*2021-04-062021-06-04浙江网商银行股份有限公司Data processing method and device

Similar Documents

PublicationPublication DateTitle
CN103838867A (en)Log processing method and device
US8874600B2 (en)System and method for building a cloud aware massive data analytics solution background
CN110362544B (en)Log processing system, log processing method, terminal and storage medium
CN109034993A (en)Account checking method, equipment, system and computer readable storage medium
CN110647512B (en)Data storage and analysis method, device, equipment and readable medium
CN108268565B (en)Method and system for processing user browsing behavior data based on data warehouse
CN113609374A (en)Data processing method, device and equipment based on content push and storage medium
CN110675194A (en)Funnel analysis method, device, equipment and readable medium
US20130185429A1 (en)Processing Store Visiting Data
CN102208991A (en)Blog processing method, device and system
CN103620601A (en)Joining tables in a mapreduce procedure
CN104182506A (en)Log management method
US20160196564A1 (en)Systems and methods for analyzing consumer sentiment with social perspective insight
ChanBig data customer knowledge management
CN105005585A (en)Log data processing method and device
CN112506887A (en)Vehicle terminal CAN bus data processing method and device
CN114547097A (en) Data processing method, apparatus, device and storage medium
CN113590372A (en)Log-based link tracking method and device, computer equipment and storage medium
CN114971714A (en)Accurate customer operation method based on big data label and computer equipment
CN118394713A (en)Log data processing method, device, equipment, storage medium and program product
CN106557483B (en)Data processing method, data query method, data processing equipment and data query equipment
Gaurav et al.An outline on big data and big data analytics
CN104063456A (en)We media transmission atlas analysis method and device based on vector query
US20230252011A1 (en)Method and system for data indexing and reporting
CN112506886B (en)Multi-source service operation log acquisition method and system

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20140604

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp