技术领域technical field
本发明涉及数据分析技术领域,具体涉及一种数据分析方法、网络设备、服务器及系统。The invention relates to the technical field of data analysis, in particular to a data analysis method, network equipment, server and system.
背景技术Background technique
随着互联网络的日益普及,互联网广告的低成本高效益日渐明显,已经被越来越多的广告主所接受和认可。通过网络传媒进行商品推广推销成为了新兴的营销方式。With the increasing popularity of the Internet, the low cost and high benefit of Internet advertising has become increasingly obvious, and has been accepted and recognized by more and more advertisers. Commodity promotion through network media has become a new marketing method.
网络精准营销及广告技术“precision marketing”&“Behavior Targeting”,就是在网络营销基础上实施广告受众的精确分析,识别出产品/活动的潜在目标用户,提高相关产品的推广营销成功机率。Network precision marketing and advertising technology "precision marketing" & "Behavior Targeting" is to implement accurate analysis of advertising audiences on the basis of network marketing, identify potential target users of products/activities, and improve the success rate of promotion and marketing of related products.
精准营销及广告的技术关键是高效并准时的识别出网络上哪些用户点击访问或观看了相应的产品广告。从而将这些用户作为进一步营销的受众目标。The technical key of precision marketing and advertising is to efficiently and punctually identify which users on the Internet click to visit or watch the corresponding product advertisements. These users are thus targeted for further marketing.
这就要求互联网后端的服务或设备记录网络内所有用户的访问记录,并通过相应的商务智能(BI,Business Intelligent)技术实施多维度数据分析及数据挖掘,预测相应的用户的兴趣趋向,相应的广告投放系统依据数据分析平台的分析预测,有针对性的开展并实施广告投放,从而提升互联网营销的成功率。This requires the Internet back-end services or devices to record the access records of all users in the network, and implement multi-dimensional data analysis and data mining through the corresponding business intelligence (BI, Business Intelligent) technology to predict the interest trend of the corresponding users, and the corresponding According to the analysis and prediction of the data analysis platform, the advertisement delivery system carries out and implements advertisement delivery in a targeted manner, thereby improving the success rate of Internet marketing.
现有技术通常采用“前台追踪用户对推广信息的点击→后台统计整理报表→人工决策调整推荐策略”的方式来实现推荐业务优化,这种做法在互联网领域使用广泛,如Google Analytics、Apache Clickstream等;在电信领域由于数据量较大以及难以追踪用户点击,常常会直接借助流量统计、广告点击统计等间接方式来估算推荐效果并辅助人工决策。The existing technology usually adopts the method of "tracking the user's click on the promotion information in the foreground → statistically sorting out the report in the background → manual decision-making to adjust the recommendation strategy" to achieve recommendation business optimization. This method is widely used in the Internet field, such as Google Analytics, Apache Clickstream, etc. ; In the telecommunications field, due to the large amount of data and the difficulty of tracking user clicks, indirect methods such as traffic statistics and advertisement click statistics are often used to estimate the recommendation effect and assist manual decision-making.
现有技术为了实现精准营销的优化和精度提升,通常都是将大量的精力和重点投入到后端的数据分析系统上,建立强大的集群及并行分布式计算分析系统。随着电信领域用户和互联网数据量的爆炸式增长,数据分析量越来越大,分析出的数据的精准度越来越低,而且随着分析系统变的越来越庞大,优化分析投入成本越来越高昂。In order to achieve the optimization and precision improvement of precision marketing in existing technologies, a lot of energy and focus are usually put into the back-end data analysis system to establish a powerful cluster and parallel distributed computing analysis system. With the explosive growth of users in the telecommunications field and the amount of Internet data, the amount of data analysis is increasing, and the accuracy of the analyzed data is getting lower and lower. Moreover, as the analysis system becomes larger and larger, the investment cost of analysis is optimized. more and more expensive.
发明内容Contents of the invention
本发明实施例提供一种数据分析方法,可以降低数据分析量,提高分析数据的精准度。本发明实施例还提供了相应的装置及系统。An embodiment of the present invention provides a data analysis method, which can reduce the amount of data analysis and improve the accuracy of data analysis. Embodiments of the present invention also provide corresponding devices and systems.
一种数据分析方法,包括:A data analysis method comprising:
接收业务访问请求,所述业务访问请求中包含用户要访问的网址;receiving a service access request, where the service access request includes a website to be visited by the user;
将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;Match the URLs that the user wants to visit with the built-in recommended URLs stored by hierarchy;
当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;When a consistent recommended website is matched, it is determined that the level of the website to be visited by the user is the level associated with the matching recommended website, and the level of the recommended website corresponds to the level of the recommended information displayed on the terminal;
解析出发送所述业务访问请求的终端标识;Parsing out the identifier of the terminal sending the service access request;
如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。If the level of the website to be visited by the user is greater than or equal to the first threshold, send the parsed terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the server, so that the server statistics are greater than or equal to the first threshold In the threshold level, the number of terminal IDs corresponding to the URLs to be accessed by each user.
一种数据分析方法,包括:A data analysis method comprising:
接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;receiving the terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user;
将所述终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;storing the terminal identifier, the website to be visited by the user, and the hierarchical association of the website to be visited by the user;
将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。Taking the data whose level is greater than or equal to the first threshold as a data source, and counting the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold.
一种网络设备,包括:A network device comprising:
第一接收单元,用于接收业务访问请求,所述业务访问请求中包含用户要访问的网址;The first receiving unit is configured to receive a service access request, and the service access request includes a website to be visited by the user;
网址匹配单元,用于将所述第一接收单元接收到的业务访问请求中包含的用户要访问的网址与内置的按层级存储的推荐网址进行匹配;A URL matching unit, configured to match the URL to be accessed by the user contained in the service access request received by the first receiving unit with the built-in recommended URL stored by hierarchy;
层级确定单元,用于当所述网址匹配单元匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;A level determination unit, configured to determine that the level of the website to be accessed by the user is the level associated with the matching recommended website when the website matching unit matches a consistent recommended website, and the level of the recommended website is the same as that in the terminal Corresponds to the level of the recommended information displayed on the website;
解析单元,用于解析出发送所述第一接收单元接收到的业务访问请求的终端的标识;a parsing unit, configured to parse out the identity of the terminal that sent the service access request received by the first receiving unit;
发送单元,用于在所述用户要访问的网址的层级大于或等于第一阈值时,向服务器发送所述解析单元解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。A sending unit, configured to send to the server the terminal identifier parsed by the parsing unit, the website to be visited by the user, and the URL of the website to be visited by the user when the level of the website to be visited by the user is greater than or equal to a first threshold level, so that the server counts the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold value.
一种服务器,包括:A server comprising:
第二接收单元,用于接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;a second receiving unit, configured to receive the terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user;
存储单元,用于将所述第二接收单元接收到的终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;a storage unit, configured to associate and store the terminal identifier received by the second receiving unit, the website to be visited by the user, and the website to be visited by the user;
分析单元,用于将所述存储单元存储的层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The analysis unit is configured to use the data stored in the storage unit at a level greater than or equal to the first threshold as a data source, and count the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold.
一种数据分析系统,包括网络设备和服务器,所述网络设备为上述技术方案所述的网络设备,所述服务器为上述技术方案所述的服务器。A data analysis system, including a network device and a server, the network device is the network device described in the above technical solution, and the server is the server described in the above technical solution.
一种数据分析系统,包括网络设备、深度包解析设备和服务器,所述深度包解析设备与所述网络设备具有通信连接;A data analysis system, including a network device, a deep packet analysis device and a server, the deep packet analysis device has a communication connection with the network device;
所述网络设备接收业务访问请求,并将所述业务访问请求或所述业务访问请求的镜像包转发给所述深度包解析设备,所述业务访问请求中包含用户要访问的网址;The network device receives the service access request, and forwards the service access request or the mirrored packet of the service access request to the deep packet analysis device, and the service access request includes the website to be visited by the user;
所述深度包解析设备将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,所述深度包解析设备向所述网络设备发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级;The deep packet parsing device matches the web site that the user wants to visit with the built-in recommended web site that is stored by hierarchy; The level associated with the recommended website, the level of the recommended website corresponds to the level of the recommended information displayed on the terminal; parse out the terminal identifier that sends the service access request; if the level of the website that the user wants to visit is greater than or equal to The first threshold, the deep packet analysis device sends the parsed terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the network device;
所述网络设备向服务器发送接收到的终端标识、用户要访问的网址和所述用户要访问的网址的层级;The network device sends the received terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the server;
所述服务器将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The server uses the data whose level is greater than or equal to the first threshold as a data source, and counts the number of terminal identifiers in the level greater than or equal to the first threshold that correspond to the website to be accessed by each user.
本发明实施例采用接收业务访问请求,所述业务访问请求中包含用户要访问的网址;将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术中只要用户访问,后台服务器就分析相比,本发明实施例提供的数据分析方法,可以降低数据分析量,提高分析数据的精准度。The embodiment of the present invention adopts the method of receiving a service access request, and the service access request includes the website to be visited by the user; matching the website to be visited by the user with the built-in recommended website stored in layers; when a consistent recommended website is matched When determining that the level of the website to be accessed by the user is the level associated with the matching recommended website, the level of the recommended website corresponds to the level of the recommended information displayed on the terminal; it is resolved to send the service access request if the level of the website the user wants to visit is greater than or equal to the first threshold, send the parsed terminal identity, the website the user wants to visit, and the level of the website the user wants to visit to the server, so that the server statistics are greater than or equal to the first threshold Or equal to the number of terminal identifiers corresponding to the website to be accessed by each user in the first threshold level. Compared with the prior art where the background server analyzes as long as the user visits, the data analysis method provided by the embodiment of the present invention can reduce the amount of data analysis and improve the accuracy of the analysis data.
附图说明Description of drawings
图1是本发明实施例中数据分析方法的一实施例示意图;Fig. 1 is a schematic diagram of an embodiment of a data analysis method in an embodiment of the present invention;
图2是本发明实施例中数据分析方法的另一实施例示意图;2 is a schematic diagram of another embodiment of the data analysis method in the embodiment of the present invention;
图3是本发明实施例中数据分析方法的另一实施例示意图;3 is a schematic diagram of another embodiment of the data analysis method in the embodiment of the present invention;
图4是本发明实施例中应用场景实施例示意图;Fig. 4 is a schematic diagram of an embodiment of an application scenario in an embodiment of the present invention;
图5是本发明实施例中网络设备的一实施例示意图;FIG. 5 is a schematic diagram of an embodiment of a network device in an embodiment of the present invention;
图6是本发明实施例中网络设备的另一实施例示意图;FIG. 6 is a schematic diagram of another embodiment of the network device in the embodiment of the present invention;
图7是本发明实施例中网络设备的另一实施例示意图;FIG. 7 is a schematic diagram of another embodiment of the network device in the embodiment of the present invention;
图8是本发明实施例中服务器的一实施例示意图;FIG. 8 is a schematic diagram of an embodiment of a server in an embodiment of the present invention;
图9是本发明实施例中服务器的另一实施例示意图;Fig. 9 is a schematic diagram of another embodiment of the server in the embodiment of the present invention;
图10是本发明实施例中系统的一实施例示意图;Fig. 10 is a schematic diagram of an embodiment of the system in the embodiment of the present invention;
图11是本发明实施例中系统的另一实施例示意图。Fig. 11 is a schematic diagram of another embodiment of the system in the embodiment of the present invention.
具体实施方式Detailed ways
本发明实施例提供一种数据分析方法,可以降低数据分析量,提高分析数据的精准度。本发明实施例还提供了相应的装置及系统。以下分别进行详细说明。An embodiment of the present invention provides a data analysis method, which can reduce the amount of data analysis and improve the accuracy of data analysis. Embodiments of the present invention also provide corresponding devices and systems. Each will be described in detail below.
参阅图1,从网络设备的角度来看,本发明实施例提供的数据分析方法的一实施例包括:Referring to FIG. 1, from the perspective of network equipment, an embodiment of the data analysis method provided by the embodiment of the present invention includes:
101、接收业务访问请求,所述业务访问请求中包含用户要访问的网址。101. Receive a service access request, where the service access request includes a website to be accessed by a user.
本发明实施例中提供的网络设备可以是网关GPRS支持节点(GGSN,Gateway GPRS Support Node),也可以是具有与本发明实施例提供的数据分析相同的功能的其他网络设备,本发明实施例以GGSN为例进行说明。The network equipment provided in the embodiment of the present invention can be gateway GPRS support node (GGSN, Gateway GPRS Support Node), also can be other network equipment with the same function of the data analysis that the embodiment of the present invention provides, the embodiment of the present invention uses GGSN is used as an example for illustration.
用户浏览广告信息时,会点击广告的推荐链接发起业务请求,GGSN接收到业务请求,实际上业务请求是通过数据包发送过来的,用户发起一次业务请求,这一次业务请求可能是通过一个或多个数据包发送给GGSN的,GGSN接收到一个数据包后,先判断接收到的数据包是否携带了完整的信息,如果判断出这个数据包是一个完整的数据流,携带了完整的信息,那么获取数据包中携带的用户要访问的网址,也就是业务访问请求中包含的用户要访问的网址;如果GGSN判断出接收到的数据包不完整,那么先将这个数据包存储起来,等到收到同一个业务请求的其他数据包后,将同一个业务请求的所有数据包组成一个完整的数据流,获取用户要访问的网址。When a user browses the advertisement information, he will click the recommended link of the advertisement to initiate a service request. The GGSN receives the service request. In fact, the service request is sent through a data packet. When the user initiates a service request, this time the service request may be through one or more A data packet is sent to the GGSN. After receiving a data packet, the GGSN first judges whether the received data packet carries complete information. If it is judged that the data packet is a complete data flow and carries complete information, then Obtain the website that the user wants to visit carried in the data packet, that is, the website that the user wants to visit included in the service access request; if GGSN judges that the received data packet is incomplete, then store the data packet first, and wait until it receives After other data packets of the same business request, all the data packets of the same business request are combined into a complete data stream to obtain the URL that the user wants to visit.
102、将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配。102. Match the web site to be accessed by the user with the built-in recommended web site stored by hierarchy.
获取所述业务访问请求中的用户要访问的网址后,在GGSN中内置有推荐网址列表,所述内置的推荐网址列表可以由第三方公司提供,如广告代理商、互联网公司、个人广告主等;所述推荐网址列表包括推荐网址与所述推荐网址的层级之间的映射关系,推荐网址的层级与在终端上展现的推荐信息的层级相对应;如果推荐网址列表中的推荐网址是第二层级,那么该推荐信息在展示页面中也是第二层级;推荐信息指的是媒体信息,广告内容,如在页面上展示的产品广告。After obtaining the URL to be accessed by the user in the service access request, a recommended URL list is built into the GGSN, and the built-in recommended URL list can be provided by a third-party company, such as an advertising agency, an Internet company, an individual advertiser, etc. ; The recommended URL list includes a mapping relationship between the recommended URL and the hierarchy of the recommended URL, and the hierarchy of the recommended URL corresponds to the hierarchy of the recommended information displayed on the terminal; if the recommended URL in the recommended URL list is the second level, then the recommendation information is also the second level in the display page; the recommendation information refers to media information, advertisement content, such as product advertisements displayed on the page.
网址匹配的过程可以为:将用户要访问的网址与内置的推荐列表中的推荐网址逐个进行比对。The URL matching process may be: comparing the URL to be visited by the user one by one with the recommended URLs in the built-in recommendation list.
103、当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应。103. When a consistent recommended website is matched, determine that the level of the website to be visited by the user is the level associated with the matching recommended website, and the level of the recommended website corresponds to the level of the recommended information displayed on the terminal .
当在网址推荐列表中找到了与用户要访问的网址相同的推荐网址后,从推荐网址与所述推荐网址的映射关系中就可以获知所述用户要访问的网址的层级,也就是说,所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级。When the recommended URL that is the same as the URL that the user wants to visit is found in the URL recommendation list, the level of the URL that the user wants to visit can be known from the mapping relationship between the recommended URL and the recommended URL, that is, all The level of the URL to be visited by the user is the level associated with the matching recommended URL.
为提供分层数据,广告内容提供商预先将广告的推荐信息在终端的展示页面上分层展现,如将产品的概要信息放在一层展现,产品的详细信息等放在二层展现;以互联网汽车广告为例,将汽车的品牌信息放在一层展现,将汽车性能参数或活动等详细信息放在二层展现,当用户点击浏览展现一层后,可知道该广告为何品牌汽车的广告,只凭用户了解何品牌的汽车并不能确定用户对该汽车感兴趣,如果继续点击展现二层了解汽车性能参数或活动等详细信息,则说明用户对该产品感兴趣,是真正的潜在购买者。In order to provide hierarchical data, the advertising content provider displays the recommended information of the advertisement on the display page of the terminal in advance. For example, the general information of the product is displayed on the first layer, and the detailed information of the product is displayed on the second layer; Take the Internet car advertisement as an example. The brand information of the car is displayed on the first floor, and detailed information such as car performance parameters or activities are displayed on the second floor. When users click to browse and display the first floor, they can know which brand car the advertisement is for. , just because the user knows what brand of car is not sure that the user is interested in the car, if you continue to click to display the second floor to learn about detailed information such as car performance parameters or activities, it means that the user is interested in the product and is a real potential buyer .
现有技术中,用于分析数据的服务器可以跟踪到用户点击了哪些网址,但不能确定网址的层级,这样对每个点击数据都进行存储和分析,导致数据分析系统越来越大,而且分析出的数据的精准度还很低,无法准确锁定对产品感兴趣的用户。In the prior art, the server used for data analysis can track which URLs the user clicks, but cannot determine the hierarchy of the URLs. In this way, each click data is stored and analyzed, resulting in an increasingly large data analysis system, and the analysis The accuracy of the data obtained is still very low, and it is impossible to accurately target users who are interested in the product.
因此本发明实施例提供了一种与广告内容提供商的广告推荐信息层级相对应的层级解析方案,可以确定用户访问的推荐信息的层级,并能将该层级发送给服务器,使服务器按层级分析数据。Therefore, the embodiment of the present invention provides a hierarchical analysis solution corresponding to the advertisement recommendation information hierarchy of the advertisement content provider, which can determine the hierarchy of the recommended information accessed by the user, and can send the hierarchy to the server, so that the server can analyze it according to the hierarchy data.
104、解析出发送所述业务访问请求的终端标识。104. Parse out the identifier of the terminal that sends the service access request.
本发明实施例提供的终端可以是移动终端,也可以是固定终端,如果是手机上网,那么终端的标识就是国际移动用户标识码(IMSI,International MobileSubscriber Identification),如:13580000000;如果是电脑上网,那么终端的标识就是该终端的媒体接入控制(MAC,Media Access Control)地址。The terminal provided by the embodiment of the present invention can be a mobile terminal or a fixed terminal. If a mobile phone accesses the Internet, the identification of the terminal is an International Mobile Subscriber Identification (IMSI, International Mobile Subscriber Identification), such as: 13580000000; if a computer accesses the Internet, Then the identifier of the terminal is the Media Access Control (MAC, Media Access Control) address of the terminal.
105、如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。105. If the level of the website that the user wants to visit is greater than or equal to the first threshold, send the parsed terminal identifier, the website that the user wants to visit, and the level of the website that the user wants to visit to the server, so that the server statistics are greater than or equal to In the first threshold level, the number of terminal identifiers corresponding to the website to be accessed by each user.
现有技术中,因将所有的用户点击广告推荐信息的数据都发送给了服务器,使服务器要存储和分析的数据量太过庞大,而且这些数据中有绝大部分数据对确定对产品感兴趣的用户没有意义。因此本发明实施例中,给出了第一阈值,当确定所述用户要访问的网址的层级后,如果用户要访问的网址的层级大于或等于该第一阈值,才向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,如果用户要访问的网址的层级小于该第一阈值,则不发送数据给服务器,这样可以降低服务器的数据分析量,如第一阈值为2时,只将层级大于等于2的数据发送给服务器。In the prior art, because all the data of user clicks on advertisement recommendation information are sent to the server, the amount of data to be stored and analyzed by the server is too large, and most of these data are of interest to the product. users does not make sense. Therefore, in the embodiment of the present invention, a first threshold is given. After determining the level of the website that the user wants to visit, if the level of the website that the user wants to visit is greater than or equal to the first threshold, the parsed URL is sent to the server. The terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user, if the level of the website to be visited by the user is less than the first threshold, no data will be sent to the server, which can reduce the amount of data analysis of the server, such as When the first threshold is 2, only data whose level is greater than or equal to 2 is sent to the server.
因网络设备向服务器发送的数据是层级大于或等于第一阈值的数据,这样服务器分析的数据也是层级大于或等于第一阈值的数据,降低了数据分析量。Since the data sent by the network device to the server is data with a level greater than or equal to the first threshold, the data analyzed by the server is also data with a level greater than or equal to the first threshold, which reduces the amount of data analysis.
服务器接收到终端标识、用户要访问的网址和所述用户要访问的网址的层级后,将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量,这样更容易确定对产品感兴趣的用户量,提高了数据分析的精准度。After the server receives the terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user, it uses the data whose level is greater than or equal to the first threshold as a data source, and counts the data in the level greater than or equal to the first threshold, and each The number of terminal identifiers corresponding to the website that the user wants to visit makes it easier to determine the number of users who are interested in the product and improves the accuracy of data analysis.
举例来说,GGSN解析出终端的标识后,将解析出的终端的标识,用户要访问的网址和所述用户要访问的网址在推荐网址中的层级发送给服务器,如终端的标识为13580000000的用户访问了一汽马自达汽车规格配置表的网址http://www.baidu.com/s?wd=%C2%ED%D7%D4%B4%EF6&rsv bp=0&rsv spt=3&oq=%C2%ED%D7%D4%B4%EF&rsp=0&f=3&inputT=9203;在步骤103中已经匹配出该网址位于内置推荐网址的第二层级,那么将以上三个信息发送给服务器。For example, after the GGSN parses out the terminal identifier, it sends the parsed terminal identifier, the URL to be accessed by the user, and the level of the URL to be accessed by the user in the recommended URL to the server, for example, a terminal whose identifier is 13580000000 The user visits the websitehttp://www.baidu.com/s? wd=%C2%ED%D7%D4%B4%EF6&rsv bp=0&rsv spt=3&oq=%C2%ED%D7%D4%B4%EF&rsp=0&f=3&inputT=9203; in step 103, the URL has been matched in The second level of the built-in recommended URL, then send the above three information to the server.
本发明实施例中,接收业务访问请求,所述业务访问请求中包含用户要访问的网址;将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术中只要用户访问,后台服务器就分析相比,本发明实施例提供的数据分析方法,可以降低数据分析量,提高分析数据的精准度。In the embodiment of the present invention, a service access request is received, and the service access request includes the website that the user wants to visit; the website that the user wants to visit is matched with the built-in recommended website that is stored by hierarchy; URL, determine that the level of the website to be accessed by the user is the level associated with the matching recommended URL, and the level of the recommended URL corresponds to the level of the recommended information displayed on the terminal; Requested terminal identification; if the level of the website that the user wants to visit is greater than or equal to the first threshold, send the parsed terminal identification, the website that the user wants to visit, and the level of the website that the user wants to visit to the server, so that the server can count It is greater than or equal to the number of terminal identifiers corresponding to the website to be accessed by each user in the first threshold level. Compared with the prior art where the background server analyzes as long as the user visits, the data analysis method provided by the embodiment of the present invention can reduce the amount of data analysis and improve the accuracy of the analysis data.
参阅图2,从服务器的角度来看,本发明实施例提供的数据分析方法的另一实施例包括:Referring to FIG. 2, from the perspective of the server, another embodiment of the data analysis method provided by the embodiment of the present invention includes:
201、接收终端标识、用户要访问的网址和所述用户要访问的网址的层级。201. Receive a terminal identifier, a website to be accessed by the user, and a level of the website to be accessed by the user.
本发明实施例中的服务器主要指的是用于数据分析的服务器,如商务智能中用到的服务器;The server in the embodiment of the present invention mainly refers to a server used for data analysis, such as a server used in business intelligence;
服务器接收终端标识、用户要访问的网址和所述用户要访问的网址的层级,如上述实施例中的例子,那么服务器就接收到终端的标识为13580000000的用户访问了一汽马自达汽车规格配置表的网址http://www.baidu.com/s?wd=%C2%ED%D7%D4%B4%EF6&rsv bp=0&rsv spt=3&oq=%C2%ED%D7%D4%B4%EF&rsp=0&f=3&inputT=9203;该网址属于推荐的第二层级。The server receives the terminal identifier, the website that the user wants to visit, and the level of the website that the user wants to visit, such as the example in the above embodiment, then the server receives the user with the terminal identifier of 13580000000 accessing the specification configuration table of FAW Mazda URLhttp://www.baidu.com/s? wd=%C2%ED%D7%D4%B4%EF6&rsv bp=0&rsv spt=3&oq=%C2%ED%D7%D4%B4%EF&rsp=0&f=3&inputT=9203; this website belongs to the second level of recommendation.
202、将所述终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储。202. Store the terminal identifier, the website to be visited by the user, and the hierarchical association of the website to be visited by the user.
服务器可为终端标识、用户要访问的网址和所述用户要访问的网址的层级建立一个映射表,在映射表中将终端标识、用户要访问的网址和所述用户要访问的网址的层级做映射关联。The server may establish a mapping table for the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user, and make the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user in the mapping table Mapping associations.
203、将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。203. Using the data whose level is greater than or equal to the first threshold as a data source, count the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold.
服务器接收到的数据是层级大于或等于第一阈值的数据,那么服务器就将层级大于或等于第一阈值的数据作为数据源,统计每个用户要访问的网址的点击率,点击率实际上就是用户的数量,只要统计出每个用户要访问的网址对应的终端标识的数量就得到了该用户要访问的网址的点击率。The data received by the server is data whose level is greater than or equal to the first threshold, then the server will use the data whose level is greater than or equal to the first threshold as the data source, and count the click-through rate of the website that each user wants to visit. The click-through rate is actually For the number of users, as long as the number of terminal identifiers corresponding to the website to be visited by each user is counted, the click-through rate of the website to be visited by the user can be obtained.
如,统计访问第二层级中访问一汽马自达汽车规格配置表的网址的点击率,那么,将访问该网址的终端用户标识相加即可得到。For example, to count the click-through rate of the website accessing the specifications and configuration tables of FAW-Mazda vehicles in the second level of access, it can be obtained by summing up the identifiers of the end users who visit the website.
如果广告提供商需要进一步的信息,如对一汽马自达汽车感兴趣的用户的归属地,那么可以从终端标识归属地列表中查询出每个终端标识的归属地,并将所述用户要访问的网址、与所述用户要访问的网址对应的终端标识和所述终端标识的归属地关联存储;统计与每个用户要访问的网址关联的每个归属地的终端标识的数量;这样可以获知每个地方对该广告感兴趣的用户数量,可以针对该地区做更好的推广。If the advertisement provider needs further information, such as the attribution of users who are interested in FAW Mazda, then the attribution of each terminal identification can be queried from the terminal identification attribution list, and the website to be visited by the user can be , the terminal identification corresponding to the website to be visited by the user and the attribution of the terminal identification are stored in association; the number of terminal identifications of each attribution associated with the website to be visited by each user is counted; in this way, each The number of users who are interested in the advertisement in a place can be better promoted for that region.
本发明实施例中,接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;将所述终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术中只要用户访问,后台服务器就分析相比,本发明实施例提供的数据分析方法,通过将推荐信息分层,网络设备匹配出用户要访问网址的层级,并将所述层级发送给服务器,后台服务器只分析层级大于或等于第一阈值的数据,从而降低数据分析量,提高分析数据的精准度。In the embodiment of the present invention, receiving the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user; and storing the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user in association; Taking the data whose level is greater than or equal to the first threshold as a data source, and counting the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold. Compared with the background server analysis in the prior art as long as the user visits, the data analysis method provided by the embodiment of the present invention, by layering the recommendation information, the network device matches the level of the website that the user wants to visit, and sends the level For the server, the background server only analyzes the data whose level is greater than or equal to the first threshold, thereby reducing the amount of data analysis and improving the accuracy of the analyzed data.
参阅图3,针对上述实施例提供的数据分析方法实施例的整体过程做详细描述:Referring to Figure 3, the overall process of the embodiment of the data analysis method provided in the above embodiment is described in detail:
S1、终端向网络设备发送业务访问请求;S1. The terminal sends a service access request to the network device;
所述业务访问请求中包含用户要访问的网址。The service access request includes the website to be accessed by the user.
S2、网络设备将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;S2. The network device matches the URL to be accessed by the user with the built-in recommended URL stored in layers;
网络设备接收到终端发送的业务访问请求后,获取所述业务访问请求中的用户要访问的网址;将所述用户要访问的网址与内置的推荐网址列表中的推荐网址进行匹配,所述推荐网址列表包括推荐网址与所述推荐网址的层级之间的映射关系。After receiving the service access request sent by the terminal, the network device obtains the website to be visited by the user in the service access request; matches the website to be visited by the user with the recommended website in the built-in recommended website list, and the recommended The URL list includes a mapping relationship between recommended URLs and levels of the recommended URLs.
S3、网络设备当当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应。S3. The network device determines that the level of the website to be accessed by the user is the level associated with the matching recommended website when a consistent recommended website is matched, and the level of the recommended website is related to the level of the recommended information displayed on the terminal Corresponding.
当在网址推荐列表中找到了与用户要访问的网址相同的推荐网址后,从推荐网址与所述推荐网址的映射关系中就可以获知所述用户要访问的网址的层级,也就是说,所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级。When the recommended URL that is the same as the URL that the user wants to visit is found in the URL recommendation list, the level of the URL that the user wants to visit can be known from the mapping relationship between the recommended URL and the recommended URL, that is, all The level of the URL to be visited by the user is the level associated with the matching recommended URL.
S4、网络设备解析出发送所述业务访问请求的终端标识;S4. The network device parses out the identifier of the terminal sending the service access request;
终端可以是移动终端,也可以是固定终端,如果是手机上网,那么终端的标识就是国际移动用户标识码(IMSI,International Mobile SubscriberIdentification),如:13580000000;如果是电脑上网,那么终端的标识就是该终端的媒体接入控制(MAC,Media Access Control)地址。The terminal can be a mobile terminal or a fixed terminal. If it is a mobile phone accessing the Internet, then the identification of the terminal is the International Mobile Subscriber Identification (IMSI, International Mobile Subscriber Identification), such as: 13580000000; if it is a computer accessing the Internet, then the identification of the terminal is the Media Access Control (MAC, Media Access Control) address of the terminal.
S5、如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级;S5. If the level of the website to be visited by the user is greater than or equal to the first threshold, send the parsed terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the server;
网络设备发送层级大于或等于第一阈值的数据给服务器,可以降低服务器的数据分析量。The network device sends the data whose level is greater than or equal to the first threshold to the server, which can reduce the data analysis amount of the server.
S6、服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。S6. The server counts the number of terminal identifiers that are greater than or equal to the first threshold level and correspond to the website to be accessed by each user.
广告提供商可以根据服务器中展现的广告的点击量,确定该广告的被关注程度,同时,还可以获知有哪些用户对该广告感兴趣,可以针对这些对广告感兴趣的用户进行推广。The advertisement provider can determine the attention degree of the advertisement according to the number of clicks of the advertisement displayed on the server, and at the same time, can also know which users are interested in the advertisement, and can promote the advertisement to these users who are interested in the advertisement.
本实施例中,从整体上对本发明实施例提供的数据分析方法进行了说明,更佳体现出了本发明实施例提供的数据分析方法,可以降低数据分析量,提高分析数据的精准度。In this embodiment, the data analysis method provided by the embodiment of the present invention is described as a whole, which better reflects the data analysis method provided by the embodiment of the present invention, which can reduce the amount of data analysis and improve the accuracy of analyzed data.
为便于理解,下面以一个具体应用场景为例,对本发明实施例中提供的数据分析方法进行详细说明:For ease of understanding, the following uses a specific application scenario as an example to describe the data analysis method provided in the embodiment of the present invention in detail:
参阅图4,终端用户点击访问产品推荐网址(URL,Uniform/UniversalResource Locator),发起互联网业务访问请求,业务访问请求最终被GGSN所接收到,GGSN获取业务访问请求中包含的用户要访问的网址,如终端用户13100000000,13200000000,13300000000,13400000000,13500000000都分别访问了波司登羽绒服2011新款的网址http://huoyan.blogcn.com/88,以其中13100000000用户发起业务请求,GGSN处理该业务请求的过程来说,GGSN将该用户要访问的网址http://huoyan.blogcn.com/88与内置的推荐网址进行匹配,找到该网址在推荐网址中的第二层级,如果GGSN内设置的层级的第一阈值为2,那么GGSN将该用户的终端的标识13100000000,用户要访问的网址http://huoyan.blogcn.com/88,该网址的层级第二级发送给服务器,发送的数据结构可以参阅下表1:本场景中只是以13100000000这个用户为例,对于其他几个发起同样请求的用户,GGSN做出同样的处理,也将其他几个用户的信息发送给服务器。Referring to Figure 4, the end user clicks to access the product recommendation URL (URL, Uniform/UniversalResource Locator), initiates an Internet service access request, and the service access request is finally received by the GGSN, and the GGSN obtains the URL that the user wants to visit contained in the service access request, For example, end users 13100000000, 13200000000, 13300000000, 13400000000, and 13500000000 all visited the websitehttp://huoyan.blogcn.com/88 of the new Bosideng down jacket in 2011, and users 13100000000 initiated service requests, and GGSN processed the service requests. Say, GGSN matches the websitehttp://huoyan.blogcn.com/88 that the user wants to visit with the built-in recommended website, and finds the second level of the website in the recommended website. If the first level of the level set in GGSN The threshold is 2, then GGSN sends the user's terminal ID 13100000000, the websitehttp://huoyan.blogcn.com/88 that the user wants to visit, and the second level of the website to the server. For the sent data structure, please refer to the following Table 1: In this scenario, the user 13100000000 is taken as an example. For other users who initiate the same request, GGSN performs the same processing and sends the information of other users to the server.
表1:GGSN发送的数据结构表Table 1: Data structure table sent by GGSN
用户13600000000发起了雪中飞羽绒服2011年新款的业务请求,网址为http://abc.wm23.com/aiken/144512.html,GGSN也解析到该网址位于第二层级,那么GGSN将终端标识为13600000000,http://abc.wm23.com/aiken/144512.html,位于第二层级三个信息也发给服务器。User 13600000000 initiated a service request for the 2011 new style of Xuezhongfei down jacket, the URL ishttp://abc.wm23.com/aiken/144512.html , and GGSN also parsed that the URL is at the second level, so GGSN identifies the terminal as 13600000000,http://abc.wm23.com/aiken/144512.html , the three information located in the second level are also sent to the server.
服务器接收到GGSN发送来的数据,按照层级将终端的标识和用户要访问的网址对应存储;还是以本场景中提到的几个用户为例,服务器将这些用户的信息存储到一张表中,如表2The server receives the data sent by GGSN, and stores the identifier of the terminal and the website that the user wants to visit according to the level; still taking the several users mentioned in this scenario as an example, the server stores the information of these users in a table, such as Table 2
当然本场景中只给出了上述6个用户,实际上可以有很多个用户,访问的网址也可以不相同,但处理方法是相同的,假如服务器要分析第二层中波司登羽绒服2011新款的数据,将上述表3中的数据进行过滤筛选,将终端用户标识是13600000000的数据筛除,统计访问波司登羽绒服2011新款的用户有5人,这样就可以针对这5个用户进行具体推荐。Of course, only the above-mentioned 6 users are given in this scenario. In fact, there can be many users, and the URLs accessed can be different, but the processing method is the same. If the server wants to analyze the data of the 2011 new Bosideng down jacket in the second layer , filter the data in the above Table 3, and filter out the data whose end user ID is 13600000000, and there are 5 users accessing the 2011 new style of Bosideng down jacket according to statistics, so that specific recommendations can be made for these 5 users.
本应用场景中只给出了推荐网址是2层的情况,实际上可以有很多层,如上述提到的波司登羽绒服2011新款,如果用户再点击具体的某一款羽绒服,那么就可以将这一层定位为第三层,如果想掌握喜欢这一款羽绒服的用户,那么就可以针对访问该网址的用户进行分析。In this application scenario, only two layers of recommended URLs are given. In fact, there can be many layers, such as the 2011 new style of Bosideng down jacket mentioned above. If the user clicks on a specific down jacket, then this The layer is positioned as the third layer. If you want to know the users who like this down jacket, you can analyze the users who visit this website.
参阅图5,本发明实施例提供的网络设备的一实施例包括:Referring to FIG. 5, an embodiment of the network device provided by the embodiment of the present invention includes:
第一接收单元301,用于接收业务访问请求,所述业务访问请求中包含用户要访问的网址;The first receiving unit 301 is configured to receive a service access request, and the service access request includes a website to be visited by the user;
网址匹配单元302,用于将所述第一接收单元301接收到的业务访问请求中包含的用户要访问的网址与内置的按层级存储的推荐网址进行匹配;A website matching unit 302, configured to match the website to be visited by the user included in the service access request received by the first receiving unit 301 with the built-in recommended website stored by hierarchy;
层级确定单元303,用于当所述网址匹配单元302匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;A level determination unit 303, configured to determine that the level of the website to be accessed by the user is the level associated with the matching recommended website when the website matching unit 302 matches a consistent recommended website, and the level of the recommended website is the same as Corresponding to the level of recommendation information displayed on the terminal;
解析单元304,用于解析出发送所述第一接收单元301接收到的业务访问请求的终端的标识;A parsing unit 304, configured to parse out the identity of the terminal that sent the service access request received by the first receiving unit 301;
发送单元305,用于在所述用户要访问的网址的层级大于或等于第一阈值时,向服务器发送所述解析单元304解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The sending unit 305 is configured to send to the server the terminal identifier parsed by the parsing unit 304, the website to be visited by the user, and the website to be visited by the user when the level of the website to be visited by the user is greater than or equal to a first threshold. The level of the website, so that the server counts the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold value.
本发明实施例中,第一接收单元301接收业务访问请求,所述业务访问请求中包含用户要访问的网址;网址匹配单元302将所述第一接收单元301接收到的业务访问请求中包含的用户要访问的网址与内置的按层级存储的推荐网址进行匹配;层级确定单元303当所述网址匹配单元302匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析单元304解析出发送所述第一接收单元301接收到的业务访问请求的终端的标识;发送单元305在所述用户要访问的网址的层级大于或等于第一阈值时,向服务器发送所述解析单元304解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术相比,本发明实施例提供的网络设备,可以降低数据分析量,提高分析数据的精准度。In the embodiment of the present invention, the first receiving unit 301 receives a service access request, and the service access request includes the website that the user wants to visit; The web site to be visited by the user is matched with the built-in recommended web site stored by level; when the web site matching unit 302 matches a consistent recommended web site, the level determining unit 303 determines that the level of the web site to be visited by the user is the matching The level associated with the recommended website on the website, the level of the recommended website corresponds to the level of the recommended information displayed on the terminal; the parsing unit 304 parses out the identity of the terminal that sends the service access request received by the first receiving unit 301; When the level of the website to be visited by the user is greater than or equal to the first threshold, the sending unit 305 sends to the server the terminal identifier parsed by the parsing unit 304, the website to be visited by the user, and the level of the website to be visited by the user , making the server count the number of terminal identifiers corresponding to the website to be accessed by each user in the first threshold level greater than or equal to. Compared with the prior art, the network device provided by the embodiment of the present invention can reduce the amount of data analysis and improve the accuracy of data analysis.
参阅图6,在上述图5对应的实施例的基础上,本发明实施例提供的网络设备的另一实施例还包括:Referring to FIG. 6, on the basis of the above-mentioned embodiment corresponding to FIG. 5, another embodiment of the network device provided by the embodiment of the present invention further includes:
获取单元306,用于获取所述第一接收单元301接收到的业务访问请求中的用户要访问的网址;An obtaining unit 306, configured to obtain the website to be accessed by the user in the service access request received by the first receiving unit 301;
所述网址匹配单元302,用于将所述获取单元获306取到的用户要访问的网址与内置的推荐网址列表中的推荐网址进行匹配,所述推荐网址列表包括推荐网址与所述推荐网址的层级之间的映射关系。The URL matching unit 302 is configured to match the URL to be accessed by the user obtained by the acquisition unit 306 with the recommended URL in the built-in recommended URL list, the recommended URL list includes the recommended URL and the recommended URL The mapping relationship between the levels.
参阅图7,在上述图6对应的实施例的基础上,本发明实施例提供的网络设备的另一实施例中所述获取单元306包括:Referring to FIG. 7 , on the basis of the above-mentioned embodiment corresponding to FIG. 6 , the acquisition unit 306 in another embodiment of the network device provided by the embodiment of the present invention includes:
过滤模块3061,用于在所述业务访问请求包含在多个数据包中时,过滤掉重复的数据包;A filtering module 3061, configured to filter out duplicate data packets when the service access request is included in multiple data packets;
排序模块3062,用于将所述过滤模块3061过滤后剩余的数据包进行排序,组成一个完整的数据流;A sorting module 3062, configured to sort the remaining data packets filtered by the filtering module 3061 to form a complete data stream;
解析模块3063,用于将所述排序模块3062组成的完整的数据流中解析出用户要访问的网址。The parsing module 3063 is configured to parse the complete data stream formed by the sorting module 3062 to obtain the website to be visited by the user.
网络设备中的获取单元306中的过滤模块3061在所述业务访问请求包含在多个数据包中时,过滤掉重复的数据包;排序模块3062将所述过滤模块3061过滤后剩余的数据包进行排序,组成一个完整的数据流;解析模块3063将所述排序模块3062组成的完整的数据流中解析出用户要访问的网址。The filtering module 3061 in the acquisition unit 306 in the network device filters out duplicate data packets when the service access request is included in multiple data packets; the sorting module 3062 performs the remaining data packets after filtering by the filtering module 3061 Sorting to form a complete data stream; the parsing module 3063 parses the complete data stream composed of the sorting module 3062 to obtain the website to be accessed by the user.
参阅图8,本发明实施例中提供的服务器的一实施例包括:Referring to Figure 8, an embodiment of the server provided in the embodiment of the present invention includes:
第二接收单元401,用于接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;The second receiving unit 401 is configured to receive the terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user;
存储单元402,用于将所述第二接收单元401接收到的终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;A storage unit 402, configured to store the terminal identifier received by the second receiving unit 401, the website to be visited by the user, and the hierarchical association of the website to be visited by the user;
分析单元403,用于将所述存储单元402存储的层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The analysis unit 403 is configured to use the data stored in the storage unit 402 at a level greater than or equal to the first threshold as a data source, and count the number of terminal identifiers corresponding to the website that each user wants to visit in the level greater than or equal to the first threshold quantity.
本发明实施例中,第二接收单元401接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;存储单元402将所述第二接收单元401接收到的终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;分析单元403将所述存储单元402存储的层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术相比,本发明实施例中所提供的服务器可以根据层级分析对应的数据,这样可以降低数据分析量,提高数据分析精度。In the embodiment of the present invention, the second receiving unit 401 receives the terminal identification, the website that the user wants to visit, and the level of the website that the user wants to visit; the storage unit 402 stores the terminal identification, the website that the user wants to visit, and The website visited and the website to be visited by the user are associated with each other; the analysis unit 403 uses the data stored in the storage unit 402 as a data source whose level is greater than or equal to the first threshold, and counts the levels greater than or equal to the first threshold. The number of endpoint IDs corresponding to the URL each user wants to visit. Compared with the prior art, the server provided in the embodiment of the present invention can analyze the corresponding data according to the level, which can reduce the amount of data analysis and improve the accuracy of data analysis.
参阅图9,在上述图8对应的实施例的基础上,本发明实施例提供的服务器的另一实施例中所述分析单元403包括:Referring to FIG. 9, on the basis of the above-mentioned embodiment corresponding to FIG. 8, the analysis unit 403 in another embodiment of the server provided by the embodiment of the present invention includes:
查询单元404,用于从终端标识归属地列表中查询出每个终端标识的归属地;A query unit 404, configured to query the attribution of each terminal identifier from the terminal identifier attribution list;
所述存储单元402,还用于将所述用户要访问的网址、与所述用户要访问的网址对应的终端标识和所述查询单元404查询到的终端标识的归属地关联存储;The storage unit 402 is further configured to associate and store the website to be visited by the user, the terminal identifier corresponding to the website to be visited by the user, and the attribution of the terminal identifier queried by the query unit 404;
所述分析单元403,还用于统计所述存储单元402存储的与每个用户要访问的网址关联的每个归属地的终端标识的数量。The analysis unit 403 is further configured to count the number of terminal identifiers of each location associated with the website to be accessed by each user stored in the storage unit 402 .
参阅图10,本发明实施例提供的数据分析系统的一实施例包括网络设备30和服务器40,Referring to FIG. 10, an embodiment of the data analysis system provided by the embodiment of the present invention includes a network device 30 and a server 40,
网络设备30,用于接收业务访问请求,所述业务访问请求中包含用户要访问的网址;将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The network device 30 is configured to receive a service access request, the service access request includes the website to be visited by the user; match the website to be visited by the user with the built-in recommended website stored by hierarchy; URL, determine that the level of the website to be accessed by the user is the level associated with the matching recommended URL, and the level of the recommended URL corresponds to the level of the recommended information displayed on the terminal; Requested terminal identification; if the level of the website that the user wants to visit is greater than or equal to the first threshold, send the parsed terminal identification, the website that the user wants to visit, and the level of the website that the user wants to visit to the server, so that the server can count It is greater than or equal to the number of terminal identifiers corresponding to the website to be accessed by each user in the first threshold level.
服务器40,用于接收终端标识、用户要访问的网址和所述用户要访问的网址的层级;将所述终端标识、用户要访问的网址和所述用户要访问的网址的层级关联存储;将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The server 40 is configured to receive the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user; associate and store the terminal identifier, the website to be visited by the user, and the hierarchy of the website to be visited by the user; The data whose level is greater than or equal to the first threshold is used as a data source, and the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold is counted.
本发明实施例提供的系统可以降低数据分析量,提高分析数据的精准度。The system provided by the embodiments of the present invention can reduce the amount of data analysis and improve the accuracy of data analysis.
参月图11,本发明实施例提供的系统的另一实施例包括:包括网络设备30、深度包解析设备50和服务器40,所述深度包解析设备50与所述网络设备通信30具有通信连接;Referring to FIG. 11 , another embodiment of the system provided by the embodiment of the present invention includes: a network device 30, a deep packet analysis device 50 and a server 40, and the deep packet analysis device 50 has a communication connection with the network device 30 ;
所述网络设备30,用于接收业务访问请求,并将所述业务访问请求或所述业务访问请求的镜像包转发给所述深度包解析设备,所述业务访问请求中包含用户要访问的网址;The network device 30 is configured to receive a service access request, and forward the service access request or a mirrored packet of the service access request to the deep packet analysis device, and the service access request includes the website to be visited by the user ;
所述深度包解析设备50,用于将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,所述深度包解析设备向所述网络设备发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级;The deep packet parsing device 50 is used to match the web site that the user wants to visit with the built-in recommended web site that is stored hierarchically; when a consistent recommended web site is matched, determine that the web site that the user wants to visit is of The level associated with the matching recommended website, the level of the recommended website corresponds to the level of the recommended information displayed on the terminal; parse out the terminal identifier that sends the service access request; if the website that the user wants to visit The level is greater than or equal to the first threshold, and the deep packet analysis device sends the parsed terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the network device;
所述网络设备30,用于向服务器发送接收到的终端标识、用户要访问的网址和所述用户要访问的网址的层级;The network device 30 is configured to send the received terminal identifier, the website to be visited by the user, and the level of the website to be visited by the user to the server;
所述服务器40,用于将层级大于或等于第一阈值的数据作为数据源,统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。The server 40 is configured to use the data whose level is greater than or equal to the first threshold as a data source, and count the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold.
本发明实施例提供的系统可以降低数据分析量,提高分析数据的精准度。The system provided by the embodiments of the present invention can reduce the amount of data analysis and improve the accuracy of data analysis.
综上所述,本发明实施例中,接收业务访问请求,所述业务访问请求中包含用户要访问的网址;将所述用户要访问的网址与内置的按层级存储的推荐网址进行匹配;当匹配到一致的推荐网址时,确定所述用户要访问的网址的层级为所述匹配上的推荐网址所关联的层级,推荐网址的层级与在终端上展现的推荐信息的层级相对应;解析出发送所述业务访问请求的终端标识;如果所述用户要访问的网址的层级大于或等于第一阈值,向服务器发送解析出的终端标识、用户要访问的网址和所述用户要访问的网址的层级,使服务器统计大于或等于第一阈值层级中,与每个用户要访问的网址对应的终端标识的数量。与现有技术中只要用户访问,后台服务器就分析相比,本发明实施例提供的数据分析方法,可以降低数据分析量,提高分析数据的精准度。To sum up, in the embodiment of the present invention, the service access request is received, and the service access request includes the website that the user wants to visit; the website that the user wants to visit is matched with the built-in recommended website that is stored hierarchically; when When a consistent recommended website is matched, it is determined that the level of the website to be visited by the user is the level associated with the matching recommended website, and the level of the recommended website corresponds to the level of the recommended information displayed on the terminal; Send the terminal identifier of the service access request; if the level of the website to be visited by the user is greater than or equal to the first threshold, send the parsed terminal identifier, the website to be visited by the user, and the URL of the website to be visited by the user to the server level, so that the server counts the number of terminal identifiers corresponding to the website to be accessed by each user in the level greater than or equal to the first threshold value. Compared with the prior art where the background server analyzes as long as the user visits, the data analysis method provided by the embodiment of the present invention can reduce the amount of data analysis and improve the accuracy of the analysis data.
本发明实施例的基础是预先将内容(比如媒体/广告内容)进行分层展现,针对用户对推荐媒体内容层次越深内容信息的点击浏览统计,精度更高,而且通过分层展现,确定重点网址访问信息,降低无效的访问记录信息,以及通过内容分层,降低大量的统计入口数据。The basis of the embodiment of the present invention is to present the content (such as media/advertising content) in layers in advance, and the accuracy is higher for users to click and browse the content information of the recommended media content with deeper layers. Website access information, reduce invalid access record information, and reduce a large amount of statistical entry data through content layering.
本发明实施例中网络侧进行数据分析的执行主体可以是集成有深度包解析(DPI,Deep Packet inspect)功能的网络设备,如网关GPRS支持节点(GGSN,Gateway GPRS Support Node),也可以是与网络设备具有通信连接的深度包解析设备,具体可以是外挂在网络设备上的深度包解析设备,如将具有DPI解析功能的设备外挂在GGSN上,并与所述GGSN建立通信连接。In the embodiment of the present invention, the execution subject of data analysis at the network side can be a network device integrated with a deep packet analysis (DPI, Deep Packet inspect) function, such as a gateway GPRS support node (GGSN, Gateway GPRS Support Node), or it can be connected with The network device has a communication-connected deep packet analysis device, which may specifically be a deep packet analysis device attached to the network device, such as attaching a device with a DPI analysis function to the GGSN, and establishing a communication connection with the GGSN.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:ROM、RAM、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium, and the storage medium can include: ROM, RAM, disk or CD, etc.
以上对本发明实施例所提供的数据分析方法以、网络设备及服务器进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The data analysis method, network equipment, and server provided by the embodiment of the present invention have been described above in detail. In this paper, specific examples are used to illustrate the principle and implementation of the present invention. The description of the above embodiment is only used to help understanding The method of the present invention and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of the present invention, there will be changes in the specific implementation and scope of application. In summary, the content of this specification should not be construed as a limitation of the invention.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210038235.0ACN102611572B (en) | 2012-02-20 | 2012-02-20 | Data analysis method, device and system |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210038235.0ACN102611572B (en) | 2012-02-20 | 2012-02-20 | Data analysis method, device and system |
| Publication Number | Publication Date |
|---|---|
| CN102611572A CN102611572A (en) | 2012-07-25 |
| CN102611572Btrue CN102611572B (en) | 2014-12-24 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201210038235.0AActiveCN102611572B (en) | 2012-02-20 | 2012-02-20 | Data analysis method, device and system |
| Country | Link |
|---|---|
| CN (1) | CN102611572B (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101317177A (en)* | 2006-06-09 | 2008-12-03 | 华为技术有限公司 | System and method for determining content provider priority |
| CN101360098A (en)* | 2008-08-29 | 2009-02-04 | 腾讯科技(北京)有限公司 | Customer behavior attribute determining method, apparatus, system and advertisement releasing method and system |
| CN101399818A (en)* | 2007-09-25 | 2009-04-01 | 日电(中国)有限公司 | Theme related webpage filtering method and system based on navigation route information |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1987916A (en)* | 2005-12-21 | 2007-06-27 | 腾讯科技(深圳)有限公司 | Method and device for releasing network advertisements |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101317177A (en)* | 2006-06-09 | 2008-12-03 | 华为技术有限公司 | System and method for determining content provider priority |
| CN101399818A (en)* | 2007-09-25 | 2009-04-01 | 日电(中国)有限公司 | Theme related webpage filtering method and system based on navigation route information |
| CN101360098A (en)* | 2008-08-29 | 2009-02-04 | 腾讯科技(北京)有限公司 | Customer behavior attribute determining method, apparatus, system and advertisement releasing method and system |
| Publication number | Publication date |
|---|---|
| CN102611572A (en) | 2012-07-25 |
| Publication | Publication Date | Title |
|---|---|---|
| US20220067782A1 (en) | System and method for segmenting and targeting audience members | |
| AU2018282471B2 (en) | Methods and apparatus to share online media impressions data | |
| US10991003B2 (en) | Audience matching network with performance factoring and revenue allocation | |
| US10846709B2 (en) | Audience targeting with universal profile synchronization | |
| US8150732B2 (en) | Audience targeting system with segment management | |
| US9710555B2 (en) | User profile stitching | |
| US8041806B2 (en) | Targeted electronic content delivery control systems and methods | |
| US8464290B2 (en) | Network for matching an audience with deliverable content | |
| JP5784236B2 (en) | Mobile ads that use data networks based on intelligence data related to internet-enabled devices, derived using graph models | |
| CN104700289A (en) | Advertising method and device | |
| US20120071131A1 (en) | Method and system for profiling data communication activity of users of mobile devices | |
| US20120078708A1 (en) | System and Method for Recording and Analyzing Internet Browser Traffic Independent of Individual or Specific Digital Platforms or Websites | |
| US20050125290A1 (en) | Audience targeting system with profile synchronization | |
| KR20130039179A (en) | System for providing targeting advertisement based on on-line shopping mall, apparatus and method for providing targeting therefore | |
| CN104835052A (en) | Method and system for improving network advertisement delivery precision | |
| WO2015062652A1 (en) | Technique for data traffic analysis | |
| KR20160130724A (en) | System for providing targeting advertisement based on on-line shopping mall, apparatus and method for providing targeting therefore | |
| CN102611572B (en) | Data analysis method, device and system | |
| KR101483189B1 (en) | Method and system for providing personalized advertisement by using integrated information |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant |