Movatterモバイル変換


[0]ホーム

URL:


CN104123659A - Commodity networked gene based brand intellectual property protection platform - Google Patents

Commodity networked gene based brand intellectual property protection platform
Download PDF

Info

Publication number
CN104123659A
CN104123659ACN201410368754.2ACN201410368754ACN104123659ACN 104123659 ACN104123659 ACN 104123659ACN 201410368754 ACN201410368754 ACN 201410368754ACN 104123659 ACN104123659 ACN 104123659A
Authority
CN
China
Prior art keywords
data
commodity
module
polarity
protection platform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410368754.2A
Other languages
Chinese (zh)
Inventor
刘浩
陈贤
刘卫平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HANGZHOU YEGOON TECHNOLOGY Co Ltd
Original Assignee
HANGZHOU YEGOON TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HANGZHOU YEGOON TECHNOLOGY Co LtdfiledCriticalHANGZHOU YEGOON TECHNOLOGY Co Ltd
Priority to CN201410368754.2ApriorityCriticalpatent/CN104123659A/en
Publication of CN104123659ApublicationCriticalpatent/CN104123659A/en
Pendinglegal-statusCriticalCurrent

Links

Landscapes

Abstract

The invention discloses a commodity networked gene based brand intellectual property protection platform comprising a data source module, a data collection module, a data integration module, a data storage module, a data analysis module, an object detection module, a visual module and a data application module. When the data collection module collects data source data, an open-source Hadoop platform is utilized for constructing a distributive whole-network commodity data capturing system; the data analysis module is utilized for structured arrangement of a great number of unstructured commodity comment data; the object detection module is utilized for analysis and detection of suspected infringing commodities by the aid of an established infringing commodity identification model; the visual module is utilized for displaying the analyzed and detected suspected infringing commodities via a visual interface. The commodity networked gene based brand intellectual property protection platform has the advantages that fewer human and material resources can be utilized, a larger-scale market can be handled effectively, and intellectual property maintenance cost is reduced for enterprises, so that economic benefits are increased.

Description

Brand knowledge property right protection platform based on commodity network gene
Technical field
The present invention relates to a kind of right-safeguarding network platform, relate in particular to a kind of brand knowledge property right protection platform based on commodity network gene.
Background technology
The state that the industry of cracking down on counterfeit goods at present relatively lags behind in technology substantially, when screening sell-fake-products commodity, adopts manual type to check or consumer's report substantially, again by the mode of law right-safeguarding, safeguards that the brand knowledge property right of self is not encroached on afterwards.This method cost is very high, and effect is general.Along with the development of ecommerce, the singularity of e-commerce platform, makes sell-fake-products businessman more hidden, hits infringement more difficult.Enterprise self is due to the finiteness of technical merit and fund, be difficult to utilize active data analysis from mass data, to obtain sell-fake-products information, although the annual huge expense dropping into of each enterprise, increase manpower and materials, be difficult to take precautions against growing fake products infringement, the crack down on the fake and safeguard the rights limitation of method of tradition highlights.
Summary of the invention
Defect and deficiency that the present invention exists in order to solve above-mentioned prior art; provide a kind of and can use manpower and materials still less; effectively process more massive market; reduce the cost that Intellectual Property Right of Enterprises is safeguarded, thus the brand knowledge property right protection platform based on commodity network gene that brings economic benefit to promote.
Technical scheme of the present invention: a kind of brand knowledge property right protection platform based on commodity network gene; comprise data source module, data collection module, Data Integration module, data memory module, data analysis module, module of target detection, visualization model and market demand module
Data collection module, while collecting data source data, utilizes the distributed the whole network commodity data of the Hadoop platform construction grasping system of increasing income;
Data Integration module, by SKU storehouse and the SKU feature database of system made, carries out to the commodity of separate sources the data of collecting from data source uniqueness identification, and unstructured data is carried out to structuring arrangement and cleaning;
Data memory module, enters data warehouse by the data storage of having integrated, for data analysis provides support;
Data analysis module, to a large amount of non-structured comment on commodity data, carries out structurized arrangement;
Module of target detection, by the infringement commodity model of cognition of setting up, analyzing and testing is to doubtful infringement commodity;
Visualization model, the doubtful infringement commodity that analyzing and testing is arrived, represent to client by visualization interface.
The distributed the whole network commodity data of the Hadoop platform construction grasping system of increasing income that the present invention adopts has following feature: 1) high-performance high stability.System has realized the distributed crawl of multithreading, before independent crawl process, is independent of each other, and after certain captures mission failure, can realize automatic Restoration Mechanism, realizes more than 99.99% reptile job stability, and can be according to the rapidly horizontal reptile scale of business demand.2) dispatching algorithm of grasping system, according to client's significance level and last time monitoring time, reasonable arrangement reptile work weight, realizes the rapid reaction to new client and Very Important Person.3) grasping system context environmental memory, realizes the context environmental register system that separate sources commodity page corresponding data was crawled last time, guarantees that grasping system realizes the Data Update of increment.
In Data Integration module of the present invention, metadata definition is the most important preposition step of data cleansing.SKU (Simple Keep Unit) is the minimum form of expression that commodity in flow process are sold in ecommerce, but on internet during merchandise sales title various, commodity code is different, realizes knowledge of goods property right protection and will realize the definition of right-safeguarding commodity SKU metadata.The present invention need to define SKU form and the recognition feature (seeing the SKU of Fig. 3 commodity data unit definition) of the commodity metadata of own platform according to cracking down on counterfeit goods, utilize each platform open interface and own data acquisition system (DAS) by the corresponding various structurings, the semi-structured and unstructured data that are scattered on each large electric business's platform and social media platform, unified integration is in the commodity storehouse in own data platform, for the further excavation of the data of commodity provides basis.
Preferably, described data collection module is collected the data from each independent channel, and those data comprise that enterprise has data by oneself, all the Related product data that can collect on the own platform of Ji enterprise; The data of Related product on electricity business platform; The data of Related product on microblogging platform; And extensive stock related data in other relevant forums.
Preferably, described data analysis module, first by natural language processing technique, extracts product feature and User Perspective keyword; Then set up Chinese polarity judgement dictionary, define the polarity of the expressed viewpoint of different keywords, finally by keyword polarity, judge, comment on commodity is converted into computable data layout.
Preferably, characteristic key words is extracted main passing through comment text pre-service, based on high frequency words statistics, the dependence of low-frequency word syntax and artificial mode, adds, and extracts comment on commodity feature, substantially realizes the mainly covering of comment feature in comment on commodity information.
Preferably, pay close attention to and analyze and study existing product features abstracting method, further improve the product features word abstracting method based on statistics and pattern match.
Preferably, study and improved based on maximum entropy, the impact viewpoint word extracted based on SVM, analytical approach based on multiple sentence dependences such as decision trees, further improving the extraction accuracy rate to product features word, User Perspective.
Preferably, the foundation of Chinese polarity judgement dictionary further builds the Chinese polarity judgement dictionary based on HowNet, the semantic polarity dictionary of extended network and add synonym dictionary to carry out polarity judgement and analysis to synonym simultaneously, the program judgement of increase to polar intensity, improves user is evaluated to Semi-polarity synonym polarity judging nicety rate.
Preferably, by Chinese polarity judgement dictionary, by user's comment viewpoint be structured as can computing data layout.
Preferably, module of target detection utilizes data that commodity certified products sales page that client producer provides excavates as training set, extract certified products commodity price and the polarity viewpoint proper vector of user to product features, by commodity authenticity verification model, other sell the data of page the similar commodity that the whole network is located by unique SKU, carry out authenticity verification, obtain the probability that these commodity are certified products.
Preferably, market demand module is for the commodity that occur infringement, by the right-safeguarding service platform contact customer of docking, links up; By law, complaint, positive orientation guide means, directly for enterprise provides right-safeguarding service, effectively the intellecture property of maintaining enterprise is not encroached on.
The present invention by large market demand in intellectual property protection; utilize the comment and analysis system of independent research; by the large data analysis technique of exclusive semanteme; help brand manufacturers by the user comment of electric business's platform and feedback are carried out to data analysis; accurately locate infringement commodity and seller; and the follow-up a series of right-safeguarding solutions that provide by company, the intellecture property of maintaining enterprise is not encroached on.With respect to the tradition method of cracking down on counterfeit goods, the present invention can effectively process more massive market by manpower and materials still less, reduces the cost that Intellectual Property Right of Enterprises is safeguarded, thus the lifting that brings economic benefit.
Accompanying drawing explanation
Fig. 1 is Technology Roadmap of the present invention;
Fig. 2 is the schematic diagram of the distributed the whole network commodity data of Hadoop platform construction grasping system in the present invention;
Fig. 3 is commodity metadata SKU definition schematic diagram in the present invention;
Fig. 4 is data analysis schematic diagram in the present invention.
Embodiment
Below in conjunction with drawings and Examples, the present invention is further detailed explanation, but be not limiting the scope of the invention.
As shown in Figure 1, the present invention mainly comprises four parts:
1, data source module
Collection is from the data of each independent channel, and these data comprise that enterprise has data by oneself, all the Related product data that can collect on the own platform of Ji enterprise; The data of Related product on electricity business platform; The data of Related product on microblogging platform; And extensive stock related data in other relevant forums.
2, Data Integration
Data Integration partly comprises Data Collection, Data Integration, data storage three large modules.
(1) Data Collection.While collecting data source data, we utilize the distributed the whole network commodity data of the Hadoop platform construction grasping system (as shown in Figure 2) of increasing income, and native system has following feature: 1) high-performance high stability.System has realized the distributed crawl of multithreading, before independent crawl process, is independent of each other, and after certain captures mission failure, can realize automatic Restoration Mechanism, realizes more than 99.99% reptile job stability, and can be according to the rapidly horizontal reptile scale of business demand.2) dispatching algorithm of grasping system, according to client's significance level and last time monitoring time, reasonable arrangement reptile work weight, realizes the rapid reaction to new client and Very Important Person.3) grasping system context environmental memory, realizes the context environmental register system that separate sources commodity page corresponding data was crawled last time, guarantees that grasping system realizes the Data Update of increment.
(2) Data Integration.The data of collecting from data source, by SKU storehouse and the SKU feature database (as shown in Figure 3) of system made, are carried out to uniqueness identification to the commodity of separate sources, and unstructured data is carried out to structuring arrangement and cleaning.Metadata definition is the most important preposition step of data cleansing.SKU (Simple Keep Unit) is the minimum form of expression that commodity in flow process are sold in ecommerce, but on internet during merchandise sales title various, commodity code is different, realizes knowledge of goods property right protection and will realize the definition of right-safeguarding commodity SKU metadata.The present invention need to define SKU form and the recognition feature (seeing the SKU of Fig. 3 commodity data unit definition) of the commodity metadata of own platform according to cracking down on counterfeit goods, utilize each platform open interface and own data acquisition system (DAS) by the corresponding various structurings, the semi-structured and unstructured data that are scattered on each large electric business's platform and social media platform, unified integration is in the commodity storehouse in own data platform, for the further excavation of the data of commodity provides basis.
(3) data storage.Final data memory module enters data warehouse by the data storage of having integrated, for data analysis provides support.
3, data analysis
Our department comprises data analysis, target detection and visual three modules (as shown in Figure 4).
(1) data analysis.Data analysis module is mainly to a large amount of non-structured comment on commodity data, carries out structurized arrangement.First by natural language processing technique, extract product feature and User Perspective keyword; Then set up Chinese polarity judgement dictionary, define the polarity of the expressed viewpoint of different keywords.Finally by keyword polarity, judge, comment on commodity is converted into computable data layout.
1) Feature Words extracts main passing through comment text pre-service, based on high frequency words statistics, the dependence of low-frequency word syntax and artificial mode, adds, and extracts comment on commodity feature, substantially realizes the mainly covering of comment feature in comment on commodity information.In natural language processing, opining mining is one of gordian technique of this module.Research in this respect, we mainly pay close attention to and analyze and study existing product features abstracting method, further improve the product features word abstracting method based on statistics and pattern match.Research and having improved based on maximum entropy, the impact viewpoint word extracted based on SVM, analytical approach based on multiple sentence dependences such as decision trees, further improves the extraction accuracy rate to product features word, User Perspective.
2) Chinese polarity judgement dictionary.Further build the Chinese polarity judgement dictionary based on HowNet, the semantic polarity dictionary of extended network and add synonym dictionary to carry out polarity judgement and analysis to synonym simultaneously, the program judgement of increase to polar intensity, improves user is evaluated to Semi-polarity synonym polarity judging nicety rate.
3) comment viewpoint structuring.According to polarity judgement dictionary, by user's comment viewpoint be structured as can computing data layout.
(2) module of target detection is by the infringement commodity model of cognition of setting up, and analyzing and testing is to doubtful infringement commodity.Utilize data that commodity certified products sales page that client producer provides excavates as training set, extract certified products commodity price and the polarity viewpoint proper vector of user to product features, by commodity authenticity verification model, other sell the data of page the similar commodity that the whole network is located by unique SKU, carry out authenticity verification, obtain the probability that these commodity are certified products.
(3) the doubtful infringement commodity that visualization model arrives analyzing and testing, represent to client by visualization interface.
4, market demand
For the commodity that occur infringement, we link up by the right-safeguarding service platform contact customer of docking.By means such as law, complaint, positive orientation guides, directly for enterprise provides right-safeguarding service, effectively the intellecture property of maintaining enterprise is not encroached on.
The main Electronic Commerce platform of the present invention, utilize the semantic large data analysis system of Chinese leading in the world, accurately gather, analyze all comment and analysis data of Ge great electricity Shang platform on-line shop, reject invalid comment, help enterprise to excavate all kinds of commodity that relate to infringement in E-commerce market, and pass through relevant law, and the intellectual property protection dependency rule of e-commerce platform, eliminate sell-fake-products commodity and businessman, effectively reduce the cost of cracking down on counterfeit goods of enterprise, and the intellecture property of maintaining enterprise is not encroached on, industry accumulation and technical experience by us are set up visual intellectual property protection platform, for a reliable channel is set up in brand business intellectual property protection and consumer's right-safeguarding.Our company is success and domestic seven wolves, Yi Erkang, unified, big and small 15 brand to create cooperative relationship such as Jeanwest, wherein help seven wolves to eliminate fake products total sales volume 2,550 ten thousand, Yi Erkang 2,412 ten thousand, its favorite your health is used before this product every year, within 2013, drop into the network expense 4,000,000 of cracking down on counterfeit goods, produce little effect.After cooperating with our company, utilize the present invention to pass through the semantic large data analysis system of Chinese, accurately gather, analyze all comment and analysis data of Ge great electricity Shang platform on-line shop, reject invalid comment, automatically excavate all kinds of commodity that relate to infringement in E-commerce market, greatly reduced the input of manpower and materials, saved the cost of cracking down on counterfeit goods nearly 90%, the control effect of fake products has been improved to 30 times simultaneously.

Claims (10)

CN201410368754.2A2014-07-302014-07-30Commodity networked gene based brand intellectual property protection platformPendingCN104123659A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201410368754.2ACN104123659A (en)2014-07-302014-07-30Commodity networked gene based brand intellectual property protection platform

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201410368754.2ACN104123659A (en)2014-07-302014-07-30Commodity networked gene based brand intellectual property protection platform

Publications (1)

Publication NumberPublication Date
CN104123659Atrue CN104123659A (en)2014-10-29

Family

ID=51769061

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201410368754.2APendingCN104123659A (en)2014-07-302014-07-30Commodity networked gene based brand intellectual property protection platform

Country Status (1)

CountryLink
CN (1)CN104123659A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104680328A (en)*2015-03-162015-06-03朗新科技股份有限公司Power grid construction quality monitoring method based on client perception values
CN105677768A (en)*2015-12-302016-06-15芜湖乐锐思信息咨询有限公司Networked classification analysis system based on complex products
CN105809451A (en)*2016-02-292016-07-27江苏大学Big data based e-commerce company evaluating, analyzing and predicting method and system for online shopping
CN106202266A (en)*2016-07-012016-12-07北京华科合创科技发展有限公司A kind of enterprise managing integrated data analysing method based on robot control system(RCS)
CN107506503A (en)*2017-10-202017-12-22福州顺升科技有限公司A kind of intellectual property outward appearance infringement analysis and management system
CN107784507A (en)*2017-09-302018-03-09广东工业大学Doubtful infringement commodity method for early warning and device, computer-readable storage medium and equipment
CN108376359A (en)*2018-03-162018-08-07深圳市华慧品牌管理有限公司IPR licensing contract sales device and marketing method based on electric business sales data
CN108845942A (en)*2018-06-202018-11-20上海哔哩哔哩科技有限公司Product feature management method, device, system and storage medium
CN109033330A (en)*2018-07-192018-12-18北京车联天下信息技术有限公司Big data cleaning method, device and server
CN109345293A (en)*2018-09-172019-02-15上海宝尊电子商务有限公司A kind of big data information service method and system towards brand electric business
CN109448793A (en)*2018-10-152019-03-08智慧芽信息科技(苏州)有限公司The interest field identification of gene order, retrieval and infringement determination method, system
CN109727646A (en)*2018-12-292019-05-07北京优迅医学检验实验室有限公司The processing method and processing device of cdna sample, mobile terminal
CN109815394A (en)*2018-12-262019-05-28北京博鳌纵横网络科技有限公司 An intellectual property custody system
CN111815486A (en)*2020-06-032020-10-23兰州集智信息技术有限公司Service platform and method for searching clues of infringing products

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101141456A (en)*2007-10-092008-03-12南京财经大学 Web Data Mining Method Based on Vertical Search
CN101231661A (en)*2008-02-192008-07-30上海估家网络科技有限公司 Method and system for object-level knowledge mining
CN103810264A (en)*2014-01-272014-05-21西安理工大学Webpage text classification method based on feature selection

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101141456A (en)*2007-10-092008-03-12南京财经大学 Web Data Mining Method Based on Vertical Search
CN101231661A (en)*2008-02-192008-07-30上海估家网络科技有限公司 Method and system for object-level knowledge mining
CN103810264A (en)*2014-01-272014-05-21西安理工大学Webpage text classification method based on feature selection

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
严鸿毅: ""基于聚焦爬虫的网上药品信息监测系统"", 《中国优秀硕士学位论文全文数据库 信息科技辑》*

Cited By (16)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104680328A (en)*2015-03-162015-06-03朗新科技股份有限公司Power grid construction quality monitoring method based on client perception values
CN105677768A (en)*2015-12-302016-06-15芜湖乐锐思信息咨询有限公司Networked classification analysis system based on complex products
CN105809451A (en)*2016-02-292016-07-27江苏大学Big data based e-commerce company evaluating, analyzing and predicting method and system for online shopping
CN106202266A (en)*2016-07-012016-12-07北京华科合创科技发展有限公司A kind of enterprise managing integrated data analysing method based on robot control system(RCS)
CN107784507A (en)*2017-09-302018-03-09广东工业大学Doubtful infringement commodity method for early warning and device, computer-readable storage medium and equipment
CN107506503B (en)*2017-10-202020-03-17福州顺升科技有限公司Intellectual property appearance infringement analysis and management system
CN107506503A (en)*2017-10-202017-12-22福州顺升科技有限公司A kind of intellectual property outward appearance infringement analysis and management system
CN108376359A (en)*2018-03-162018-08-07深圳市华慧品牌管理有限公司IPR licensing contract sales device and marketing method based on electric business sales data
CN108845942A (en)*2018-06-202018-11-20上海哔哩哔哩科技有限公司Product feature management method, device, system and storage medium
CN108845942B (en)*2018-06-202024-03-12上海幻电信息科技有限公司Product feature management method, device, system and storage medium
CN109033330A (en)*2018-07-192018-12-18北京车联天下信息技术有限公司Big data cleaning method, device and server
CN109345293A (en)*2018-09-172019-02-15上海宝尊电子商务有限公司A kind of big data information service method and system towards brand electric business
CN109448793A (en)*2018-10-152019-03-08智慧芽信息科技(苏州)有限公司The interest field identification of gene order, retrieval and infringement determination method, system
CN109815394A (en)*2018-12-262019-05-28北京博鳌纵横网络科技有限公司 An intellectual property custody system
CN109727646A (en)*2018-12-292019-05-07北京优迅医学检验实验室有限公司The processing method and processing device of cdna sample, mobile terminal
CN111815486A (en)*2020-06-032020-10-23兰州集智信息技术有限公司Service platform and method for searching clues of infringing products

Similar Documents

PublicationPublication DateTitle
CN104123659A (en)Commodity networked gene based brand intellectual property protection platform
CN107977798B (en) A risk assessment method for e-commerce product quality
KR101741509B1 (en)Device and method for analyzing corporate reputation by data mining of news, recording medium for performing the method
Bhakuni et al.Evolution and evaluation: Sarcasm analysis for twitter data using sentiment analysis
Wang et al.Using social media mining technology to assist in price prediction of stock market
Nagar et al.Using text and data mining techniques to extract stock market sentiment from live news streams
Anas et al.Opinion mining based fake product review monitoring and removal system
CN107292744A (en)Investment Trend analysis method and its system based on machine learning
US20240256878A1 (en)Deep learning entity matching system using weak supervision
US20190340517A2 (en)A method for detection and characterization of technical emergence and associated methods
Bella et al.ATLaS: A framework for traceability links recovery combining information retrieval and semi-supervised techniques
CN104616180A (en)Method for predicting hot sellers
Sharonova et al.Issues of Fact-based Information Analysis.
SinhaAnalysis of anomaly and novelty detection in time series data using machine learning techniques
CN119180266A (en)Historical data-based audit opinion generation method, device and equipment
Singh et al.Twitter sentiment analysis for stock prediction
Jishtu et al.Prediction of the stock market based on machine learning and sentiment analysis
CN108717637B (en)Automatic mining method and system for E-commerce safety related entities
CN110909050A (en)Data statistical analysis system
Matti et al.Financial fraud detection using social media crowdsourcing
Wang et al.Research on opinion spam detection by time series anomaly detection
Shri et al.An effective approach to rank reviews based on relevance by weighting method
Prasad et al.Sentiment analysis of customer product reviews using machine learning
Xing et al.Social media text sentiment analysis: exploration of machine learning methods
Xiuli et al.Electronic Commerce Data Mining using Rough Set and Logistic Regression.

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20141029

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp