Big data platform based online comment validity recommendation methodTechnical Field
The invention belongs to the technical field of electronic commerce application, and particularly relates to a recommendation method for online comment validity based on a big data platform.
Background
With the rapid development of society and economy, especially the unprecedented prosperity of internet technology, the life style of people is changed, and more consumers tend to shop online. Electronic commerce websites such as Taobao, Jingdong, Shuduo, Amazon and the like are rapidly expanded, and the number of commodities and the scale of users are in an explosive growth trend. Meanwhile, with the rapid development of the internet and information technology, the original electronic commerce mode taking the enterprise as a center is changed into social electronic commerce taking consumers as the center, the consumers can actively transmit information instead of passively receiving product information released by enterprises, and the data shows an explosive growth trend when entering a big data era globally. Consumers and enterprises enjoy abundant network information to bring convenience, and meanwhile suffer from the trouble caused by the problem of information overload.
In the face of the full-line commodities on the existing e-commerce websites, countless users make countless comments aiming at the feelings of the users, and 'information overload' of comment information is caused. In the face of such a thick information gold mine, the existing e-commerce website expects to efficiently extract valuable comments from the information gold mine and present the comments to the user, so as to help the user to know commodity information and shorten the purchasing decision time of the user. Meanwhile, as the security and convenience of mobile payment are greatly improved, more and more users shop through mobile terminals, and the user wants to acquire more commodity information by browsing as few comments as possible in consideration of the limitations of actual scenes and screen sizes, so that if the optimal comments are recommended to the user, the recommendation of the optimal comments becomes the direction of the current e-commerce website key research.
Disclosure of Invention
Aiming at the technical problems existing in the overload of the comment information, the invention provides the recommendation method based on the online comment validity of the big data platform, which has the advantages of reasonable design and simple method and can effectively recommend the optimal comment.
In order to achieve the purpose, the invention adopts the technical scheme that the invention provides a recommendation method of online comment validity based on a big data platform, which comprises the following effective steps:
A. firstly, extracting corresponding keywords according to questions of consumers in the similar products to the products;
B. formulating a comment set of the commodity according to the keywords and all comments of the extracted keywords on a certain commodity;
C. assigning a time to publication for each comment, wherein publication within three days is 1, publication within one week is 0.9, publication within two weeks is 0.8, publication within three weeks is 0.7, publication within one month is 0.6, publication within three months is 0.5, publication within half a year is 0.3, publication within one year is 0.1;
D. according to the formula:
wherein, CgFor the recommended value of a certain comment of the commodity, T is the time value of the comment, k is all keywords contained in the comment, TC is the total number of the comment of the commodity, and KCiThe number of comments for a certain keyword;
E. and finally, recommending according to the magnitude of the recommended value obtained by calculation in the order from large to small.
Preferably, in the step B, the comment includes an initial comment and an additional comment.
Preferably, in the step C, when one comment has an additional comment, the additional comment is assigned with the posting time of the additional comment.
Preferably, in the step D, k is all keywords except the logistics information in the review.
Preferably, in the step E, comments with less consistent keywords and less scores in the comments are removed in a collaborative filtering manner, and the comments with scores later are referred to.
Preferably, in the step E, comments with the same keyword in the comments are removed in a collaborative filtering manner, and only the first five comments with higher scores are included.
Compared with the prior art, the invention has the advantages and positive effects that,
1. the invention provides a recommendation method for online comment validity based on a big data platform, which analyzes the validity of comments by taking the information of questions asked by a consumer to a merchant as a keyword through consideration from the perspective of the consumer, further solves the problems of the existing comments, and is simple, safe, reliable and suitable for large-scale popularization and use.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, the present invention will be further described with reference to the following examples. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and thus the present invention is not limited to the specific embodiments of the present disclosure.
Embodiment 1, this embodiment provides a recommendation method for online review validity based on a big data platform,
firstly, extracting corresponding keywords according to questions of consumers to the products in the same kind of products, in order to reduce unnecessary disputes, the electronic commerce website increases the question function of the customers to the products, the purpose of reducing the comments watched by the consumers is achieved by customer service or the answers of the consumers to the questions, the problems of brand problems or prices and the like are considered, the problems belong to special problems of the products, the special problems are probably not mainly reflected in the comments of some consumers, the problems are probably of different meanings to other consumers, therefore, the keywords of all the problems of the same kind of products are extracted, the products can be more comprehensively displayed, and the consumers can learn the products from the perspective of the consumers.
And then, establishing a comment set of the commodity according to all comments of the extracted keywords to the certain commodity and the keywords, wherein the main purpose of establishing the comment set is to determine the proportion of the keywords in the commodity, if one keyword appears in a plurality of comments, the keyword belongs to the point which is intended by a consumer for the commodity, and certainly, one comment may appear in different comment sets.
Considering that the updating of some commodities is fast, therefore, the influence of the reviews with the closer publication time on consumers is larger, and therefore, the publication time of each review is assigned, wherein the publication time within three days is 1, the publication time within one week is 0.9, the publication time within two weeks is 0.8, the publication time within three weeks is 0.7, the publication time within one month is 0.6, the publication time within three months is 0.5, the publication time within half year is 0.3, and the publication time within one year is 0.1.
Then, calculating a recommended value of a certain comment of the commodity according to a formula:
wherein, CgFor the recommended value of a certain comment of the commodity, T is the time value of the comment, k is all keywords contained in the comment, TC is the total number of the comment of the commodity, and KCiConsidering that logistics may be the most concerned problem of consumers in electronic commerce transactions, the logistics percentage is the highest among all reviews, and thus some excellent reviews which do not record logistics information may have errors, and k is all keywords excluding logistics information in the reviews. Thus, given the removal of the logistic score, the recommendation for a good will have a time and review proportion.
And finally, recommending according to the order from large to small according to the size of the recommended value obtained by calculation, considering that the comments with the same keyword are possibly many, and all the comments displayed in the ranking recommendation mode are the same type of comments. Collaborative filtering recommendations are rapidly becoming a very popular technology in information filtering and information systems. Different from the traditional recommendation based on content filtering and direct content analysis, the method is characterized in that the interest of the user is analyzed through collaborative filtering, similar (interested) users of the specified user are found in the user group, and the preference degree prediction of the specified user on the information is formed through the evaluation of the similar users on the information. Therefore, similar comments can be selected through the collaborative filtering algorithm, then the ranked similar comments are not ranked any more, and other comments in the later place are forwarded, so that the comprehensiveness of the comments is facilitated.
Considering that the comment of the same user may have additional comments, for this reason, the comment includes the initial comment and the additional comment. When one comment has an additional comment, the additional comment is assigned with the posting time of the additional comment. In the comment recommendation value calculation, the initial comment and the additional comment are determined as the entire comment.
Through the arrangement, the keywords are extracted from the viewpoint of the consumer, the incompleteness of extracting the keywords by the existing merchants is solved, the calculation complexity of extracting the keywords from all the comments is solved, in addition, the scores obtained by adding time assignment to the percentage scores of the keywords in all the comments are determined to be the recommended value of a certain comment, the accuracy and the comprehensiveness of the recommended value are further ensured, and the effectiveness of online comment recommendation is guaranteed.
The above description is only a preferred embodiment of the present invention, and not intended to limit the present invention in other forms, and any person skilled in the art may apply the above modifications or changes to the equivalent embodiments with equivalent changes, without departing from the technical spirit of the present invention, and any simple modification, equivalent change and change made to the above embodiments according to the technical spirit of the present invention still belong to the protection scope of the technical spirit of the present invention.