Movatterモバイル変換


[0]ホーム

URL:


CN114647787B - A user personalized recommendation method based on multimodal data - Google Patents

A user personalized recommendation method based on multimodal data
Download PDF

Info

Publication number
CN114647787B
CN114647787BCN202210322829.8ACN202210322829ACN114647787BCN 114647787 BCN114647787 BCN 114647787BCN 202210322829 ACN202210322829 ACN 202210322829ACN 114647787 BCN114647787 BCN 114647787B
Authority
CN
China
Prior art keywords
user
recommendation
agent
log
mapping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210322829.8A
Other languages
Chinese (zh)
Other versions
CN114647787A (en
Inventor
郭楠
傅章鹏
高天寒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northeastern University China
Original Assignee
Northeastern University China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northeastern University ChinafiledCriticalNortheastern University China
Priority to CN202210322829.8ApriorityCriticalpatent/CN114647787B/en
Publication of CN114647787ApublicationCriticalpatent/CN114647787A/en
Application grantedgrantedCritical
Publication of CN114647787BpublicationCriticalpatent/CN114647787B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

Translated fromChinese

本发明提供一种基于多模态数据的用户个性化推荐方法,涉及网络推荐技术领域。该方法在获得用户允许收集的历史行为记录以及映像日志后,提取其中涉及到的所有对象,以及每个待推荐对象的特征,映射至同一多维空间中,以此整合多模态数据,再使用强化学习模型作为推荐系统智能体,通过收集到的用户记录训练智能体,并使用经过训练的推荐智能体进行推荐,从而实现对多模态数据的用户个性化推荐。本发明提出的方法中面向的用户历史行为记录、映像日志以及待推荐对象集等都可以包含多个领域的对象(如文本、图片、视频等),并通过特征提取后整合的方法模糊多模态之间的区别,从而解决一个传统推荐系统只能应用在单一领域进行推荐的问题。

The present invention provides a user personalized recommendation method based on multimodal data, and relates to the field of network recommendation technology. After obtaining the historical behavior records and image logs that the user allows to be collected, the method extracts all the objects involved therein, as well as the features of each object to be recommended, and maps them to the same multidimensional space, so as to integrate the multimodal data, and then uses the reinforcement learning model as the recommendation system agent, trains the agent through the collected user records, and uses the trained recommendation agent to make recommendations, thereby realizing user personalized recommendations for multimodal data. The user historical behavior records, image logs, and the set of objects to be recommended in the method proposed by the present invention can all contain objects in multiple fields (such as text, pictures, videos, etc.), and blur the distinction between multimodalities through the method of post-feature extraction integration, thereby solving the problem that a traditional recommendation system can only be applied to a single field for recommendation.

Description

User personalized recommendation method based on multi-mode data
Technical Field
The invention relates to the technical field of network recommendation, in particular to a user personalized recommendation method based on multi-mode data.
Background
In recent years, with the rapid development of the internet, the modern society has become an informationized and digitalized society, the data fills the whole world, and the information explosion has become normal. However, in the face of large amounts of data, the user's availability of information is rather reduced, i.e. information overload (Information over load) problems arise. In this regard, the recommendation system is one of key technologies for effectively solving the problem of information overload. In fact, with the rapid development of the internet, the internet of things and the cloud computing technology, the personalized recommendation system has become the standard of internet products, and internet information to be faced by internet users in the fields of e-commerce, video, news, music and the like is closely related to the recommendation system.
The recommendation system performs personalized recommendation to the user by acquiring historical behavior data of the user, such as browsing data of a webpage, purchase history or item rating, and the like. The recommendation problems are solved by a recommendation system based on content filtering, a recommendation system based on collaborative filtering and a mixed recommendation system, various recommendation algorithms using neural networks are widely applied to the realization of recommendation systems along with the rising of deep learning algorithms, such as a recurrent neural network, a convolution neural network, a generation type countermeasure network and the like, and in addition, recommendation technologies based on reinforcement learning, recommendation technologies based on heterogeneous networks and the like are also available.
Most of the current recommendation systems only provide a single-mode-based personalized recommendation problem solution for users, namely, only learning the historical data of the users in a single field (such as news, video, music and the like), so that interest preferences of the users in the single field can be captured and recommended, and diversity of recommendation results is greatly limited.
Disclosure of Invention
Aiming at the defects of the prior art, the technical problem to be solved by the invention is to provide the user personalized recommendation method based on the multi-mode data, which learns the user preference and forms the user portrait according to the history data formed by the user in various fields (such as news, video, music and the like), so as to recommend the multi-mode data and expand the diversity of recommendation results.
In order to solve the technical problems, the invention adopts the following technical scheme:
a user personalized recommendation method based on multi-mode data comprises the following steps:
Step 1, acquiring a total object set I and an object set L0 to be recommended, and definitely requiring oriented multi-mode information types.
And 2, aiming at each facing mode, applying a corresponding feature extraction algorithm, extracting features from each object in the total object set and mapping the features into the same mathematical multidimensional space S.
And step 3, acquiring and accumulating a user history behavior record and an image log. The historical behavior record is the click history record of the user before mapping, namely a clicked object list, and the mapping log is the object list displayed for the user and the click behavior of the user on the objects, wherein 1 represents clicking and 0 represents non-clicking.
And 4, initializing a recommendation system, inputting the multi-dimensional space S obtained in the step 2 and the user history behavior record and the mapping log obtained in the step 3 into the recommendation system, and setting intelligent agents of the recommendation system and reinforcement learning environment parameters.
And 5, training the intelligent agent of the recommendation system is executed.
And 6, processing the object set to be recommended L0 by using a trained recommendation system agent, simulating interaction behavior of a user facing the object set to be recommended L0 by using a recommendation system, and generating a corresponding mapping log D.
And 7, extracting an object set interacted by the user in the mapping log D, namely, a personalized interaction object set predicted by the recommendation system agent corresponding to the user, as a total recommendation list L.
And 8, processing the total recommendation list L according to different requirements to generate a multi-mode recommendation list.
And 9, recommending the generated multi-mode recommendation list to obtain a specific user interaction result, thereby generating an image log. The agent is further trained and updated each time the log accumulates to a given n.
The user personalized recommendation method based on the multi-mode data has the advantages that after the historical behavior record and the mapping log which are allowed to be collected by the user are obtained, all the related objects and the characteristics of each object to be recommended are extracted and mapped into the same multi-dimensional space, the multi-mode data are integrated, the reinforcement learning model is used as an intelligent body of a recommendation system, the collected user records the training intelligent body, and the trained recommendation intelligent body is used for recommendation, so that user personalized recommendation of the multi-mode data is achieved. Compared with the prior art, the user history behavior record, the mapping log, the object set to be recommended and the like in the method provided by the invention can contain objects (such as texts, pictures, videos and the like) in a plurality of fields, and the distinction between multiple modes is blurred by the method of integrating the extracted features, so that the problem that a traditional recommendation system can only be applied to recommendation in a single field is solved.
Drawings
FIG. 1 is a flowchart of a recommendation method according to an embodiment of the present invention;
Fig. 2 is a flowchart of a training method of an agent (rainbow model) according to an embodiment of the present invention.
Detailed Description
The following describes in further detail the embodiments of the present invention with reference to the drawings and examples. The following examples are illustrative of the invention and are not intended to limit the scope of the invention.
As shown in fig. 1, the user personalized recommendation method based on the multimodal data in this embodiment is as follows.
Step 1, acquiring a total object set I and an object set L0 to be recommended, and definitely requiring oriented multi-mode information types.
And 2, aiming at each facing mode, applying a corresponding feature extraction algorithm, extracting features from each object in the total object set and mapping the features into the same mathematical multidimensional space S. For example, the image uses CNN or RNN convolutional neural network to extract features, the text uses tf-idf algorithm or CNN convolutional neural network to extract features, the video uses P3D (Pseudo-3D Residual Networks) residual network to extract features, the audio uses Discrete Wavelet Transform (DWT) or Perceptual Linear Prediction (PLP) algorithm to extract features, etc.
And step 3, acquiring and accumulating a user history behavior record and an image log. The historical behavior record is the click history record of the user before mapping, namely a clicked object list, and the mapping log is the object list displayed for the user and the click behavior of the user on the objects, wherein 1 represents clicking and 0 represents non-clicking.
And 4, initializing a recommendation system, inputting the multi-dimensional space S obtained in the step 2 and the user history behavior record and the mapping log obtained in the step 3 into the recommendation system, and setting intelligent agents of the recommendation system and reinforcement learning environment parameters.
Step 5, training of the recommendation system agent is executed, and as shown in fig. 2, the process is as follows:
and 5.1, the agent requests a user related record from the user history behavior record and the image log obtained in the step 3.
And 5.2, returning the record through the memory, and replacing the corresponding object with the characteristic representation through the result of the step 2.
And 5.3, forming a user portrait by the agent according to the historical behavior record of the user, sequentially executing images according to the user portrait, obtaining rewards by the agent if the action of the agent is consistent with the real log, and obtaining no rewards or punishments if the action of the agent is inconsistent with the real log.
And 5.4, after all the entries in the user mapping log are executed, finishing the reinforcement learning Frame once, judging that if the training termination condition in the reinforcement learning environment parameters set in the step 4 is met, finishing the training of the intelligent agent, and turning to the step 6, otherwise, returning to the step 5.1.
And 6, processing the object set to be recommended L0 by using a trained recommendation system agent, simulating interaction behavior of a user facing the object set to be recommended L0 by using a recommendation system, and generating a corresponding mapping log D.
And 7, extracting an object set interacted by the user in the mapping log D, namely, a personalized interaction object set predicted by the recommendation system agent corresponding to the user, as a total recommendation list L.
Step 8, processing the total recommendation list L according to different requirements to generate a multi-mode recommendation list;
When the extensive interests of the user in multiple fields need to be mined or the recommendation result modes need to be diversified, the total recommendation list L can be simply ordered and output according to specific scene requirements, and a multi-mode recommendation list L1 is generated.
When a user needs to make a recommendation in other fields (such as recommending text hyperlinks for news and movies) according to a certain object, other mode objects closest to the current object can be selected from the multidimensional space S according to the nearest neighbor principle, and a multimodal recommendation list L2 is generated.
When a user needs to be recommended more accurately and effectively by combining knowledge expressions of different levels, the features of different modes are fused by using a multi-mode fusion (multimodal fusion) algorithm, and an agent is trained based on the fused features, so that a recommendation list L3 with multi-mode expressions for the same object, such as a news headline is generated.
And 9, recommending the generated multi-mode recommendation list to obtain a specific user interaction result, thereby generating an image log. The agent is further trained and updated each time the log accumulates to a given n.
The method of the present invention is sequentially executed based on MIcrosoft NEWS DATASET (MIND) datasets:
Step 1, since the MIND data set only contains news text data and does not meet the requirement of multi-mode tasks, corresponding picture data in news is crawled through spiderFlow crawler tools (https:// gitub. Com/chenyuansgit/spiderFlow) according to news links provided in the MIND data set, and a picture data set corresponding to the news text is formed. Therefore, the obtained total object set I is all news texts and news pictures, the object set L0 to be recommended is the mapping log news texts and corresponding news picture sets in the test set, and the multi-mode information types which are definitely required to be oriented are texts and images.
And 2, respectively applying a corresponding feature extraction algorithm to two modes of a facing text and an image, extracting basic text features from news text features through Tf-idf item weighting, and then carrying out feature selection through a chi-square test algorithm to reduce the obtained basic text feature dimension, so that the model generalization capability is stronger, the overfitting is reduced, and the understanding between the features and the feature values is enhanced. The news picture features are extracted through a pre-trained VGG16 neural network in keras library, and finally the extracted features of each object in the total object set are mapped into the same mathematical multidimensional space S through a umap algorithm (https:// arxiv. Org/abs/1802.03426).
And 3, reading the accumulated user history behavior record and image log in the data set.
Initializing a recommendation system, inputting the multi-dimensional space S obtained in the step 2 and the image log obtained in the step 3 into the recommendation system, setting an intelligent body of the recommendation system and a reinforcement learning environment parameter (a rainbow model is taken as an example), wherein the action space is an integer of 0-10, the probability of predicting the object to be clicked is divided by 10, the state space is a multi-dimensional matrix for reflecting the characteristics of the user portrait and the current object to be determined, and rewarding is determined by auc indexes obtained by the prediction result output by the image log.
Step 5, training of the recommendation system agent is executed, and as shown in fig. 2, the process is as follows:
step 5.1, the agent requests a user related record from the collection obtained in the step 3;
Step 5.2, the memory returns a record, and the corresponding object is replaced by the characteristic representation according to the result of the step 2;
Step 5.3, the agent clusters and separates user preference characteristics by using a Mean-Shift algorithm according to the history record of the user to form a user portrait, predicts the click rate of the user on the object in the mapping log one by one according to the obtained user portrait, and awards the agent according to auc indexes obtained by calculation of the prediction result and the real record;
and 5.4, after all the entries in the user mapping log are executed, finishing the reinforcement learning Frame once, judging that if the training termination condition in the reinforcement learning environment parameters set in the step 4 is met, finishing the training of the agent, otherwise, returning to the step 5.1.
And 6, processing the mapping log in the test set by using the trained recommendation system agent, predicting the click probability of the user on each object according to the user history behavior record in the test set, calculating auc indexes, and recording the prediction result D.
And 7, using the high click probability object in the prediction result D as a medium recommendation list L.
And 8, processing the total recommendation list L according to different requirements to generate a multi-mode recommendation list, wherein the following requirement processing process comprises the following steps:
When the extensive interests of the user in multiple fields need to be mined or the recommendation result modes need to be diversified, the total recommendation list L can be simply ordered and output according to specific scene requirements, and a multi-mode recommendation list L1 is generated.
When it is required to recommend the user in other fields (such as news map matching, or news hyperlink recommending for pictures, etc.) according to a certain object, other mode objects closest to the current object can be selected from the multidimensional space S according to the nearest neighbor principle, and a multimodal recommendation list L2 is generated.
And 9, recommending the generated multi-mode recommendation list to obtain a specific user interaction result, thereby generating an image log. The agent may be further trained and updated each time the log accumulates to a given n.
Through the steps, the finally obtained experimental result indexes are shown in table 1 (step 5 adopts different recommended models respectively).
TABLE 1 results of various recommended model experiments
By adopting the reinforcement learning model disclosed by the invention as a recommendation system agent for recommendation, a test is carried out on MIcrosoft NEWS DATASET (MIND) data sets, and the result shows that the reinforcement learning agent has similar performance indexes as the traditional recommendation model, namely, can be qualified for recommendation tasks. The data of the MIND data set on the picture mode is expanded by the crawler to be a data set of a text-picture mixed mode, and further test is carried out, so that various indexes of the invention can meet the personalized recommendation requirement of users for multi-mode data from the test result, and the requirement of a recommendation system is met.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will understand that they can make modifications to the technical solutions described in the above-mentioned embodiments or make equivalent substitutions of some or all of the technical features, without departing from the essence of the corresponding technical solutions from the scope of the invention defined by the claims.

Claims (6)

CN202210322829.8A2022-03-302022-03-30 A user personalized recommendation method based on multimodal dataActiveCN114647787B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202210322829.8ACN114647787B (en)2022-03-302022-03-30 A user personalized recommendation method based on multimodal data

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202210322829.8ACN114647787B (en)2022-03-302022-03-30 A user personalized recommendation method based on multimodal data

Publications (2)

Publication NumberPublication Date
CN114647787A CN114647787A (en)2022-06-21
CN114647787Btrue CN114647787B (en)2025-02-18

Family

ID=81994734

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202210322829.8AActiveCN114647787B (en)2022-03-302022-03-30 A user personalized recommendation method based on multimodal data

Country Status (1)

CountryLink
CN (1)CN114647787B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN116702094B (en)*2023-08-012023-12-22国家计算机网络与信息安全管理中心Group application preference feature representation method
CN120408203B (en)*2025-06-302025-09-09上海智融政通科技发展有限公司Personalized digital device training generation method based on multi-mode information fusion

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN107423442A (en)*2017-08-072017-12-01火烈鸟网络(广州)股份有限公司 Application recommendation method and system based on user portrait behavior analysis, storage medium and computer equipment
CN113010783A (en)*2021-03-172021-06-22华南理工大学Medical recommendation method, system and medium based on multi-modal cardiovascular disease information

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104462560B (en)*2014-12-252018-01-05广东电子工业研究院有限公司A kind of recommendation method of personalized recommendation system
CN113449183B (en)*2021-06-182022-07-12华中科技大学Interactive recommendation method and system based on offline user environment and dynamic rewards
CN113704615A (en)*2021-08-302021-11-26万汇互联(深圳)科技有限公司Deep interest network recommendation method based on multiple modes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN107423442A (en)*2017-08-072017-12-01火烈鸟网络(广州)股份有限公司 Application recommendation method and system based on user portrait behavior analysis, storage medium and computer equipment
CN113010783A (en)*2021-03-172021-06-22华南理工大学Medical recommendation method, system and medium based on multi-modal cardiovascular disease information

Also Published As

Publication numberPublication date
CN114647787A (en)2022-06-21

Similar Documents

PublicationPublication DateTitle
CN111581510B (en)Shared content processing method, device, computer equipment and storage medium
US10783361B2 (en)Predictive analysis of target behaviors utilizing RNN-based user embeddings
CN118467851B (en)Artificial intelligent data searching and distributing method and system
US10459975B1 (en)Method and system for creating an automatic video summary
EP4180991A1 (en)Neural network distillation method and apparatus
US20240380822A1 (en)Multi-modal hypergraph-based click prediction
CN104317835B (en)The new user of video terminal recommends method
CN114647787B (en) A user personalized recommendation method based on multimodal data
US20240311421A1 (en)Multiple Dataset Search Based On a Visual Query
Fakhfakh et al.Deep learning-based recommendation: Current issues and challenges
CN118551100A (en)Network user two-stage recommendation method and device based on large language model optimization
CN115860780B (en)Transaction potential user accurate analysis recommendation matching method and system
CN115640449A (en)Media object recommendation method and device, computer equipment and storage medium
Li et al.From edge data to recommendation: A double attention-based deformable convolutional network
CN115964560A (en)Information recommendation method and equipment based on multi-mode pre-training model
Patil et al.Micro video recommendation in multimodality using dual-perception and gated recurrent graph neural network
CN115221352A (en)Big data short video recommendation system based on collaborative filtering algorithm
CN111026910A (en)Video recommendation method and device, electronic equipment and computer-readable storage medium
CN115482021A (en)Multimedia information recommendation method and device, electronic equipment and storage medium
CN117909542A (en)Video recommendation method, device, equipment and storage medium
Leander et al.Optimizing a Personalized Movie Recommendation System with Support Vector Machine and Content-Based Filtering
WO2024061073A1 (en)Multimedia information generation method and apparatus, and computer-readable storage medium
CN117216361A (en)Recommendation method, recommendation device, electronic equipment and computer readable storage medium
CN116932862A (en)Cold start object recommendation method, cold start object recommendation device, computer equipment and storage medium
US12288237B2 (en)Online inference and learning for nonsymmetric determinantal point processes

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp