TY - GEN
T1 - An effective model-free Gaussian Process based online social media recommendation
AU - Xu, Jiawei
AU - Wang, Yufeng
AU - Ma, Jianhua
AU - Jin, Qun
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Nowadays, a variety of social media platforms acts as the major information portals, frequently updates the news and media pool and recommends them to dynamically arriving individuals, which requires real-time performance and immediate response to user feedback. Traditional offline/static recommendation approaches assume there exist large amount of historical interactive data, and build models from these data, which can neither deal with cold-start issue and nor fully incorporate individuals' feedbacks. Reinforcement learning based online recommendation system such as multi-armed bandit (MAB) and contextual MAB, attempts to maximize the long-term learning gains over time through exploring and exploiting the inherently interactive nature of online learning processes. However, most of work adopt the model-based paradigm, in which the feedback/reward function of individual is statically set as a specific format (e.g., linear in the features of the recommended contents and individual), which can only tailor to some specific learning objective and individual model. Without assuming the fixed reward model for each individual, this paper utilizes Gaussian process (GP) to characterize the expected feedback/reward of any recommended social media. Then, targeting at relatively accurate and noisy feedbacks of individual users, two MAB based algorithms are designed to select recommended social media to balance the effect of exploration and exploitation. The experimental results on real social media dataset Movielens demonstrate the effectiveness of our proposed methods compared with other model-based recommendation schemes.
AB - Nowadays, a variety of social media platforms acts as the major information portals, frequently updates the news and media pool and recommends them to dynamically arriving individuals, which requires real-time performance and immediate response to user feedback. Traditional offline/static recommendation approaches assume there exist large amount of historical interactive data, and build models from these data, which can neither deal with cold-start issue and nor fully incorporate individuals' feedbacks. Reinforcement learning based online recommendation system such as multi-armed bandit (MAB) and contextual MAB, attempts to maximize the long-term learning gains over time through exploring and exploiting the inherently interactive nature of online learning processes. However, most of work adopt the model-based paradigm, in which the feedback/reward function of individual is statically set as a specific format (e.g., linear in the features of the recommended contents and individual), which can only tailor to some specific learning objective and individual model. Without assuming the fixed reward model for each individual, this paper utilizes Gaussian process (GP) to characterize the expected feedback/reward of any recommended social media. Then, targeting at relatively accurate and noisy feedbacks of individual users, two MAB based algorithms are designed to select recommended social media to balance the effect of exploration and exploitation. The experimental results on real social media dataset Movielens demonstrate the effectiveness of our proposed methods compared with other model-based recommendation schemes.
KW - Gaussian Process
KW - Multi-Armed Bandit
KW - Online Recommendation
KW - Reinforcement Learning
KW - Social media
UR - http://www.scopus.com/inward/record.url?scp=85142090419&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142090419&partnerID=8YFLogxK
U2 - 10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics55523.2022.00085
DO - 10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics55523.2022.00085
M3 - Conference contribution
AN - SCOPUS:85142090419
T3 - Proceedings - IEEE Congress on Cybermatics: 2022 IEEE International Conferences on Internet of Things, iThings 2022, IEEE Green Computing and Communications, GreenCom 2022, IEEE Cyber, Physical and Social Computing, CPSCom 2022 and IEEE Smart Data, SmartData 2022
SP - 374
EP - 378
BT - Proceedings - IEEE Congress on Cybermatics
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE Congress on Cybermatics: 15th IEEE International Conferences on Internet of Things, iThings 2022, 18th IEEE International Conferences on Green Computing and Communications, GreenCom 2022, 2022 IEEE International Conference on Cyber, Physical and Social Computing, CPSCom 2022 and 8th IEEE International Conference on Smart Data, SmartData 2022
Y2 - 22 August 2022 through 25 August 2022
ER -