本文已被:浏览 799次 下载 324次
投稿时间:2021-09-27 修订日期:2022-03-29
投稿时间:2021-09-27 修订日期:2022-03-29
中文摘要: 推荐系统能够有效缓解互联网的迅猛发展带来的信息过载问题,但欺诈攻击的存在制约了推荐系统的健康发展,因此如何准确、高效地检测欺诈攻击是推荐系统安全领域的重要问题。已有检测方法往往依赖专家知识人工提取检测特征或基于深度学习自动获取某一视角下的检测特征,在此基础上通过硬分类找出攻击用户,导致检测性能不佳。因此,本文同时考虑多视角下的特征自动提取,引入模糊决策,提出了一种基于CNN和犹豫模糊集的欺诈攻击检测方法(简称CNN-HFS)。首先,对每个用户分别从评分值、评分偏好和评分时间视角抽取3个行为矩阵,利用双三次插值法对3个矩阵进行缩放得到对应的密集评分矩阵、密集偏好矩阵和密集时间矩阵;然后,将每个用户任意视角下的缩放矩阵视为一个图像,在3个不同视角下分别训练CNN,并计算任意用户在每个视角下属于攻击用户类的隶属度;最后,引入模糊犹豫集对多视角下的检测结果进行综合决策,根据决策结果识别出攻击用户。为了验证CNN-HFS的有效性,选取SVM-TIA、CoDetector、CNN-SAD、SDAEs-PCA、CNN-R、CNN-P和CNN-T作为对比方法,在MovieLens 1M和Amazon数据集上对精确率、召回率和F1-measure值3个评价指标进行实验评估。实验结果表明,本文所提方法在3个评价指标上明显优于其他7种对比方法,可以获得更高的检测性能。
Abstract:Recommender systems can effectively alleviate the problem of information overload caused by the rapid development of the Internet. However, the occurrence of shilling attacks restricts the healthy development of recommender systems. Therefore, how to detect shilling attacks accurately and efficiently is an important problem in the field of recommender systems security. The existing detection methods usually design hand-crafted detection features based on expert knowledge or automatically learn features from a single perspective using deep learning, then the attack users are identified according to the extracted features by hard classification, resulting in the poor detection performance. By automatically learning features from multiple perspectives and introducing a hesitant fuzzy decision, a novel detection method based on CNN and hesitant fuzzy set was proposed and named CNN-HFS. Firstly, for each user, three behavior matrices were extracted from the perspectives of rating, preference and rating time, respectively. To reduce the influence of data sparse, these matrices were scaled by bicubic interpolation to correspondingly obtain a dense rating matrix, a dense preference matrix and a dense time matrix. Next, each scaling matrix of users was regarded as an image, and three different CNN classifiers were trained based on these scaling matrices in three different views respectively. For each user, three membership degrees to the classifier of attack users were calculated. Finally, a fuzzy hesitant set was introduced to make a comprehensive decision, and the attack users were identified according to the decision results. To validate the effectiveness of the proposed CNN-HFS, the extensive experiments were conducted on the MovieLens 1M and Amazon datasets. The evaluation metrics of precision, recall and F1-measure were used to compare the proposed method with SVM-TIA, CoDetector, CNN-SAD, SDAEs-PCA, CNN-R, CNN-P and CNN-T. The experimental results showed that the proposed method is superior to seven baseline methods in terms of three detection metrics and achieves an excellent detection performance under various attacks.
文章编号:202100979 中图分类号:TP393 文献标志码:
基金项目:河北省自然科学基金项目(F2020201023);河北省高等学校科学技术研究项目(ZD2022105);河北大学高层次人才科研启动项目(521100221089)
作者简介:第一作者:蔡红云(1980—),女,副教授,硕士生导师,博士. 研究方向:信任模型;推荐系统安全. E-mail:caihongyun@hbu.cn;通信作者:袁世林, E-mail:yuanshilin_hbu@163.com
引用文本:
蔡红云,袁世林,温玉,任继超,孟洁.基于CNN和犹豫模糊决策的欺诈攻击检测[J].工程科学与技术,2022,54(3):80-90.
CAI Hongyun,YUAN Shilin,WEN Yu,REN Jichao,MENG Jie.Shilling Attacks Detection Based on CNN and Hesitant Fuzzy Sets[J].Advanced Engineering Sciences,2022,54(3):80-90.
引用文本:
蔡红云,袁世林,温玉,任继超,孟洁.基于CNN和犹豫模糊决策的欺诈攻击检测[J].工程科学与技术,2022,54(3):80-90.
CAI Hongyun,YUAN Shilin,WEN Yu,REN Jichao,MENG Jie.Shilling Attacks Detection Based on CNN and Hesitant Fuzzy Sets[J].Advanced Engineering Sciences,2022,54(3):80-90.