Machine Learning Approaches for Fake Reviews Detection: A Systematic Literature Review

Authors

  • Mohammed Ennaouri Software Project Management Team, Mohammed V University in Rabat, High National School for Computer Science and Systems Analysis. Rabat, Morocco
  • Ahmed Zellou Software Project Management Team, Mohammed V University in Rabat, High National School for Computer Science and Systems Analysis. Rabat, Morocco

DOI:

https://doi.org/10.13052/jwe1540-9589.2254

Keywords:

Fake reviews, opinion spam, spam reviews, machine learning

Abstract

These days, most people refer to user reviews to purchase an online product. Unfortunately, spammers exploit this situation by posting deceptive reviews and misleading consumers either to promote a product with poor quality or to demote a brand and damage its reputation. Among the solutions to this problem is human verification. Unfortunately, the real-time nature of fake reviews makes the task more difficult, especially on e-commerce platforms. The purpose of this study is to conduct a systematic literature review to analyze solutions put out by researchers who have worked on setting up an automatic and efficient framework to identify fake reviews, unsolved problems in the domain, and the future research direction. Our findings emphasize the importance of the use of certain features and provide researchers and practitioners with insights on proposed solutions and their limitations. Thus, the findings of the study reveals that most approaches focus on sentiment analysis, opinion mining and, in particular, machine learning (ML), which contributes to the development of more powerful models that can significantly solve the problem and thus enhance further the accuracy and efficiency of detecting fake reviews.

Downloads

Download data is not yet available.

Author Biographies

Mohammed Ennaouri, Software Project Management Team, Mohammed V University in Rabat, High National School for Computer Science and Systems Analysis. Rabat, Morocco

Mohammed Ennaouri is a senior researcher in the High National School for Computer Science and Systems Analysis (ENSIAS), Rabat, Morocco. He received his Master’s degree in Internet of things: software and analytics in 2021. He is currently working as a teacher in secondary school. His research interest includes fake news, machine learning algorithms and recommender Systems.

Ahmed Zellou, Software Project Management Team, Mohammed V University in Rabat, High National School for Computer Science and Systems Analysis. Rabat, Morocco

Ahmed Zellou Received his Ph.D. in Applied Sciences at the Mohammedia School of Engineers, Mohammed V University, Rabat, Morocco 2008, his habilitation to supervise research work in 2014. He becomes full professor in 2020. His research interests include interoperability, mediation systems, distributed computing, data, indexing, recommander systems, data quality, and semantic web where he is the author/coauthor of over 100 research publications.

References

Chen, Cheng, Wu, Kui, Srinivasan, Venkatesh and Zhang, Xudong. (2011). Battling the Internet Water Army: Detection of Hidden Paid Posters. 10.1145/2492517.2492637.

Mukherjee, A., Venkataraman, V., Liu, B., and Glance, N. (2013, June). What yelp fake review filter might be doing?. In Proceedings of the international AAAI conference on web and social media (Vol. 7, No. 1).

Kitchenham, Barbara and Charters, Stuart. (2007). Guidelines for performing Systematic Literature Reviews in Software Engineering. 2.

J. K. Rout, A. K. Dash and N. K. Ray, “A Framework for Fake Review Detection: Issues and Challenges,” 2018 International Conference on Information Technology (ICIT), 2018, pp. 7–10, doi: 10.1109/ICIT.2018.00014.

Martinez-Torres, Rocio and Toral, S.L. (2019). A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation. Tourism Management. 75. 393–403. doi: 10.1016/j.tourman.2019.06.003.

Siagian, Al Hafiz and Aritsugi, Masayoshi. (2020). Robustness of Word and Character N-gram Combinations in Detecting Deceptive and Truthful Opinions. Journal of Data and Information Quality. 12. 1–24. 10.1145/3349536.

J. Wang, H. Kan, F. Meng, Q. Mu, G. Shi and X. Xiao, “Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training,” in IEEE Access, vol. 8, pp. 182625–182639, 2020, doi: 10.1109/ACCESS.2020.3028588.

Aiyar, Shreyas and Shetty, Nisha. (2018). N-Gram Assisted Youtube Spam Comment Detection. Procedia Computer Science. 132. 174–182. doi: 10.1016/j.procs.2018.05.181.

S. Jamshidi Nejad, F. Ahmadi-Abkenari and P. Bayat, “Opinion Spam Detection based on Supervised Sentiment Analysis Approach,” 2020 10th International Conference on Computer and Knowledge Engineering (ICCKE), 2020, pp. 209–214, doi: 10.1109/ICCKE50421.2020.9303677.

X. Wang, X. Zhang, C. Jiang and H. Liu, “Identification of fake reviews using semantic and behavioral features,” 2018 4th International Conference on Information Management (ICIM), 2018, pp. 92–97, doi: 10.1109/INFOMAN.2018.8392816.

Noekhah, Shirin, Salim, Naomie and Zakaria, Nor. (2019). Opinion spam detection: Using multi-iterative graph-based model. Information Processing & Management. 57. 21. doi: 10.1016/j.ipm.2019.102140.

Liu, Yuxin, Wang, Li, Shi, Tengfei and Li, Jinyan. (2021). Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM. Information Systems. 103. 101865. doi: 10.1016/j.is.2021.101865.

K. Archchitha and E. Y. A. Charles, “Opinion Spam Detection in Online Reviews Using Neural Networks,” 2019 19th International Conference on Advances in ICT for Emerging Regions (ICTer), 2019, pp. 1–6, doi: 10.1109/ICTer48817.2019.9023695.

G. M. Shahariar, S. Biswas, F. Omar, F. M. Shah and S. Binte Hassan, “Spam Review Detection Using Deep Learning,” 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), 2019, pp. 0027–0033, doi: 10.1109/IEMCON.2019.8936148.

Mewada, Arvind and Dewang, Rupesh. (2021). Deceptive reviewer detection by analyzing web data using HMM and similarity measures. Materials Today: Proceedings. doi: 10.1016/j.matpr.2020.12.1126.

Guanxiong Huang, Hai Liang, Uncovering the effects of textual features on trustworthiness of online consumer reviews: A computational-experimental approach, Journal of Business Research, Volume 126, 2021, Pages 1–11, ISSN 0148-2963, https://doi.org/10.1016/j.jbusres.2020.12.052. (https://www.sciencedirect.com/science/article/pii/S014829632030881X).

Y. Gao, M. Gong, Y. Xie and A. K. Qin, “An Attention-Based Unsupervised Adversarial Model for Movie Review Spam Detection,” in IEEE Transactions on Multimedia, vol. 23, pp. 784–796, 2021, doi: 10.1109/TMM.2020.2990085.

Ziyang Wang, Wei Wei, Xian-Ling Mao, Guibing Guo, Pan Zhou, Sheng Jiang, User-based network embedding for opinion spammer detection, Pattern Recognition, Volume 125, 2022, 108512, ISSN 0031-3203, https://doi.org/10.1016/j.patcog.2021.108512. (https://www.sciencedirect.com/science/article/pii/S0031320321006889).

G. Xu, M. Hu, C. Ma and M. Daneshmand, “GSCPM: CPM-Based Group Spamming Detection in Online Product Reviews,” ICC 2019 - 2019 IEEE International Conference on Communications (ICC), 2019, pp. 1–6, doi: 10.1109/ICC.2019.8761650.

C. Yılmaz, “SPR2EP: A Semi-Supervised Spam Review Detection Framework,” 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/54601.

D. A. Navastara, A. A. Zaqiyah and C. Fatichah, “Opinion Spam Detection in Product Reviews Using Self-Training Semi-Supervised Learning Approach,” 2019 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation (ICAMIMIA), 2019, pp. 169–173, doi: 10.1109/ICAMIMIA47173.2019.9223407.

Tian, Yingjie, Mirzabagheri, Mahboubeh, Tirandazi, Peyman, Hosseini Bamakan, Seyed Mojtaba. (2020). A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM. Information Processing & Management. 57. 102381. doi: 10.1016/j.ipm.2020.102381.

Ligthart, Alexander, Catal, Cagatay and Tekinerdogan, Bedir. (2021). Analyzing the effectiveness of semi-supervised learning approaches for opinion spam classification. Applied Soft Computing. 101. 107023. doi: 10.1016/j.asoc.2020.107023.

R. Hassan and M. R. Islam, “Detection of fake online reviews using semi-supervised and supervised learning,” 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), 2019, pp. 1–5, doi: 10.1109/ECACE.2019.8679186.

L. Zhang, Z. Wu and J. Cao, “Detecting Spammer Groups From Product Reviews: A Partially Supervised Learning Model,” in IEEE Access, vol. 6, pp. 2559–2568, 2018, doi: 10.1109/ACCESS.2017.2784370.

Liu, Yuanchao, Pang, Bo and Wang, Xiaolong. (2019). Opinion Spam Detection by Incorporating Multimodal Embedded Representation into a Probabilistic Review Graph. Neurocomputing. 366. doi: 10.1016/j.neucom.2019.08.013.

Budhi, Gregorius, Chiong, Raymond, Wang, Zuli and Dhakal, Sandeep. (2021). Using a Hybrid Content-Based and Behaviour-Based Featuring Approach in a Parallel Environment to Detect Fake Reviews. Electronic Commerce Research and Applications. 47. 101048. doi: 10.1016/j.elerap.2021.101048.

Luca, Michael and Zervas, Georgios. (2013). Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud. SSRN Electronic Journal. 62. doi: 10.2139/ssrn.2293164.

Ott, Myle, Choi, Yejin, Cardie, Claire and Hancock, Jeffrey. (2011). Finding Deceptive Opinion Spam by Any Stretch of the Imagination.

Luca, Michael and Zervas, Georgios. (2013). Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud. SSRN Electronic Journal. 62. doi: 10.2139/ssrn.2293164.

Harsh Taneja, Supreet Kaur, An ensemble classification model for fake feedback detection using proposed labeled CloudArmor dataset, Computers & Electrical Engineering, Volume 93, 2021, 107217, ISSN 0045-7906, https://doi.org/10.1016/j.compeleceng.2021.107217. (https://www.sciencedirect.com/science/article/pii/S004579062100210X).

R. Hassan and M. R. Islam, “A Supervised Machine Learning Approach to Detect Fake Online Reviews,” 2020 23rd International Conference on Computer and Information Technology (ICCIT), 2020, pp. 1–6, doi: 10.1109/ICCIT51783.2020.9392727.

Neisari, Ashraf, Rueda, Luis and Saad, Sherif. (2021). Spam Review Detection Using Self-organizing Maps and Convolutional Neural Networks. Computers & Security. 106. 102274. doi: 10.1016/j.cose.2021.102274.

Ott, Myle, Claire Cardie and Jeffrey T. Hancock. “Negative Deceptive Opinion Spam.” NAACL (2013).

N. Jnoub and W. Klas, “Declarative Programming Approach for Fake Review Detection,” 2020 15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMA, 2020, pp. 1–7, doi: 10.1109/SMAP49528.2020.9248468.

Downloads

Published

2023-12-21

How to Cite

Ennaouri, M. ., & Zellou, A. . (2023). Machine Learning Approaches for Fake Reviews Detection: A Systematic Literature Review. Journal of Web Engineering, 22(05), 821–848. https://doi.org/10.13052/jwe1540-9589.2254

Issue

Section

Articles