Affiliation:
1. Faculty of Electronics and Information Technology, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
Abstract
Many customers rely on online reviews to make an informed decision about purchasing products and services. Unfortunately, fake reviews, which can mislead customers, are increasingly common. Therefore, there is a growing need for effective methods of detection. In this article, we present a case study showing research aimed at recognizing fake reviews in Google Maps places in Poland. First, we describe a method of construction and validation of a dataset, named GMR–PL (Google Maps Reviews—Polish), containing a selection of 18 thousand fake and genuine reviews in Polish. Next, we show how we used this dataset to train machine learning models to detect fake reviews and the accounts that published them. We also propose a novel metric for measuring the typicality of an account name and a metric for measuring the geographical dispersion of reviewed places. Initial recognition results were promising: we achieved an F1 score of 0.92 and 0.74 when detecting fake accounts and reviews, respectively. We believe that our experience will help in creating real-life review datasets for other languages and, in turn, will help in research aimed at the detection of fake reviews on the Internet.
Funder
EU POWER Program
Polish Ministry of Education and Science
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference35 articles.
1. A survey on fake news and rumour detection techniques;Bondielli;Inf. Sci.,2019
2. A Public Health Research Agenda for Managing Infodemics: Methods and Results of the First WHO Infodemiology Conference;Calleja;JMIR Infodemiol.,2021
3. Countering misinformation: A multidisciplinary approach;Moy;Big Data Soc.,2021
4. Google Maps (2023, May 20). Google Maps. Available online: https://www.google.pl/maps.
5. Jindal, N., and Liu, B. (2008, January 11–12). Opinion spam and analysis. Proceedings of the International Conference on Web Search and Data Mining (WSDM 2008), Palo Alto, CA, USA.