Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic

Author:

Wang JuneORCID,Calderon GabrielaORCID,Hager Erin R.,Edwards Lorece V.,Berry Andrea A.,Liu Yisi,Dinh Janny,Summers August C.ORCID,Connor Katherine A.,Collins Megan E.,Prichett LauraORCID,Marshall Beth R.,Johnson Sara B.ORCID

Abstract

Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surveys to receive incentives, a major risk to data integrity. As part of a study of COVID-19 and the return to in-person school, we implemented a web-based survey of parents in Maryland between December 2021 and July 2022. Recruitment relied, in part, on social media advertisements. Despite implementing many existing best practices, we found the survey challenged by sophisticated fraudsters. In response, we iteratively improved survey security. In this paper, we describe efforts to identify and prevent fraudulent online survey responses. Informed by this experience, we provide specific, actionable recommendations for identifying and preventing online survey fraud in future research. Some strategies can be deployed within the data collection platform such as careful crafting of survey links, Internet Protocol address logging to identify duplicate responses, and comparison of client-side and server-side time stamps to identify responses that may have been completed by respondents outside of the survey’s target geography. Other strategies can be implemented during the survey design phase. These approaches include the use of a 2-stage design in which respondents must be eligible on a preliminary screener before receiving a personalized link. Other design-based strategies include within-survey and cross-survey validation questions, the addition of “speed bump” questions to thwart careless or computerized responders, and the use of optional open-ended survey questions to identify fraudsters. We describe best practices for ongoing monitoring and post-completion survey data review and verification, including algorithms to expedite some aspects of data review and quality assurance. Such strategies are increasingly critical to safeguarding survey-based public health research.

Funder

National Institutes of Health

Publisher

Public Library of Science (PLoS)

Reference15 articles.

1. Threats of Bots and Other Bad Actors to Data Quality Following Research Participant Recruitment Through Social Media: Cross-Sectional Questionnaire;R Pozzar,2020

2. Ensuring Survey Research Data Integrity in the era of internet bots.;M Griffin;Quality & Quantity,2021

3. Social media as a recruitment platform for a nationwide online survey of covid-19 knowledge, beliefs, and practices in the United States: Methodology and feasibility analysis (preprint);SH Ali,2020

4. Research Electronic Data Capture (redcap)—a metadata-driven methodology and workflow process for providing Translational Research Informatics Support;PA Harris;Journal of Biomedical Informatics,2009

5. Digitizing clinical trials;OT Inan,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3