Threats of Bots and Other Bad Actors to Data Quality Following Research Participant Recruitment Through Social Media: Cross-Sectional Questionnaire-Reference-Cited by-同舟云学术

Threats of Bots and Other Bad Actors to Data Quality Following Research Participant Recruitment Through Social Media: Cross-Sectional Questionnaire

Published:2020-10-07 Issue:10 Volume:22 Page:e23021
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Pozzar Rachel^ORCID,Hammer Marilyn J^ORCID,Underhill-Blazey Meghan^ORCID,Wright Alexi A^ORCID,Tulsky James A^ORCID,Hong Fangxin^ORCID,Gundersen Daniel A^ORCID,Berry Donna L^ORCID

Abstract

Background Recruitment of health research participants through social media is becoming more common. In the United States, 80% of adults use at least one social media platform. Social media platforms may allow researchers to reach potential participants efficiently. However, online research methods may be associated with unique threats to sample validity and data integrity. Limited research has described issues of data quality and authenticity associated with the recruitment of health research participants through social media, and sources of low-quality and fraudulent data in this context are poorly understood. Objective The goal of the research was to describe and explain threats to sample validity and data integrity following recruitment of health research participants through social media and summarize recommended strategies to mitigate these threats. Our experience designing and implementing a research study using social media recruitment and online data collection serves as a case study. Methods Using published strategies to preserve data integrity, we recruited participants to complete an online survey through the social media platforms Twitter and Facebook. Participants were to receive $15 upon survey completion. Prior to manually issuing remuneration, we reviewed completed surveys for indicators of fraudulent or low-quality data. Indicators attributable to respondent error were labeled suspicious, while those suggesting misrepresentation were labeled fraudulent. We planned to remove cases with 1 fraudulent indicator or at least 3 suspicious indicators. Results Within 7 hours of survey activation, we received 271 completed surveys. We classified 94.5% (256/271) of cases as fraudulent and 5.5% (15/271) as suspicious. In total, 86.7% (235/271) provided inconsistent responses to verifiable items and 16.2% (44/271) exhibited evidence of bot automation. Of the fraudulent cases, 53.9% (138/256) provided a duplicate or unusual response to one or more open-ended items and 52.0% (133/256) exhibited evidence of inattention. Conclusions Research findings from several disciplines suggest studies in which research participants are recruited through social media are susceptible to data quality issues. Opportunistic individuals who use virtual private servers to fraudulently complete research surveys for profit may contribute to low-quality data. Strategies to preserve data integrity following research participant recruitment through social media are limited. Development and testing of novel strategies to prevent and detect fraud is a research priority.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference36 articles.

1. Using e-technologies in clinical trials

2. A Comparison of Three Online Recruitment Strategies for Engaging Parents

3. Comparing Twitter and Online Panels for Survey Recruitment of E-Cigarette Users and Smokers

4. Comparisons of Online Recruitment Strategies for Convenience Samples

5. Integrative Review of Recruitment of Research Participants Through Facebook

Cited by 117 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. What is wrong with individual differences research?;Personality and Individual Differences;2024-04

2. Detecting the corruption of online questionnaires by artificial intelligence;Frontiers in Robotics and AI;2024-02-02

3. Predicting physical activity for people with multiple sclerosis: The role of exercise-related cognitive errors.;Rehabilitation Psychology;2024-02

4. Internet-based Sexual Health Survey: Protocol for Data Verification and Respondent Validity (Preprint);2024-01-26

5. Who can you trust these days?: Dealing with imposter participants during online recruitment and data collection;Qualitative Research;2024-01-22