Predicting Web Survey Breakoffs Using Machine Learning Models-Reference-Cited by-同舟云学术

Predicting Web Survey Breakoffs Using Machine Learning Models

Published:2022-07-11 Issue: Volume: Page:089443932211120
ISSN:0894-4393
Container-title:Social Science Computer Review
language:en
Short-container-title:Social Science Computer Review

Author:

Chen Zeming¹,Cernat Alexandru¹,Shlomo Natalie¹

Affiliation:

1. University of Manchester, UK

Abstract

Web surveys are becoming increasingly popular but tend to have more breakoffs compared to the interviewer-administered surveys. Survey breakoffs occur when respondents quit the survey partway through. The Cox survival model is commonly used to understand patterns of breakoffs. Nevertheless, there is a trend to using more data-driven models when the purpose is prediction, such as classification machine learning models. It is unclear in the breakoff literature what are the best statistical models for predicting question-level breakoffs. Additionally, there is no consensus about the treatment of time-varying question-level predictors, such as question response time and question word count. While some researchers use the current values, others aggregate the value from the beginning of the survey. This study develops and compares both survival models and classification models along with different treatments of time-varying variables. Based on the level of agreement between the predicted and actual breakoff, we find that the Cox model and gradient boosting outperform other survival models and classification models respectively. We also find that using the values of time-varying predictors concurrent to the breakoff status is more predictive of breakoff, compared to aggregating their values from the beginning of the survey, implying that respondents’ breakoff behaviour is more driven by the current response burden.

Publisher

SAGE Publications

Subject

Law,Library and Information Sciences,Computer Science Applications,General Social Sciences

Link

http://journals.sagepub.com/doi/pdf/10.1177/08944393221112000

Reference41 articles.

1. Questionnaire Breakoff and Item Nonresponse in Web-Based Questionnaires: Multilevel Analysis of Person-Level and Item Design Factors in a Birth Cohort

2. Kaggle forecasting competitions: An overlooked learning opportunity

3. Impact of question topics and filter question formats on web survey breakoffs

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Effects of Order and Text Box Size of Open-ended Questions on Withdrawal Rate and the Length of Response;Proceedings of the 35th Australian Computer-Human Interaction Conference;2023-12-02