New Data Sources in Social Science Research: Things to Know Before Working With Reddit Data-Reference-Cited by-同舟云学术

New Data Sources in Social Science Research: Things to Know Before Working With Reddit Data

Published:2019-12-18 Issue:5 Volume:39 Page:943-960
ISSN:0894-4393
Container-title:Social Science Computer Review
language:en
Short-container-title:Social Science Computer Review

Author:

Amaya Ashley¹,Bach Ruben²,Keusch Florian²,Kreuter Frauke²³

Affiliation:

1. RTI International, Washington, DC, USA

2. University of Mannheim, Germany

3. University of Maryland, College Park, MD, USA

Abstract

Social media are becoming more popular as a source of data for social science researchers. These data are plentiful and offer the potential to answer new research questions at smaller geographies and for rarer subpopulations. When deciding whether to use data from social media, it is useful to learn as much as possible about the data and its source. Social media data have properties quite different from those with which many social scientists are used to working, so the assumptions often used to plan and manage a project may no longer hold. For example, social media data are so large that they may not be able to be processed on a single machine; they are in file formats with which many researchers are unfamiliar, and they require a level of data transformation and processing that has rarely been required when using more traditional data sources (e.g., survey data). Unfortunately, this type of information is often not obvious ahead of time as much of this knowledge is gained through word-of-mouth and experience. In this article, we attempt to document several challenges and opportunities encountered when working with Reddit, the self-proclaimed “front page of the Internet” and popular social media site. Specifically, we provide descriptive information about the Reddit site and its users, tips for using organic data from Reddit for social science research, some ideas for conducting a survey on Reddit, and lessons learned in merging survey responses with Reddit posts. While this article is specific to Reddit, researchers may also view it as a list of the type of information one may seek to acquire prior to conducting a project that uses any type of social media data.

Publisher

SAGE Publications

Subject

Law,Library and Information Sciences,Computer Science Applications,General Social Sciences

Link

http://journals.sagepub.com/doi/pdf/10.1177/0894439319893305

Reference22 articles.

1. Measuring the Strength of Attitudes in Social Media Data

2. Pseudonymous Parents

3. Methodological Considerations in Analyzing Twitter Data

4. Conceptualising the use of Facebook in ethnographic research: as tool, as data and as context

Cited by 80 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Peer Support for Chronic Pain in Online Health Communities: Quantitative Study on the Dynamics of Social Interactions in a Chronic Pain Forum;Journal of Medical Internet Research;2024-09-05

2. Anger, fear, and frozenness: Exploring the emotive aspect of anti-police sentiment;Journal of Criminal Justice;2024-09

3. From Web to RheumaLpack: Creating a Linguistic Corpus for Exploitation and Knowledge Discovery in Rheumatology;Computers in Biology and Medicine;2024-09

4. Voices in the digital crowd: a discursive analysis of Airbnb boycott;Tourism Recreation Research;2024-08-13

5. Does generative artificial intelligence pose a risk to performance validity test security?;The Clinical Neuropsychologist;2024-07-21