The World Wide Web as Complex Data Set: Expanding the Digital Humanities into the Twentieth Century and Beyond through Internet Research-Reference-Cited by-同舟云学术

The World Wide Web as Complex Data Set: Expanding the Digital Humanities into the Twentieth Century and Beyond through Internet Research

Published:2016-03 Issue:1 Volume:10 Page:95-109
ISSN:1753-8548
Container-title:International Journal of Humanities and Arts Computing
language:en
Short-container-title:IJHAC

Author:

Black Michael L.

Abstract

While intellectual property protections effectively frame digital humanities text mining as a field primarily for the study of the nineteenth century, the Internet offers an intriguing object of study for humanists working in later periods. As a complex data source, the World Wide Web presents its own methodological challenges for digital humanists, but lessons learned from projects studying large nineteenth century corpora offer helpful starting points. Complicating matters further, legal and ethical questions surrounding web scraping, or the practice of large scale data retrieval over the Internet, will require humanists to frame their research to distinguish it from commercial and malicious activities. This essay reviews relevant research in the digital humanities and new media studies in order to show how web scraping might contribute to humanities research questions. In addition to recommendations for addressing the complex concerns surrounding web scraping this essay also provides a basic overview of the process and some recommendations for resources.

Publisher

Edinburgh University Press

Subject

Human-Computer Interaction,General Arts and Humanities,General Computer Science

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Is Taylor Swift leading a new Pop revolution? A cross-generation analysis of Pop/Rock cover songs;F1000Research;2024-02-12

2. Conclusion: A Highly transformative age for web archives;Proceedings e report;2024

3. A New Scientometric Database of Scientific Publications in Brazilian International Relations Journals (1997-2021);Contexto Internacional;2023-04

4. Anwendungskontexte von Web Scraping in der Versorgungsforschung - Nur für Web-Expert:innen? Oder eine Methode für alle Versorgungsforscher:innen!?;Zeitschrift für Evidenz, Fortbildung und Qualität im Gesundheitswesen;2023-02

5. Is Taylor Swift leading a new Pop revolution? A cross-generation analysis of Pop/Rock cover songs;F1000Research;2023-01-27