Affiliation:
1. VU University Amsterdam
Abstract
Abstract
Most common methods for automatic text analysis in communication science ignore syntactic information, focusing on the occurrence and co-occurrence of individual words, and sometimes n-grams. This is remarkably effective for some purposes, but poses a limitation for fine-grained analyses into semantic relations such as who does what to whom and according to what source. One tested, effective method for moving beyond this bag-of-words assumption is to use a rule-based approach for labeling and extracting syntactic patterns in dependency trees. Although this method can be used for a variety of purposes, its application is hindered by the lack of dedicated and accessible tools. In this paper we introduce the rsyntax R package, which is designed to make working with dependency trees easier and more intuitive for R users, and provides a framework for combining multiple rules for reliably extracting useful semantic relations.
Publisher
Amsterdam University Press
Reference33 articles.
1. Methodological challenges in estimating tone: Application to news coverage of the us economy;Meeting of the midwest political science association, chicago, il.,2016
2. spacyr: R wrapper to the spacy nlp library [Computer software manual],2017
3. Taking stock of the toolkit: An overview of relevant automated content analysis approaches and techniques for digital journalism scholars;Digital Journalism,2016
4. A fast and accurate dependency parser using neural networks;Proceedings of the 2014 conference on empirical methods in natural language processing (emnlp),2014
5. Relex—relation extraction using dependency parse trees;Bioinformatics,2007
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献