Author:
Monroe Burt L.,Colaresi Michael P.,Quinn Kevin M.
Abstract
Entries in the burgeoning “text-as-data” movement are often accompanied by lists or visualizations of how word (or other lexical feature) usage differs across some pair or set of documents. These are intended either to establish some target semantic concept (like the content of partisan frames) to estimate word-specific measures that feed forward into another analysis (like locating parties in ideological space) or both. We discuss a variety of techniques for selecting words that capture partisan, or other, differences in political speech and for evaluating the relative importance of those words. We introduce and emphasize several new approaches based on Bayesian shrinkage and regularization. We illustrate the relative utility of these approaches with analyses of partisan, gender, and distributive speech in the U.S. Senate.
Publisher
Cambridge University Press (CUP)
Subject
Political Science and International Relations,Sociology and Political Science
Reference80 articles.
1. Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
2. We can also look at the entire vocabulary over time in an animation. An example for the topic of defense is given here: http://qssi.psu.edu/PartisanDefenseWords.html.
3. Specifically, football is very slightly Republican, whereas great and women are slightly Democratic. The graphic for this is omitted because it is largely blank, as expected, but is available in the web appendix.
Cited by
262 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献