Author:
Hoeber Orland,Hoeber Larena,El Meseery Maha,Odoh Kenneth,Gopi Radhika
Abstract
Purpose
– Due to the size and velocity at which user generated content is created on social media services such as Twitter, analysts are often limited by the need to pre-determine the specific topics and themes they wish to follow. Visual analytics software may be used to support the interactive discovery of emergent themes. The paper aims to discuss these issues.
Design/methodology/approach
– Tweets collected from the live Twitter stream matching a user’s query are stored in a database, and classified based on their sentiment. The temporally changing sentiment is visualized, along with sparklines showing the distribution of the top terms, hashtags, user mentions, and authors in each of the positive, neutral, and negative classes. Interactive tools are provided to support sub-querying and the examination of emergent themes.
Findings
– A case study of using Vista to analyze sport fan engagement within a mega-sport event (2013 Le Tour de France) is provided. The authors illustrate how emergent themes can be identified and isolated from the large collection of data, without the need to identify these a priori.
Originality/value
– Vista provides mechanisms that support the interactive exploration among Twitter data. By combining automatic data processing and machine learning methods with interactive visualization software, researchers are relieved of tedious data processing tasks, and can focus on the analysis of high-level features of the data. In particular, patterns of Twitter use can be identified, emergent themes can be isolated, and purposeful samples of the data can be selected by the researcher for further analysis.
Subject
Library and Information Sciences,Computer Science Applications,Information Systems
Reference33 articles.
1. Agarwal, A.
,
Xie, B.
,
Vovsha, I.
,
Rambow, O.
and
Passonneau, R.
(2011), “Sentiment analysis of twitter data”, Proceedings of the Workshop on Languages in Social Media, pp. 30-38.
2. Alencar, A.B.
,
de Oliveira, M.C.F.
and
Paulovich, F.V.
(2012), “Seeing beyond reading: a survey on visual text analytics”,
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
, Vol. 2 No. 6, pp. 476-492.
3. Archambault, D.
,
Greene, D.
,
Cunningham, P.
and
Hurley, N.
(2011), “ThemeCrowds: multiresolution summaries of twitter usage”, Proceedings of the International Workshop on Search and Mining User-Generated Contents, pp. 77-84.
4. Blaszka, M.
,
Burch, L.M.
,
Frederick, E.L.
,
Clavio, G.
and
Walsh, P.
(2012), “#WorldSeries: an empirical examination of a twitter hashtag during a major sporting event”,
International Journal of Sport Communication
, Vol. 5 No. 4, pp. 435-453.
5. Bostock, M.
,
Ogievetsky, V.
and
Heer, J.
(2011), “D3: data driven documents”,
IEEE Transactions on Visualization and Computer Graphics
, Vol. 17 No. 12, pp. 2301-2309.
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献