TweetLID: a benchmark for tweet language identification-Reference-Cited by-同舟云学术

TweetLID: a benchmark for tweet language identification

Published:2015-09-26 Issue:4 Volume:50 Page:729-766
ISSN:1574-020X
Container-title:Language Resources and Evaluation
language:en
Short-container-title:Lang Resources & Evaluation

Author:

Zubiaga Arkaitz,Vicente Iñaki San,Gamallo Pablo,Pichel José Ramom,Alegria Iñaki,Aranberri Nora,Ezeiza Aitzol,Fresno Víctor

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Linguistics and Language,Education,Language and Linguistics

Link

http://link.springer.com/content/pdf/10.1007/s10579-015-9317-4.pdf

Reference78 articles.

1. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. (2011). Sentiment analysis of twitter data. In Proceedings of the workshop on languages in social media (pp. 30–38). Association for Computational Linguistics.

2. Alegria, I., Aranberri, N., Comas, P. R., Fresno, V., Gamallo, P., Padró, L., San Vicente, I., Turmo, J., & Zubiaga, A. (2014). Tweetnorm\_es corpus: An annotated corpus for spanish microtext normalization. In Proceedings of the language resources and evaluation conference.

3. Baldwin, T., & Lui, M. (2010). Language identification: The long and the short of the matter. In Human language technologies: The 2010 annual conference of the North American Chapter of the Association for Computational Linguistics (pp. 229–237). Association for Computational Linguistics.

4. Baykan, E., Henzinger, M., & Weber, I. (2008). Web page language identification based on urls. Proceedings of the VLDB Endowment, 1(1), 176–187.

5. Beesley, K. R. (1988). Language identifier: A computer program for automatic natural-language identification of on-line text. In Proceedings of the 29th annual conference of the American Translators Association (Vol. 47, p. 54). Citeseer.

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sentiment Analysis of Marathi–English Code-Mixed Using Ensemble Model;Data-Intensive Research;2024

2. Evaluation and Measurement;Synthesis Lectures on Human Language Technologies;2024

3. TweetVi: A Tweet Visualisation Dashboard for Automatic Topic Classification and Sentiment Analysis;Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration;2023

4. Closely related Indonesian language identification using deep learning;VII INTERNATIONAL CONFERENCE “SAFETY PROBLEMS OF CIVIL ENGINEERING CRITICAL INFRASTRUCTURES” (SPCECI2021);2023

5. Exploring the geo virtual linguistic landscape of Dublin urban areas: before and during the COVID-19 outbreak;International Journal of Multilingualism;2022-08-01