Selecting and Weighting N-Grams to Identify 1100 Languages-Reference-Cited by-同舟云学术

Selecting and Weighting N-Grams to Identify 1100 Languages

Published:2013 Issue: Volume: Page:475-483
ISSN:0302-9743
Container-title:Text, Speech, and Dialogue
language:
Short-container-title:

Author:

Brown Ralf D.

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-40585-3_60

Reference13 articles.

1. Brown, R.D.: Finding and Identifying Text in 900+ Languages. Digital Investigation 9, S34–S43 (2012)

2. Cavnar, W.B., Trenkle, J.M.: N-Gram-Based Text Categorization. In: Proceedings of SDAIR 1994, 3rd Annual Symposium on Document Analysis and Information Retrieval, UNLV Publications/Reprographics, pp. 161–175 (April 1994)

3. Ljubešić, N., Mikelić, N., Boras, D.: Language identification: How to distinguish similar languages. In: Lužar-Stifter, V., Hljuz Dobrić, V. (eds.) Proceedings of the 29th International Conference on Information Technology Interfaces, Zagreb, pp. 541–546. SRCE University Computing Centre (2007)

4. Ahmed, B., Cha, S.H., Tappert, C.: Language Identification from Text Using N-gram Based Cumulative Frequency Addition. In: Proceedings of Student/Faculty Research Day, CSIS, Pace University (May 2004)

5. Carter, S., Tsagkias, M., Weerkamp, W.: Semi-Supervised Priors for Microblog Language Identification. In: Proceedings of the Dutch-Belgian Information Retrieval Workshop (DIR 2011), Amsterdam (February 2011)

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large Scale, Multi-domain Language Identification;Synthesis Lectures on Human Language Technologies;2024

2. Specific Challenges of Variation and Text Types;Synthesis Lectures on Human Language Technologies;2024

3. Evaluation and Measurement;Synthesis Lectures on Human Language Technologies;2024

4. Features and Methods;Synthesis Lectures on Human Language Technologies;2024

5. Sustainable development goals research in higher education institutions: An interdisciplinarity assessment through an entropy-based indicator;Journal of Business Research;2022-11