Affiliation:
1. Université Lumière Lyon2, France
2. University of Trento, Italy
Abstract
Text mining refers to the discovery of previously unknown knowledge that can be found in text collections. In recent years, the text mining field has received great attention due to the abundance of textual data. A researcher in this area is requested to cope with issues originating from the natural language particularities. This survey discusses such semantic issues along with the approaches and methodologies proposed in the existing literature. It covers syntactic matters, tokenization concerns and it focuses on the different text representation techniques, categorisation tasks and similarity measures suggested.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Cited by
74 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Artificial Intelligence in Interdisciplinary Linguistics;Bulletin of Kemerovo State University. Series: Humanities and Social Sciences;2023-10-02
2. Automatic software vulnerability assessment by extracting vulnerability elements;Journal of Systems and Software;2023-10
3. The application of text mining in accounting;International Journal of Accounting Information Systems;2023-09
4. Information Extraction From Text Messages Using Natural Language Processing;2023 International Conference on Computer Communication and Informatics (ICCCI);2023-01-23
5. Mobile Health Text Misinformation Identification Using Mobile Data Mining;International Journal of Mobile Devices, Wearable Technology, and Flexible Electronics;2022-10-14