Affiliation:
1. Computer Laboratory and RCEAL, University of Cambridge, 15 J J Thomson Avenue, Cambridge CB3 0FD, UK
Abstract
Natural language processing (NLP)—the automatic analysis, understanding and generation of human language by computers—is vitally dependent on accurate knowledge about
words
. Because words change their behaviour between text types, domains and sub-languages, a fully accurate static lexical resource (e.g. a dictionary, word classification) is unattainable. Researchers are now developing techniques that could be used to automatically acquire or update lexical resources from textual data. If successful, the automatic approach could considerably enhance the accuracy and portability of language technologies, such as machine translation, text mining and summarization. This paper reviews the recent and on-going research in automatic lexical acquisition. Focusing on lexical classification, it discusses the many challenges that still need to be met before the approach can benefit NLP on a large scale.
Subject
General Physics and Astronomy,General Engineering,General Mathematics
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Cluster Analysis;A Practical Handbook of Corpus Linguistics;2020
2. HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment;Computational Linguistics;2017-12
3. Visions of the future for the Royal Society’s 350th anniversary year;Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences;2010-08-13