Affiliation:
1. Xi’an Jiaotong-Liverpool University
Abstract
Abstract
Corpus data provide evidence of the patterning of language, and one way word usage can be analysed is through the
study of concordance lines. While popular concordancers provide different sorting methods, they are typically only able to display
lines in the order in which they occur in the corpus, randomly, or alphabetically by words in slots to the left or right of the
word of interest. Less sophisticated users may find recognising patterns from these orderings quite challenging. This paper
considers possible needs of language learners in terms of concordance ranking and introduces two methods which have been adopted
and developed for The Prime Machine. The first method uses repeated patterns, measuring the number of matches
made with other lines in the set. The second method incorporates collocation scores, providing examples with strong collocations
from the entire corpus at the top of sampled concordance lines.
Publisher
John Benjamins Publishing Company
Subject
Linguistics and Language,Language and Linguistics
Reference23 articles.
1. Visualisation in corpus-based discourse studies
2. How much vocabulary is needed touse a concordance?
3. Word association norms, mutual information, and lexicography;Church;Computational Linguistics,1990
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献