L2 and L1 semantic context indices as automated measures of lexical sophistication-Reference-Cited by-同舟云学术

L2 and L1 semantic context indices as automated measures of lexical sophistication

Published:2023-02-02 Issue:3 Volume:40 Page:576-606
ISSN:0265-5322
Container-title:Language Testing
language:en
Short-container-title:Language Testing

Author:

Monteiro Kátia¹^ORCID,Crossley Scott¹^ORCID,Botarleanu Robert-Mihai²,Dascălu Mihai²

Affiliation:

1. Georgia State University, USA

2. Politehnica University of Bucharest, Romania

Abstract

Lexical frequency benchmarks have been extensively used to investigate second language (L2) lexical sophistication, especially in language assessment studies. However, indices based on semantic co-occurrence, which may be a better representation of the experience language users have with lexical items, have not been sufficiently tested as benchmarks of lexical sophistication. To address this gap, we developed and tested indices based on semantic co-occurrence from two computational methods, namely, Latent Semantic Analysis and Word2Vec. The indices were developed from one L2 written corpus (i.e., EF Cambridge Open Language Database [EF-CAMDAT]) and one first language (L1) written corpus (i.e., Corpus of Contemporary American English [COCA] Magazine). Available L1 semantic context indices (i.e., Touchstone Applied Sciences Associates [TASA] indices) were also assessed. To validate the indices, they were used to predict L2 essay quality scores as judged by human raters. The models suggested that the semantic context indices developed from EF-CAMDAT and TASA, but not the COCA Magazine indices, explained unique variance in the presence of lexical sophistication measures. This study suggests that semantic context indices based on multi-level corpora, including L2 corpora, may provide a useful representation of the experience L2 writers have with input, which may assist with automatic scoring of L2 writing.

Publisher

SAGE Publications

Subject

Linguistics and Language,Social Sciences (miscellaneous),Language and Linguistics

Link

http://journals.sagepub.com/doi/pdf/10.1177/02655322221147924

Reference75 articles.

1. Contextual Diversity, Not Word Frequency, Determines Word-Naming and Lexical Decision Times

2. Mixed-effects modeling with crossed random effects for subjects and items

3. Writing evaluation: what can analytic versus holistic essay scoring tell us?

4. The locus of word-frequency effects in the pronunciation task: Lexical access and/or production?

5. Visual Word Recognition of Single-Syllable Words.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Features of lexical complexity: insights from L1 and L2 speakers;Frontiers in Artificial Intelligence;2023-11-30