Refining and modifying the EFCAMDAT-Reference-Cited by-同舟云学术

Refining and modifying the EFCAMDAT

Published:2020-12-10 Issue:2 Volume:6 Page:220-236
ISSN:2215-1478
Container-title:International Journal of Learner Corpus Research
language:en
Short-container-title:IJLCR

Author:

Shatz Itamar¹

Affiliation:

1. University of Cambridge

Abstract

Abstract This report outlines the development of a new corpus, which was created by refining and modifying the largest open-access L2 English learner database – the EFCAMDAT. The extensive data-curation process, which can inform the development and use of other corpora, included procedures such as converting the database from XML to a tabular format, and removing problematic markup tags and non-English texts. The final dataset contains two corresponding samples, written by similar learners in response to different prompts, which represents a unique research opportunity when it comes to analyzing task effects and conducting replication studies. Overall, the resulting corpus contains ~406,000 texts in the first sample and ~317,000 texts in the second sample, written by learners representing diverse L1s and a large range of L2 proficiency levels.

Publisher

John Benjamins Publishing Company

Link

http://www.jbe-platform.com/deliver/fulltext/ijlcr.20009.sha.pdf

Reference19 articles.

1. Exploring big educational learner corpora for SLA research

2. Task Effects on Linguistic Complexity and Accuracy: A Large-Scale Learner Corpus Analysis Employing Natural Language Processing Techniques

3. Learner corpus methodology

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How do constructions with modal verbs develop in second language learners of English?;Journal of Second Language Studies;2024-08-12

2. The potential influence of cross-linguistic lexical similarity on lexical diversity in L2 English writing;Corpora;2024-08

3. Proficiency-rated learner corpora;International Journal of Learner Corpus Research;2024-06-28

4. Utility of Kolmogorov complexity measures: Analysis of L2 groups and L1 backgrounds;PLOS ONE;2024-04-18

5. Definite and indefinite article accuracy in learner English: A multifactorial analysis;Studies in Second Language Acquisition;2023-10-13