Machine learning for learner English-Reference-Cited by-同舟云学术

Machine learning for learner English

Published:2020-04-14 Issue:1 Volume:6 Page:72-103
ISSN:2215-1478
Container-title:International Journal of Learner Corpus Research
language:en
Short-container-title:IJLCR

Author:

Ballier Nicolas¹²,Canu Stéphane³⁴,Petitjean Caroline³⁴,Gasso Gilles³⁴,Balhana Carlos⁵,Alexopoulou Theodora⁵,Gaillat Thomas⁶

Affiliation:

1. Université de Paris

2. CLILLAC-ARP

3. INSA Rouen

4. LITIS

5. University of Cambridge

6. Université Universités de Rennes 1&2, LIDILE

Abstract

Abstract This paper discusses machine learning techniques for the prediction of Common European Framework of Reference (CEFR) levels in a learner corpus. We summarise the CAp 2018 Machine Learning (ML) competition, a classification task of the six CEFR levels, which map linguistic competence in a foreign language onto six reference levels. The goal of this competition was to produce a machine learning system to predict learners’ competence levels from written productions comprising between 20 and 300 words and a set of characteristics computed for each text extracted from the French component of the EFCAMDAT data (Geertzen et al., 2013). Together with the description of the competition, we provide an analysis of the results and methods proposed by the participants and discuss the benefits of this kind of competition for the learner corpus research (LCR) community. The main findings address the methods used and lexical bias introduced by the task.

Publisher

John Benjamins Publishing Company

Link

http://www.jbe-platform.com/deliver/fulltext/ijlcr.18012.bal.pdf

Reference65 articles.

1. Semisupervised Learning for Computational Linguistics

2. Task Effects on Linguistic Complexity and Accuracy: A Large-Scale Learner Corpus Analysis Employing Natural Language Processing Techniques

3. Classifying intermediate learner English: a data-driven approach to learner corpora;Alexopoulou,2013

4. Automated essay scoring with e-rater® v.2;Attali;The Journal of Technology, Learning and Assessment,2006

5. Lexical bias in essay level prediction;Balikas;ArXiv e-prints,2018

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automated Scoring of English Essays in CEFR Levels using LSTM and DistilBERT Embeddings;2023 10th International Conference on Advanced Informatics: Concept, Theory and Application (ICAICTA);2023-10-07

2. Robustness Analysis uncovers Language Proficiency Bias in Emotion Recognition Systems;2023 11th International Conference on Affective Computing and Intelligent Interaction (ACII);2023-09-10

3. Visualizing Linguistic Complexity and Proficiency in Learner English Writings;CALICO Journal;2023-05-25

4. 7 Modals as a predictive factor for L2 proficiency level;Models of Modals;2023-04-12

5. Robustness Analysis uncovers Language Proficiency Bias in Emotion Recognition Systems;INT CONF AFFECT;2023