The University of Pittsburgh English Language Institute Corpus (PELIC)-Reference-Cited by-同舟云学术

The University of Pittsburgh English Language Institute Corpus (PELIC)

Published:2022-03-08 Issue:1 Volume:8 Page:121-138
ISSN:2215-1478
Container-title:International Journal of Learner Corpus Research
language:en
Short-container-title:IJLCR

Author:

Naismith Ben¹^ORCID,Han Na-Rae¹,Juffs Alan¹^ORCID

Affiliation:

1. University of Pittsburgh

Abstract

Abstract This report introduces the University of Pittsburgh English Language Institute Corpus (PELIC; Juffs et al., 2020), a publicly available 4.2-million-word learner corpus of written texts. Collected over seven years in the University of Pittsburgh’s Intensive English Program, these texts were produced by more than 1,100 students with diverse linguistic backgrounds and proficiency levels. Unlike most learner corpora which are cross-sectional, PELIC is longitudinal, offering greater opportunities for tracking development in a natural classroom setting. This potential is illustrated in an overview of the research conducted to date with these data. The report also provides a description of PELIC’s creation and contents, including how the texts have been managed to facilitate natural language processing. Overall, the corpus contributes to the field of learner corpus research by adding to the pool of freely and publicly available learner corpora, supplemented by a useful set of Python tools and tutorials for accessing these data.

Publisher

John Benjamins Publishing Company

Subject

Linguistics and Language,Education,Language and Linguistics

Link

http://www.jbe-platform.com/deliver/fulltext/ijlcr.21002.nai.pdf

Reference32 articles.

1. Exploring big educational learner corpora for SLA research

2. Exploring the longitudinal development of grammatical complexity in the disciplinary writing of L2-English university students

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Demystifying large language models in second language development research;Computer Speech & Language;2025-01

2. Evaluating NLP models with written and spoken L2 samples;Research Methods in Applied Linguistics;2024-08

3. AI Language Models: An Opportunity to Enhance Language Learning;Informatics;2024-07-19

4. How a Phonics-Based Intervention, L1 Orthography, and Item Characteristics Impact Adult ESL Spelling Knowledge;Education Sciences;2024-04-17

5. Development and Application of Network Data Mining in English Corpus Software Design;2023 IEEE 4th Annual Flagship India Council International Subsections Conference (INDISCON);2023-08-05