Affiliation:
1. University of Cambridge
2. University of Birmingham
Abstract
Abstract
As large-scale learner corpora become increasingly available, it is vital that natural language processing (NLP)
technology is developed to provide rich linguistic annotations necessary for second language (L2) research. We present a system for
automatically analyzing subcategorization frames (SCFs) for learner English. SCFs link lexis with morphosyntax, shedding light on the
interplay between lexical and structural information in learner language. Meanwhile, SCFs are crucial to the study of a wide range of
phenomena including individual verbs, verb classes and varying syntactic structures. To illustrate the usefulness of our system for learner
corpus research and second language acquisition (SLA), we investigate how L2 learners diversify their use of SCFs in text and how this
diversity changes with L2 proficiency.
Publisher
John Benjamins Publishing Company
Subject
Linguistics and Language,Language and Linguistics