From archive to corpus-Reference-Cited by-同舟云学术

From archive to corpus

Published:2010-03-22 Issue:1 Volume:15 Page:106-131
ISSN:1384-6655
Container-title:International Journal of Corpus Linguistics
language:en
Short-container-title:IJCL

Author:

Johnston Trevor¹

Affiliation:

1. Macquarie University

Abstract

Annotations are an important resource in corpus-based linguistic research. In fact, the most important feature of a modern signed language corpus should be that it has been annotated rather than simply transcribed. Digital multi-media annotation software can now transform language recordings into machine-readable texts using gloss-based annotations without it first being necessary to transcribe these utterances, provided that sign tokens are identified and discriminated according to type. Further annotations can subsequently be appended to these units. However, unique identifiers of sign types (or ‘ID-glosses’) can only be used if a comprehensive reference lexical database of the language already exists. In order to create a basic multi-purpose reference signed language corpus, therefore, linguists should prioritize annotation using ID-glosses above transcription. The effort expended in creating a transcription that does not facilitate the unique identification of sign types will not result in a machine-readable corpus in any meaningful sense, contrary to expectations.

Publisher

John Benjamins Publishing Company

Subject

Linguistics and Language,Language and Linguistics

Link

http://www.jbe-platform.com/deliver/fulltext/ijcl.15.1.05joh.pdf

Cited by 90 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The GeSCA repository: Gesture and Sign Corpus of Australia;Australian Journal of Linguistics;2024-09-11

2. From rule-based models to deep learning transformers architectures for natural language processing and sign language translation systems: survey, taxonomy and performance evaluation;Artificial Intelligence Review;2024-08-29

3. Spoken and signed languages hand in hand: parallel and directly comparable corpora of French Belgian Sign Language (lsfb) and French;Corpora;2024-08

4. Metalinguistic Discourse in an Emerging Sign Language;Languages;2024-07-03

5. Emblems: Meaning at the interface of language and gesture;Glossa: a journal of general linguistics;2024-06-18