Morphological Analysis Using a Sequence Decoder-Reference-Cited by-同舟云学术

Morphological Analysis Using a Sequence Decoder

Published:2019-11 Issue: Volume:7 Page:567-579
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:Transactions of the Association for Computational Linguistics

Author:

Akyürek Ekin¹¹,Dayanık Erenay¹¹,Yuret Deniz¹

Affiliation:

1. Koç University Artificial Intelligence Laboratory, İstanbul, Turkey.

Abstract

We introduce Morse, a recurrent encoder-decoder model that produces morphological analyses of each word in a sentence. The encoder turns the relevant information about the word and its context into a fixed size vector representation and the decoder generates the sequence of characters for the lemma followed by a sequence of individual morphological features. We show that generating morphological features individually rather than as a combined tag allows the model to handle rare or unseen tags and to outperform whole-tag models. In addition, generating morphological features as a sequence rather than, for example, an unordered set allows our model to produce an arbitrary number of features that represent multiple inflectional groups in morphologically complex languages. We obtain state-of-the-art results in nine languages of different morphological complexity under low-resource, high-resource, and transfer learning settings. We also introduce TrMor2018, a new high-accuracy Turkish morphology data set. Our Morse implementation and the TrMor2018 data set are available online to support future research. 1 See https://github.com/ai-ku/Morse.jl for a Morse implementation in Julia/Knet (Yuret, 2016 ) and https://github.com/ai-ku/TrMor2018 for the new Turkish data set.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00286

Reference33 articles.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Learning-Based Preprocessing Tools for Turkish Natural Language Processing;Communications in Computer and Information Science;2024

2. Adaptive Keyword Extraction Service for Turkish;Lecture Notes in Networks and Systems;2022-09-01

3. Joint learning of morphology and syntax with cross-level contextual information flow;Natural Language Engineering;2022-01-20

4. Transmorph: a transformer based morphological disambiguator for Turkish;Turkish Journal of Electrical Engineering and Computer Sciences;2022-01-01