Named-entity recognition in Turkish legal texts-Reference-Cited by-同舟云学术

Named-entity recognition in Turkish legal texts

Published:2022-07-11 Issue: Volume: Page:1-28
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Çetindağ Can,Yazıcıoğlu Berkay,Koç Aykut

Abstract

Abstract Natural language processing (NLP) technologies and applications in legal text processing are gaining momentum. Being one of the most prominent tasks in NLP, named-entity recognition (NER) can substantiate a great convenience for NLP in law due to the variety of named entities in the legal domain and their accentuated importance in legal documents. However, domain-specific NER models in the legal domain are not well studied. We present a NER model for Turkish legal texts with a custom-made corpus as well as several NER architectures based on conditional random fields and bidirectional long-short-term memories (BiLSTMs) to address the task. We also study several combinations of different word embeddings consisting of GloVe, Morph2Vec, and neural network-based character feature extraction techniques either with BiLSTM or convolutional neural networks. We report 92.27% F1 score with a hybrid word representation of GloVe and Morph2Vec with character-level features extracted with BiLSTM. Being an agglutinative language, the morphological structure of Turkish is also considered. To the best of our knowledge, our work is the first legal domain-specific NER study in Turkish and also the first study for an agglutinative language in the legal domain. Thus, our work can also have implications beyond the Turkish language.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference94 articles.

1. Long Short-Term Memory

2. Dalkılıç, F.E. , Gelişli, S. and Diri, B. (2010). Named entity recognition from Turkish texts. In 2010 IEEE 18th Signal Processing and Communications Applications Conference. IEEE, pp. 918–920.

3. Joint parsing and named entity recognition

4. A hybrid named entity recognizer for Turkish

5. Natural language processing (almost) from scratch;Collobert;Journal of Machine Learning Research,2011

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HUKUKİ METİNLERİN OTOMATİK İŞLENMESİNDE YAPAY ZEKÂ TEKNOLOJİLERİNİN KULLANIMI;Bilişim Hukuku Dergisi;2024-06-30

2. An entity-centric approach to manage court judgments based on Natural Language Processing;Computer Law & Security Review;2024-04

3. CRF-Named Entity Recognition Model for Ancient Isan Medicine Texts;2024 12th International Electrical Engineering Congress (iEECON);2024-03-06

4. Application of BiLSTM-CRF model with different embeddings for product name extraction in unstructured Turkish text;Neural Computing and Applications;2024-02-21

5. Addressing Annotated Data Scarcity in Legal Information Extraction;Lecture Notes in Computer Science;2024