Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code-Reference-Cited by-同舟云学术

Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code

Published:2021-09-15 Issue: Volume: Page:
ISSN:0924-8463
Container-title:Artificial Intelligence and Law
language:en
Short-container-title:Artif Intell Law

Author:

Tagarelli Andrea^ORCID,Simeri Andrea

Abstract

AbstractModeling law search and retrieval as prediction problems has recently emerged as a predominant approach in law intelligence. Focusing on the law article retrieval task, we present a deep learning framework named LamBERTa, which is designed for civil-law codes, and specifically trained on the Italian civil code. To our knowledge, this is the first study proposing an advanced approach to law article prediction for the Italian legal system based on a BERT (Bidirectional Encoder Representations from Transformers) learning framework, which has recently attracted increased attention among deep learning approaches, showing outstanding effectiveness in several natural language processing and learning tasks. We define LamBERTa models by fine-tuning an Italian pre-trained BERT on the Italian civil code or its portions, for law article retrieval as a classification task. One key aspect of our LamBERTa framework is that we conceived it to address an extreme classification scenario, which is characterized by a high number of classes, the few-shot learning problem, and the lack of test query benchmarks for Italian legal prediction tasks. To solve such issues, we define different methods for the unsupervised labeling of the law articles, which can in principle be applied to any law article code system. We provide insights into the explainability and interpretability of our LamBERTa models, and we present an extensive experimental analysis over query sets of different type, for single-label as well as multi-label evaluation tasks. Empirical evidence has shown the effectiveness of LamBERTa, and also its superiority against widely used deep-learning text classifiers and a few-shot learner conceived for an attribute-aware prediction task.

Funder

Università della Calabria

Publisher

Springer Science and Business Media LLC

Subject

Law,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10506-021-09301-8.pdf

Reference64 articles.

1. Aletras N, Tsarapatsanis D, Preotiuc-Pietro D, Lampos V (2016) Predicting judicial decisions of the European court of human rights: a natural language processing perspective. PeerJ Comput Sci 2:e93

2. Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: Proc ICLR

3. Bengio S, Dembczynski K, Joachims T, Kloft M, Varma M (2019) Extreme classification. Tech. rep., Report from Dagstuhl Seminar 18291. https://doi.org/10.4230/DagRep.8.7.62

4. Boella G, Caro LD, Humphreys L (2011) Using classification to support legal knowledge engineers in the Eunomos legal document management system. In: Proceedings of workshop on juris-informatics (JURISIN)

5. Branting LK, Yeh AS, Weiss B, Merkhofer EM, Brown B (2017) Inducing predictive models for decision support in administrative adjudication. In: AI approaches to the complexity of legal systems—AICOL workshops 2015-2017, vol 10791, pp 465–477

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts;Artificial Intelligence and Law;2024-09-09

2. HUKUKİ METİNLERİN OTOMATİK İŞLENMESİNDE YAPAY ZEKÂ TEKNOLOJİLERİNİN KULLANIMI;Bilişim Hukuku Dergisi;2024-06-30

3. The Explainability of Transformers: Current Status and Directions;Computers;2024-04-04

4. ITALIAN-LEGAL-BERT models for improving natural language processing tasks in the Italian legal domain;Computer Law & Security Review;2024-04

5. Enforcing legal information extraction through context-aware techniques: The ASKE approach;Computer Law & Security Review;2024-04