Extraction of time-related expressions using text mining with application to Hebrew-Reference-Cited by-同舟云学术

Extraction of time-related expressions using text mining with application to Hebrew

Published:2024-02-23 Issue:2 Volume:19 Page:e0293196
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Mughaz Dror^ORCID,HaCohen-Kerner Yaakov,Gabbay Dov

Abstract

In this research, we extract time-related expressions from a rabbinic text in a semi-automatic manner. These expressions usually appear next to rabbinic references (name / nickname / acronym / book-name). The first step toward our goal is to find all the expressions near references in the corpus. However, not all of the phrases around the references are time-related expressions. Therefore, these phrases are initially considered to be potential time-related expressions. To extract the time-related expressions, we formulate two new statistical functions, and we use screening and heuristic methods. We tested these statistical functions, grammatical screenings, and heuristic methods on a corpus containing responsa documents. In this corpus, many rabbinic citations are known and marked. The statistical functions and the screening methods filtered the potential time-related expressions and reduced 99.88% of the initial expressions (from 484,681 to 575).

Publisher

Public Library of Science (PLoS)

Reference59 articles.

1. Automatic extraction and learning of keyphrases from scientific articles;Y. HaCohen-Kerner;Lecture Notes in Computer Science,2005

2. Words, patterns, and documents: Experiments in machine learning and text analysis.;S. Argamon;Digital Humanities Quarterly,2009

3. Text Mining for Evaluating Authors’ Birth and Death Years.;D. Moghaz;ACM Transactions on Knowledge Discovery from Data (TKDD),2019

4. Computing Education Research Landscape through an Analysis of Keywords.;Z. Papamitsiou;In Proceedings of the 2020 ACM Conference on International Computing Education Research,2020