Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites-Reference-Cited by-同舟云学术

Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites

Published:2004-06 Issue:2 Volume:30 Page:151-179
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Navigli Roberto¹,Velardi Paola²

Affiliation:

1. Dipartimento di Informatica, Università di Roma “La Sapienza,” Via Salaria, 113-00198 Roma, Italia.

2. Università di Roma “La Sapienza” Universit'a di Roma “La Sapienza”

Abstract

We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn first extracts a domain terminology from available documents. Then, complex domain terms are semantically interpreted and arranged in a hierarchical fashion. Finally, a general-purpose ontology, WordNet, is trimmed and enriched with the detected domain concepts. The major novel aspect of this approach is semantic interpretation, that is, the association of a complex concept with a complex term. This involves finding the appropriate WordNet concept for each word of a terminological string and the appropriate conceptual relations that hold among the concept components. Semantic interpretation is based on a new word sense disambiguation algorithm, called structural semantic interconnections.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/089120104323093276

Reference6 articles.

1. An empirical symbolic approach to natural language processing

2. Automatic Labeling of Semantic Roles

3. Ontology learning for the Semantic Web

4. Ontology learning and its application to automated terminology translation

5. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus

Cited by 188 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How to classify domain entities into top-level ontology concepts using large language models;Applied Ontology;2024-07-03

2. A three-dimensional model of semantic search: queries, resources, and results;PROBLEMS IN PROGRAMMING;2023-12

3. ONTOLOGY DEVELOPMENT FOR GREEN BUILDING BY USING A SEMI-AUTOMATIC METHOD;Journal of Green Building;2023-12-01

4. An Artificial Intelligence-Based Model for Knowledge Evaluation and Integration in Public Organizations;Applied Sciences;2023-10-28

5. An AI-Enhanced Process Mining Framework for Software Process Insights;2023 15th International Conference on Knowledge and Systems Engineering (KSE);2023-10-18