Parser Extraction of Triples in Unstructured Text-Reference-Cited by-同舟云学术

Parser Extraction of Triples in Unstructured Text

Published:2017-02-13 Issue:4 Volume:5 Page:143
ISSN:2252-8938
Container-title:IAES International Journal of Artificial Intelligence (IJ-AI)
language:
Short-container-title:IJ-AI

Author:

D'Souza Shaun

Abstract

The web contains vast repositories of unstructured text. We investigate the opportunity for building a knowledge graph from these text sources. We generate a set of triples which can be used in knowledge gathering and integration. We define the architecture of a language compiler for processing subject-predicate-object triples using the OpenNLP parser. We implement a depth-first search traversal on the POS tagged syntactic tree appending predicate and object information. A parser enables higher precision and higher recall extractions of syntactic relationships across conjunction boundaries. We are able to extract 2-2.5 times the correct extractions of ReVerb. The extractions are used in a variety of semantic web applications and question answering. We verify extraction of 50,000 triples on the ClueWeb dataset.

Publisher

Institute of Advanced Engineering and Science

Subject

Electrical and Electronic Engineering,Artificial Intelligence,Information Systems and Management,Control and Systems Engineering

Link

http://ijai.iaescore.com/index.php/IJAI/article/viewFile/6102/pdf

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Knowledge mining of unstructured information: application to cyber domain;Scientific Reports;2023-01-31

2. A System for Converting and Recovering Texts Managed as Structured Information;Scientific Reports;2022-12-23

3. Evolving System Bottlenecks in the as a Service Cloud;SSRN Electronic Journal;2018

4. LSTM Neural Network for Textual Ngrams;SSRN Electronic Journal;2018