Affiliation:
1. NUI Galway, Ireland
2. DCC, University of Chile, Chile
Abstract
Access to hundreds of knowledge bases has been made available on the Web through SPARQL endpoints. Unfortunately, few endpoints publish descriptions of their content. It is thus unclear how agents can learn about the content of a given endpoint. This research investigates the feasibility of a system that gathers information about public endpoints by querying directly about their own content. It would thus be feasible to build a centralised catalogue describing the content indexed by individual endpoints by issuing them SPARQL 1.1 queries; this catalogue could be searched and queried by agents looking for endpoints with content they are interested in. However, the coverage of the catalogue is bounded by the limitations of public endpoints themselves: some may not support SPARQL 1.1, some may return partial responses, some may throw exceptions for expensive aggregate queries, etc. The goal is twofold: 1) using VoID as a bar, to empirically investigate the extent to which endpoints can describe their own content, and 2) to build and analyse the capabilities of an online catalogue.
Reference55 articles.
1. Profiling and mining RDF data with ProLOD++
2. ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints
3. Akar, Z., Halaç, T. G., Ekinci, E. E., & Dikenelli, O. (2012). Querying the Web of Interlinked Datasets using VOID Descriptions. In Linked Data On the Web (LDOW). CEUR.
4. Alexander, K., Cyganiak, R., Hausenblas, M., & Zhao, J. (2009). Describing Linked Datasets. In Linked Data On the Web (LDOW). CEUR.
5. Auer, S., Demter, J., Martin, M., & Lehmann, J. (2012). {LODStats} -- An Extensible Framework for High-Performance Dataset Analytics. In Knowledge Engineering and Knowledge Management ({EKAW}) (pp. 353–362). Springer.