Affiliation:
1. INSIGHT Centre for Data Analytics, National University of Ireland, Ireland
2. Center for Semantic Web Research, University of Chile, Chile
Abstract
Access to hundreds of knowledge bases has been made available on the Web through public SPARQL endpoints. Unfortunately, few endpoints publish descriptions of their content (e.g., using VoID). It is thus unclear how agents can learn about the content of a given SPARQL endpoint or, relatedly, find SPARQL endpoints with content relevant to their needs. In this paper, the authors investigate the feasibility of a system that gathers information about public SPARQL endpoints by querying them directly about their own content. With the advent of SPARQL 1.1 and features such as aggregates, it is now possible to specify queries whose results would form a detailed profile of the content of the endpoint, comparable with a large subset of VoID. In theory it would thus be feasible to build a rich centralised catalogue describing the content indexed by individual endpoints by issuing them SPARQL (1.1) queries; this catalogue could then be searched and queried by agents looking for endpoints with content they are interested in. In practice, however, the coverage of the catalogue is bounded by the limitations of public endpoints themselves: some may not support SPARQL 1.1, some may return partial responses, some may throw exceptions for expensive aggregate queries, etc. The authors' goal in this paper is thus twofold: (i) using VoID as a bar, to empirically investigate the extent to which public endpoints can describe their own content, and (ii) to build and analyse the capabilities of a best-effort online catalogue of current endpoints based on the (partial) results collected.
Reference50 articles.
1. Profiling and mining RDF data with ProLOD++
2. ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints
3. Akar, Z., Halaç, T. G., Ekinci, E. E., & Dikenelli, O. (2012). Querying the Web of Interlinked Datasets using VOID Descriptions. In Linked Data On the Web (LDOW). CEUR.
4. Alexander, K., Cyganiak, R., Hausenblas, M., & Zhao, J. (2009). Describing Linked Datasets. In Linked Data On the Web (LDOW). CEUR.
5. Auer, S., Demter, J., Martin, M., & Lehmann, J. (2012). LODStats – An Extensible Framework for High-Performance Dataset Analytics. In Knowledge Engineering and Knowledge Management EKAW (pp. 353–362). Springer.