The STIRData Approach to Interoperability of European Company High-Value Datasets
-
Published:2024-04-01
Issue:4
Volume:5
Page:
-
ISSN:2661-8907
-
Container-title:SN Computer Science
-
language:en
-
Short-container-title:SN COMPUT. SCI.
Author:
Klímek JakubORCID, Chortaras Alexandros, Míšek Jakub, Yang Jim J., Skagemo Steinar, Tzouvaras Vassilis
Abstract
AbstractThe European Commission has published a list of high-value datasets (HVDs) that public sector bodies must make available as open data as part of the Open Data Directive. One of the HVD topics is company data. Although the HVD description contains items that must be included in these datasets, it does not prescribe any technical means of how the data should be published. This is a major obstacle to the interoperability of the datasets once they are published. In this extended paper, we elaborate on the results of STIRData, a project co-financed by the Connecting Europe Facility Programme of the European Union, focusing on various aspects of data interoperability of open data from business registries, covering the company data HVDs topic. These aspects include the semantic, technical, and legal interoperability of this data. The results include a data architecture and a data specification to make the published data technically and semantically interoperable. In addition, we present basic legal interoperability guidelines to ensure legal interoperability of the published data, which is a topic often neglected by technically focused data experts. The project results include proof-of-concept transformations of data from selected European business registries using open source tools and in accordance with the data specification. Moreover, a user-orientated platform for browsing and analysing the data is presented as an example of the possibilities of using the data published in an interoperable way. Finally, we present an example of how compliant data can be processed by data experts for further analysis.
Funder
Connecting Europe Facility Charles University
Publisher
Springer Science and Business Media LLC
Reference14 articles.
1. Klímek J, et al. Semantic, Technical and Legal Interoperability of European Company Open Data in Practice: The STIRData Approach. In: Gusikhin O, Hammoudi S, Cuzzocrea A, editors., et al., Proceedings of the 12th International Conference on Data Science, Technology and Applications, DATA 2023, Rome, Italy, July 11-13, 2023. SCITEPRESS; 2023. p. 183–94. 2. Lanthaler M, Wood D, Cyganiak R. RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation, W3C; 2014. https://www.w3.org/TR/2014/REC-rdf11-concepts-20140225/. 3. Harris S, Seaborne A. SPARQL 1.1 Query Language. W3C Recommendation, W3C; 2013. https://www.w3.org/TR/2013/REC-sparql11-query-20130321/. 4. Chortaras A, Stamou G, Berners-Lee T, et al. D2RML: Integrating Heterogeneous Data and Web Services into Custom RDF Graphs. In: Berners-Lee T, et al. (eds) Workshop on Linked Data on the Web co-located with The Web Conference 2018, LDOW@WWW 2018, Lyon, France April 23rd, 2018, Vol. 2073 of CEUR Workshop Proceedings (CEUR-WS.org, 2018). http://ceur-ws.org/Vol-2073/article-07.pdf. 5. Klímek J, Škoda P, Indrawan-Santiago M, Steinbauer M, Salvadori IL, Khalil I, Anderst-Kotsis G. LinkedPipes ETL in use: practical publication and consumption of linked data. In: Indrawan-Santiago M, Steinbauer M, Salvadori IL, Khalil I, Anderst-Kotsis G, editors. Proceedings of the 19th International Conference on Information Integration and Web-based Applications & Services, iiWAS 2017, Salzburg, Austria, December 4-6, 2017. ACM; 2017. p. 441–5. https://doi.org/10.1145/3151759.3151809.
|
|