What are Links in Linked Open Data? A Characterization and Evaluation of Links between Knowledge Graphs on the Web. Haller, A., Fernández, J. D., Kamdar, M. R., & Polleres, A. ACM Journal of Data and Information Quality (JDIQ), 2(2):1–-34, May, 2020. Pre-print available at ˘rlhttps://epub.wu.ac.at/7193/
doi  abstract   bibtex   
Linked Open Data promises to provide guiding principles to publish interlinked knowledge graphs on the Web in the form of findable, accessible, interoperable and reusable datasets. We argue that while as such, Linked Data may be viewed as a basis for instantiating the FAIR principles, there are still a number of open issues that cause significant data quality issues even when knowledge graphs are published as Linked Data. Firstly, in order to define boundaries of single coherent knowledge graphs within Linked Data, a principled notion of what a dataset is, or, respectively, what links within and between datasets are, has been missing. Secondly, we argue that in order to enable FAIR knowledge graphs, Linked Data misses standardised findability and accessability mechanism, via a single entry link. In order to address the first issue, we (i) propose a rigorous definition of a naming authority for a Linked Data dataset (ii) define different link types for data in Linked datasets, (iii) provide an empirical analysis of linkage among the datasets of the Linked Open Data cloud, and (iv) analyse the dereferenceability of those links. We base our analyses and link computations on a scalable mechanism implemented on top of the HDT format, which allows us to analyse quantity and quality of different link types at scale.
@article{hall-etal-2020JDIQ,
 author = {Armin Haller and Javier D. Fern{\'a}ndez and Maulik R. Kamdar and Axel Polleres},
 title = {What are Links in Linked Open Data? A Characterization and Evaluation of Links between Knowledge Graphs on the Web},
 journal = {ACM Journal of Data and Information Quality (JDIQ)},
 note = {Pre-print available at \url{https://epub.wu.ac.at/7193/}},
 abstract = {Linked Open Data promises to provide guiding principles to publish interlinked knowledge graphs on the Web in the form of findable, accessible, interoperable and reusable datasets. We argue that while as such, Linked Data may be viewed as a basis for instantiating the FAIR principles, there are still a number of open issues that cause significant data quality issues even when knowledge graphs are published as Linked Data. Firstly, in order to define boundaries of single coherent knowledge graphs within Linked Data, a principled notion of what a dataset is, or, respectively, what links within and between datasets are, has been missing. Secondly, we argue that in order to enable FAIR knowledge graphs, Linked Data misses standardised findability and accessability mechanism, via a single entry link. In order to address the first issue, we (i) propose a rigorous definition of a naming authority for a Linked Data dataset (ii) define different link types for data in Linked datasets, (iii) provide an empirical analysis of linkage among the datasets of the Linked Open Data cloud, and (iv) analyse the dereferenceability of those links. We base our analyses and link computations on a scalable mechanism implemented on top of the HDT format, which allows us to analyse quantity and quality of different link types at scale.},
 year = 2020,
 volume = 2,
 number = 2,
 pages = {1–-34},
 month = May,
 doi = {10.1145/3369875}
}

Downloads: 0