Data Integration for Open Data on the Web. Neumaier, S., Polleres, A., Steyskal, S., & Umbrich, J. In Ianni, G., Lembo, D., Bertossi, L. E., Faber, W., Glimm, B., Gottlob, G., & Staab, S., editors, Reasoning Web. Semantic Interoperability on the Web (Reasoning Web 2017), volume 10370, of Lecture Notes in Computer Science (LNCS), pages 1–28. Springer, London, United Kingdom, July, 2017.
Data Integration for Open Data on the Web [pdf]Paper  abstract   bibtex   
In this lecture we will discuss and introduce challenges of integrating openly available Web data and how to solve them. Firstly, while we will address this topic from the viewpoint of Semantic Web research, not all data is readily available as RDF or Linked Data, so we will give an introduction to different data formats prevalent on the Web, namely, standard formats for publishing and exchanging tabular, tree-shaped, and graph data. Secondly, not all Open Data is really completely open, so we will discuss and address issues around licences, terms of usage associated with Open Data, as well as documentation of data provenance. Thirdly, we will discuss issues connected with (meta-)data quality issues associated with Open Data on the Web and how Semantic Web techniques and vocabularies can be used to describe and remedy them. Fourth, we will address issues about searchability and integration of Open Data and discuss in how far semantic search can help to overcome these. We close with briefly summarizing further issues not covered explicitly herein, such as multi-linguality, temporal aspects (archiving, evolution, temporal querying), as well as how/whether OWL and RDFS reasoning on top of integrated open data could be help.
@incollection{neum-etal-RW2017,
	Abstract = {In this lecture we will discuss and introduce challenges of integrating openly available Web data and how to solve them. Firstly, while we will address this topic from the viewpoint of Semantic Web research, not all data is readily available as RDF or Linked Data, so we will give an introduction to different data formats prevalent on the Web, namely, standard formats for publishing and exchanging tabular, tree-shaped, and graph data. Secondly, not all Open Data is really completely open, so we will discuss and address issues around licences, terms of usage associated with Open Data, as well as documentation of data provenance. Thirdly, we will discuss issues connected with (meta-)data quality issues associated with Open Data on the Web and how Semantic Web techniques and vocabularies can be used to describe and remedy them. Fourth, we will address issues about searchability and integration of Open Data and discuss in how far semantic search can help to overcome these. We close with briefly summarizing further issues not covered explicitly herein, such as multi-linguality, temporal aspects (archiving, evolution, temporal querying), as well as how/whether OWL and RDFS reasoning on top of integrated open data could be help.},
	Address = {London, United Kingdom},
	Author = {Sebastian Neumaier and Axel Polleres and Simon Steyskal and J\"urgen Umbrich},
	Booktitle = {Reasoning Web. Semantic Interoperability on the Web ({Reasoning Web 2017})},
	Day = {7--11},
	Month = JUL,
	Publisher = {Springer},
	Series = LNCS,
	volume = 10370,
	editor = {Giovambattista Ianni and Domenico Lembo and Leopoldo E. Bertossi and Wolfgang Faber and Birte Glimm
 and Georg Gottlob and Steffen Staab},
	pages = {1--28},
	Title = {Data Integration for Open Data on the Web},
	Type = BC,
	Url = {http://www.polleres.net/publications/neum-etal-RW2017.pdf},
	Year = 2017,
}
Downloads: 0