Some entities are more equal than others: statistical methods to consolidate Linked Data. Hogan, A., Polleres, A., Umbrich, J., & Zimmermann, A. In Workshop on New Forms of Reasoning for the Semantic Web: Scalable & Dynamic (NeFoRS10), Heraklion, Greece, May, 2010.
Some entities are more equal than others: statistical methods to consolidate Linked Data [pdf]Paper  abstract   bibtex   
We propose a method for consolidating entities in RDF data on the Web. Our approach is based on a statistical analysis of the use of predicates and their associated values to identify ``quasi''-key properties. Compared to a purely symbolic based approach, we obtain promising results, retrieving more identical entities with a high precision. We also argue that our technique scales well – possibly to the size of the current Web of Data – as opposed to more expensive existing approaches.

Downloads: 0