Data Summaries for On-demand Queries over Linked Data. Harth, A., Hose, K., Karnstedt, M., Polleres, A., Sattler, K., & Umbrich, J. In Proceedings of the 19th World Wide Web Conference (WWW2010), Raleigh, NC, USA, April, 2010. Technical report version available at ˘rlhttp://polleres.net/publications/DERI-TR-2009-11-17.pdf
Data Summaries for On-demand Queries over Linked Data [pdf]Paper  abstract   bibtex   
Typical approaches for search and querying over structured Web Data collect (crawl) and pre-process (index) large amounts of data before allowing for query answering in a central data warehouse. This time-consuming pre-processing phase decreases the freshness of query results and only uses to a limited degree the benefits of Linked Data where structured data is accessible live and up-to-date at distributed Web resources that may change constantly. An ideal query answering system for Linked Data should return always current answers in a reasonable amount of time, even on corpora as large as the web. Query processors evaluating queries directly on the life sources require knowledge of the contents of data sources. In the current paper we develop and evaluate a probabilistic index structure for covering graph-structured content of sources adhering to Linked Data principles, provide an algorithm for answering conjunctive queries over Linked Data on the web exploiting this structure, and evaluate the system using synthetically generated queries. We find that our lightweight index structure enable more complete query results over Linked Data compared to direct lookup approaches, while keeping the overhead for additional lookups and index maintenance low.

Downloads: 0