Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources. Xie, L. & He, X. In Proceedings of the 21st ACM International Conference on Multimedia, of MM '13, pages 967–976, New York, NY, USA, 2013. ACM.
Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources [link]Paper  Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources [pdf]Paper  Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources [pdf]Slides  Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources [link]Page  doi  abstract   bibtex   
This paper studies the use of everyday words to describe images. The common saying has it that 'a picture is worth a thousand words', here we ask which thousand? The proliferation of tagged social multimedia data presents a challenge to understanding collective tag-use at large scale – one can ask if patterns from photo tags help understand tag-tag relations, and how it can be leveraged to improve visual search and recognition. We propose a new method to jointly analyze three distinct visual knowledge resources: Flickr, ImageNet/WordNet, and ConceptNet. This allows us to quantify the visual relevance of both tags learn their relationships. We propose a novel network estimation algorithm, Inverse Concept Rank, to infer incomplete tag relationships. We then design an algorithm for image annotation that takes into account both image and tag features. We analyze over 5 million photos with over 20,000 visual tags. The statistics from this collection leads to good results for image tagging, relationship estimation, and generalizing to unseen tags. This is a first step in analyzing picture tags and everyday semantic knowledge. Potential other applications include generating natural language descriptions of pictures, as well as validating and supplementing knowledge databases.

Downloads: 0