{"_id":"orBnGGg9heviZab3d","bibbaseid":"znaidia-shabou-leborgne-hudelot-paragios-bagofmultimediawordsforimageclassification-2012","author_short":["Znaidia, A.","Shabou, A.","Le Borgne, H.","Hudelot, C.","Paragios, N."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Bag-of-multimedia-words for image classification","author":[{"propositions":[],"lastnames":["Znaidia"],"firstnames":["Amel"],"suffixes":[]},{"propositions":[],"lastnames":["Shabou"],"firstnames":["Aymen"],"suffixes":[]},{"propositions":[],"lastnames":["Le","Borgne"],"firstnames":["Hervé"],"suffixes":[]},{"propositions":[],"lastnames":["Hudelot"],"firstnames":["Céline"],"suffixes":[]},{"propositions":[],"lastnames":["Paragios"],"firstnames":["Nikos"],"suffixes":[]}],"booktitle":"Pattern Recognition (ICPR), 2012 21st International Conference on","pages":"1509–1512","year":"2012","url_pdf":"https://hleborgne.github.io/files/znaidia2012icpr.pdf","abstract":"We introduce the bag-of-multimedia-words model that tightly combines the heterogeneous information coming from the text and the pixel-based information of a multimedia document. The proposed multimedia feature generation process is generic for any multi-modality and aims at enriching a multimedia document description with compact and discriminative signatures well appropriate to linear classifiers. It is evaluated on the Pascal VOC 2007 classification challenge, outperforming the state-of-the-art bag-of-visual-words or bag-of-tag-words based classification approaches.","keywords":"vision-language","bibtex":"@inproceedings{znaidia2012icpr,\n title = {Bag-of-multimedia-words for image classification},\n author = {Znaidia, Amel and Shabou, Aymen and Le Borgne, Herv{\\'e} and Hudelot, C{\\'e}line and Paragios, Nikos},\n booktitle = {Pattern Recognition (ICPR), 2012 21st International Conference on},\n pages = {1509--1512},\n year = {2012},\n url_PDF = {https://hleborgne.github.io/files/znaidia2012icpr.pdf},\n abstract = {We introduce the bag-of-multimedia-words model that tightly combines the heterogeneous information coming from the text and the pixel-based information of a multimedia document. The proposed multimedia feature generation process is generic for any multi-modality and aims at enriching a multimedia document description with compact and discriminative signatures well appropriate to linear classifiers. It is evaluated on the Pascal VOC 2007 classification challenge, outperforming the state-of-the-art bag-of-visual-words or bag-of-tag-words based classification approaches.},\n keywords = {vision-language}\n}\n\n","author_short":["Znaidia, A.","Shabou, A.","Le Borgne, H.","Hudelot, C.","Paragios, N."],"key":"znaidia2012icpr","id":"znaidia2012icpr","bibbaseid":"znaidia-shabou-leborgne-hudelot-paragios-bagofmultimediawordsforimageclassification-2012","role":"author","urls":{" pdf":"https://hleborgne.github.io/files/znaidia2012icpr.pdf"},"keyword":["vision-language"],"metadata":{"authorlinks":{}},"downloads":1,"html":""},"bibtype":"inproceedings","biburl":"https://hleborgne.github.io/files/hleborgne-publications.bib","dataSources":["sJzmxoNKfHCgQoayi"],"keywords":["vision-language"],"search_terms":["bag","multimedia","words","image","classification","znaidia","shabou","le borgne","hudelot","paragios"],"title":"Bag-of-multimedia-words for image classification","year":2012,"downloads":1}