SentiCap: Generating Image Descriptions with Sentiments

SentiCap: Generating Image Descriptions with Sentiments. Mathews, A., Xie, L., & He, X. In Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), Phoenix, Arizona USA, 2016.

Abstract

SentiCap: Generating Image Descriptions with Sentiments [pdf]

Paper

Slides abstract bibtex

The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a system to describe an image with emotions, and present a model that automatically generates captions with positive or negative sentiments. We propose a novel switching recurrent neural network with word-level regularization, which is able to produce emotional image captions using only 2000+ training sentences containing sentiments. We evaluate the captions with different automatic and crowd-sourcing metrics. Our model compares favourably in common quality metrics for image captioning. In 84.6% of cases the generated positive captions were judged as being at least as descriptive as the factual captions. Of these positive captions 88% were confirmed by the crowd-sourced workers as having the appropriate sentiment.

@inproceedings{mathews2016senticap,
  title        = {{SentiCap: Generating Image Descriptions with Sentiments}},
  author       = {Mathews, Alexander and Xie, Lexing and He, Xuming},
  booktitle    = {Thirtieth {AAAI} Conference on Artificial Intelligence ({AAAI-16})},
  url_abstract = {http://arxiv.org/abs/1510.01431},
  url_paper    = {http://arxiv.org/pdf/1510.01431v2.pdf},
  url_slides   = {http://cm.cecs.anu.edu.au/documents/senticap_slides.pdf},
  address      = {Phoenix, Arizona USA},
  year         = {2016},
  abstract     = {The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a system to describe an image with emotions, and present a model that automatically generates captions with positive or negative sentiments. We propose a novel switching recurrent neural network with word-level regularization, which is able to produce emotional image captions using only 2000+ training sentences containing sentiments. We evaluate the captions with different automatic and crowd-sourcing metrics. Our model compares favourably in common quality metrics for image captioning. In 84.6\% of cases the generated positive captions were judged as being at least as descriptive as the factual captions. Of these positive captions 88\% were confirmed by the crowd-sourced workers as having the appropriate sentiment.}
}

Downloads: 0

{"_id":"RXq4aY8imcpSdnaMT","bibbaseid":"mathews-xie-he-senticapgeneratingimagedescriptionswithsentiments-2016","downloads":0,"creationDate":"2016-02-16T23:55:01.495Z","title":"SentiCap: Generating Image Descriptions with Sentiments","author_short":["Mathews, A.","Xie, L.","He, X."],"year":2016,"bibtype":"inproceedings","biburl":"http://cm.cecs.anu.edu.au/documents/publications.bib","bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"SentiCap: Generating Image Descriptions with Sentiments","author":[{"propositions":[],"lastnames":["Mathews"],"firstnames":["Alexander"],"suffixes":[]},{"propositions":[],"lastnames":["Xie"],"firstnames":["Lexing"],"suffixes":[]},{"propositions":[],"lastnames":["He"],"firstnames":["Xuming"],"suffixes":[]}],"booktitle":"Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16)","url_abstract":"http://arxiv.org/abs/1510.01431","url_paper":"http://arxiv.org/pdf/1510.01431v2.pdf","url_slides":"http://cm.cecs.anu.edu.au/documents/senticap_slides.pdf","address":"Phoenix, Arizona USA","year":"2016","abstract":"The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a system to describe an image with emotions, and present a model that automatically generates captions with positive or negative sentiments. We propose a novel switching recurrent neural network with word-level regularization, which is able to produce emotional image captions using only 2000+ training sentences containing sentiments. We evaluate the captions with different automatic and crowd-sourcing metrics. Our model compares favourably in common quality metrics for image captioning. In 84.6% of cases the generated positive captions were judged as being at least as descriptive as the factual captions. Of these positive captions 88% were confirmed by the crowd-sourced workers as having the appropriate sentiment.","bibtex":"@inproceedings{mathews2016senticap,\n title = {{SentiCap: Generating Image Descriptions with Sentiments}},\n author = {Mathews, Alexander and Xie, Lexing and He, Xuming},\n booktitle = {Thirtieth {AAAI} Conference on Artificial Intelligence ({AAAI-16})},\n url_abstract = {http://arxiv.org/abs/1510.01431},\n url_paper = {http://arxiv.org/pdf/1510.01431v2.pdf},\n url_slides = {http://cm.cecs.anu.edu.au/documents/senticap_slides.pdf},\n address = {Phoenix, Arizona USA},\n year = {2016},\n abstract = {The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a system to describe an image with emotions, and present a model that automatically generates captions with positive or negative sentiments. We propose a novel switching recurrent neural network with word-level regularization, which is able to produce emotional image captions using only 2000+ training sentences containing sentiments. We evaluate the captions with different automatic and crowd-sourcing metrics. Our model compares favourably in common quality metrics for image captioning. In 84.6\\% of cases the generated positive captions were judged as being at least as descriptive as the factual captions. Of these positive captions 88\\% were confirmed by the crowd-sourced workers as having the appropriate sentiment.}\n}\n\n\n","author_short":["Mathews, A.","Xie, L.","He, X."],"key":"mathews2016senticap","id":"mathews2016senticap","bibbaseid":"mathews-xie-he-senticapgeneratingimagedescriptionswithsentiments-2016","role":"author","urls":{" abstract":"http://arxiv.org/abs/1510.01431"," paper":"http://arxiv.org/pdf/1510.01431v2.pdf"," slides":"http://cm.cecs.anu.edu.au/documents/senticap_slides.pdf"},"metadata":{"authorlinks":{}}},"search_terms":["senticap","generating","image","descriptions","sentiments","mathews","xie","he"],"keywords":[],"authorIDs":[],"dataSources":["CsTNFAMJm4FoD3yFY","8bdqaFbAALQsNy4E3"]}