Estimating perception of scene layout properties from global image features. Ross, M. G. & Oliva, A. Journal of Vision, January, 2010. PMID: 21216762
Estimating perception of scene layout properties from global image features [link]Paper  doi  abstract   bibtex   
The relationship between image features and scene structure is central to the study of human visual perception and computer vision, but many of the specifics of real-world layout perception remain unknown. We do not know which image features are relevant to perceiving layout properties, or whether those features provide the same information for every type of image. Furthermore, we do not know the spatial resolutions required for perceiving different properties. This paper describes an experiment and a computational model that provides new insights on these issues. Humans perceive the global spatial layout properties such as dominant depth, openness, and perspective, from a single image. This work describes an algorithm that reliably predicts human layout judgments. This model's predictions are general, not specific to the observers it trained on. Analysis reveals that the optimal spatial resolutions for determining layout vary with the content of the space and the property being estimated. Openness is best estimated at high resolution, depth is best estimated at medium resolution, and perspective is best estimated at low resolution. Given the reliability and simplicity of estimating the global layout of real-world environments, this model could help resolve perceptual ambiguities encountered by more detailed scene reconstruction schemas.
@article{ ross_estimating_2010,
  title = {Estimating perception of scene layout properties from global image features},
  volume = {10},
  issn = {, 1534-7362},
  url = {http://www.journalofvision.org/content/10/1/2},
  doi = {10.1167/10.1.2},
  abstract = {The relationship between image features and scene structure is central to the study of human visual perception and computer vision, but many of the specifics of real-world layout perception remain unknown. We do not know which image features are relevant to perceiving layout properties, or whether those features provide the same information for every type of image. Furthermore, we do not know the spatial resolutions required for perceiving different properties. This paper describes an experiment and a computational model that provides new insights on these issues. Humans perceive the global spatial layout properties such as dominant depth, openness, and perspective, from a single image. This work describes an algorithm that reliably predicts human layout judgments. This model's predictions are general, not specific to the observers it trained on. Analysis reveals that the optimal spatial resolutions for determining layout vary with the content of the space and the property being estimated. Openness is best estimated at high resolution, depth is best estimated at medium resolution, and perspective is best estimated at low resolution. Given the reliability and simplicity of estimating the global layout of real-world environments, this model could help resolve perceptual ambiguities encountered by more detailed scene reconstruction schemas.},
  language = {en},
  number = {1},
  urldate = {2013-06-13},
  journal = {Journal of Vision},
  author = {Ross, Michael G. and Oliva, Aude},
  month = {January},
  year = {2010},
  note = {{PMID:} 21216762},
  keywords = {computational modeling, depth, space and scene perception, structure of natural images}
}

Downloads: 0