Audio-Visual Integration in Stereoscopic 3D. Deas, L., Wilcox, L. M., Kazimi, A., & Allison, R. S. In Proceedings of the ACM Symposium on Applied Perception, Dublin, Ireland, pages 83-89, 09, 2013. Paper -1 -2 doi abstract bibtex The perception of synchronous, intelligible, speech is fundamental to a high-quality modern cinema experience. Surprisingly, this issue has remained relatively unexplored in stereoscopic 3D (S3D) media, despite its increasing popularity. Instead, visual parameters have been the primary focus of concern for those who create, and those who study the impact of, S3D content. In the work presented here we ask if ability to integrate audio and visual information is influenced by adding the third dimension to film. We also investigate the effects of known visual parameters (horizontal and vertical parallax), on audio-visual integration. To this end, we use an illusion of speech processing known as the McGurk effect as an objective measure of multi-modal integration. In the classic (2D) version of this phenomenon, discrepant auditory (/ba/) and visual (/ga/) information typically results in the perception of a unique `fusion' syllable (e.g. /da/). We extended this paradigm to measure the McGurk effect in a small theatre. We varied the horizontal (IA: 0, 6, 12, 18, 24 mm) and vertical (0, 0.5, 0.75, 1 deg) parallax from trial-to-trial and asked observers to report their percept of the phoneme. Our results show a consistently high proportion of the expected fusion responses, with no effect of horizontal or vertical offsets. These data are the first to show that the McGurk effect extends to stereoscopic stimuli and is not a phenomenon isolated to 2D media perception. Furthermore, the results show that audiences can tolerate a high level of both horizontal and vertical disparity and maintain veridical speech perception. We consider these results in terms of current stereoscopic filmmaking recommendations and practices.
@inproceedings{Deas:2013kx,
abstract = {The perception of synchronous, intelligible, speech is fundamental to a high-quality modern cinema experience. Surprisingly, this issue has remained relatively unexplored in stereoscopic 3D (S3D) media, despite its increasing popularity. Instead, visual parameters have been the primary focus of concern for those who create, and those who study the impact of, S3D content. In the work presented here we ask if ability to integrate audio and visual information is influenced by adding the third dimension to film. We also investigate the effects of known visual parameters (horizontal and vertical parallax), on audio-visual integration. To this end, we use an illusion of speech processing known as the McGurk effect as an objective measure of multi-modal integration. In the classic (2D) version of this phenomenon, discrepant auditory (/ba/) and visual (/ga/) information typically results in the perception of a unique `fusion' syllable (e.g. /da/). We extended this paradigm to measure the McGurk effect in a small theatre. We varied the horizontal (IA: 0, 6, 12, 18, 24 mm) and vertical (0, 0.5, 0.75, 1 deg) parallax from trial-to-trial and asked observers to report their percept of the phoneme. Our results show a consistently high proportion of the expected fusion responses, with no effect of horizontal or vertical offsets. These data are the first to show that the McGurk effect extends to stereoscopic stimuli and is not a phenomenon isolated to 2D media perception. Furthermore, the results show that audiences can tolerate a high level of both horizontal and vertical disparity and maintain veridical speech perception. We consider these results in terms of current stereoscopic filmmaking recommendations and practices.
},
annote = {Dublin, Sept 2013},
author = {Deas, L. and Wilcox, L. M. and Kazimi, A. and Allison, R. S.},
booktitle = {Proceedings of the ACM Symposium on Applied Perception, Dublin, Ireland},
date-added = {2013-06-12 23:20:52 +0000},
date-modified = {2019-02-03 09:36:53 -0500},
doi = {10.1145/2492494.2492506},
keywords = {Stereopsis},
month = {09},
pages = {83-89},
title = {Audio-Visual Integration in Stereoscopic 3D},
url = {http://percept.eecs.yorku.ca/papers/p83-deas.pdf},
year = {2013},
url-1 = {http://percept.eecs.yorku.ca/papers/p83-deas.pdf},
url-2 = {https://doi.org/10.1145/2492494.2492506}}
Downloads: 0
{"_id":{"_str":"5251d068c79ac1d32b000081"},"__v":2,"authorIDs":["5458111c2abc8e9f37000a4d","5e596c1656d60ade0100014f","vnY8GQ5AKXHNi7dqd"],"author_short":["Deas, L.","Wilcox, L. M.","Kazimi, A.","Allison, R. S."],"bibbaseid":"deas-wilcox-kazimi-allison-audiovisualintegrationinstereoscopic3d-2013","bibdata":{"bibtype":"inproceedings","type":"inproceedings","abstract":"The perception of synchronous, intelligible, speech is fundamental to a high-quality modern cinema experience. Surprisingly, this issue has remained relatively unexplored in stereoscopic 3D (S3D) media, despite its increasing popularity. Instead, visual parameters have been the primary focus of concern for those who create, and those who study the impact of, S3D content. In the work presented here we ask if ability to integrate audio and visual information is influenced by adding the third dimension to film. We also investigate the effects of known visual parameters (horizontal and vertical parallax), on audio-visual integration. To this end, we use an illusion of speech processing known as the McGurk effect as an objective measure of multi-modal integration. In the classic (2D) version of this phenomenon, discrepant auditory (/ba/) and visual (/ga/) information typically results in the perception of a unique `fusion' syllable (e.g. /da/). We extended this paradigm to measure the McGurk effect in a small theatre. We varied the horizontal (IA: 0, 6, 12, 18, 24 mm) and vertical (0, 0.5, 0.75, 1 deg) parallax from trial-to-trial and asked observers to report their percept of the phoneme. Our results show a consistently high proportion of the expected fusion responses, with no effect of horizontal or vertical offsets. These data are the first to show that the McGurk effect extends to stereoscopic stimuli and is not a phenomenon isolated to 2D media perception. Furthermore, the results show that audiences can tolerate a high level of both horizontal and vertical disparity and maintain veridical speech perception. We consider these results in terms of current stereoscopic filmmaking recommendations and practices. ","annote":"Dublin, Sept 2013","author":[{"propositions":[],"lastnames":["Deas"],"firstnames":["L."],"suffixes":[]},{"propositions":[],"lastnames":["Wilcox"],"firstnames":["L.","M."],"suffixes":[]},{"propositions":[],"lastnames":["Kazimi"],"firstnames":["A."],"suffixes":[]},{"propositions":[],"lastnames":["Allison"],"firstnames":["R.","S."],"suffixes":[]}],"booktitle":"Proceedings of the ACM Symposium on Applied Perception, Dublin, Ireland","date-added":"2013-06-12 23:20:52 +0000","date-modified":"2019-02-03 09:36:53 -0500","doi":"10.1145/2492494.2492506","keywords":"Stereopsis","month":"09","pages":"83-89","title":"Audio-Visual Integration in Stereoscopic 3D","url":"http://percept.eecs.yorku.ca/papers/p83-deas.pdf","year":"2013","url-1":"http://percept.eecs.yorku.ca/papers/p83-deas.pdf","url-2":"https://doi.org/10.1145/2492494.2492506","bibtex":"@inproceedings{Deas:2013kx,\n\tabstract = {The perception of synchronous, intelligible, speech is fundamental to a high-quality modern cinema experience. Surprisingly, this issue has remained relatively unexplored in stereoscopic 3D (S3D) media, despite its increasing popularity. Instead, visual parameters have been the primary focus of concern for those who create, and those who study the impact of, S3D content. In the work presented here we ask if ability to integrate audio and visual information is influenced by adding the third dimension to film. We also investigate the effects of known visual parameters (horizontal and vertical parallax), on audio-visual integration. To this end, we use an illusion of speech processing known as the McGurk effect as an objective measure of multi-modal integration. In the classic (2D) version of this phenomenon, discrepant auditory (/ba/) and visual (/ga/) information typically results in the perception of a unique `fusion' syllable (e.g. /da/). We extended this paradigm to measure the McGurk effect in a small theatre. We varied the horizontal (IA: 0, 6, 12, 18, 24 mm) and vertical (0, 0.5, 0.75, 1 deg) parallax from trial-to-trial and asked observers to report their percept of the phoneme. Our results show a consistently high proportion of the expected fusion responses, with no effect of horizontal or vertical offsets. These data are the first to show that the McGurk effect extends to stereoscopic stimuli and is not a phenomenon isolated to 2D media perception. Furthermore, the results show that audiences can tolerate a high level of both horizontal and vertical disparity and maintain veridical speech perception. We consider these results in terms of current stereoscopic filmmaking recommendations and practices.\n},\n\tannote = {Dublin, Sept 2013},\n\tauthor = {Deas, L. and Wilcox, L. M. and Kazimi, A. and Allison, R. S.},\n\tbooktitle = {Proceedings of the ACM Symposium on Applied Perception, Dublin, Ireland},\n\tdate-added = {2013-06-12 23:20:52 +0000},\n\tdate-modified = {2019-02-03 09:36:53 -0500},\n\tdoi = {10.1145/2492494.2492506},\n\tkeywords = {Stereopsis},\n\tmonth = {09},\n\tpages = {83-89},\n\ttitle = {Audio-Visual Integration in Stereoscopic 3D},\n\turl = {http://percept.eecs.yorku.ca/papers/p83-deas.pdf},\n\tyear = {2013},\n\turl-1 = {http://percept.eecs.yorku.ca/papers/p83-deas.pdf},\n\turl-2 = {https://doi.org/10.1145/2492494.2492506}}\n\n\n\n","author_short":["Deas, L.","Wilcox, L. M.","Kazimi, A.","Allison, R. S."],"key":"Deas:2013kx","id":"Deas:2013kx","bibbaseid":"deas-wilcox-kazimi-allison-audiovisualintegrationinstereoscopic3d-2013","role":"author","urls":{"Paper":"http://percept.eecs.yorku.ca/papers/p83-deas.pdf","-1":"http://percept.eecs.yorku.ca/papers/p83-deas.pdf","-2":"https://doi.org/10.1145/2492494.2492506"},"keyword":["Stereopsis"],"metadata":{"authorlinks":{"allison, r":"https://percept.eecs.yorku.ca/bibase%20pubs.shtml"}},"downloads":0},"bibtype":"inproceedings","biburl":"https://bibbase.org/network/files/ibWG96BS4w7ibooE9","downloads":0,"keywords":["stereopsis"],"search_terms":["audio","visual","integration","stereoscopic","deas","wilcox","kazimi","allison"],"title":"Audio-Visual Integration in Stereoscopic 3D","year":2013,"dataSources":["kmmXSosvtyJQxBtzs","BPKPSXjrbMGteC59J","MpMK4SvZzj5Fww5vJ","YbBWRH5Fc7xRr8ghk","szZaibkmSiiQBFQG8","DoyrDTpJ7HHCtki3q","JaoxzeTFRfvwgLoCW","XKwRm5Lx8Z9bzSzaP","AELuRZBpnp7nRDaqw"]}