Deeper depth prediction with fully convolutional residual networks

Deeper depth prediction with fully convolutional residual networks. Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., & Navab, N. Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016, 2016.

Paper doi abstract bibtex

This paper addresses the problem of estimating the depth map of a scene given a single RGB image. We propose a fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps. In order to improve the output resolution, we present a novel way to efficiently learn feature map up-sampling within the network. For optimization, we introduce the reverse Huber loss that is particularly suited for the task at hand and driven by the value distributions commonly present in depth maps. Our model is composed of a single architecture that is trained end-to-end and does not rely on post-processing techniques, such as CRFs or other additional refinement steps. As a result, it runs in real-time on images or videos. In the evaluation, we show that the proposed model contains fewer parameters and requires fewer training data than the current state of the art, while outperforming all approaches on depth estimation. Code and models are publicly available.

@article{
 title = {Deeper depth prediction with fully convolutional residual networks},
 type = {article},
 year = {2016},
 keywords = {CNN,Depth prediction},
 pages = {239-248},
 id = {728305d5-6e7d-3893-a111-d47ab223136a},
 created = {2020-11-24T10:01:25.868Z},
 file_attached = {true},
 profile_id = {235249c2-3ed4-314a-b309-b1ea0330f5d9},
 group_id = {1ff583c0-be37-34fa-9c04-73c69437d354},
 last_modified = {2020-11-25T07:49:20.038Z},
 read = {true},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 folder_uuids = {71ca8421-f528-4caf-a342-3c1291372174},
 private_publication = {false},
 abstract = {This paper addresses the problem of estimating the depth map of a scene given a single RGB image. We propose a fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps. In order to improve the output resolution, we present a novel way to efficiently learn feature map up-sampling within the network. For optimization, we introduce the reverse Huber loss that is particularly suited for the task at hand and driven by the value distributions commonly present in depth maps. Our model is composed of a single architecture that is trained end-to-end and does not rely on post-processing techniques, such as CRFs or other additional refinement steps. As a result, it runs in real-time on images or videos. In the evaluation, we show that the proposed model contains fewer parameters and requires fewer training data than the current state of the art, while outperforming all approaches on depth estimation. Code and models are publicly available.},
 bibtype = {article},
 author = {Laina, Iro and Rupprecht, Christian and Belagiannis, Vasileios and Tombari, Federico and Navab, Nassir},
 doi = {10.1109/3DV.2016.32},
 journal = {Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016}
}

Downloads: 0

{"_id":"zAC3qJxwvH2QDJciB","bibbaseid":"laina-rupprecht-belagiannis-tombari-navab-deeperdepthpredictionwithfullyconvolutionalresidualnetworks-2016","authorIDs":["5e1a2c999fbdddde010000fc"],"author_short":["Laina, I.","Rupprecht, C.","Belagiannis, V.","Tombari, F.","Navab, N."],"bibdata":{"title":"Deeper depth prediction with fully convolutional residual networks","type":"article","year":"2016","keywords":"CNN,Depth prediction","pages":"239-248","id":"728305d5-6e7d-3893-a111-d47ab223136a","created":"2020-11-24T10:01:25.868Z","file_attached":"true","profile_id":"235249c2-3ed4-314a-b309-b1ea0330f5d9","group_id":"1ff583c0-be37-34fa-9c04-73c69437d354","last_modified":"2020-11-25T07:49:20.038Z","read":"true","starred":false,"authored":false,"confirmed":"true","hidden":false,"folder_uuids":"71ca8421-f528-4caf-a342-3c1291372174","private_publication":false,"abstract":"This paper addresses the problem of estimating the depth map of a scene given a single RGB image. We propose a fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps. In order to improve the output resolution, we present a novel way to efficiently learn feature map up-sampling within the network. For optimization, we introduce the reverse Huber loss that is particularly suited for the task at hand and driven by the value distributions commonly present in depth maps. Our model is composed of a single architecture that is trained end-to-end and does not rely on post-processing techniques, such as CRFs or other additional refinement steps. As a result, it runs in real-time on images or videos. In the evaluation, we show that the proposed model contains fewer parameters and requires fewer training data than the current state of the art, while outperforming all approaches on depth estimation. Code and models are publicly available.","bibtype":"article","author":"Laina, Iro and Rupprecht, Christian and Belagiannis, Vasileios and Tombari, Federico and Navab, Nassir","doi":"10.1109/3DV.2016.32","journal":"Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016","bibtex":"@article{\n title = {Deeper depth prediction with fully convolutional residual networks},\n type = {article},\n year = {2016},\n keywords = {CNN,Depth prediction},\n pages = {239-248},\n id = {728305d5-6e7d-3893-a111-d47ab223136a},\n created = {2020-11-24T10:01:25.868Z},\n file_attached = {true},\n profile_id = {235249c2-3ed4-314a-b309-b1ea0330f5d9},\n group_id = {1ff583c0-be37-34fa-9c04-73c69437d354},\n last_modified = {2020-11-25T07:49:20.038Z},\n read = {true},\n starred = {false},\n authored = {false},\n confirmed = {true},\n hidden = {false},\n folder_uuids = {71ca8421-f528-4caf-a342-3c1291372174},\n private_publication = {false},\n abstract = {This paper addresses the problem of estimating the depth map of a scene given a single RGB image. We propose a fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps. In order to improve the output resolution, we present a novel way to efficiently learn feature map up-sampling within the network. For optimization, we introduce the reverse Huber loss that is particularly suited for the task at hand and driven by the value distributions commonly present in depth maps. Our model is composed of a single architecture that is trained end-to-end and does not rely on post-processing techniques, such as CRFs or other additional refinement steps. As a result, it runs in real-time on images or videos. In the evaluation, we show that the proposed model contains fewer parameters and requires fewer training data than the current state of the art, while outperforming all approaches on depth estimation. Code and models are publicly available.},\n bibtype = {article},\n author = {Laina, Iro and Rupprecht, Christian and Belagiannis, Vasileios and Tombari, Federico and Navab, Nassir},\n doi = {10.1109/3DV.2016.32},\n journal = {Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016}\n}","author_short":["Laina, I.","Rupprecht, C.","Belagiannis, V.","Tombari, F.","Navab, N."],"urls":{"Paper":"https://bibbase.org/service/mendeley/bfbbf840-4c42-3914-a463-19024f50b30c/file/24208ac2-3949-929f-4ed2-ddf1ab153e82/Deeper_Depth_Prediction_with_Fully_Convolutional_Residual_Networks.pdf.pdf"},"biburl":"https://bibbase.org/service/mendeley/bfbbf840-4c42-3914-a463-19024f50b30c","bibbaseid":"laina-rupprecht-belagiannis-tombari-navab-deeperdepthpredictionwithfullyconvolutionalresidualnetworks-2016","role":"author","keyword":["CNN","Depth prediction"],"metadata":{"authorlinks":{}},"downloads":0},"bibtype":"article","biburl":"https://bibbase.org/service/mendeley/bfbbf840-4c42-3914-a463-19024f50b30c","creationDate":"2020-01-11T20:14:17.412Z","downloads":0,"keywords":["cnn","depth prediction"],"search_terms":["deeper","depth","prediction","fully","convolutional","residual","networks","laina","rupprecht","belagiannis","tombari","navab"],"title":"Deeper depth prediction with fully convolutional residual networks","year":2016,"dataSources":["7J2sSBScio8zjZfWf","ya2CyA73rpZseyrZ8","2252seNhipfTmjEBQ"]}