What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision. Malmaud, J., Huang, J., Rathod, V., Johnston, N., Rabinovich, A., & Murphy, K. In Mihalcea, R., Chai, J. Y., & Sarkar, A., editors, HLT-NAACL, pages 143-152, 2015. The Association for Computational Linguistics.
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision. [pdf]Link  What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision. [link]Paper  bibtex   

Downloads: 0