Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Choosing Linguistics over Vision to Describe Images Ankush Gupta, Yashaswi Verma, C. V. Jawahar International Institute of Information Technology, Hyderabad, India - 500032 {ankush.gupta@research., yashaswi.verma@research., jawahar@}iiit.ac.in § “This is a picture of one tree, one road and one person. The rusty tree is under the red road. The colorful person is near the rusty tree, and under the road.” (Kulkarni et al. 2011)…
Words 5461 - Pages 22