
Title |
|---|
![]() Visual Translation Embedding Network for Visual Relation DetectionComputer Vision and Pattern Recognition (CVPR), 2017 |
![]() Person Search with Natural Language DescriptionComputer Vision and Pattern Recognition (CVPR), 2017 |
![]() Learning to Detect Human-Object InteractionsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2017 |
![]() Gated Multimodal Units for Information FusionInternational Conference on Learning Representations (ICLR), 2017 |
![]() Learning Word-Like Units from Joint Audio-Visual AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2017 |
![]() Incremental Learning for Robot Perception through HRIIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2017 |
![]() Comprehension-guided referring expressionsComputer Vision and Pattern Recognition (CVPR), 2017 |
![]() A Joint Speaker-Listener-Reinforcer Model for Referring ExpressionsComputer Vision and Pattern Recognition (CVPR), 2016 |
![]() Top-down Visual Saliency Guided by CaptionsComputer Vision and Pattern Recognition (CVPR), 2016 |
![]() An Empirical Study of Language CNN for Image CaptioningIEEE International Conference on Computer Vision (ICCV), 2016 |
![]() Automatic Generation of Grounded Visual QuestionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2016 |