COMIC: Towards A Compact Image Captioning Model with AttentionIEEE transactions on multimedia (IEEE TMM), 2019 |
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding
for Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2019 |
Taking a HINT: Leveraging Explanations to Make Vision and Language
Models More GroundedIEEE International Conference on Computer Vision (ICCV), 2019 |
Hierarchical Photo-Scene Encoder for Album StorytellingAAAI Conference on Artificial Intelligence (AAAI), 2019 |
Intention Oriented Image Captions with Guiding ObjectsComputer Vision and Pattern Recognition (CVPR), 2018 |
Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by
Joint Reconstruction and Prediction of View and Word SequencesAAAI Conference on Artificial Intelligence (AAAI), 2018 |