ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.01482
  4. Cited By
Walking with MIND: Mental Imagery eNhanceD Embodied QA

Walking with MIND: Mental Imagery eNhanceD Embodied QA

ACM Multimedia (ACM MM), 2019
5 August 2019
Juncheng Li
Siliang Tang
Leilei Gan
Yueting Zhuang
ArXiv (abs)PDFHTML

Papers citing "Walking with MIND: Mental Imagery eNhanceD Embodied QA"

15 / 15 papers shown
Title
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
P. Zhang
Yifei Su
Pengyuan Wu
Dong An
Li Zhang
Zhigang Wang
Dong Wang
Yan Ding
Jiangwei Zhong
Xuelong Li
LM&Ro
360
2
0
27 May 2025
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
192
73
0
14 Aug 2023
Global Structure Knowledge-Guided Relation Extraction Method for
  Visually-Rich Document
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich DocumentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiangnan Chen
Qianwen Xiao
Juncheng Li
Duo Dong
Jun Lin
Xiaozhong Liu
Siliang Tang
174
6
0
23 May 2023
Dilated Context Integrated Network with Cross-Modal Consensus for
  Temporal Emotion Localization in Videos
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in VideosACM Multimedia (ACM MM), 2022
Juncheng Billy Li
Junlin Xie
Linchao Zhu
Long Qian
Siliang Tang
...
Haochen Shi
Shengyu Zhang
Longhui Wei
Qi Tian
Yueting Zhuang
164
14
0
03 Aug 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid
  Counterfactual Training for Robust Content-based Image Retrieval
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval
Wenqiao Zhang
Jiannan Guo
Meng Li
Haochen Shi
Shengyu Zhang
Juncheng Li
Siliang Tang
Yueting Zhuang
201
6
0
09 Jul 2022
Compositional Temporal Grounding with Structured Variational Cross-Graph
  Correspondence Learning
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence LearningComputer Vision and Pattern Recognition (CVPR), 2022
Juncheng Li
Junlin Xie
Long Qian
Linchao Zhu
Siliang Tang
Leilei Gan
Yi Yang
Yueting Zhuang
Xinze Wang
213
80
0
24 Mar 2022
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive
  Pseudo Labeling and Informative Active Annotation
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active AnnotationComputer Vision and Pattern Recognition (CVPR), 2022
Wenqiao Zhang
Lei Zhu
James Hallinan
A. Makmur
Shengyu Zhang
Qingpeng Cai
Beng Chin Ooi
318
114
0
04 Mar 2022
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and
  Unpaired Text-based Image Captioning
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
DiffM
223
48
0
13 Dec 2021
Explore before Moving: A Feasible Path Estimation and Memory Recalling
  Framework for Embodied Navigation
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation
Yang Wu
Shirui Feng
Guanbin Li
Liang Lin
53
0
0
16 Oct 2021
Why Do We Click: Visual Impression-aware News Recommendation
Why Do We Click: Visual Impression-aware News RecommendationACM Multimedia (ACM MM), 2021
Jiahao Xun
Shengyu Zhang
Zhou Zhao
Jieming Zhu
Tao Gui
Jingjie Li
Xiuqiang He
Xiaofei He
Tat-Seng Chua
Leilei Gan
201
39
0
26 Sep 2021
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for
  Video-and-Language Inference
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language InferenceIEEE International Conference on Computer Vision (ICCV), 2021
Juncheng Li
Siliang Tang
Linchao Zhu
Haochen Shi
Xuanwen Huang
Leilei Gan
Yi Yang
Yueting Zhuang
217
28
0
26 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoVLM&Ro
445
0
0
07 Jul 2021
Multimodal Aggregation Approach for Memory Vision-Voice Indoor
  Navigation with Meta-Learning
Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Liqi Yan
Dongfang Liu
Yaoxian Song
Changbin (Brad) Yu
118
18
0
01 Sep 2020
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation
  (VIN)
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)
Xin Ye
Yezhou Yang
SSL
274
16
0
26 Feb 2020
Unsupervised Reinforcement Learning of Transferable Meta-Skills for
  Embodied Navigation
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied NavigationComputer Vision and Pattern Recognition (CVPR), 2019
Juncheng Li
Xinze Wang
Siliang Tang
Haizhou Shi
Leilei Gan
Yueting Zhuang
William Yang Wang
SSL
290
77
0
18 Nov 2019
1