ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.09031
  4. Cited By
A Picture is Worth a Thousand Words: Language Models Plan from Pixels

A Picture is Worth a Thousand Words: Language Models Plan from Pixels

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
16 March 2023
Anthony Z. Liu
Lajanugen Logeswaran
Sungryull Sohn
Honglak Lee
    LM&Ro
ArXiv (abs)PDFHTMLGithub

Papers citing "A Picture is Worth a Thousand Words: Language Models Plan from Pixels"

5 / 5 papers shown
ViMo: A Generative Visual GUI World Model for App Agents
ViMo: A Generative Visual GUI World Model for App Agents
Dezhao Luo
Bohan Tang
Kang Li
Georgios Papoudakis
Jifei Song
S. Gong
Haifeng Zhang
Jun Wang
Cheng Deng
LM&RoVGen
628
12
0
15 Apr 2025
Correctable Landmark Discovery via Large Models for Vision-Language
  Navigation
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
392
24
0
29 May 2024
Scene-LLM: Extending Language Model for 3D Visual Understanding and
  Reasoning
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Rao Fu
Jingyu Liu
Xilun Chen
Yixin Nie
Wenhan Xiong
LM&RoLRM
314
168
0
18 Mar 2024
De-Diffusion Makes Text a Strong Cross-Modal Interface
De-Diffusion Makes Text a Strong Cross-Modal InterfaceComputer Vision and Pattern Recognition (CVPR), 2023
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLMDiffM
331
19
0
01 Nov 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic TaskNeural Information Processing Systems (NeurIPS), 2023
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGeDiffM
660
93
0
13 Oct 2023
1
Page 1 of 1