ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.00804
  4. Cited By
ProTo: Program-Guided Transformer for Program-Guided Tasks

ProTo: Program-Guided Transformer for Program-Guided Tasks

2 October 2021
Zelin Zhao
Karan Samel
Binghong Chen
Le Song
    ViT
    LM&Ro
ArXivPDFHTML

Papers citing "ProTo: Program-Guided Transformer for Program-Guided Tasks"

11 / 11 papers shown
Title
Enhancing Visual Question Answering through Question-Driven Image
  Captions as Prompts
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
36
10
0
12 Apr 2024
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel
  Program Guidance
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance
C. Chang
Ni Mu
Jiajun Wu
Ling Pan
Huazhe Xu
48
7
0
05 Dec 2022
Towards Reasoning-Aware Explainable VQA
Towards Reasoning-Aware Explainable VQA
Rakesh Vaideeswaran
Feng Gao
Abhinav Mathur
Govind Thattai
LRM
27
3
0
09 Nov 2022
Tracking Objects as Pixel-wise Distributions
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
26
54
0
12 Jul 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
50
525
0
13 Jun 2022
LEMON: Language-Based Environment Manipulation via Execution-Guided
  Pre-training
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
Qi Shi
Qian Liu
Bei Chen
Yu Zhang
Ting Liu
Jian-Guang Lou
19
9
0
20 Jan 2022
Learning to Synthesize Programs as Interpretable and Generalizable
  Policies
Learning to Synthesize Programs as Interpretable and Generalizable Policies
Dweep Trivedi
Jesse Zhang
Shao-Hua Sun
Joseph J. Lim
NAI
9
71
0
31 Aug 2021
MST: Masked Self-Supervised Transformer for Visual Representation
MST: Masked Self-Supervised Transformer for Visual Representation
Zhaowen Li
Zhiyang Chen
Fan Yang
Wei Li
Yousong Zhu
...
Rui Deng
Liwei Wu
Rui Zhao
Ming Tang
Jinqiao Wang
ViT
30
161
0
10 Jun 2021
Dual-stream Network for Visual Recognition
Dual-stream Network for Visual Recognition
Mingyuan Mao
Renrui Zhang
Honghui Zheng
Peng Gao
Teli Ma
Yan Peng
Errui Ding
Baochang Zhang
Shumin Han
ViT
20
63
0
31 May 2021
ResT: An Efficient Transformer for Visual Recognition
ResT: An Efficient Transformer for Visual Recognition
Qing-Long Zhang
Yubin Yang
ViT
24
229
0
28 May 2021
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
257
496
0
07 Jun 2018
1