v1v2 (latest)

Pretrained Transformers as Universal Computation Engines

9 March 2021

Kevin Lu

Aditya Grover

Pieter Abbeel

Igor Mordatch

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Pretrained Transformers as Universal Computation Engines"

50 / 151 papers shown

Training Transitive and Commutative Multimodal Transformers with LoReTTaNeural Information Processing Systems (NeurIPS), 2023

328

23 May 2023

Introspective Tips: Large Language Model for In-Context Decision Making

...

268

19 May 2023

Semantic Composition in Visually Grounded Language Models

Rohan Pandey

CoGe

206

15 May 2023

Efficient Feature Distillation for Zero-shot Annotation Object DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

259

21 Mar 2023

Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer LearningInternational Conference on Learning Representations (ICLR), 2023

Zaid Khan

Yun Fu

VLM

182

21 Mar 2023

Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2023

Daniel Lawson

A. H. Qureshi

MoMe OffRL

379

14 Mar 2023

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Pieter Abbeel

LM&Ro OffRL LRM AI4CE

397

216

07 Mar 2023

PaLM-E: An Embodied Multimodal Language ModelInternational Conference on Machine Learning (ICML), 2023

...

479

2,258

06 Mar 2023

163

20 Feb 2023

Efficiency 360: Efficient Vision Transformers

Badri N. Patro

Vijay Srinivas Agneeswaran

409

16 Feb 2023

Knowledge from Large-Scale Protein Contact Prediction Models Can Be Transferred to the Data-Scarce RNA Contact Prediction TaskInternational Conference on Pattern Recognition (ICPR), 2023

352

13 Feb 2023

Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentNeural Information Processing Systems (NeurIPS), 2023

Hao Liu

Wilson Yan

Pieter Abbeel

254

02 Feb 2023

Grounding Language Models to Images for Multimodal Inputs and OutputsInternational Conference on Machine Learning (ICML), 2023

Jing Yu Koh

Ruslan Salakhutdinov

Daniel Fried

MLLM

448

151

31 Jan 2023

Continuous Spatiotemporal TransformersInternational Conference on Machine Learning (ICML), 2023

Antonio H. O. Fonseca

E. Zappala

J. O. Caro

David van Dijk

176

31 Jan 2023

ClimaX: A foundation model for weather and climateInternational Conference on Machine Learning (ICML), 2023

Tung Nguyen

Johannes Brandstetter

584

374

24 Jan 2023

A Survey on Transformers in Reinforcement Learning

547

08 Jan 2023

Evaluating Step-by-Step Reasoning through Symbolic Verification

285

16 Dec 2022

Vision Transformers are Parameter-Efficient Audio-Visual LearnersComputer Vision and Pattern Recognition (CVPR), 2022

Yan-Bo Lin

Yi-Lin Sung

Jie Lei

Joey Tianyi Zhou

Gedas Bertasius

324

110

15 Dec 2022

Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges

Somayeh Bakhtiari Ramezani

FaML AI4TS

226

27 Nov 2022

I Can't Believe There's No Images! Learning Visual Tasks Using only Language SupervisionIEEE International Conference on Computer Vision (ICCV), 2022

354

17 Nov 2022

On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

S. Takagi

OffRL

187

17 Nov 2022

Metaphors We Learn By

Roland Memisevic

204

11 Nov 2022

What is Wrong with Language Models that Can Not Tell a Story?

Ivan P. Yamshchikov

Alexey Tikhonov

246

09 Nov 2022

Pretraining in Deep Reinforcement Learning: A Survey

Shuai Li

242

08 Nov 2022

LMPriors: Pre-Trained Language Models as Task-Specific Priors

237

22 Oct 2022

Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained ModelsAAAI Conference on Artificial Intelligence (AAAI), 2022

Sourya Basu

P. Sattigeri

Karthikeyan N. Ramamurthy

Vijil Chenthamarakshan

Kush R. Varshney

Lav Varshney

Payel Das

254

13 Oct 2022

Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning

306

11 Oct 2022

Generating Executable Action Plans with Environmentally-Aware Language ModelsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022

Maitrey Gramopadhye

D. Szafir

LM&Ro LLMAG

325

10 Oct 2022

Understanding HTML with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Sharan Narang

505

08 Oct 2022

Linearly Mapping from Image to Text SpaceInternational Conference on Learning Representations (ICLR), 2022

1.2K

145

30 Sep 2022

Downstream Datasets Make Surprisingly Good Pretraining CorporaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

228

28 Sep 2022

Disentangling Transfer in Continual Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

269

28 Sep 2022

MonoByte: A Pool of Monolingual Byte-level Language ModelsInternational Conference on Computational Linguistics (COLING), 2022

Hugo Queiroz Abonizio

Leandro Rodrigues de Souza

R. Lotufo

Rodrigo Nogueira

181

22 Sep 2022

Non-Linguistic Supervision for Contrastive Learning of Sentence EmbeddingsNeural Information Processing Systems (NeurIPS), 2022

253

20 Sep 2022

OmniVL:One Foundation Model for Image-Language and Video-Language TasksNeural Information Processing Systems (NeurIPS), 2022

Zuxuan Wu

Lu Yuan

294

178

15 Sep 2022

Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open QuestionsACM Computing Surveys (ACM CSUR), 2022

Paul Pu Liang

Amir Zadeh

Louis-Philippe Morency

315

169

07 Sep 2022

Exploring and Evaluating Personalized Models for Code Generation

188

29 Aug 2022

What Can Transformers Learn In-Context? A Case Study of Simple Function ClassesNeural Information Processing Systems (NeurIPS), 2022

658

676

01 Aug 2022

Unsupervised Domain Adaptation for Video Transformers in Action RecognitionInternational Conference on Pattern Recognition (ICPR), 2022

Victor G. Turrisi da Costa

Giacomo Zara

Paolo Rota

Thiago Oliveira-Santos

Andrii Zadaianchuk

Vittorio Murino

Elisa Ricci

187

26 Jul 2022

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-trainingEuropean Conference on Computer Vision (ECCV), 2022

Lu Yuan

221

26 Jul 2022

Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational LimitNeural Information Processing Systems (NeurIPS), 2022

382

157

18 Jul 2022

Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence ModelingInternational Conference on Machine Learning (ICML), 2022

Tung Nguyen

Aditya Grover

BDL UQCV

298

136

09 Jul 2022

394

08 Jul 2022

CASHformer: Cognition Aware SHape Transformer for Longitudinal AnalysisInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

209

05 Jul 2022

TTS-CGAN: A Transformer Time-Series Conditional GAN for Biosignal Data Augmentation

175

28 Jun 2022

ProGen2: Exploring the Boundaries of Protein Language ModelsCell Systems (Cell Syst.), 2022

Erik Nijkamp

190

427

27 Jun 2022

LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning TasksNeural Information Processing Systems (NeurIPS), 2022

Dimitris Papailiopoulos

Kangwook Lee

LMTD

576

172

14 Jun 2022

CyCLIP: Cyclic Contrastive Language-Image PretrainingNeural Information Processing Systems (NeurIPS), 2022

522

166

28 May 2022

History Compression via Language Models in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

518

24 May 2022

Housekeep: Tidying Virtual Households using Commonsense ReasoningEuropean Conference on Computer Vision (ECCV), 2022

416

22 May 2022