Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.05247
Cited By
Pretrained Transformers as Universal Computation Engines
9 March 2021
Kevin Lu
Aditya Grover
Pieter Abbeel
Igor Mordatch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pretrained Transformers as Universal Computation Engines"
50 / 149 papers shown
Title
HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights
Ozan Gokdemir
Carlo Siebenschuh
Alexander Brace
Azton Wells
Brian Hsu
...
A. Anandkumar
Ian Foster
R. Stevens
V. Vishwanath
A. Ramanathan
VLM
32
0
0
07 May 2025
Shape Modeling of Longitudinal Medical Images: From Diffeomorphic Metric Mapping to Deep Learning
Edwin Tay
Nazli Tümer
Amir A. Zadpoor
MedIm
47
0
0
27 Mar 2025
General Reasoning Requires Learning to Reason from the Get-go
Seungwook Han
Jyothish Pari
Samuel J. Gershman
Pulkit Agrawal
LRM
96
0
0
26 Feb 2025
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis
Xu Wang
Jiaju Kang
Puyu Han
Yubao Zhao
Qian Liu
Liwenfei He
Lingqiong Zhang
Lingyun Dai
Yongcheng Wang
Jie Tao
LM&MA
60
1
0
16 Feb 2025
Better Prompt Compression Without Multi-Layer Perceptrons
Edouardo Honig
Andrew Lizarraga
Zijun Zhang
Ying Nian Wu
MQ
87
0
0
12 Jan 2025
OneLLM: One Framework to Align All Modalities with Language
Jiaming Han
Kaixiong Gong
Yiyuan Zhang
Jiaqi Wang
Kaipeng Zhang
D. Lin
Yu Qiao
Peng Gao
Xiangyu Yue
MLLM
104
107
0
10 Jan 2025
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
Zhaofeng Wu
Xinyan Velocity Yu
Dani Yogatama
Jiasen Lu
Yoon Kim
AIFin
46
10
0
07 Nov 2024
Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization
Chengtao Jian
Kai Yang
Yang Jiao
AI4TS
24
3
0
09 Oct 2024
ESQA: Event Sequences Question Answering
Irina Abdullaeva
Andrei Filatov
Mikhail Orlov
Ivan Karpukhin
Viacheslav Vasilev
Denis Dimitrov
Andrey Kuznetsov
Ivan A Kireev
Andrey Savchenko
39
0
0
03 Jul 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
30
9
0
22 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
35
55
0
17 May 2024
The Platonic Representation Hypothesis
Minyoung Huh
Brian Cheung
Tongzhou Wang
Phillip Isola
72
109
0
13 May 2024
What explains the success of cross-modal fine-tuning with ORCA?
Paloma García-de-Herreros
Vagrant Gautam
Philipp Slusallek
Dietrich Klakow
Marius Mosbach
35
0
0
20 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRL
OnRL
33
3
0
11 Mar 2024
TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models
Yilong Ren
Yue Chen
Shuai Liu
Boyue Wang
Haiyang Yu
Zhiyong Cui
AI4TS
59
18
0
04 Mar 2024
LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
Haoxin Liu
Zhiyuan Zhao
Jindong Wang
Harshavardhan Kamarthi
B. A. Prakash
AI4TS
LRM
VLM
58
24
0
25 Feb 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
35
1
0
20 Feb 2024
Do Large Language Models Understand Logic or Just Mimick Context?
Junbing Yan
Chengyu Wang
Junyuan Huang
Wei Zhang
ReLM
ELM
LRM
14
5
0
19 Feb 2024
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language Models
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kuehnberger
LRM
11
3
0
12 Feb 2024
Empowering Time Series Analysis with Large Language Models: A Survey
Yushan Jiang
Zijie Pan
Xikun Zhang
Sahil Garg
Anderson Schneider
Yuriy Nevmyvaka
Dongjin Song
AI4TS
AIFin
87
47
0
05 Feb 2024
How Can Large Language Models Understand Spatial-Temporal Data?
Lei Liu
Shuo Yu
Runze Wang
Zhenxun Ma
Yanming Shen
AI4TS
19
23
0
25 Jan 2024
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Anirudh S. Sundar
Chao-Han Huck Yang
David M. Chan
Shalini Ghosh
Venkatesh Ravichandran
P. S. Nidadavolu
MoMe
33
8
0
22 Dec 2023
How to guess a gradient
Utkarsh Singhal
Brian Cheung
Kartik Chandra
Jonathan Ragan-Kelley
Joshua B. Tenenbaum
Tomaso Poggio
Stella X. Yu
ODL
18
3
0
07 Dec 2023
Guided Flows for Generative Modeling and Decision Making
Qinqing Zheng
Matt Le
Neta Shaul
Y. Lipman
Aditya Grover
Ricky T. Q. Chen
24
35
0
22 Nov 2023
Unified machine learning tasks and datasets for enhancing renewable energy
Arsam Aryandoust
Thomas Rigoni
Francesco di Stefano
Anthony Patt
30
0
0
12 Nov 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
Ting-Rui Chiang
Dani Yogatama
20
1
0
25 Oct 2023
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting
Xu Liu
Junfeng Hu
Yuan N. Li
Shizhe Diao
Yuxuan Liang
Bryan Hooi
Roger Zimmermann
AI4TS
20
75
0
15 Oct 2023
Data-Centric Financial Large Language Models
Zhixuan Chu
Huaiyu Guo
Xinyuan Zhou
Yijia Wang
Fei Yu
...
Xin Lu
Qing Cui
Longfei Li
Junqing Zhou
Sheng R. Li
AIFin
9
7
0
07 Oct 2023
One for All: Towards Training One Graph Model for All Classification Tasks
Hao Liu
Jiarui Feng
Lecheng Kong
Ningyue Liang
Dacheng Tao
Yixin Chen
Muhan Zhang
AI4CE
12
108
0
29 Sep 2023
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation
Kai Huang
Hanyu Yin
Heng Huang
Wei Gao
25
10
0
22 Sep 2023
The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models
Dimitris Spathis
F. Kawsar
AI4TS
21
17
0
12 Sep 2023
Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning
Yun-Hin Chan
Rui Zhou
Running Zhao
Zhihan Jiang
Edith C. H. Ngai
FedML
20
8
0
22 Aug 2023
Can Language Models Learn to Listen?
Evonne Ng
Sanjay Subramanian
Dan Klein
Angjoo Kanazawa
Trevor Darrell
Shiry Ginosar
22
16
0
21 Aug 2023
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models
Heng Wang
Jianbo Ma
Santiago Pascual
Richard Cartwright
Weidong (Tom) Cai
VGen
19
37
0
18 Aug 2023
FoodSAM: Any Food Segmentation
Xing Lan
Jiayi Lyu
Han Jiang
Kunkun Dong
Zehai Niu
Yi Zhang
Jian Xue
VLM
19
25
0
11 Aug 2023
OpenProteinSet: Training data for structural biology at scale
Gustaf Ahdritz
N. Bouatta
S. Kadyan
Lukas Jarosch
Daniel Berenberg
Ian Fisk
Andrew Watkins
Stephen Ra
Richard Bonneau
Mohammed AlQuraishi
AI4CE
8
10
0
10 Aug 2023
Multimodal Neurons in Pretrained Text-Only Transformers
Sarah Schwettmann
Neil Chowdhury
Samuel J. Klein
David Bau
Antonio Torralba
MILM
17
27
0
03 Aug 2023
Transformers are Universal Predictors
Sourya Basu
Moulik Choraria
L. Varshney
18
4
0
15 Jul 2023
Large Language Models as General Pattern Machines
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
39
183
0
10 Jul 2023
Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D Semantic Segmentation
Boxiang Zhang
Zunran Wang
Yonggen Ling
Yuanyuan Guan
Shenghao Zhang
Wenhui Li
30
6
0
09 Jul 2023
Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
11
3
0
21 Jun 2023
Opening the Black Box: Analyzing Attention Weights and Hidden States in Pre-trained Language Models for Non-language Tasks
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
19
2
0
21 Jun 2023
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Jiacheng Ye
Xijia Tao
Lingpeng Kong
LRM
28
22
0
11 Jun 2023
Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners
Xiaojuan Tang
Zilong Zheng
Jiaqi Li
Fanxu Meng
Song-Chun Zhu
Yitao Liang
Muhan Zhang
ReLM
LRM
16
53
0
24 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
13
2
0
23 May 2023
Introspective Tips: Large Language Model for In-Context Decision Making
Liting Chen
Lu Wang
Hang Dong
Yali Du
Jie Yan
...
Pu Zhao
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LLMAG
LRM
30
23
0
19 May 2023
Semantic Composition in Visually Grounded Language Models
Rohan Pandey
CoGe
16
1
0
15 May 2023
Efficient Feature Distillation for Zero-shot Annotation Object Detection
Zhuoming Liu
Xuefeng Hu
Ram Nevatia
VLM
ObjD
11
1
0
21 Mar 2023
Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning
Zaid Khan
Yun Fu
VLM
28
12
0
21 Mar 2023
Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies
Daniel Lawson
A. H. Qureshi
MoMe
OffRL
14
13
0
14 Mar 2023
1
2
3
Next