ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.05247
  4. Cited By
Pretrained Transformers as Universal Computation Engines
v1v2 (latest)

Pretrained Transformers as Universal Computation Engines

9 March 2021
Kevin Lu
Aditya Grover
Pieter Abbeel
Igor Mordatch
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Pretrained Transformers as Universal Computation Engines"

50 / 151 papers shown
Energy-Efficient Domain-Specific Artificial Intelligence Models and Agents: Pathways and Paradigms
Energy-Efficient Domain-Specific Artificial Intelligence Models and Agents: Pathways and Paradigms
Abhijit Chatterjee
N. Jha
Jonathan D. Cohen
Thomas Griffiths
Hongjing Lu
Diana Marculescu
Ashiqur Rasul
Keshab K. Parhi
LLMAGAI4CE
411
1
0
24 Oct 2025
TimeCopilot
TimeCopilot
Azul Garza
Reneé Rosillo
AI4TS
203
1
0
30 Aug 2025
T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion
T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion
Abdul Monaf Chowdhury
Rabeya Akter
S. Arib
AI4TS
126
1
0
06 Aug 2025
Transfer of Structural Knowledge from Synthetic Languages
Transfer of Structural Knowledge from Synthetic Languages
Mikhail Budnikov
Ivan Yamshchikov
216
0
0
21 May 2025
Large Language Models Implicitly Learn to See and Hear Just By Reading
Large Language Models Implicitly Learn to See and Hear Just By Reading
Prateek Verma
Mert Pilanci
390
1
0
20 May 2025
An empirical study of task and feature correlations in the reuse of pre-trained models
An empirical study of task and feature correlations in the reuse of pre-trained models
Jama Hussein Mohamud
Willie Brink
172
0
0
15 May 2025
HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights
HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific InsightsPlatform for Advanced Scientific Computing Conference (PASC), 2025
Ozan Gokdemir
Carlo Siebenschuh
Alexander Brace
Azton Wells
Brian Hsu
...
A. Anandkumar
Ian Foster
R. Stevens
V. Vishwanath
A. Ramanathan
VLM
231
10
0
07 May 2025
Shape Modeling of Longitudinal Medical Images: From Diffeomorphic Metric Mapping to Deep Learning
Shape Modeling of Longitudinal Medical Images: From Diffeomorphic Metric Mapping to Deep Learning
Edwin Tay
Nazli Tümer
Amir A. Zadpoor
MedIm
524
0
0
27 Mar 2025
General Intelligence Requires Reward-based Pretraining
General Intelligence Requires Reward-based Pretraining
Seungwook Han
Jyothish Pari
Samuel J. Gershman
Pulkit Agrawal
LRM
824
2
0
26 Feb 2025
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis
Xu Wang
Jiaju Kang
Puyu Han
Yubao Zhao
Qian Liu
Liwenfei He
Lingqiong Zhang
Lingyun Dai
Yongcheng Wang
Jie Tao
LM&MA
493
3
0
16 Feb 2025
Better Prompt Compression Without Multi-Layer Perceptrons
Better Prompt Compression Without Multi-Layer Perceptrons
Edouardo Honig
Andrew Lizarraga
Zijun Zhang
Ying Nian Wu
MQ
941
2
0
12 Jan 2025
OneLLM: One Framework to Align All Modalities with Language
OneLLM: One Framework to Align All Modalities with LanguageComputer Vision and Pattern Recognition (CVPR), 2023
Jiaming Han
Kaixiong Gong
Yiyuan Zhang
Yuan Liu
Kaipeng Zhang
Dahua Lin
Yu Qiao
Shiyang Feng
Xiangyu Yue
MLLM
577
198
0
10 Jan 2025
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesInternational Conference on Learning Representations (ICLR), 2024
Zhaofeng Wu
Xinyan Velocity Yu
Dani Yogatama
Jiasen Lu
Yoon Kim
AIFin
533
39
0
07 Nov 2024
Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series
  OOD Generalization
Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD GeneralizationNeural Information Processing Systems (NeurIPS), 2024
Chengtao Jian
Kai Yang
Yang Jiao
AI4TS
418
13
0
09 Oct 2024
ESQA: Event Sequences Question Answering
ESQA: Event Sequences Question Answering
Irina Abdullaeva
Andrei Filatov
Mikhail Orlov
Ivan Karpukhin
Viacheslav Vasilev
Denis Dimitrov
Andrey Kuznetsov
Ivan A Kireev
Ivan A Kireev
227
1
0
03 Jul 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A
  Survey
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
347
31
0
22 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive
  Survey on Principles, Key Techniques, and Opportunities
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and OpportunitiesIEEE Communications Surveys and Tutorials (COMST), 2024
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
324
188
0
17 May 2024
The Platonic Representation Hypothesis
The Platonic Representation HypothesisInternational Conference on Machine Learning (ICML), 2024
Minyoung Huh
Brian Cheung
Tongzhou Wang
Phillip Isola
883
240
0
13 May 2024
What explains the success of cross-modal fine-tuning with ORCA?
What explains the success of cross-modal fine-tuning with ORCA?
Paloma García-de-Herreros
Vagrant Gautam
Philipp Slusallek
Dietrich Klakow
Marius Mosbach
243
1
0
20 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
In-context Exploration-Exploitation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRLOnRL
220
12
0
11 Mar 2024
TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language
  Models
TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models
Yilong Ren
Yue Chen
Shuai Liu
Boyue Wang
Haiyang Yu
Zhiyong Cui
AI4TS
325
37
0
04 Mar 2024
LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by
  Long-Short-Term Prompting
LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
Haoxin Liu
Zhiyuan Zhao
Jindong Wang
Harshavardhan Kamarthi
B. A. Prakash
AI4TSLRMVLM
252
59
0
25 Feb 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared
  Semantic Spaces
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
231
2
0
20 Feb 2024
Do Large Language Models Understand Logic or Just Mimick Context?
Do Large Language Models Understand Logic or Just Mimick Context?
Junbing Yan
Chengyu Wang
Junyuan Huang
Wei Zhang
ReLMELMLRM
215
10
0
19 Feb 2024
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language
  Models
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language ModelsAsian Conference on Machine Learning (ACML), 2024
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kuehnberger
LRM
280
5
0
12 Feb 2024
Empowering Time Series Analysis with Large Language Models: A Survey
Empowering Time Series Analysis with Large Language Models: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Yushan Jiang
Zijie Pan
Xikun Zhang
Sahil Garg
Anderson Schneider
Yuriy Nevmyvaka
Dongjin Song
AI4TSAIFin
383
83
0
05 Feb 2024
How Can Large Language Models Understand Spatial-Temporal Data?
How Can Large Language Models Understand Spatial-Temporal Data?
Lei Liu
Shuo Yu
Runze Wang
Zhenxun Ma
Yanming Shen
AI4TS
326
43
0
25 Jan 2024
Multimodal Attention Merging for Improved Speech Recognition and Audio
  Event Classification
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Anirudh S. Sundar
Chao-Han Huck Yang
David M. Chan
Shalini Ghosh
Venkatesh Ravichandran
P. S. Nidadavolu
MoMe
295
12
0
22 Dec 2023
How to guess a gradient
How to guess a gradient
Utkarsh Singhal
Brian Cheung
Kartik Chandra
Jonathan Ragan-Kelley
Joshua B. Tenenbaum
Tomaso Poggio
Stella X. Yu
ODL
145
7
0
07 Dec 2023
Guided Flows for Generative Modeling and Decision Making
Guided Flows for Generative Modeling and Decision Making
Qinqing Zheng
Matt Le
Neta Shaul
Y. Lipman
Aditya Grover
Ricky T. Q. Chen
329
73
0
22 Nov 2023
Unified machine learning tasks and datasets for enhancing renewable
  energy
Unified machine learning tasks and datasets for enhancing renewable energy
Arsam Aryandoust
Thomas Rigoni
Francesco di Stefano
Anthony Patt
208
0
0
12 Nov 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of
  Masked Language Model Pretraining
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ting-Rui Chiang
Dani Yogatama
169
1
0
25 Oct 2023
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series
  Forecasting
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting
Xu Liu
Junfeng Hu
Yuan N. Li
Shizhe Diao
Yuxuan Liang
Bryan Hooi
Roger Zimmermann
AI4TS
297
150
0
15 Oct 2023
Data-Centric Financial Large Language Models
Data-Centric Financial Large Language Models
Zhixuan Chu
Huaiyu Guo
Xinyuan Zhou
Yijia Wang
Fei Yu
...
Xin Lu
Daixin Wang
Longfei Li
Junqing Zhou
Sheng Li
AIFin
318
10
0
07 Oct 2023
One for All: Towards Training One Graph Model for All Classification
  Tasks
One for All: Towards Training One Graph Model for All Classification TasksInternational Conference on Learning Representations (ICLR), 2023
Hao Liu
Jiarui Feng
Lecheng Kong
Ningyue Liang
Dacheng Tao
Yixin Chen
Muhan Zhang
AI4CE
511
211
0
29 Sep 2023
Towards Green AI in Fine-tuning Large Language Models via Adaptive
  Backpropagation
Towards Green AI in Fine-tuning Large Language Models via Adaptive BackpropagationInternational Conference on Learning Representations (ICLR), 2023
Kai Huang
Hanyu Yin
Heng Huang
Wei Gao
264
17
0
22 Sep 2023
The first step is the hardest: Pitfalls of Representing and Tokenizing
  Temporal Data for Large Language Models
The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models
Dimitris Spathis
F. Kawsar
AI4TS
201
42
0
12 Sep 2023
Internal Cross-layer Gradients for Extending Homogeneity to
  Heterogeneity in Federated Learning
Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated LearningInternational Conference on Learning Representations (ICLR), 2023
Yun-Hin Chan
Rui Zhou
Running Zhao
Zhihan Jiang
Edith C.H. Ngai
FedML
248
11
0
22 Aug 2023
Can Language Models Learn to Listen?
Can Language Models Learn to Listen?IEEE International Conference on Computer Vision (ICCV), 2023
Evonne Ng
Sanjay Subramanian
Dan Klein
Angjoo Kanazawa
Trevor Darrell
Shiry Ginosar
275
37
0
21 Aug 2023
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by
  Connecting Foundation Models
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Heng Wang
Jianbo Ma
Santiago Pascual
Richard Cartwright
Weidong (Tom) Cai
VGen
397
75
0
18 Aug 2023
FoodSAM: Any Food Segmentation
FoodSAM: Any Food SegmentationIEEE transactions on multimedia (IEEE TMM), 2023
Xing Lan
Jiayi Lyu
Han Jiang
Kunkun Dong
Zehai Niu
Yi Zhang
Jian Xue
VLM
285
40
0
11 Aug 2023
OpenProteinSet: Training data for structural biology at scale
OpenProteinSet: Training data for structural biology at scaleNeural Information Processing Systems (NeurIPS), 2023
Gustaf Ahdritz
N. Bouatta
S. Kadyan
Lukas Jarosch
Daniel Berenberg
Ian Fisk
Andrew Watkins
Stephen Ra
Richard Bonneau
Mohammed AlQuraishi
AI4CE
232
23
0
10 Aug 2023
Multimodal Neurons in Pretrained Text-Only Transformers
Multimodal Neurons in Pretrained Text-Only Transformers
Sarah Schwettmann
Neil Chowdhury
Samuel J. Klein
David Bau
Antonio Torralba
MILM
272
43
0
03 Aug 2023
Transformers are Universal Predictors
Transformers are Universal Predictors
Sourya Basu
Moulik Choraria
Lav Varshney
150
6
0
15 Jul 2023
Large Language Models as General Pattern Machines
Large Language Models as General Pattern MachinesConference on Robot Learning (CoRL), 2023
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
330
260
0
10 Jul 2023
Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D
  Semantic Segmentation
Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D Semantic SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023
Boxiang Zhang
Zunran Wang
Yonggen Ling
Yuanyuan Guan
Shenghao Zhang
Wenhui Li
219
10
0
09 Jul 2023
Investigating Pre-trained Language Models on Cross-Domain Datasets, a
  Step Closer to General AI
Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
151
6
0
21 Jun 2023
Opening the Black Box: Analyzing Attention Weights and Hidden States in
  Pre-trained Language Models for Non-language Tasks
Opening the Black Box: Analyzing Attention Weights and Hidden States in Pre-trained Language Models for Non-language Tasks
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
120
2
0
21 Jun 2023
Language Versatilists vs. Specialists: An Empirical Revisiting on
  Multilingual Transfer Ability
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Jiacheng Ye
Xijia Tao
Lingpeng Kong
LRM
222
33
0
11 Jun 2023
Large Language Models are In-Context Semantic Reasoners rather than
  Symbolic Reasoners
Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners
Xiaojuan Tang
Zilong Zheng
Jiaqi Li
Fanxu Meng
Song-Chun Zhu
Yitao Liang
Muhan Zhang
ReLMLRM
305
76
0
24 May 2023
1234
Next
Page 1 of 4