Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 11,958 papers shown
Title
PromptSep: Generative Audio Separation via Multimodal Prompting
Yutong Wen
Ke Chen
Prem Seetharaman
Oriol Nieto
Jiaqi Su
Rithesh Kumar
Minje Kim
Paris Smaragdis
Zeyu Jin
Justin Salamon
DiffM
261
0
0
06 Nov 2025
RISE-T2V: Rephrasing and Injecting Semantics with LLM for Expansive Text-to-Video Generation
Xiangjun Zhang
Litong Gong
Yinglin Zheng
Yansong Liu
Wentao Jiang
Mingyi Xu
Biao Wang
Tiezheng Ge
Ming Zeng
DiffM
VGen
124
1
0
06 Nov 2025
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
Yuantian Shao
Yuanteng Chen
Peisong Wang
Jianlin Yu
Jing Lin
Yiwu Yao
Zhihui Wei
Jian Cheng
MQ
260
0
0
06 Nov 2025
MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation
Shih-Lun Wu
Yoon Kim
Cheng-Zhi Anna Huang
242
0
0
06 Nov 2025
ScaleDL: Towards Scalable and Efficient Runtime Prediction for Distributed Deep Learning Workloads
Xiaokai Wang
Shaoyuan Huang
Yuting Li
Xiaofei Wang
GNN
AI4CE
256
0
0
06 Nov 2025
E-CARE: An Efficient LLM-based Commonsense-Augmented Framework for E-Commerce
Ge Zhang
Rohan Deepak Ajwani
Tony Zheng
Hongjian Gu
Yaochen Hu
Wei Guo
Mark Coates
Yingxue Zhang
LRM
144
0
0
06 Nov 2025
Reusing Pre-Training Data at Test Time is a Compute Multiplier
Alex Fang
Thomas Voice
Ruoming Pang
Ludwig Schmidt
Tom Gunter
102
0
0
06 Nov 2025
Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis
Lars Krupp
Daniel Geißler
Vishal Banwari
P. Lukowicz
Jakob Karolus
LLMAG
LM&Ro
164
1
0
06 Nov 2025
PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference
Yushu Zhao
Zheng Wang
Minjia Zhang
MoE
141
0
0
06 Nov 2025
Diffusion Language Models are Super Data Learners
Jinjie Ni
Qian Liu
Longxu Dou
Chao Du
Zili Wang
Hang Yan
Tianyu Pang
Michael Shieh
AI4CE
125
9
0
05 Nov 2025
OMPILOT: Harnessing Transformer Models for Auto Parallelization to Shared Memory Computing Paradigms
Arijit Bhattacharjee
Ali TehraniJamsaz
Le Chen
N. Hasabnis
Mihai Capota
Nesreen K. Ahmed
Ali Jannesari
116
0
0
05 Nov 2025
Divide, Cache, Conquer: Dichotomic Prompting for Efficient Multi-Label LLM-Based Classification
Mikołaj Langner
Jan Eliasz
Ewa Rudnicka
Jan Kocoń
72
0
0
05 Nov 2025
AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda
Mohd Nauman
Sravan Gvm
Vijay Devane
Shyam Pawar
Viraj Thakur
Kundeshwar Pundalik
Piyush Sawarkar
Rohit Saluja
Maunendra Sankar Desarkar
Ganesh Ramakrishnan
LM&MA
ELM
232
0
0
04 Nov 2025
Dynamic Reflections: Probing Video Representations with Text Alignment
Tyler Zhu
Tengda Han
Leonidas Guibas
Viorica Patraucean
M. Ovsjanikov
VGen
233
0
0
04 Nov 2025
Data-Efficient Adaptation and a Novel Evaluation Method for Aspect-based Sentiment Analysis
Y. Hua
Paul Denny
Jorg Wicker
Katerina Taskova
84
0
0
04 Nov 2025
Memory-Efficient Training with In-Place FFT Implementation
Xinyu Ding
Bangtian Liu
Siyu Liao
Zhongfeng Wang
193
0
0
03 Nov 2025
Random Initialization of Gated Sparse Adapters
Vi Retault
Yohaï-Eliel Berreby
CLL
MoE
184
0
0
03 Nov 2025
CMI-MTL: Cross-Mamba interaction based multi-task learning for medical visual question answering
Qiangguo Jin
Xianyao Zheng
Hui Cui
Changming Sun
Yuqi Fang
Cong Cong
R. Su
Leyi Wei
Ping Xuan
Junbo Wang
100
0
0
03 Nov 2025
ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks
Chengzhang Yu
Zening Lu
Chenyang Zheng
C. Wang
Yiming Zhang
Zhanpeng Jin
KELM
107
0
0
03 Nov 2025
AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs
Mo El-Haj
Paul Rayson
AIFin
386
0
0
03 Nov 2025
CAT-ID
2
^2
2
: Category-Tree Integrated Document Identifier Learning for Generative Retrieval In E-commerce
Xiaoyu Liu
Fuwei Zhang
Yiqing Wu
Xinyu Jia
Zenghua Xia
Fuzhen Zhuang
Zhao Zhang
Fei Jiang
Wei Lin
VLM
215
1
0
03 Nov 2025
HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models
Stephan Oepen
Nikolay Arefev
Mikko Aulamo
Marta Bañón
Maja Buljan
...
Teemu Vahtola
Dušan Variš
Fedor Vitiugin
Tea Vojtěchová
Jaume Zaragoza
166
0
0
02 Nov 2025
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Panwang Pan
Chenguo Lin
Jingjing Zhao
Chenxin Li
Yuchen Lin
...
Honglei Yan
Kairun Wen
Yunlong Lin
Yixuan Yuan
Yadong Mu
3DGS
VGen
125
1
0
01 Nov 2025
Reviving Stale Updates: Data-Free Knowledge Distillation for Asynchronous Federated Learning
Baris Askin
Holger Roth
Zhenyu Sun
Carlee Joe-Wong
Gauri Joshi
Ziyue Xu
FedML
192
0
0
01 Nov 2025
Isotropic Curvature Model for Understanding Deep Learning Optimization: Is Gradient Orthogonalization Optimal?
Weijie Su
132
0
0
01 Nov 2025
Air Pollution Forecasting in Bucharest
Dragoş-Andrei Şerban
Razvan-Alexandru Smadu
Dumitru-Clementin Cercel
AI4TS
85
0
0
01 Nov 2025
Listwise Preference Diffusion Optimization for User Behavior Trajectories Prediction
Hongtao Huang
Chengkai Huang
Junda Wu
Tong Yu
Julian McAuley
Lina Yao
148
0
0
01 Nov 2025
ID-Crafter: VLM-Grounded Online RL for Compositional Multi-Subject Video Generation
Panwang Pan
Jingjing Zhao
Yuchen Lin
Chenguo Lin
Chenxin Li
Haopeng Li
Honglei Yan
Tingting Shen
DiffM
VGen
304
0
0
01 Nov 2025
Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis
Weiming Chen
Yijia Wang
Zhihan Zhu
Z. He
DiffM
112
0
0
31 Oct 2025
E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources
Tong Shen
Jingai Yu
Dong Zhou
Dong Li
E. Barsoum
DiffM
95
0
0
31 Oct 2025
EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities
Travis Davies
Yiqi Huang
Alexi Gladstone
Yunxin Liu
Xiang Chen
Heng Ji
Huxian Liu
Luhui Hu
OffRL
148
1
0
31 Oct 2025
BiSparse-AAS: Bilinear Sparse Attention and Adaptive Spans Framework for Scalable and Efficient Text Summarization
D. Hagos
Legand L. Burge
Anietie Andy
Anis Yazidi
Vladimir Vlassov
136
0
0
31 Oct 2025
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
Haorui Li
Weitao Du
Yuqiang Li
Hongyu Guo
Shengchao Liu
128
1
0
31 Oct 2025
Don't Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation
Woojin Kim
Jaeyoung Do
124
0
0
30 Oct 2025
An All-Reduce Compatible Top-K Compressor for Communication-Efficient Distributed Learning
Chuyan Chen
Chenyang Ma
Zhangxin Li
Yutong He
Yanjie Dong
Kun Yuan
244
0
0
30 Oct 2025
Bridging the Gap Between Molecule and Textual Descriptions via Substructure-aware Alignment
Hyuntae Park
Yeachan Kim
SangKeun Lee
56
1
0
30 Oct 2025
SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning
Fang Liu
Simiao Liu
Yinghao Zhu
Xiaoli Lian
Li Zhang
AAML
110
0
0
30 Oct 2025
Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems
Hongbo Li
Qinhang Wu
Sen-Fon Lin
Yingbin Liang
Ness B. Shroff
MoE
124
0
0
30 Oct 2025
Integrating Ontologies with Large Language Models for Enhanced Control Systems in Chemical Engineering
Crystal Su
Kuai Yu
Jingrui Zhang
Mingyuan Shao
Daniel Bauer
AI4CE
73
0
0
30 Oct 2025
UniTok-Audio: A Unified Audio Generation Framework via Generative Modeling on Discrete Codec Tokens
Chengwei Liu
Haoyin Yan
Shaofei Xue
Xiaotao Liang
Yinghao Liu
Zheng Xue
Gang Song
Boyang Zhou
203
2
0
30 Oct 2025
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation
Jing Lin
R. Wang
Junzhe Lu
Ziqi Huang
Guorui Song
...
Wanqi Yin
Qingping Sun
Zhongang Cai
Lei Yang
Ziwei Liu
DiffM
VGen
133
2
0
30 Oct 2025
Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase
Mihir Mahajan
Alfred Nguyen
Franz Srambical
Stefan Bauer
160
0
0
30 Oct 2025
Knowledge-Guided Textual Reasoning for Explainable Video Anomaly Detection via LLMs
Hari Lee
40
0
0
30 Oct 2025
Towards Scaling Laws for Symbolic Regression
David Otte
Jörg Franke
Frank Hutter
109
0
0
30 Oct 2025
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
Biao Zhang
Yong Cheng
Siamak Shakeri
Xinyi Wang
Min Ma
Orhan Firat
125
1
0
30 Oct 2025
1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models
Zeliang Zong
Kai Zhang
Zheyang Li
Wenming Tan
Ye Ren
Yiyan Zhai
Jilin Hu
120
0
0
30 Oct 2025
Reasoning Up the Instruction Ladder for Controllable Language Models
Zishuo Zheng
Vidhisha Balachandran
Chan Young Park
Faeze Brahman
Sachin Kumar
LRM
211
0
0
30 Oct 2025
Benchmarking Generative AI Against Bayesian Optimization for Constrained Multi-Objective Inverse Design
Muhammad Bilal Awan
Abdul Razzaq
Abdul Shahid
65
0
0
29 Oct 2025
Beyond One-Size-Fits-All: Personalized Harmful Content Detection with In-Context Learning
Rufan Zhang
Lin Zhang
Xianghang Mi
64
0
0
29 Oct 2025
PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face Attributes
Image and Vision Computing (IVC), 2025
Xiang Liu
Zhaoxiang Liu
Huan Hu
Zipeng Wang
Ping Chen
Z. Chen
Kai Wang
Shiguo Lian
97
0
0
29 Oct 2025
Previous
1
2
3
4
5
...
238
239
240
Next