Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 11,958 papers shown
Title
CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product
Kaiwen Xue
Chenglong Li
Zhonghong Ou
Guoxin Zhang
Kaoyan Lu
...
Xinyu Liu
Qunlin Chen
Weiwei Qin
Yiran Shen
Jiayi Cen
96
0
0
17 Nov 2025
Uni-Hema: Unified Model for Digital Hematopathology
Abdul Rehman
Iqra Rasool
Ayisha Imran
Mohsen Ali
Waqas Sultani
VLM
128
0
0
17 Nov 2025
Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation
Yu Hou
Won-Yong Shin
49
0
0
17 Nov 2025
Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study
Pranjal Gupta
Karan Bhukar
H. Kumar
Seema Nagar
P. Mohapatra
Debanjana Kar
52
0
0
17 Nov 2025
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Jihun Park
Kyoungmin Lee
Jongmin Gim
Hyeonseo Jo
Minseok Oh
Wonhyeok Choi
K. Hwang
Jaeyeul Kim
Minwoo Choi
S. Im
103
0
1
17 Nov 2025
HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models
Zhiguang Lu
Qianqian Xu
Peisong Wen
Siran Da
Qingming Huang
DiffM
593
0
0
16 Nov 2025
Multivariate Diffusion Transformer with Decoupled Attention for High-Fidelity Mask-Text Collaborative Facial Generation
Yushe Cao
Dianxi Shi
Xing Fu
Xuechao Zou
Haikuo Peng
Xueqi Li
Chun Yu
Junliang Xing
DiffM
192
0
0
16 Nov 2025
Do LLMs and Humans Find the Same Questions Difficult? A Case Study on Japanese Quiz Answering
Naoya Sugiura
Kosuke Yamada
Yasuhiro Ogawa
Katsuhiko Toyama
Ryohei Sasano
77
0
0
15 Nov 2025
GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction
Jiaqi Wu
Yaosen Chen
Shuyuan Zhu
VGen
280
0
0
15 Nov 2025
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
Quanxing Xu
Ling Zhou
Feifei Zhang
Jinyu Tian
Rubing Huang
VLM
180
0
0
15 Nov 2025
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Haozhe Liu
Ding Liu
Mingchen Zhuge
Zijian Zhou
Tian Xie
...
Juan-Manuel Perez-Rua
Tao Xiang
Wei Liu
Shikun Liu
Jürgen Schmidhuber
84
0
0
15 Nov 2025
MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering
Seokwon Song
Minsu Park
Gunhee Kim
72
0
0
15 Nov 2025
KVSwap: Disk-aware KV Cache Offloading for Long-Context On-device Inference
H. Zhang
Chunwei Xia
Zheng Wang
SyDa
252
1
0
14 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Italian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
356
1
0
14 Nov 2025
Improving LLM's Attachment to External Knowledge In Dialogue Generation Tasks Through Entity Anonymization
Hadi Sheikhi
Chenyang Huang
Osmar R. Zaiane
88
0
0
14 Nov 2025
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
Duc Nguyen
Huu Binh Ta
Nhuan Le Duc
T. Nguyen
T. Tran
MoE
349
0
0
12 Nov 2025
Not Everything That Counts Can Be Counted: A Case for Safe Qualitative AI
SoftwareX (SoftwareX), 2025
Stine Beltoft
Lukas Galke
45
0
0
12 Nov 2025
Beyond Randomness: Understand the Order of the Noise in Diffusion
Song Yan
Min Li
Bi Xinliang
J. Yang
Yusen Zhang
Guanye Xiong
Yunwei Lan
Tao Zhang
Wei Zhai
Zheng-jun Zha
DiffM
272
0
0
11 Nov 2025
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Shinwoo Park
Hyejin Park
Hyeseon Ahn
Yo-Sub Han
227
0
0
11 Nov 2025
ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
Andrija Stanisic
Stefan Nastic
73
0
0
11 Nov 2025
A Unified Geometric Field Theory Framework for Transformers: From Manifold Embeddings to Kernel Modulation
Xianshuai Shi
Jianfeng Zhu
Leibo Liu
82
0
0
11 Nov 2025
Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion Transformers
Sida Huang
Siqi Huang
Ping Luo
Hongyuan Zhang
DiffM
232
2
0
11 Nov 2025
A Circular Argument : Does RoPE need to be Equivariant for Vision?
Chase van de Geijn
Timo Lüddecke
Polina Turishcheva
Alexander S. Ecker
118
2
0
11 Nov 2025
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
Da Li
Yuxiao Luo
Keping Bi
Jiafeng Guo
Wei Yuan
B. Yang
Yan Wang
Fan Yang
Tingting Gao
Guorui Zhou
VLM
225
0
0
11 Nov 2025
Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research
Neelavro Saha
Rafi Shahriyar
Nafis Ashraf Roudra
Saadman Sakib
Annajiat Alim Rasel
128
1
0
11 Nov 2025
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
Leonie Bossemeyer
Samuel Heinrich
Grant Van Horn
Oisin Mac Aodha
84
0
0
11 Nov 2025
SWAN - Enabling Fast and Mobile Histopathology Image Annotation through Swipeable Interfaces
Sweta Banerjee
Timo Gosch
Sara Hester
Viktoria Weiss
Thomas Conrad
...
R. Klopfleisch
C. Kaltenecker
C. Bertram
Katharina Breininger
Marc Aubreville
32
0
0
11 Nov 2025
Majority Rules: LLM Ensemble is a Winning Approach for Content Categorization
Ariel Kamen
Yakov Kamen
44
0
0
11 Nov 2025
oboro: Text-to-Image Synthesis on Limited Data using Flow-based Diffusion Transformer with MMH Attention
Ryusuke Mizutani
Kazuaki Matano
Tsugumi Kadowaki
Haruki Tenya
Layris
nuigurumi
Koki Hashimoto
Yu Tanaka
146
0
0
11 Nov 2025
LLM Optimization Unlocks Real-Time Pairwise Reranking
Jingyu Wu
Aditya Shrivastava
Jing Zhu
Alfy Samuel
Anoop Kumar
Daben Liu
92
1
0
10 Nov 2025
Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Eyal Gutflaish
Eliran Kachlon
Hezi Zisman
Tal Hacham
Nimrod Sarid
...
Saar Huberman
Gal Davidi
Guy Bukchin
Kfir Goldberg
Ron Mokady
DiffM
VLM
197
1
0
10 Nov 2025
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Yingfeng Luo
Ziqiang Xu
Yuxuan Ouyang
Murun Yang
Dingyang Lin
...
Bei Li
Peinan Feng
Quan Du
Tong Xiao
Jingbo Zhu
LRM
219
0
0
10 Nov 2025
FedRW: Efficient Privacy-Preserving Data Reweighting for Enhancing Federated Learning of Language Models
Pukang Ye
Junwei Luo
Xiaolei Dong
Yunbo Yang
101
0
0
10 Nov 2025
Rethinking Parameter Sharing as Graph Coloring for Structured Compression
Boyang Zhang
Daning Cheng
Yunquan Zhang
168
0
0
10 Nov 2025
A Decentralized Retrieval Augmented Generation System with Source Reliabilities Secured on Blockchain
Yining Lu
Wenyi Tang
Max Johnson
Taeho Jung
Meng Jiang
72
0
0
10 Nov 2025
Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets
Runhan Shi
Letian Chen
Gufeng Yu
Yang Yang
AAML
207
0
0
09 Nov 2025
Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding
Qian Ma
Ruoxiang Xu
Yongqiang Cai
72
0
0
09 Nov 2025
TinyChemVL: Advancing Chemical Vision-Language Models via Efficient Visual Token Reduction and Complex Reaction Tasks
Xuanle Zhao
Shuxin Zeng
Yinyuan Cai
Xiang Cheng
Duzhen Zhang
Xiuyi Chen
Bo Xu
132
0
0
09 Nov 2025
Seq2Seq Models Reconstruct Visual Jigsaw Puzzles without Seeing Them
Gur Elkn
Ofir Itzhak Shahar
Ohad Ben-Shahar
VLM
80
0
0
09 Nov 2025
LLaDA-Rec: Discrete Diffusion for Parallel Semantic ID Generation in Generative Recommendation
Teng Shi
Chenglei Shen
Weijie Yu
Shen Nie
Chongxuan Li
Xiao Zhang
Ming He
Yan Han
Jun Xu
DiffM
96
0
0
09 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLL
LLMAG
LRM
250
3
0
09 Nov 2025
CGCE: Classifier-Guided Concept Erasure in Generative Models
Viet Nguyen
Vishal M. Patel
144
0
0
08 Nov 2025
MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
Yu Zhang
Hui-Ling Zhen
Mingxuan Yuan
Bei Yu
MQ
273
0
0
08 Nov 2025
From Hubs to Deserts: Urban Cultural Accessibility Patterns with Explainable AI
Protik Bose Pranto
Minhazul Islam
Ripon Kumar Saha
Abimelec Mercado Rivera
Namig Abbasov
76
0
0
08 Nov 2025
Attention and Compression is all you need for Controllably Efficient Language Models
Jatin Prakash
A. Puli
Rajesh Ranganath
MQ
VLM
434
0
0
07 Nov 2025
Reasoning-Guided Claim Normalization for Noisy Multilingual Social Media Posts
Manan Sharma
Arya Suneesh
Manish Jain
Pawan Kumar Rajpoot
Prasanna Devadiga
Bharatdeep Hazarika
Ashish Shrivastava
Kishan Gurumurthy
Anshuman B Suresh
Aditya U Baliga
108
0
0
07 Nov 2025
Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG
Harshit Nainwani
Hediyeh Baban
AI4TS
232
0
0
07 Nov 2025
ManufactuBERT: Efficient Continual Pretraining for Manufacturing
Robin Armingaud
Romaric Besançon
68
0
0
07 Nov 2025
A Representation Sharpening Framework for Zero Shot Dense Retrieval
Dhananjay Ashok
Suraj Nair
Mutasem Al-Darabsah
C. Teo
Tarun Agarwal
Jonathan May
96
0
0
07 Nov 2025
TabDistill: Distilling Transformers into Neural Nets for Few-Shot Tabular Classification
Pasan Dissanayake
Sanghamitra Dutta
LMTD
309
0
0
07 Nov 2025
Previous
1
2
3
4
5
6
...
238
239
240
Next