ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 12,040 papers shown
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu
Jiexi Lyu
Fulei Sun
R. G. Yang
Zhiqiang Ma
Wei Li
112
0
0
20 Nov 2025
NLP Datasets for Idiom and Figurative Language Tasks
NLP Datasets for Idiom and Figurative Language Tasks
Blake Matheny
Phuong Minh Nguyen
Minh Le Nguyen
Stephanie Reynolds
125
0
0
20 Nov 2025
AskDB: An LLM Agent for Natural Language Interaction with Relational Databases
Xuan-Quang Phan
Tan-Ha Mai
Thai-Duy Dinh
Minh-Thuan Nguyen
Lam-Son Lê
108
0
0
20 Nov 2025
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
Haotian Xu
Yuning You
Tengfei Ma
116
0
0
20 Nov 2025
You Only Forward Once: An Efficient Compositional Judging Paradigm
You Only Forward Once: An Efficient Compositional Judging Paradigm
Tianlong Zhang
Hongwei Xue
Shilin Yan
Di Wu
Chen Xu
Y. Yang
139
0
0
20 Nov 2025
Sparse Autoencoders are Topic Models
Leander Girrbach
Zeynep Akata
119
0
0
20 Nov 2025
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Yan Xia
Letian Shi
Yilin Di
João F. Henriques
Daniel Cremers
3DPC
156
0
0
19 Nov 2025
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Michael McCabe
Payel Mukhopadhyay
Tanya Marwah
Bruno Régaldo-Saint Blancard
François Rozet
...
Mariel Pettee
Jeff Shen
Kyunghyun Cho
M. Cranmer
S. Ho
AI4CE
243
3
0
19 Nov 2025
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
Raghu Chittersu
Yuvraj Singh Rathore
Pranav Adlinge
Kunal Swami
DiffM
268
0
0
19 Nov 2025
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Alexis Audran-Reiss
Jordi Armengol-Estapé
Karen Hambardzumyan
Amar Budhiraja
Martin Josifoski
...
Jenny Zhang
Taco Cohen
Yossi Adi
Tatiana Shavrina
Yoram Bachrach
175
2
0
19 Nov 2025
UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
W. Zhang
Yeying Jin
Xin Li
Yan Zhang
Xiaofeng Cong
Cong Wang
Fengcai Qiao
zhichao Lian
94
0
0
19 Nov 2025
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
Ye Tian
Chengcheng Wang
Jing Han
Yehui Tang
Kai Han
MQ
124
0
0
19 Nov 2025
IPR-1: Interactive Physical Reasoner
IPR-1: Interactive Physical Reasoner
Mingyu Zhang
Lifeng Zhuo
Tianxi Tan
Guocan Xie
Xian Nie
...
Renjie Zhao
Zizhu He
Z. Wang
Jiting Cai
Yong-Lu Li
PINNLRMAI4CE
407
0
0
19 Nov 2025
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
Panqi Yang
Haodong Jing
Nanning Zheng
Yongqiang Ma
216
0
0
19 Nov 2025
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
Samih Fadli
65
0
0
19 Nov 2025
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Yuan Jiang
Zehao Li
Shan Huang
Christoph Treude
Xiaohong Su
Tiantian Wang
AAML
264
1
0
19 Nov 2025
SplitFlux: Learning to Decouple Content and Style from a Single Image
SplitFlux: Learning to Decouple Content and Style from a Single Image
Yitong Yang
Y Samuel Wang
Changshuo Wang
Yongjun Zhang
Ziyang Chen
Shuting He
223
0
0
19 Nov 2025
DEVAL: A Framework for Evaluating and Improving the Derivation Capability of Large Language Models
DEVAL: A Framework for Evaluating and Improving the Derivation Capability of Large Language Models
Y. Li
Qin Li
Min Zhang
Min Zhang
LRM
213
0
0
18 Nov 2025
Foundational Question Generation for Video Question Answering via an Embedding-Integrated Approach
Foundational Question Generation for Video Question Answering via an Embedding-Integrated Approach
Ju-Young Oh
106
0
0
18 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
146
7
0
18 Nov 2025
ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC
ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC
Ahlam Alrehili
Areej Alhothali
KELM
134
0
0
18 Nov 2025
Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study
Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study
Pranjal Gupta
Karan Bhukar
H. Kumar
Seema Nagar
P. Mohapatra
Debanjana Kar
68
0
0
17 Nov 2025
Uni-Hema: Unified Model for Digital Hematopathology
Uni-Hema: Unified Model for Digital Hematopathology
Abdul Rehman
Iqra Rasool
Ayisha Imran
Mohsen Ali
Waqas Sultani
VLM
153
0
0
17 Nov 2025
Translation Entropy: A Statistical Framework for Evaluating Translation Systems
Translation Entropy: A Statistical Framework for Evaluating Translation Systems
Ronit D. Gross
Yanir Harel
Ido Kanter
62
1
0
17 Nov 2025
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
Enhui Ma
Lijun Zhou
Tao Tang
Jiahuan Zhang
Junpeng Jiang
...
Xianpeng Lang
Haiyang Sun
Xia Zhou
Di Lin
Kaicheng Yu
252
0
0
17 Nov 2025
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Jihun Park
Kyoungmin Lee
Jongmin Gim
Hyeonseo Jo
Minseok Oh
Wonhyeok Choi
K. Hwang
Jaeyeul Kim
Minwoo Choi
S. Im
112
1
1
17 Nov 2025
NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation
NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation
Kang Yin
Hye-Bin Shin
144
0
0
17 Nov 2025
CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product
CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product
Kaiwen Xue
Chenglong Li
Zhonghong Ou
Guoxin Zhang
Kaoyan Lu
...
Xinyu Liu
Qunlin Chen
Weiwei Qin
Yiran Shen
Jiayi Cen
123
0
0
17 Nov 2025
Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation
Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation
Yu Hou
Won-Yong Shin
74
0
0
17 Nov 2025
Multivariate Diffusion Transformer with Decoupled Attention for High-Fidelity Mask-Text Collaborative Facial Generation
Multivariate Diffusion Transformer with Decoupled Attention for High-Fidelity Mask-Text Collaborative Facial Generation
Yushe Cao
Dianxi Shi
Xing Fu
Xuechao Zou
Haikuo Peng
Xueqi Li
Chun Yu
Junliang Xing
DiffM
230
0
0
16 Nov 2025
HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models
HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models
Zhiguang Lu
Qianqian Xu
Peisong Wen
Siran Da
Qingming Huang
DiffM
705
0
0
16 Nov 2025
MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering
MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering
Seokwon Song
Minsu Park
Gunhee Kim
106
0
0
15 Nov 2025
GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction
GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction
Jiaqi Wu
Yaosen Chen
Shuyuan Zhu
VGen
315
0
0
15 Nov 2025
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
Quanxing Xu
Ling Zhou
Feifei Zhang
Jinyu Tian
Rubing Huang
VLM
263
0
0
15 Nov 2025
Do LLMs and Humans Find the Same Questions Difficult? A Case Study on Japanese Quiz Answering
Do LLMs and Humans Find the Same Questions Difficult? A Case Study on Japanese Quiz Answering
Naoya Sugiura
Kosuke Yamada
Yasuhiro Ogawa
Katsuhiko Toyama
Ryohei Sasano
105
0
0
15 Nov 2025
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Haozhe Liu
Ding Liu
Mingchen Zhuge
Zijian Zhou
Tian Xie
...
Juan-Manuel Perez-Rua
Tao Xiang
Wei Liu
Shikun Liu
Jürgen Schmidhuber
105
0
0
15 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Large Language Models and 3D Vision for Intelligent Robotic Perception and AutonomyItalian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
375
2
0
14 Nov 2025
Improving LLM's Attachment to External Knowledge In Dialogue Generation Tasks Through Entity Anonymization
Improving LLM's Attachment to External Knowledge In Dialogue Generation Tasks Through Entity Anonymization
Hadi Sheikhi
Chenyang Huang
Osmar R. Zaiane
113
0
0
14 Nov 2025
KVSwap: Disk-aware KV Cache Offloading for Long-Context On-device Inference
KVSwap: Disk-aware KV Cache Offloading for Long-Context On-device Inference
H. Zhang
Chunwei Xia
Zheng Wang
SyDa
351
1
0
14 Nov 2025
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
Duc Nguyen
Huu Binh Ta
Nhuan Le Duc
T. Nguyen
T. Tran
MoE
457
0
0
12 Nov 2025
Not Everything That Counts Can Be Counted: A Case for Safe Qualitative AI
Not Everything That Counts Can Be Counted: A Case for Safe Qualitative AISoftwareX (SoftwareX), 2025
Stine Beltoft
Lukas Galke
81
0
0
12 Nov 2025
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
Leonie Bossemeyer
Samuel Heinrich
Grant Van Horn
Oisin Mac Aodha
103
0
0
11 Nov 2025
A Unified Geometric Field Theory Framework for Transformers: From Manifold Embeddings to Kernel Modulation
A Unified Geometric Field Theory Framework for Transformers: From Manifold Embeddings to Kernel Modulation
Xianshuai Shi
Jianfeng Zhu
Leibo Liu
151
0
0
11 Nov 2025
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
Da Li
Yuxiao Luo
Keping Bi
Jiafeng Guo
Wei Yuan
B. Yang
Yan Wang
Fan Yang
Tingting Gao
Guorui Zhou
VLM
254
0
0
11 Nov 2025
ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
Andrija Stanisic
Stefan Nastic
116
0
0
11 Nov 2025
Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion Transformers
Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion Transformers
Sida Huang
Siqi Huang
Ping Luo
Hongyuan Zhang
DiffM
294
3
0
11 Nov 2025
A Circular Argument : Does RoPE need to be Equivariant for Vision?
A Circular Argument : Does RoPE need to be Equivariant for Vision?
Chase van de Geijn
Timo Lüddecke
Polina Turishcheva
Alexander S. Ecker
160
2
0
11 Nov 2025
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Shinwoo Park
Hyejin Park
Hyeseon Ahn
Yo-Sub Han
256
0
0
11 Nov 2025
Majority Rules: LLM Ensemble is a Winning Approach for Content Categorization
Ariel Kamen
Yakov Kamen
72
0
0
11 Nov 2025
Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research
Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research
Neelavro Saha
Rafi Shahriyar
Nafis Ashraf Roudra
Saadman Sakib
Annajiat Alim Rasel
154
1
0
11 Nov 2025
Previous
123456...239240241
Next
Page 3 of 241
Pageof 241