ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 12,021 papers shown
Cisco Time Series Model Technical Report
Cisco Time Series Model Technical Report
Liang Gou
Archit Khare
Praneet Pabolu
Prachi Patel
Joseph Ross
...
Jingze Sun
Kristal Curtis
Vedant Dharnidharka
Abhinav Mathur
Hao Yang
AI4TS
126
0
0
25 Nov 2025
Text-Guided Semantic Image Encoder
Text-Guided Semantic Image Encoder
Raghuveer Thirukovalluru
Xiaochuang Han
Bhuwan Dhingra
Emily Dinan
Maha Elbayad
VLM
152
0
0
25 Nov 2025
Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding
Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding
Duy-Tung Pham
A. Nguyen
Viet-Hoang Tran
Nhan-Phu Chung
Xin T. Tong
T. Nguyen
Thieu N. Vo
50
0
0
25 Nov 2025
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao
Ziyang Gong
Hehai Lin
Yang Liu
Jiashun Cheng
...
C. Qin
Hong Cheng
Xue Yang
Juepeng Zheng
Haohuan Fu
228
0
0
25 Nov 2025
Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development
Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development
David Szczecina
Senan Gaffori
Edmond Li
DeLMO
357
0
0
25 Nov 2025
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang
Yucheng Zhou
Wencheng Han
Runzhou Tao
Zhongying Qiu
Jianfei Yang
Jianbing Shen
DiffMEGVM
330
0
0
25 Nov 2025
Decoupling and Damping: Structurally-Regularized Gradient Matching for Multimodal Graph Condensation
Decoupling and Damping: Structurally-Regularized Gradient Matching for Multimodal Graph Condensation
Lian Shen
Zhendan Chen
Yinhui jiang
Meijia Song
Ziming Su
Juan Liu
Xiangrong Liu
84
0
0
25 Nov 2025
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
GigaWorld Team
Angen Ye
Boyuan Wang
Chaojun Ni
Guan Huang
...
Yang Wang
Yukun Zhou
Z. Zhang
Z. Dong
Zheng Zhu
VGenLM&Ro
360
2
0
25 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
368
0
0
24 Nov 2025
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
Dongha Lee
Jinhee Park
Minjun Kim
Junseok Kwon
AI4CE
397
0
0
24 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVMVGen
120
0
0
24 Nov 2025
An Invariant Latent Space Perspective on Language Model Inversion
An Invariant Latent Space Perspective on Language Model Inversion
Wentao Ye
Jiaqi Hu
Haobo Wang
Xinpeng Ti
Zhiqing Xiao
Hao Chen
Liyao Li
Lei Feng
Sai Wu
Junbo Zhao
72
0
0
24 Nov 2025
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
Ziteng Sun
Adrian Benton
Samuel Kushnir
Asher Trockman
Vikas Singh
Suhas Diggavi
A. Suresh
MQ
154
0
0
24 Nov 2025
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Shristi Das Biswas
Arani Roy
Kaushik Roy
VGen
262
0
0
24 Nov 2025
LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space
LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space
Hai Wu
Shuai Tang
Jiale Wang
Longkun Zou
Mingyue Guo
Rongqin Liang
Ke Chen
Yaowei Wang
140
1
0
24 Nov 2025
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
Damodar Panigrahi
Raj Patel
Shaswata Mitra
Sudip Mittal
Shahram Rahimi
80
0
0
24 Nov 2025
A symbolic Perl algorithm for the unification of Nahuatl word spellings
A symbolic Perl algorithm for the unification of Nahuatl word spellingsMexican International Conference on Artificial Intelligence (MICAI), 2025
Juan-José Guzmán-Landa
Jesús Vázquez-Osorio
Juan-Manuel Torres-Moreno
Ligia Quintana-Torres
Miguel Figueroa-Saavedra
Martha Lorena Avendaño Garrido
Graham Ranger
Patricia Velázquez-Morales
Gerardo Eugenio Sierra Martínez
84
0
0
24 Nov 2025
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
Yang Xiang
Yixin Ji
Juntao Li
Min Zhang
LRM
108
0
0
24 Nov 2025
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
Zhenghan Fang
Jian Zheng
Qiaozi Gao
Xiaofeng Gao
Jeremias Sulam
212
0
0
24 Nov 2025
EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
Xihe Qiu
Gengchen Ma
Haoyu Wang
Chen Zhan
Xiaoyu Tan
Shuo Li
VLM
143
0
0
24 Nov 2025
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Nitya Tiwari
Parv Maheshwari
Vidisha Agarwal
LRM
100
0
0
24 Nov 2025
FineXtrol: Controllable Motion Generation via Fine-Grained Text
FineXtrol: Controllable Motion Generation via Fine-Grained Text
Keming Shen
Bizhu Wu
Junliang Chen
Xiaoqin Wang
Linlin Shen
VGen
116
0
0
24 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffMVGen
148
0
0
23 Nov 2025
A Systematic Study of Compression Ordering for Large Language Models
A Systematic Study of Compression Ordering for Large Language Models
Shivansh Chhawri
Rahul Mahadik
Suparna Rooj
MQ
128
0
0
23 Nov 2025
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
Avishka Perera
Kumal Hewagamage
Saeedha Nazar
Kavishka Abeywardana
Hasitha Gallella
Ranga Rodrigo
Mohamed Afham
3DV
175
0
0
23 Nov 2025
Foundations of Artificial Intelligence Frameworks: Notion and Limits of AGI
Foundations of Artificial Intelligence Frameworks: Notion and Limits of AGI
Khanh Gia Bui
NAIAI4CE
357
0
0
23 Nov 2025
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
Sultan AlRashed
Chadi Helwe
Francesco Orabona
MoE
104
0
0
23 Nov 2025
Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Yusong Wu
Stephen Brade
Teng Ma
Tia-Jane Fowler
Enning Yang
Berker Banar
Aaron Courville
Natasha Jaques
Cheng-Zhi Anna Huang
AAML
136
0
0
22 Nov 2025
Blu-WERP (Web Extraction and Refinement Pipeline): A Scalable Pipeline for Preprocessing Large Language Model Datasets
Blu-WERP (Web Extraction and Refinement Pipeline): A Scalable Pipeline for Preprocessing Large Language Model Datasets
Gowtham
Sai Rupesh
Sanjay Kumar
Saravanan
Venkata Chaithanya
VLM
205
0
0
22 Nov 2025
DELTA: Language Diffusion-based EEG-to-Text Architecture
DELTA: Language Diffusion-based EEG-to-Text Architecture
Mingyu Jeon
Hyobin Kim
DiffM
68
0
0
22 Nov 2025
Plan-X: Instruct Video Generation via Semantic Planning
Plan-X: Instruct Video Generation via Semantic Planning
Lun Huang
You Xie
Hongyi Xu
Tianpei Gu
Chenxu Zhang
Guoxian Song
Zenan Li
Xiaochen Zhao
Linjie Luo
Guillermo Sapiro
DiffMVGen
88
0
0
22 Nov 2025
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
G. Carneiro
Thanh-Toan Do
MQ
134
0
0
21 Nov 2025
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Aniketh Iyengar
Jiaqi Han
Boris Ruf
Vincent Grari
Marcin Detyniecki
Stefano Ermon
DiffM
185
0
0
21 Nov 2025
Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan's Historical Celebrities
Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan's Historical Celebrities
Junjie Hao
Chun Wang
Ying Qiao
Qiuyue Zuo
Qiya Song
Hua Ma
Xieping Gao
104
0
0
21 Nov 2025
CREST: Improving Interpretability and Effectiveness of Troubleshooting at Ericsson through Criterion-Specific Trouble Report Retrieval
CREST: Improving Interpretability and Effectiveness of Troubleshooting at Ericsson through Criterion-Specific Trouble Report RetrievalJournal of Systems and Software (JSS), 2025
Soroush Javdan
Pragash Krishnamoorthy
Olga Baysal
169
0
0
21 Nov 2025
Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models
Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
G. Carneiro
Jianfei Cai
Thanh-Toan Do
MQ
138
0
0
21 Nov 2025
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Daniil Ignatev
Ayman Santeer
Albert Gatt
Denis Paperno
148
0
0
21 Nov 2025
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
Y. Zhang
Yizheng Zhao
Chen-Hao Hu
Binxing Jiao
Daxin Jiang
Ruihang Miao
Cam-Tu Nguyen
181
0
0
21 Nov 2025
Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards
Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards
Zhen Wang
Zhifeng Gao
Guolin Ke
OffRLLRM
253
0
0
21 Nov 2025
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
Jiaye Qian
Ge Zheng
Yuchen Zhu
Sibei Yang
MLLM
289
1
0
21 Nov 2025
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
Y. Tang
Daiki Shimada
Hang Hua
Chao Huang
Jing Bi
Rogerio Feris
Chenliang Xu
233
0
0
21 Nov 2025
R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization
R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization
Jiayi Chen
Jieqi Shi
Jing Huo
Chen Wu
MQ
171
0
0
21 Nov 2025
AskDB: An LLM Agent for Natural Language Interaction with Relational Databases
Xuan-Quang Phan
Tan-Ha Mai
Thai-Duy Dinh
Minh-Thuan Nguyen
Lam-Son Lê
96
0
0
20 Nov 2025
Sparse Autoencoders are Topic Models
Leander Girrbach
Zeynep Akata
113
0
0
20 Nov 2025
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu
Jiexi Lyu
Fulei Sun
R. G. Yang
Zhiqiang Ma
Wei Li
108
0
0
20 Nov 2025
NLP Datasets for Idiom and Figurative Language Tasks
NLP Datasets for Idiom and Figurative Language Tasks
Blake Matheny
Phuong Minh Nguyen
Minh Le Nguyen
Stephanie Reynolds
112
0
0
20 Nov 2025
You Only Forward Once: An Efficient Compositional Judging Paradigm
You Only Forward Once: An Efficient Compositional Judging Paradigm
Tianlong Zhang
Hongwei Xue
Shilin Yan
Di Wu
Chen Xu
Y. Yang
126
0
0
20 Nov 2025
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
Haotian Xu
Yuning You
Tengfei Ma
108
0
0
20 Nov 2025
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Yan Xia
Letian Shi
Yilin Di
João F. Henriques
Daniel Cremers
3DPC
148
0
0
19 Nov 2025
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Alexis Audran-Reiss
Jordi Armengol-Estapé
Karen Hambardzumyan
Amar Budhiraja
Martin Josifoski
...
Jenny Zhang
Taco Cohen
Yossi Adi
Tatiana Shavrina
Yoram Bachrach
165
2
0
19 Nov 2025
Previous
12345...239240241
Next