ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 11,958 papers shown
Title
A symbolic Perl algorithm for the unification of Nahuatl word spellings
A symbolic Perl algorithm for the unification of Nahuatl word spellingsMexican International Conference on Artificial Intelligence (MICAI), 2025
Juan-José Guzmán-Landa
Jesús Vázquez-Osorio
Juan-Manuel Torres-Moreno
Ligia Quintana-Torres
Miguel Figueroa-Saavedra
Martha Lorena Avendaño Garrido
Graham Ranger
Patricia Velázquez-Morales
Gerardo Eugenio Sierra Martínez
60
0
0
24 Nov 2025
An Invariant Latent Space Perspective on Language Model Inversion
An Invariant Latent Space Perspective on Language Model Inversion
Wentao Ye
Jiaqi Hu
Haobo Wang
Xinpeng Ti
Zhiqing Xiao
Hao Chen
Liyao Li
Lei Feng
Sai Wu
Junbo Zhao
60
0
0
24 Nov 2025
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
Dongha Lee
Jinhee Park
Minjun Kim
Junseok Kwon
AI4CE
243
0
0
24 Nov 2025
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Shristi Das Biswas
Arani Roy
Kaushik Roy
VGen
224
0
0
24 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVMVGen
104
0
0
24 Nov 2025
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
Damodar Panigrahi
Raj Patel
Shaswata Mitra
Sudip Mittal
Shahram Rahimi
60
0
0
24 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
348
0
0
24 Nov 2025
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Nitya Tiwari
Parv Maheshwari
Vidisha Agarwal
LRM
92
0
0
24 Nov 2025
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
Avishka Perera
Kumal Hewagamage
Saeedha Nazar
Kavishka Abeywardana
Hasitha Gallella
Ranga Rodrigo
Mohamed Afham
3DV
167
0
0
23 Nov 2025
A Systematic Study of Compression Ordering for Large Language Models
A Systematic Study of Compression Ordering for Large Language Models
Shivansh Chhawri
Rahul Mahadik
Suparna Rooj
MQ
80
0
0
23 Nov 2025
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
Sultan AlRashed
Chadi Helwe
Francesco Orabona
MoE
88
0
0
23 Nov 2025
Foundations of Artificial Intelligence Frameworks: Notion and Limits of AGI
Foundations of Artificial Intelligence Frameworks: Notion and Limits of AGI
Khanh Gia Bui
NAIAI4CE
345
0
0
23 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffMVGen
132
0
0
23 Nov 2025
Blu-WERP (Web Extraction and Refinement Pipeline): A Scalable Pipeline for Preprocessing Large Language Model Datasets
Blu-WERP (Web Extraction and Refinement Pipeline): A Scalable Pipeline for Preprocessing Large Language Model Datasets
Gowtham
Sai Rupesh
Sanjay Kumar
Saravanan
Venkata Chaithanya
VLM
189
0
0
22 Nov 2025
Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Yusong Wu
Stephen Brade
Teng Ma
Tia-Jane Fowler
Enning Yang
Berker Banar
Aaron Courville
Natasha Jaques
Cheng-Zhi Anna Huang
AAML
112
0
0
22 Nov 2025
DELTA: Language Diffusion-based EEG-to-Text Architecture
DELTA: Language Diffusion-based EEG-to-Text Architecture
Mingyu Jeon
Hyobin Kim
DiffM
32
0
0
22 Nov 2025
Plan-X: Instruct Video Generation via Semantic Planning
Plan-X: Instruct Video Generation via Semantic Planning
Lun Huang
You Xie
Hongyi Xu
Tianpei Gu
Chenxu Zhang
Guoxian Song
Zenan Li
Xiaochen Zhao
Linjie Luo
Guillermo Sapiro
DiffMVGen
84
0
0
22 Nov 2025
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Don't Learn, Ground: A Case for Natural Language Inference with Visual Grounding
Daniil Ignatev
Ayman Santeer
Albert Gatt
Denis Paperno
140
0
0
21 Nov 2025
Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards
Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards
Zhen Wang
Zhifeng Gao
Guolin Ke
OffRLLRM
237
0
0
21 Nov 2025
CREST: Improving Interpretability and Effectiveness of Troubleshooting at Ericsson through Criterion-Specific Trouble Report Retrieval
CREST: Improving Interpretability and Effectiveness of Troubleshooting at Ericsson through Criterion-Specific Trouble Report RetrievalJournal of Systems and Software (JSS), 2025
Soroush Javdan
Pragash Krishnamoorthy
Olga Baysal
149
0
0
21 Nov 2025
Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models
Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
G. Carneiro
Jianfei Cai
Thanh-Toan Do
MQ
122
0
0
21 Nov 2025
R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization
R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization
Jiayi Chen
Jieqi Shi
Jing Huo
Chen Wu
MQ
125
0
0
21 Nov 2025
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
Jiaye Qian
Ge Zheng
Yuchen Zhu
Sibei Yang
MLLM
208
1
0
21 Nov 2025
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
G. Carneiro
Thanh-Toan Do
MQ
125
0
0
21 Nov 2025
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
Y. Zhang
Yizheng Zhao
Chen-Hao Hu
Binxing Jiao
Daxin Jiang
Ruihang Miao
Cam-Tu Nguyen
133
0
0
21 Nov 2025
Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan's Historical Celebrities
Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan's Historical Celebrities
Junjie Hao
Chun Wang
Ying Qiao
Qiuyue Zuo
Qiya Song
Hua Ma
Xieping Gao
76
0
0
21 Nov 2025
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
Y. Tang
Daiki Shimada
Hang Hua
Chao Huang
Jing Bi
Rogerio Feris
Chenliang Xu
221
0
0
21 Nov 2025
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
Aniketh Iyengar
Jiaqi Han
Boris Ruf
Vincent Grari
Marcin Detyniecki
Stefano Ermon
DiffM
136
0
0
21 Nov 2025
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
Haotian Xu
Yuning You
Tengfei Ma
96
0
0
20 Nov 2025
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu
Jiexi Lyu
Fulei Sun
R. G. Yang
Zhiqiang Ma
Wei Li
72
0
0
20 Nov 2025
AskDB: An LLM Agent for Natural Language Interaction with Relational Databases
Xuan-Quang Phan
Tan-Ha Mai
Thai-Duy Dinh
Minh-Thuan Nguyen
Lam-Son Lê
72
0
0
20 Nov 2025
You Only Forward Once: An Efficient Compositional Judging Paradigm
You Only Forward Once: An Efficient Compositional Judging Paradigm
Tianlong Zhang
Hongwei Xue
Shilin Yan
Di Wu
Chen Xu
Y. Yang
126
0
0
20 Nov 2025
NLP Datasets for Idiom and Figurative Language Tasks
NLP Datasets for Idiom and Figurative Language Tasks
Blake Matheny
Phuong Minh Nguyen
Minh Le Nguyen
Stephanie Reynolds
104
0
0
20 Nov 2025
Sparse Autoencoders are Topic Models
Leander Girrbach
Zeynep Akata
97
0
0
20 Nov 2025
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
Ye Tian
Chengcheng Wang
Jing Han
Yehui Tang
Kai Han
MQ
100
0
0
19 Nov 2025
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
Raghu Chittersu
Yuvraj Singh Rathore
Pranav Adlinge
Kunal Swami
DiffM
220
0
0
19 Nov 2025
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Walrus: A Cross-Domain Foundation Model for Continuum Dynamics
Michael McCabe
Payel Mukhopadhyay
Tanya Marwah
Bruno Régaldo-Saint Blancard
François Rozet
...
Mariel Pettee
Jeff Shen
Kyunghyun Cho
M. Cranmer
S. Ho
AI4CE
208
0
0
19 Nov 2025
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Alexis Audran-Reiss
Jordi Armengol-Estapé
Karen Hambardzumyan
Amar Budhiraja
Martin Josifoski
...
Jenny Zhang
Taco Cohen
Yossi Adi
Tatiana Shavrina
Yoram Bachrach
108
2
0
19 Nov 2025
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
Yuan Jiang
Zehao Li
Shan Huang
Christoph Treude
Xiaohong Su
Tiantian Wang
AAML
233
0
0
19 Nov 2025
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Text2Loc++: Generalizing 3D Point Cloud Localization from Natural Language
Yan Xia
Letian Shi
Yilin Di
João F. Henriques
Daniel Cremers
3DPC
128
0
0
19 Nov 2025
IPR-1: Interactive Physical Reasoner
IPR-1: Interactive Physical Reasoner
Mingyu Zhang
Lifeng Zhuo
Tianxi Tan
Guocan Xie
Xian Nie
...
Renjie Zhao
Zizhu He
Z. Wang
Jiting Cai
Yong-Lu Li
PINNLRMAI4CE
330
0
0
19 Nov 2025
SplitFlux: Learning to Decouple Content and Style from a Single Image
SplitFlux: Learning to Decouple Content and Style from a Single Image
Yitong Yang
Y Samuel Wang
Changshuo Wang
Yongjun Zhang
Ziyang Chen
Shuting He
184
0
0
19 Nov 2025
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
Panqi Yang
Haodong Jing
Nanning Zheng
Yongqiang Ma
194
0
0
19 Nov 2025
UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
W. Zhang
Yeying Jin
Xin Li
Yan Zhang
Xiaofeng Cong
Cong Wang
Fengcai Qiao
zhichao Lian
69
0
0
19 Nov 2025
Foundational Question Generation for Video Question Answering via an Embedding-Integrated Approach
Foundational Question Generation for Video Question Answering via an Embedding-Integrated Approach
Ju-Young Oh
71
0
0
18 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
116
3
0
18 Nov 2025
DEVAL: A Framework for Evaluating and Improving the Derivation Capability of Large Language Models
DEVAL: A Framework for Evaluating and Improving the Derivation Capability of Large Language Models
Y. Li
Qin Li
Min Zhang
Min Zhang
LRM
201
0
0
18 Nov 2025
ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC
ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC
Ahlam Alrehili
Areej Alhothali
KELM
128
0
0
18 Nov 2025
CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product
CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product
Kaiwen Xue
Chenglong Li
Zhonghong Ou
Guoxin Zhang
Kaoyan Lu
...
Xinyu Liu
Qunlin Chen
Weiwei Qin
Yiran Shen
Jiayi Cen
96
0
0
17 Nov 2025
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Jihun Park
Kyoungmin Lee
Jongmin Gim
Hyeonseo Jo
Minseok Oh
Wonhyeok Choi
K. Hwang
Jaeyeul Kim
Minwoo Choi
S. Im
103
0
1
17 Nov 2025
Previous
12345...238239240
Next