ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 11,939 papers shown
Title
Low-Rank Prehab: Preparing Neural Networks for SVD Compression
Haoran Qin
Shansita D. Sharma
Ali Abbasi
Chayne Thrash
Soheil Kolouri
76
0
0
01 Dec 2025
TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance
Pei Yang
Y. Liu
Kelly Peng
Yuan Gao
Yiren Song
WIGM
125
0
0
01 Dec 2025
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Letian Yi
Tingpeng Zhang
Mingyuan Zhou
Guannan Wang
Quanke Su
Zhilu Lai
DiffM
8
0
0
01 Dec 2025
On The Finetuning of MLIPs Through the Lens of Iterated Maps With BPTT
Evan Dramko
Yizhi Zhu
Aleksandar Krivokapic
Geoffroy Hautier
Thomas Reps
C. Jermaine
Anastasios Kyrillidis
8
0
0
30 Nov 2025
Table as a Modality for Large Language Models
Table as a Modality for Large Language Models
Liyao Li
Chao Ye
Wentao Ye
Y. Sun
Zhe Jiang
...
Yiming Zhang
Ningtao Wang
Xing Fu
Gang Chen
Junbo Zhao
LMTD
84
0
0
30 Nov 2025
WaterSearch: A Quality-Aware Search-based Watermarking Framework for Large Language Models
WaterSearch: A Quality-Aware Search-based Watermarking Framework for Large Language Models
Yukang Lin
Jiahao Shao
Shuoran Jiang
Wentao Zhu
Bingjie Lu
Xiangping Wu
Joanna Siebert
Qingcai Chen
WaLM
168
0
0
30 Nov 2025
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Lingdong Wang
Guan-Ming Su
D. Kothandaraman
Tsung-Wei Huang
Mohammad Hajiesmaili
R. Sitaraman
DiffMVGen
64
0
0
29 Nov 2025
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb
David B. Ascher
Ahsan Baidar Bakht
12
0
0
29 Nov 2025
Financial Text Classification Based On rLoRA Finetuning On Qwen3-8B model
Zhiming Lian
8
0
0
29 Nov 2025
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Praveen Gatla
Anushka
Nikita Kanwar
Gouri Sahoo
Rajesh Kumar Mundotiya
48
1
0
28 Nov 2025
Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM
Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM
Mengjie Liu
Jiahui Peng
Pei Chu
Jiantao Qiu
Ren Ma
...
Zhenxiang Li
Chao Xu
Zhongying Tu
Wentao Zhang
Conghui He
88
0
0
28 Nov 2025
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
Zhizhou Zhong
Yicheng Ji
Zhe Kong
Y. Liu
Jiarui Wang
...
Ying Qin
Huan Li
Shuiyang Mao
W. Liu
Wenhan Luo
DiffMVGen
60
0
0
28 Nov 2025
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery ApproachIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2025
Shuqi Liu
Han Wu
Guanzhi Deng
Jianshu Chen
Xiaoyang Wang
Linqi Song
32
0
0
28 Nov 2025
Language-conditioned world model improves policy generalization by reading environmental descriptions
Language-conditioned world model improves policy generalization by reading environmental descriptions
Anh Nguyen
Stefan Lee
LM&Ro
102
0
0
28 Nov 2025
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Paulo J. N. Pinto
A. Pinho
Diogo Pratas
AI4CE
183
0
0
28 Nov 2025
CoFiRec: Coarse-to-Fine Tokenization for Generative Recommendation
CoFiRec: Coarse-to-Fine Tokenization for Generative Recommendation
Tianxin Wei
Xuying Ning
Xuxing Chen
Ruizhong Qiu
Yupeng Hou
Yan Xie
Shuang Yang
Zhigang Hua
Jingrui He
AI4TS
8
0
0
27 Nov 2025
Ghosting Your LLM: Without The Knowledge of Your Gradient and Data
Ghosting Your LLM: Without The Knowledge of Your Gradient and Data
Abeer Matar A. Almalky
Ziyan Wang
Mohaiminul Al Nahian
Li Yang
Adnan Siraj Rakin
AAML
124
0
0
27 Nov 2025
Exploring Performance Variations in Finetuned Translators of Ultra-Low Resource Languages: Do Linguistic Differences Matter?
Exploring Performance Variations in Finetuned Translators of Ultra-Low Resource Languages: Do Linguistic Differences Matter?
Isabel Gonçalves
Paulo Cavalin
Claudio S. Pinhanez
8
0
0
27 Nov 2025
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
Dongyang Fan
Diba Hashemi
Sai Praneeth Karimireddy
Martin Jaggi
97
0
0
26 Nov 2025
On the Origin of Algorithmic Progress in AI
On the Origin of Algorithmic Progress in AI
Hans Gundlach
Alex Fogelson
Jayson Lynch
Ana Trisovic
Jonathan Rosenfeld
Anmol Sandhu
Neil Thompson
56
0
0
26 Nov 2025
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee
Yoonkyo Jung
Inkook Chun
Yao-Chih Lee
Zikui Cai
...
Aayush Talreja
Tan Dat Dao
Yongyuan Liang
Jia-Bin Huang
Furong Huang
52
0
0
26 Nov 2025
Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Reham Omar
Abdelghny Orogat
Ibrahim Abdelaziz
Omij Mangukiya
Panos Kalnis
Essam Mansour
142
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffMVGen
128
0
0
26 Nov 2025
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
Róbert Belanec
Branislav Pecher
Ivan Srba
Maria Bielikova
103
1
0
26 Nov 2025
A Systematic Study of Model Merging Techniques in Large Language Models
A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
213
0
0
26 Nov 2025
Towards Audio Token Compression in Large Audio Language Models
Towards Audio Token Compression in Large Audio Language Models
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
James R. Glass
AuLLM
193
0
0
26 Nov 2025
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong
Yinglong Zhang
Xiaoying Hong
Xuewen Xia
Xing Xu
146
0
0
26 Nov 2025
CameraMaster: Unified Camera Semantic-Parameter Control for Photography Retouching
CameraMaster: Unified Camera Semantic-Parameter Control for Photography Retouching
Qirui Yang
Yang Yang
Ying Zeng
Xiaobin Hu
Bo Li
Huanjing Yue
Jingyu Yang
P. Jiang
DiffMVGen
271
0
0
26 Nov 2025
Text-Guided Semantic Image Encoder
Text-Guided Semantic Image Encoder
Raghuveer Thirukovalluru
Xiaochuang Han
Bhuwan Dhingra
Emily Dinan
Maha Elbayad
VLM
124
0
0
25 Nov 2025
Cisco Time Series Model Technical Report
Cisco Time Series Model Technical Report
Liang Gou
Archit Khare
Praneet Pabolu
Prachi Patel
Joseph Ross
...
Jingze Sun
Kristal Curtis
Vedant Dharnidharka
Abhinav Mathur
Hao Yang
AI4TS
106
0
0
25 Nov 2025
Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development
Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development
David Szczecina
Senan Gaffori
Edmond Li
DeLMO
297
0
0
25 Nov 2025
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang
Yucheng Zhou
Wencheng Han
Runzhou Tao
Zhongying Qiu
Jianfei Yang
Jianbing Shen
DiffMEGVM
298
0
0
25 Nov 2025
Decoupling and Damping: Structurally-Regularized Gradient Matching for Multimodal Graph Condensation
Decoupling and Damping: Structurally-Regularized Gradient Matching for Multimodal Graph Condensation
Lian Shen
Zhendan Chen
Yinhui jiang
Meijia Song
Ziming Su
Juan Liu
Xiangrong Liu
56
0
0
25 Nov 2025
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao
Ziyang Gong
Hehai Lin
Yang Liu
Jiashun Cheng
...
C. Qin
Hong Cheng
Xue Yang
Juepeng Zheng
Haohuan Fu
196
0
0
25 Nov 2025
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
GigaWorld Team
Angen Ye
Boyuan Wang
Chaojun Ni
Guan Huang
...
Yang Wang
Yukun Zhou
Z. Zhang
Z. Dong
Zheng Zhu
VGenLM&Ro
168
0
0
25 Nov 2025
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
Zhenghan Fang
Jian Zheng
Qiaozi Gao
Xiaofeng Gao
Jeremias Sulam
128
0
0
24 Nov 2025
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
Damodar Panigrahi
Raj Patel
Shaswata Mitra
Sudip Mittal
Shahram Rahimi
52
0
0
24 Nov 2025
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
Ziteng Sun
Adrian Benton
Samuel Kushnir
Asher Trockman
Vikas Singh
Suhas Diggavi
A. Suresh
MQ
122
0
0
24 Nov 2025
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
Nitya Tiwari
Parv Maheshwari
Vidisha Agarwal
LRM
84
0
0
24 Nov 2025
EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
Xihe Qiu
Gengchen Ma
Haoyu Wang
Chen Zhan
Xiaoyu Tan
Shuo Li
VLM
111
0
0
24 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
324
0
0
24 Nov 2025
FineXtrol: Controllable Motion Generation via Fine-Grained Text
FineXtrol: Controllable Motion Generation via Fine-Grained Text
Keming Shen
Bizhu Wu
Junliang Chen
Xiaoqin Wang
Linlin Shen
VGen
96
0
0
24 Nov 2025
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
Shristi Das Biswas
Arani Roy
Kaushik Roy
VGen
200
0
0
24 Nov 2025
A symbolic Perl algorithm for the unification of Nahuatl word spellings
A symbolic Perl algorithm for the unification of Nahuatl word spellingsMexican International Conference on Artificial Intelligence (MICAI), 2025
Juan-José Guzmán-Landa
Jesús Vázquez-Osorio
Juan-Manuel Torres-Moreno
Ligia Quintana-Torres
Miguel Figueroa-Saavedra
Martha Lorena Avendaño Garrido
Graham Ranger
Patricia Velázquez-Morales
Gerardo Eugenio Sierra Martínez
48
0
0
24 Nov 2025
LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space
LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space
Hai Wu
Shuai Tang
Jiale Wang
Longkun Zou
Mingyue Guo
Rongqin Liang
Ke Chen
Yaowei Wang
120
0
0
24 Nov 2025
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
Yang Xiang
Yixin Ji
Juntao Li
Min Zhang
LRM
76
0
0
24 Nov 2025
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
Dongha Lee
Jinhee Park
Minjun Kim
Junseok Kwon
AI4CE
215
0
0
24 Nov 2025
An Invariant Latent Space Perspective on Language Model Inversion
An Invariant Latent Space Perspective on Language Model Inversion
Wentao Ye
Jiaqi Hu
Haobo Wang
Xinpeng Ti
Zhiqing Xiao
Hao Chen
Liyao Li
Lei Feng
Sai Wu
Junbo Zhao
52
0
0
24 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVMVGen
84
0
0
24 Nov 2025
A Systematic Study of Compression Ordering for Large Language Models
A Systematic Study of Compression Ordering for Large Language Models
Shivansh Chhawri
Rahul Mahadik
Suparna Rooj
MQ
80
0
0
23 Nov 2025
1234...237238239
Next