Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 11,958 papers shown
Title
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
143
1
0
24 Dec 2025
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
Yuzhuo Chen
Zehua Ma
Jianhua Wang
Kai Kang
Shunyu Yao
Weiming Zhang
VLM
125
2
0
24 Dec 2025
AI-based Traffic Modeling for Network Security and Privacy: Challenges Ahead
Dinil Mon Divakaran
AAML
260
2
0
24 Dec 2025
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
Anand Gopalakrishnan
Róbert Csordás
Jürgen Schmidhuber
M. C. Mozer
108
1
0
24 Dec 2025
Generative Retrieval with Few-shot Indexing
Arian Askari
Chuan Meng
Mohammad Aliannejadi
Zhaochun Ren
Evangelos Kanoulas
Suzan Verberne
RALM
305
6
0
24 Dec 2025
Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
Yueru Jia
Jiaming Liu
Shengbang Liu
Rui Zhou
W. Yu
Yuyang Yan
Xiaowei Chi
Yandong Guo
Boxin Shi
Shanghang Zhang
VGen
204
1
0
02 Dec 2025
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang
Xiaoyu Shi
Baolu Li
Weikang Bian
Quande Liu
Huchuan Lu
Xintao Wang
Pengfei Wan
Kun Gai
Xu Jia
VGen
158
1
0
02 Dec 2025
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models
Róbert Belanec
Ivan Srba
Maria Bielikova
ALM
324
0
0
02 Dec 2025
TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance
Pei Yang
Y. Liu
Kelly Peng
Yuan Gao
Yiren Song
WIGM
141
0
0
01 Dec 2025
Think Before You Prune: Self-Reflective Structured Pruning for Reasoning Language Models
Ziyan Wang
Enmao Diao
Qi Le
Pu Wang
G. Wang
Minwoo Lee
Shu-ping Yeh
Li Yang
ReLM
LRM
VLM
100
0
0
01 Dec 2025
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Letian Yi
Tingpeng Zhang
Mingyuan Zhou
Guannan Wang
Quanke Su
Zhilu Lai
DiffM
32
0
0
01 Dec 2025
Low-Rank Prehab: Preparing Neural Networks for SVD Compression
Haoran Qin
Shansita D. Sharma
Ali Abbasi
Chayne Thrash
Soheil Kolouri
88
0
0
01 Dec 2025
On The Finetuning of MLIPs Through the Lens of Iterated Maps With BPTT
Evan Dramko
Yizhi Zhu
Aleksandar Krivokapic
Geoffroy Hautier
Thomas Reps
C. Jermaine
Anastasios Kyrillidis
44
0
0
30 Nov 2025
WaterSearch: A Quality-Aware Search-based Watermarking Framework for Large Language Models
Yukang Lin
Jiahao Shao
Shuoran Jiang
Wentao Zhu
Bingjie Lu
Xiangping Wu
Joanna Siebert
Qingcai Chen
WaLM
224
0
0
30 Nov 2025
Table as a Modality for Large Language Models
Liyao Li
Chao Ye
Wentao Ye
Y. Sun
Zhe Jiang
...
Yiming Zhang
Ningtao Wang
Xing Fu
Gang Chen
Junbo Zhao
LMTD
104
1
0
30 Nov 2025
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb
David B. Ascher
Ahsan Baidar Bakht
32
0
0
29 Nov 2025
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Lingdong Wang
Guan-Ming Su
D. Kothandaraman
Tsung-Wei Huang
Mohammad Hajiesmaili
R. Sitaraman
DiffM
VGen
92
0
0
29 Nov 2025
Financial Text Classification Based On rLoRA Finetuning On Qwen3-8B model
Zhiming Lian
20
0
0
29 Nov 2025
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
Praveen Gatla
Anushka
Nikita Kanwar
Gouri Sahoo
Rajesh Kumar Mundotiya
64
1
0
28 Nov 2025
Language-conditioned world model improves policy generalization by reading environmental descriptions
Anh Nguyen
Stefan Lee
LM&Ro
124
0
0
28 Nov 2025
Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM
Mengjie Liu
Jiahui Peng
Pei Chu
Jiantao Qiu
Ren Ma
...
Zhenxiang Li
Chao Xu
Zhongying Tu
Wentao Zhang
Conghui He
104
0
0
28 Nov 2025
Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts
Paulo J. N. Pinto
A. Pinho
Diogo Pratas
AI4CE
203
0
0
28 Nov 2025
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
Zhizhou Zhong
Yicheng Ji
Zhe Kong
Y. Liu
Jiarui Wang
...
Ying Qin
Huan Li
Shuiyang Mao
W. Liu
Wenhan Luo
DiffM
VGen
64
0
0
28 Nov 2025
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2025
Shuqi Liu
Han Wu
Guanzhi Deng
Jianshu Chen
Xiaoyang Wang
Linqi Song
52
0
0
28 Nov 2025
Exploring Performance Variations in Finetuned Translators of Ultra-Low Resource Languages: Do Linguistic Differences Matter?
Isabel Gonçalves
Paulo Cavalin
Claudio S. Pinhanez
12
0
0
27 Nov 2025
CoFiRec: Coarse-to-Fine Tokenization for Generative Recommendation
Tianxin Wei
Xuying Ning
Xuxing Chen
Ruizhong Qiu
Yupeng Hou
Yan Xie
Shuang Yang
Zhigang Hua
Jingrui He
AI4TS
16
0
0
27 Nov 2025
Ghosting Your LLM: Without The Knowledge of Your Gradient and Data
Abeer Matar A. Almalky
Ziyan Wang
Mohaiminul Al Nahian
Li Yang
Adnan Siraj Rakin
AAML
156
0
0
27 Nov 2025
Towards Audio Token Compression in Large Audio Language Models
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
James R. Glass
AuLLM
233
0
0
26 Nov 2025
Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Reham Omar
Abdelghny Orogat
Ibrahim Abdelaziz
Omij Mangukiya
Panos Kalnis
Essam Mansour
150
0
0
26 Nov 2025
CameraMaster: Unified Camera Semantic-Parameter Control for Photography Retouching
Qirui Yang
Yang Yang
Ying Zeng
Xiaobin Hu
Bo Li
Huanjing Yue
Jingyu Yang
P. Jiang
DiffM
VGen
283
0
0
26 Nov 2025
Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong
Yinglong Zhang
Xiaoying Hong
Xuewen Xia
Xing Xu
174
0
0
26 Nov 2025
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee
Yoonkyo Jung
Inkook Chun
Yao-Chih Lee
Zikui Cai
...
Aayush Talreja
Tan Dat Dao
Yongyuan Liang
Jia-Bin Huang
Furong Huang
68
0
0
26 Nov 2025
A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
261
0
0
26 Nov 2025
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
Róbert Belanec
Branislav Pecher
Ivan Srba
Maria Bielikova
103
1
0
26 Nov 2025
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
Dongyang Fan
Diba Hashemi
Sai Praneeth Karimireddy
Martin Jaggi
97
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffM
VGen
136
0
0
26 Nov 2025
On the Origin of Algorithmic Progress in AI
Hans Gundlach
Alex Fogelson
Jayson Lynch
Ana Trisovic
Jonathan Rosenfeld
Anmol Sandhu
Neil Thompson
68
0
0
26 Nov 2025
Text-Guided Semantic Image Encoder
Raghuveer Thirukovalluru
Xiaochuang Han
Bhuwan Dhingra
Emily Dinan
Maha Elbayad
VLM
136
0
0
25 Nov 2025
Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development
David Szczecina
Senan Gaffori
Edmond Li
DeLMO
313
0
0
25 Nov 2025
Decoupling and Damping: Structurally-Regularized Gradient Matching for Multimodal Graph Condensation
Lian Shen
Zhendan Chen
Yinhui jiang
Meijia Song
Ziming Su
Juan Liu
Xiangrong Liu
60
0
0
25 Nov 2025
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
Shilei Cao
Ziyang Gong
Hehai Lin
Yang Liu
Jiashun Cheng
...
C. Qin
Hong Cheng
Xue Yang
Juepeng Zheng
Haohuan Fu
212
0
0
25 Nov 2025
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning
Hongji Yang
Yucheng Zhou
Wencheng Han
Runzhou Tao
Zhongying Qiu
Jianfei Yang
Jianbing Shen
DiffM
EGVM
306
0
0
25 Nov 2025
Cisco Time Series Model Technical Report
Liang Gou
Archit Khare
Praneet Pabolu
Prachi Patel
Joseph Ross
...
Jingze Sun
Kristal Curtis
Vedant Dharnidharka
Abhinav Mathur
Hao Yang
AI4TS
110
0
0
25 Nov 2025
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
GigaWorld Team
Angen Ye
Boyuan Wang
Chaojun Ni
Guan Huang
...
Yang Wang
Yukun Zhou
Z. Zhang
Z. Dong
Zheng Zhu
VGen
LM&Ro
232
2
0
25 Nov 2025
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
Zhenghan Fang
Jian Zheng
Qiaozi Gao
Xiaofeng Gao
Jeremias Sulam
184
0
0
24 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
348
0
0
24 Nov 2025
An Invariant Latent Space Perspective on Language Model Inversion
Wentao Ye
Jiaqi Hu
Haobo Wang
Xinpeng Ti
Zhiqing Xiao
Hao Chen
Liyao Li
Lei Feng
Sai Wu
Junbo Zhao
60
0
0
24 Nov 2025
IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response
Damodar Panigrahi
Raj Patel
Shaswata Mitra
Sudip Mittal
Shahram Rahimi
60
0
0
24 Nov 2025
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
Ziteng Sun
Adrian Benton
Samuel Kushnir
Asher Trockman
Vikas Singh
Suhas Diggavi
A. Suresh
MQ
126
0
0
24 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVM
VGen
104
0
0
24 Nov 2025
1
2
3
4
...
238
239
240
Next