v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 12,026 papers shown

Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation

Thaweerath Phisannupawong

J. J. Damanik

Han-Lim Choi

159

24 Oct 2025

Bi-Level Optimization for Generative Recommendation: Bridging Tokenization and Generation

122

24 Oct 2025

PARL: Prompt-based Agents for Reinforcement Learning

Yarik Menchaca Resendiz

Roman Klinger

LLMAG LRM

164

24 Oct 2025

Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

139

23 Oct 2025

Relative-Based Scaling Law for Neural Language Models

144

23 Oct 2025

Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging

Ibrahim Ethem Hamamci

145

23 Oct 2025

A Reinforcement Learning Framework for Robust and Secure LLM Watermarking

153

23 Oct 2025

Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs

Yanlin Song

Ben Liu

Víctor Gutiérrez-Basulto

279

23 Oct 2025

RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging

...

220

23 Oct 2025

From Masks to Worlds: A Hitchhiker's Guide to World Models

182

23 Oct 2025

Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression

200

23 Oct 2025

Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal TransformersIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025

138

23 Oct 2025

xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads

Jiabo Shi

Dimitrios Pezaros

Yehia Elkhatib

102

23 Oct 2025

Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge

294

22 Oct 2025

Machine Text Detectors are Membership Inference Attacks

163

22 Oct 2025

Data-Centric Lessons To Improve Speech-Language Pretraining

136

22 Oct 2025

Restoring Pruned Large Language Models via Lost Component Compensation

141

22 Oct 2025

A Concrete Roadmap towards Safety Cases based on Chain-of-Thought Monitoring

Julian Schulz

LRM

122

22 Oct 2025

ARA: Adaptive Rank Allocation for Efficient Large Language Model SVD Compression

123

22 Oct 2025

Difficulty-Controllable Multiple-Choice Question Generation Using Large Language Models and Direct Preference Optimization

Yuto Tomikawa

Masaki Uto

140

22 Oct 2025

Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression

22 Oct 2025

ELUTQ: Optimizing Quantization Accuracy under LUT-Based Computation for Edge LLMs

456

22 Oct 2025

Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation

153

22 Oct 2025

Identity-Aware Large Language Models require Cultural Reasoning

21 Oct 2025

Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning

152

21 Oct 2025

Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection

185

21 Oct 2025

ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning

138

21 Oct 2025

Reasoning Language Model Inference Serving Unveiled: An Empirical Study

256

21 Oct 2025

Large language models for folktale type automation based on motifs: Cinderella case study

Tjaša Arčon

Marko Robnik-Šikonja

Polona Tratnik

21 Oct 2025

CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training

211

21 Oct 2025

From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering

156

21 Oct 2025

CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs

165

21 Oct 2025

Rethinking On-policy Optimization for Query Augmentation

182

20 Oct 2025

Benchmarking Probabilistic Time Series Forecasting Models on Neural Activity

Nicholas A. Steinmetz

BDL AI4TS

170

20 Oct 2025

Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains

221

20 Oct 2025

DSEBench: A Test Collection for Explainable Dataset Search with Examples

142

20 Oct 2025

Annotation-Efficient Universal Honesty Alignment

156

20 Oct 2025

Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling

133

20 Oct 2025

Unbiased Gradient Low-Rank Projection

148

20 Oct 2025

AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages

108

20 Oct 2025

Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models

124

20 Oct 2025

DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning

243

20 Oct 2025

Watermark Robustness and Radioactivity May Be at Odds in Federated Learning

219

19 Oct 2025

Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization

141

19 Oct 2025

Neuronal Group Communication for Efficient Neural representation

Zhengqi Pei

Qingming Huang

Shuhui Wang

111

19 Oct 2025

All You Need is One: Capsule Prompt Tuning with a Single Vector

142

19 Oct 2025

Mixed-Precision Quantization for Language Models: Techniques and Prospects

233

19 Oct 2025

Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads

153

19 Oct 2025

Uncovering Brain-Like Hierarchical Patterns in Vision-Language Models through fMRI-Based Neural Encoding

19 Oct 2025

Back to Bytes: Revisiting Tokenization Through UTF-8

127

19 Oct 2025