Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.06782
Cited By
v1
v2
v3
v4 (latest)
Gradient Surgery for Multi-Task Learning
Neural Information Processing Systems (NeurIPS), 2020
19 January 2020
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Gradient Surgery for Multi-Task Learning"
50 / 741 papers shown
Title
Scalable Multi-Objective Robot Reinforcement Learning through Gradient Conflict Resolution
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
37
0
0
18 Sep 2025
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks
Mingsong Li
Lin Liu
Hongjun Wang
Haoxing Chen
Xijun Gu
Shizhan Liu
Dong Gong
Junbo Zhao
Zhenzhong Lan
Jianguo Li
32
0
0
18 Sep 2025
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
Min Zeng
Jingfei Sun
Xueyou Luo
Caiquan Liu
Shiqi Zhang
Li Xie
Xiaoxin Chen
AI4TS
VLM
32
0
0
15 Sep 2025
Prediction Loss Guided Decision-Focused Learning
Haeun Jeon
Hyunglip Bae
Chanyeong Kim
Yongjae Lee
Woo Chang Kim
20
0
0
10 Sep 2025
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning
Yuan Pu
Yazhe Niu
Jia Tang
Junyu Xiong
Shuai Hu
Hongsheng Li
MoMe
90
0
0
09 Sep 2025
How Far Are We from True Unlearnability?
International Conference on Learning Representations (ICLR), 2025
Kai Ye
Liangcai Su
Chenxiong Qian
54
3
0
09 Sep 2025
GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning
Evgeny Alves Limarenko
Anastasiia Alexandrovna Studenikina
32
0
0
08 Sep 2025
Simple Optimizers for Convex Aligned Multi-Objective Optimization
Ben Kretzu
Karen Ullrich
Yonathan Efroni
12
1
0
06 Sep 2025
DivMerge: A divergence-based model merging method for multi-tasking
Brahim Touayouch
Loïc Fosse
Géraldine Damnati
Gwénolé Lecorvé
MoMe
133
0
0
02 Sep 2025
Non-conflicting Energy Minimization in Reinforcement Learning based Robot Control
Skand Peri
Akhil Perincherry
Bikram Pandit
Stefan Lee
28
0
0
01 Sep 2025
Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
Yao Wang
Di Liang
Minlong Peng
MoMe
189
3
0
29 Aug 2025
Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction
Mang Cao
Sanping Zhou
Yizhe Li
Ye Deng
Wenli Huang
Le Wang
Mamba
73
0
0
28 Aug 2025
Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation
Ziniu Zhang
Zhenshuo Zhang
Dongyue Li
Lu Wang
Jennifer Dy
Hongyang R. Zhang
56
0
0
27 Aug 2025
Gradient Rectification for Robust Calibration under Distribution Shift
Yilin Zhang
Cai Xu
Y. Wu
Ziyu Guan
Wei Zhao
60
0
0
27 Aug 2025
Enhancing Speech Emotion Recognition with Multi-Task Learning and Dynamic Feature Fusion
Honghong Wang
Jing Deng
Fanqin Meng
Rong Zheng
28
0
0
25 Aug 2025
On Task Vectors and Gradients
Luca Zhou
Daniele Solombrino
Donato Crisostomi
Maria Sofia Bucarelli
Giuseppe Alessio D’Inverno
Fabrizio Silvestri
Emanuele Rodolà
MoMe
233
0
0
22 Aug 2025
Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models
Jiabo Huang
Chen Chen
Lingjuan Lyu
VLM
64
0
0
20 Aug 2025
Learning to See Through Flare
Xiaopeng Peng
Heath Gemar
Erin F. Fleet
Kyle Novak
A. Watnik
Grover A. Swartzlander
44
0
0
19 Aug 2025
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Zixin Rao
Youssef Mohamed
Shang Liu
Zeyan Liu
DeLMO
92
0
0
19 Aug 2025
AutoScale: Linear Scalarization Guided by Multi-Task Optimization Metrics
Yi Yang
Kei Ikemura
Qingwen Zhang
Xiaomeng Zhu
Ci Li
Nazre Batool
Sina Sharif Mansouri
John Folkesson
MoMe
56
1
0
19 Aug 2025
Hierarchy-Consistent Learning and Adaptive Loss Balancing for Hierarchical Multi-Label Classification
Ruobing Jiang
Mengzhe Liu
Haobing Liu
Yanwei Yu
24
0
0
19 Aug 2025
A robust and compliant robotic assembly control strategy for batch precision assembly task with uncertain fit types and fit amounts
Bin Wang
Jiwen Zhang
Song Wang
Dan Wu
32
0
0
17 Aug 2025
J6: Jacobian-Driven Role Attribution for Multi-Objective Prompt Optimization in LLMs
Yao Wu
52
0
0
16 Aug 2025
UMRE: A Unified Monotonic Transformation for Ranking Ensemble in Recommender Systems
Zhengrui Xu
Zhe Yang
Zhengxiao Guo
Shukai Liu
Luocheng Lin
Xiaoyan Liu
Y. Liu
Han Li
40
0
0
11 Aug 2025
Pareto Multi-Objective Alignment for Language Models
Qiang He
S. Maghsudi
44
2
0
11 Aug 2025
Gradient Surgery for Safe LLM Fine-Tuning
Biao Yi
Jiahao Li
Baolei Zhang
Lihai Nie
Tong Li
Tiansheng Huang
Zheli Liu
40
1
0
10 Aug 2025
Stackelberg Coupling of Online Representation Learning and Reinforcement Learning
Fernando Martinez
Tao Li
Y. Lu
Juntao Chen
OffRL
60
0
0
10 Aug 2025
Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning
Aleksandar Todorov
Juan Cardenas-Cartagena
Rafael F. Cunha
Marco Zullich
Matthia Sabatelli
CLL
48
1
0
09 Aug 2025
TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction
Zewei Zhou
Seth Z. Zhao
Tianhui Cai
Zhiyu Huang
Bolei Zhou
Jiaqi Ma
40
3
0
06 Aug 2025
Efficient Inter-Task Attention for Multitask Transformer Models
Christian Bohn
Thomas Kurbiel
Klaus Friedrichs
Hasan Tercan
Tobias Meisen
58
0
0
06 Aug 2025
Momentum-integrated Multi-task Stock Recommendation with Converge-based Optimization
Hao Wang
Jingshu Peng
Yanyan Shen
Xujia Li
Lei Chen
AI4TS
AIFin
40
0
0
05 Aug 2025
Learning Dynamics of Meta-Learning in Small Model Pretraining
David Demitri Africa
Yuval Weiss
P. Buttery
Richard Diehl Martinez
AI4CE
118
2
0
04 Aug 2025
Revisiting Replay and Gradient Alignment for Continual Pre-Training of Large Language Models
Istabrak Abbes
G. Subbaraj
Matthew D Riemer
Nizar Islah
Benjamin Thérien
Tsuguchika Tabaru
Hiroaki Kingetsu
Sarath Chandar
Irina Rish
CLL
51
0
0
03 Aug 2025
Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks
Vira Joshi
Zifan Xu
Bo Liu
Peter Stone
Amy Zhang
OffRL
131
4
0
31 Jul 2025
Suppressing Gradient Conflict for Generalizable Deepfake Detection
Ming-Hui Liu
Harry Cheng
Xin Luo
Xin-Shun Xu
72
0
0
29 Jul 2025
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Zedong Wang
Siyuan Li
Dan Xu
59
0
0
28 Jul 2025
Enhancing Stability of Physics-Informed Neural Network Training Through Saddle-Point Reformulation
Dmitry Bylinkin
Mikhail Aleksandrov
S. Chezhegov
Aleksandr Beznosikov
28
0
0
21 Jul 2025
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning
Wooseong Jeong
Kuk-Jin Yoon
152
0
0
10 Jul 2025
Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement
Xiao Yang
Jiyao Wang
Yuxuan Fan
Can Liu
Houcheng Su
Weichen Guo
Zitong Yu
Dengbo He
Kaishun Wu
TTA
115
2
0
10 Jul 2025
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Giwon Lee
Wooseong Jeong
Daehee Park
Jaewoo Jeong
Kuk-Jin Yoon
158
0
0
07 Jul 2025
Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations
Dongkyu Cho
Rumi Chunara
150
0
0
01 Jul 2025
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
Qian Feng
Dawei Zhou
Hanbin Zhao
Chao Zhang
Jiahua Dong
Dengxin Dai
Hui Qian
VLM
CLL
182
7
0
01 Jul 2025
MotionGPT3: Human Motion as a Second Modality
Bingfan Zhu
Biao Jiang
S. Wang
Shixiang Tang
Tao Chen
Linjie Luo
Youyi Zheng
Xin Chen
63
0
0
30 Jun 2025
Vision Generalist Model: A Survey
International Journal of Computer Vision (IJCV), 2025
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
181
0
0
11 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
161
1
0
10 Jun 2025
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh-Quan Cao
Ivan Lopes
Raoul de Charette
123
0
0
09 Jun 2025
A Framework for Controllable Multi-objective Learning with Annealed Stein Variational Hypernetworks
Minh-Duc Nguyen
Dung D. Le
138
0
0
07 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
107
0
0
06 Jun 2025
Gradient Similarity Surgery in Multi-Task Deep Learning
Thomas Borsani
Andrea Rosani
Giuseppe Nicosia
Giuseppe Di Fatta
MedIm
117
2
0
06 Jun 2025
Multilevel neural simulation-based inference
Yuga Hikida
Ayush Bharti
Niall Jeffrey
F. Briol
176
2
0
06 Jun 2025
Previous
1
2
3
4
5
...
13
14
15
Next