ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.06782
  4. Cited By
Gradient Surgery for Multi-Task Learning
v1v2v3v4 (latest)

Gradient Surgery for Multi-Task Learning

Neural Information Processing Systems (NeurIPS), 2020
19 January 2020
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Gradient Surgery for Multi-Task Learning"

50 / 741 papers shown
Title
Scalable Multi-Objective Robot Reinforcement Learning through Gradient Conflict Resolution
Scalable Multi-Objective Robot Reinforcement Learning through Gradient Conflict Resolution
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
37
0
0
18 Sep 2025
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks
Mingsong Li
Lin Liu
Hongjun Wang
Haoxing Chen
Xijun Gu
Shizhan Liu
Dong Gong
Junbo Zhao
Zhenzhong Lan
Jianguo Li
32
0
0
18 Sep 2025
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
Min Zeng
Jingfei Sun
Xueyou Luo
Caiquan Liu
Shiqi Zhang
Li Xie
Xiaoxin Chen
AI4TSVLM
32
0
0
15 Sep 2025
Prediction Loss Guided Decision-Focused Learning
Prediction Loss Guided Decision-Focused Learning
Haeun Jeon
Hyunglip Bae
Chanyeong Kim
Yongjae Lee
Woo Chang Kim
20
0
0
10 Sep 2025
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning
Yuan Pu
Yazhe Niu
Jia Tang
Junyu Xiong
Shuai Hu
Hongsheng Li
MoMe
90
0
0
09 Sep 2025
How Far Are We from True Unlearnability?
How Far Are We from True Unlearnability?International Conference on Learning Representations (ICLR), 2025
Kai Ye
Liangcai Su
Chenxiong Qian
54
3
0
09 Sep 2025
GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning
GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning
Evgeny Alves Limarenko
Anastasiia Alexandrovna Studenikina
32
0
0
08 Sep 2025
Simple Optimizers for Convex Aligned Multi-Objective Optimization
Simple Optimizers for Convex Aligned Multi-Objective Optimization
Ben Kretzu
Karen Ullrich
Yonathan Efroni
12
1
0
06 Sep 2025
DivMerge: A divergence-based model merging method for multi-tasking
DivMerge: A divergence-based model merging method for multi-tasking
Brahim Touayouch
Loïc Fosse
Géraldine Damnati
Gwénolé Lecorvé
MoMe
133
0
0
02 Sep 2025
Non-conflicting Energy Minimization in Reinforcement Learning based Robot Control
Non-conflicting Energy Minimization in Reinforcement Learning based Robot Control
Skand Peri
Akhil Perincherry
Bikram Pandit
Stefan Lee
28
0
0
01 Sep 2025
Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
Yao Wang
Di Liang
Minlong Peng
MoMe
189
3
0
29 Aug 2025
Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction
Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction
Mang Cao
Sanping Zhou
Yizhe Li
Ye Deng
Wenli Huang
Le Wang
Mamba
73
0
0
28 Aug 2025
Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation
Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation
Ziniu Zhang
Zhenshuo Zhang
Dongyue Li
Lu Wang
Jennifer Dy
Hongyang R. Zhang
56
0
0
27 Aug 2025
Gradient Rectification for Robust Calibration under Distribution Shift
Gradient Rectification for Robust Calibration under Distribution Shift
Yilin Zhang
Cai Xu
Y. Wu
Ziyu Guan
Wei Zhao
60
0
0
27 Aug 2025
Enhancing Speech Emotion Recognition with Multi-Task Learning and Dynamic Feature Fusion
Enhancing Speech Emotion Recognition with Multi-Task Learning and Dynamic Feature Fusion
Honghong Wang
Jing Deng
Fanqin Meng
Rong Zheng
28
0
0
25 Aug 2025
On Task Vectors and Gradients
On Task Vectors and Gradients
Luca Zhou
Daniele Solombrino
Donato Crisostomi
Maria Sofia Bucarelli
Giuseppe Alessio D’Inverno
Fabrizio Silvestri
Emanuele Rodolà
MoMe
233
0
0
22 Aug 2025
Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models
Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models
Jiabo Huang
Chen Chen
Lingjuan Lyu
VLM
64
0
0
20 Aug 2025
Learning to See Through Flare
Learning to See Through Flare
Xiaopeng Peng
Heath Gemar
Erin F. Fleet
Kyle Novak
A. Watnik
Grover A. Swartzlander
44
0
0
19 Aug 2025
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Zixin Rao
Youssef Mohamed
Shang Liu
Zeyan Liu
DeLMO
92
0
0
19 Aug 2025
AutoScale: Linear Scalarization Guided by Multi-Task Optimization Metrics
AutoScale: Linear Scalarization Guided by Multi-Task Optimization Metrics
Yi Yang
Kei Ikemura
Qingwen Zhang
Xiaomeng Zhu
Ci Li
Nazre Batool
Sina Sharif Mansouri
John Folkesson
MoMe
56
1
0
19 Aug 2025
Hierarchy-Consistent Learning and Adaptive Loss Balancing for Hierarchical Multi-Label Classification
Hierarchy-Consistent Learning and Adaptive Loss Balancing for Hierarchical Multi-Label Classification
Ruobing Jiang
Mengzhe Liu
Haobing Liu
Yanwei Yu
24
0
0
19 Aug 2025
A robust and compliant robotic assembly control strategy for batch precision assembly task with uncertain fit types and fit amounts
A robust and compliant robotic assembly control strategy for batch precision assembly task with uncertain fit types and fit amounts
Bin Wang
Jiwen Zhang
Song Wang
Dan Wu
32
0
0
17 Aug 2025
J6: Jacobian-Driven Role Attribution for Multi-Objective Prompt Optimization in LLMs
J6: Jacobian-Driven Role Attribution for Multi-Objective Prompt Optimization in LLMs
Yao Wu
52
0
0
16 Aug 2025
UMRE: A Unified Monotonic Transformation for Ranking Ensemble in Recommender Systems
UMRE: A Unified Monotonic Transformation for Ranking Ensemble in Recommender Systems
Zhengrui Xu
Zhe Yang
Zhengxiao Guo
Shukai Liu
Luocheng Lin
Xiaoyan Liu
Y. Liu
Han Li
40
0
0
11 Aug 2025
Pareto Multi-Objective Alignment for Language Models
Pareto Multi-Objective Alignment for Language Models
Qiang He
S. Maghsudi
44
2
0
11 Aug 2025
Gradient Surgery for Safe LLM Fine-Tuning
Gradient Surgery for Safe LLM Fine-Tuning
Biao Yi
Jiahao Li
Baolei Zhang
Lihai Nie
Tong Li
Tiansheng Huang
Zheli Liu
40
1
0
10 Aug 2025
Stackelberg Coupling of Online Representation Learning and Reinforcement Learning
Stackelberg Coupling of Online Representation Learning and Reinforcement Learning
Fernando Martinez
Tao Li
Y. Lu
Juntao Chen
OffRL
60
0
0
10 Aug 2025
Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning
Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning
Aleksandar Todorov
Juan Cardenas-Cartagena
Rafael F. Cunha
Marco Zullich
Matthia Sabatelli
CLL
48
1
0
09 Aug 2025
TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction
TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction
Zewei Zhou
Seth Z. Zhao
Tianhui Cai
Zhiyu Huang
Bolei Zhou
Jiaqi Ma
40
3
0
06 Aug 2025
Efficient Inter-Task Attention for Multitask Transformer Models
Efficient Inter-Task Attention for Multitask Transformer Models
Christian Bohn
Thomas Kurbiel
Klaus Friedrichs
Hasan Tercan
Tobias Meisen
58
0
0
06 Aug 2025
Momentum-integrated Multi-task Stock Recommendation with Converge-based Optimization
Momentum-integrated Multi-task Stock Recommendation with Converge-based Optimization
Hao Wang
Jingshu Peng
Yanyan Shen
Xujia Li
Lei Chen
AI4TSAIFin
40
0
0
05 Aug 2025
Learning Dynamics of Meta-Learning in Small Model Pretraining
Learning Dynamics of Meta-Learning in Small Model Pretraining
David Demitri Africa
Yuval Weiss
P. Buttery
Richard Diehl Martinez
AI4CE
118
2
0
04 Aug 2025
Revisiting Replay and Gradient Alignment for Continual Pre-Training of Large Language Models
Revisiting Replay and Gradient Alignment for Continual Pre-Training of Large Language Models
Istabrak Abbes
G. Subbaraj
Matthew D Riemer
Nizar Islah
Benjamin Thérien
Tsuguchika Tabaru
Hiroaki Kingetsu
Sarath Chandar
Irina Rish
CLL
51
0
0
03 Aug 2025
Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks
Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks
Vira Joshi
Zifan Xu
Bo Liu
Peter Stone
Amy Zhang
OffRL
131
4
0
31 Jul 2025
Suppressing Gradient Conflict for Generalizable Deepfake Detection
Suppressing Gradient Conflict for Generalizable Deepfake Detection
Ming-Hui Liu
Harry Cheng
Xin Luo
Xin-Shun Xu
72
0
0
29 Jul 2025
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Zedong Wang
Siyuan Li
Dan Xu
59
0
0
28 Jul 2025
Enhancing Stability of Physics-Informed Neural Network Training Through Saddle-Point Reformulation
Enhancing Stability of Physics-Informed Neural Network Training Through Saddle-Point Reformulation
Dmitry Bylinkin
Mikhail Aleksandrov
S. Chezhegov
Aleksandr Beznosikov
28
0
0
21 Jul 2025
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning
Wooseong Jeong
Kuk-Jin Yoon
152
0
0
10 Jul 2025
Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement
Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement
Xiao Yang
Jiyao Wang
Yuxuan Fan
Can Liu
Houcheng Su
Weichen Guo
Zitong Yu
Dengbo He
Kaishun Wu
TTA
115
2
0
10 Jul 2025
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Giwon Lee
Wooseong Jeong
Daehee Park
Jaewoo Jeong
Kuk-Jin Yoon
158
0
0
07 Jul 2025
Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations
Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations
Dongkyu Cho
Rumi Chunara
150
0
0
01 Jul 2025
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
Qian Feng
Dawei Zhou
Hanbin Zhao
Chao Zhang
Jiahua Dong
Dengxin Dai
Hui Qian
VLMCLL
182
7
0
01 Jul 2025
MotionGPT3: Human Motion as a Second Modality
MotionGPT3: Human Motion as a Second Modality
Bingfan Zhu
Biao Jiang
S. Wang
Shixiang Tang
Tao Chen
Linjie Luo
Youyi Zheng
Xin Chen
63
0
0
30 Jun 2025
Vision Generalist Model: A Survey
Vision Generalist Model: A SurveyInternational Journal of Computer Vision (IJCV), 2025
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
181
0
0
11 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TSOffRLAI4CE
161
1
0
10 Jun 2025
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh-Quan Cao
Ivan Lopes
Raoul de Charette
123
0
0
09 Jun 2025
A Framework for Controllable Multi-objective Learning with Annealed Stein Variational Hypernetworks
A Framework for Controllable Multi-objective Learning with Annealed Stein Variational Hypernetworks
Minh-Duc Nguyen
Dung D. Le
138
0
0
07 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
107
0
0
06 Jun 2025
Gradient Similarity Surgery in Multi-Task Deep Learning
Gradient Similarity Surgery in Multi-Task Deep Learning
Thomas Borsani
Andrea Rosani
Giuseppe Nicosia
Giuseppe Di Fatta
MedIm
117
2
0
06 Jun 2025
Multilevel neural simulation-based inference
Multilevel neural simulation-based inference
Yuga Hikida
Ayush Bharti
Niall Jeffrey
F. Briol
176
2
0
06 Jun 2025
Previous
12345...131415
Next