Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2001.06782
Cited By
v1
v2
v3
v4 (latest)
Gradient Surgery for Multi-Task Learning
19 January 2020
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gradient Surgery for Multi-Task Learning"
50 / 694 papers shown
Title
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
OffRL
SSL
174
7
0
06 Jun 2023
MultiAdam: Parameter-wise Scale-invariant Optimizer for Multiscale Training of Physics-informed Neural Networks
J. Yao
Chang Su
Zhongkai Hao
Songming Liu
Hang Su
Jun Zhu
ODL
PINN
AI4CE
103
17
0
05 Jun 2023
Biologically-Motivated Learning Model for Instructed Visual Processing
R. Abel
S. Ullman
111
0
0
04 Jun 2023
Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings
Daniel Rotem
Michael Hassid
Jonathan Mamou
Roy Schwartz
109
6
0
04 Jun 2023
Efficient Multi-Task and Transfer Reinforcement Learning with Parameter-Compositional Framework
Lingfeng Sun
Haichao Zhang
Wei Xu
Masayoshi Tomizuka
165
10
0
02 Jun 2023
Addressing Negative Transfer in Diffusion Models
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
244
27
0
01 Jun 2023
Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance
Lisha Chen
H. Fernando
Yiming Ying
Tianyi Chen
139
28
0
31 May 2023
Learning Task-preferred Inference Routes for Gradient De-conflict in Multi-output DNNs
Yi Sun
Xin Xu
Jiaqiang Li
Xiaochang Hu
Yifei Shi
L. Zeng
310
3
0
31 May 2023
Independent Component Alignment for Multi-Task Learning
Dmitry Senushkin
Nikolay Patakin
Arseny Kuznetsov
Anton Konushin
CVBM
120
62
0
30 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
146
107
0
29 May 2023
Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms
Peiyao Xiao
Hao Ban
Kaiyi Ji
181
25
0
28 May 2023
Meta-learning For Vision-and-language Cross-lingual Transfer
Hanxu Hu
Frank Keller
VLM
96
3
0
24 May 2023
When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP
Jingwei Ni
Zhijing Jin
Qian Wang
Mrinmaya Sachan
Markus Leippold
AIFin
106
6
0
23 May 2023
How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning
Rochelle Choenni
Dan Garrette
Ekaterina Shutova
135
19
0
22 May 2023
Multi-behavior Self-supervised Learning for Recommendation
Jingcao Xu
Chaokun Wang
Cheng Wu
Yang Song
Kai Zheng
Xiaowei Wang
Changping Wang
Guorui Zhou
Kun Gai
112
50
0
22 May 2023
Mitigating Catastrophic Forgetting in Task-Incremental Continual Learning with Adaptive Classification Criterion
Yun Luo
Xiaotian Lin
Zhen Yang
Fandong Meng
Jie Zhou
Yue Zhang
CLL
92
6
0
20 May 2023
Multi-Task Models Adversarial Attacks
Lijun Zhang
Xiao Liu
Kaleel Mahmood
Caiwen Ding
Hui Guan
AAML
155
0
0
20 May 2023
FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning
Penghui Wei
Hongjian Dou
Shaoguo Liu
Rong Tang
Li Liu
Liangji Wang
Bo Zheng
FedML
104
14
0
15 May 2023
Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn
Ondrej Bohdal
Yinbing Tian
Yongshuo Zong
Ruchika Chavhan
Da Li
Henry Gouk
Li Guo
Timothy M. Hospedales
163
6
0
12 May 2023
Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal
Naresh Gurulingan
Bahram Zonooz
Elahe Arani
103
2
0
30 Apr 2023
EDAPS: Enhanced Domain-Adaptive Panoptic Segmentation
Suman Saha
Lukas Hoyer
Anton Obukhov
Dengxin Dai
Luc Van Gool
158
16
0
27 Apr 2023
Generating Adversarial Examples with Task Oriented Multi-Objective Optimization
Anh-Vu Bui
Trung Le
He Zhao
Quan Hung Tran
Paul Montague
Dinh Q. Phung
AAML
93
2
0
26 Apr 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Dingwen Kong
Lin F. Yang
132
14
0
18 Apr 2023
Chinese Open Instruction Generalist: A Preliminary Release
Ge Zhang
Yemin Shi
Ruibo Liu
Ruibin Yuan
Yizhi Li
...
Zhaoqun Li
Zekun Wang
Chenghua Lin
Wen-Fen Huang
Jie Fu
ALM
175
33
0
17 Apr 2023
Structured Pruning for Multi-Task Deep Neural Networks
Siddhant Garg
Lijun Zhang
Hui Guan
75
1
0
13 Apr 2023
On the Pareto Front of Multilingual Neural Machine Translation
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
MoE
112
7
0
06 Apr 2023
A Transformer-Based Deep Learning Approach for Fairly Predicting Post-Liver Transplant Risk Factors
Can Li
Xiaoqian Jiang
Kai Zhang
MedIm
96
18
0
05 Apr 2023
CGDTest: A Constrained Gradient Descent Algorithm for Testing Neural Networks
Vineel Nagisetty
Laura Graves
Guanting Pan
Piyush Jha
Vijay Ganesh
AAML
OOD
84
1
0
04 Apr 2023
Implicit Visual Bias Mitigation by Posterior Estimate Sharpening of a Bayesian Neural Network
Rebecca S Stone
Nishant Ravikumar
A. Bulpitt
David C. Hogg
BDL
121
0
0
29 Mar 2023
Identification of Negative Transfers in Multitask Learning Using Surrogate Models
Dongyue Li
Huy Le Nguyen
Hongyang R. Zhang
113
15
0
25 Mar 2023
BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological Measurements
Girish Narayanswamy
Yujia Liu
Yuzhe Yang
Chengqian Ma
Xin Liu
Daniel J. McDuff
Shwetak N. Patel
136
23
0
21 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
205
181
0
16 Mar 2023
Improving physics-informed neural networks with meta-learned optimization
Alexander Bihlo
PINN
144
23
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
125
2
0
13 Mar 2023
RotoGBML: Towards Out-of-Distribution Generalization for Gradient-Based Meta-Learning
Min Zhang
Zifeng Zhuang
Zhitao Wang
Xuetao Zhang
Wen-Bin Li
97
5
0
12 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model
Wendong Zhang
Wendong Zhang
Geng Chen
Siyu Gao
Yunbo Wang
Xiaokang Yang
Xiaokang Yang
CLL
167
4
0
12 Mar 2023
Gradient Coordination for Quantifying and Maximizing Knowledge Transference in Multi-Task Learning
Xuanhua Yang
Jianxin R. Zhao
Shaoguo Liu
Liangji Wang
Bo Zheng
87
1
0
10 Mar 2023
HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining
Weizhen He
Cheng Chen
Qingsong Xie
Meilin Chen
Yizhou Wang
...
Feng Zhu
Haiyang Yang
Li Yi
Rui Zhao
Wanli Ouyang
VLM
154
41
0
10 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
135
129
0
07 Mar 2023
Meta-Learning with Adaptive Weighted Loss for Imbalanced Cold-Start Recommendation
Minchang Kim
Yongjin Yang
Jung Hyun Ryu
Taesup Kim
OffRL
68
7
0
28 Feb 2023
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Ziyu Jiang
Yinpeng Chen
Mengchen Liu
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Zhangyang Wang
SSL
VLM
CLIP
126
19
0
27 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
152
15
0
22 Feb 2023
Recon: Reducing Conflicting Gradients from the Root for Multi-Task Learning
Guangyuan Shi
Qimai Li
Wenlong Zhang
Jiaxin Chen
Xiao-Ming Wu
164
47
0
22 Feb 2023
MaxGNR: A Dynamic Weight Strategy via Maximizing Gradient-to-Noise Ratio for Multi-Task Learning
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
56
3
0
18 Feb 2023
Task-Specific Skill Localization in Fine-tuned Language Models
A. Panigrahi
Nikunj Saunshi
Haoyu Zhao
Sanjeev Arora
MoMe
143
85
0
13 Feb 2023
GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks
Salah Ghamizi
Jingfeng Zhang
Maxime Cordy
Mike Papadakis
Masashi Sugiyama
Yves Le Traon
AAML
96
5
0
06 Feb 2023
Multipath agents for modular multitask ML systems
Andrea Gesmundo
109
1
0
06 Feb 2023
Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models
Byunggyu Lew
Donghyun Son
Buru Chang
103
10
0
03 Feb 2023
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
Alon Albalak
Colin Raffel
William Yang Wang
145
13
0
01 Feb 2023
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
Grace Zhang
Ayush Jain
Injune Hwang
Shao-Hua Sun
Joseph J. Lim
130
5
0
01 Feb 2023
Previous
1
2
3
...
8
9
10
...
12
13
14
Next