Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2001.06782
Cited By
v1
v2
v3
v4 (latest)
Gradient Surgery for Multi-Task Learning
19 January 2020
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gradient Surgery for Multi-Task Learning"
50 / 694 papers shown
Title
Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond
D. Kollias
V. Sharmanska
Stefanos Zafeiriou
CVBM
182
59
0
02 Jan 2024
Elastic Multi-Gradient Descent for Parallel Continual Learning
Fan Lyu
Wei Feng
Yuepan Li
Qing Sun
Fanhua Shang
Liang Wan
Liang Wang
107
2
0
02 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
149
22
0
31 Dec 2023
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
203
7
0
23 Dec 2023
FoodLMM: A Versatile Food Assistant using Large Multi-modal Model
Yuehao Yin
Huiyan Qi
B. Zhu
Jingjing Chen
Yu-Gang Jiang
Chong-Wah Ngo
124
24
0
22 Dec 2023
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He
Kai Li
Yifan Zang
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
MoE
134
8
0
22 Dec 2023
E2E-AT: A Unified Framework for Tackling Uncertainty in Task-aware End-to-end Learning
Wangkun Xu
Jianhong Wang
Fei Teng
54
5
0
17 Dec 2023
Multitask Learning Can Improve Worst-Group Outcomes
Atharva Kulkarni
Lucio Dery
Amrith Rajagopal Setlur
Aditi Raghunathan
Ameet Talwalkar
Graham Neubig
112
2
0
05 Dec 2023
Learning to Compose SuperWeights for Neural Parameter Allocation Search
Piotr Teterwak
Soren Nelson
Nikoli Dryden
D. Bashkirova
Kate Saenko
Bryan A. Plummer
128
2
0
03 Dec 2023
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
Anku Rani
Dwip Dalal
Shreya Gautam
Pankaj Gupta
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
124
1
0
01 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
62
0
0
30 Nov 2023
Enhancing the Performance of Neural Networks Through Causal Discovery and Integration of Domain Knowledge
Xiaoge Zhang
Xiao-Lin Wang
Fenglei Fan
Yiu-ming Cheung
Indranil Bose
150
2
0
29 Nov 2023
MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs
Xingtong Yu
Chang Zhou
Yuan Fang
Xinming Zhang
217
43
0
28 Nov 2023
Exactly conservative physics-informed neural networks and deep operator networks for dynamical systems
E. Cardoso-Bihlo
Alex Bihlo
AI4CE
PINN
122
10
0
23 Nov 2023
FedHCA
2
^2
2
: Towards Hetero-Client Federated Multi-Task Learning
Yuxiang Lu
Suizhi Huang
Yuwen Yang
Shalayiding Sirejiding
Yue Ding
Hongtao Lu
FedML
151
8
0
22 Nov 2023
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
208
10
0
22 Nov 2023
Attention Deficit is Ordered! Fooling Deformable Vision Transformers with Collaborative Adversarial Patches
Quazi Mishkatul Alam
Bilel Tarchoun
Ihsen Alouani
Nael B. Abu-Ghazaleh
AAML
ViT
99
1
0
21 Nov 2023
Adaptive Training Distributions with Scalable Online Bilevel Optimization
David Grangier
Pierre Ablin
Awni Y. Hannun
126
11
0
20 Nov 2023
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy
Jan Peters
Carlo DÉramo
MoE
104
27
0
19 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei-Ping Huang
Junyu Mao
Rui Wang
Hai Hu
105
10
0
15 Nov 2023
Examining Modularity in Multilingual LMs via Language-Specialized Subnetworks
Rochelle Choenni
Ekaterina Shutova
Daniel H Garrette
117
10
0
14 Nov 2023
Examining Common Paradigms in Multi-Task Learning
Cathrin Elich
Lukas Kirchdorfer
Jan M. Kohler
Lukas Schott
160
3
0
08 Nov 2023
Massive Editing for Large Language Models via Meta Learning
Chenmien Tan
Ge Zhang
Jie Fu
KELM
153
48
0
08 Nov 2023
Diffusion Models for Reinforcement Learning: A Survey
Zhengbang Zhu
Hanye Zhao
Haoran He
Yichao Zhong
Shenyu Zhang
Haoquan Guo
Tingting Chen
Weinan Zhang
243
76
0
02 Nov 2023
Learning to optimize by multi-gradient for multi-objective optimization
Linxi Yang
Xinmin Yang
L. Tang
128
1
0
01 Nov 2023
Implicit biases in multitask and continual learning from a backward error analysis perspective
Benoit Dherin
144
3
0
01 Nov 2023
Episodic Multi-Task Learning with Heterogeneous Neural Processes
Jiayi Shen
Xiantong Zhen
Qi
Qi Wang
M. Worring
101
14
0
28 Oct 2023
From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction
Nima Shoghi
Adeesh Kolluru
John R. Kitchin
Zachary W. Ulissi
C. L. Zitnick
Brandon M. Wood
AI4CE
179
41
0
25 Oct 2023
Mitigate Domain Shift by Primary-Auxiliary Objectives Association for Generalizing Person ReID
Qilei Li
Shaogang Gong
107
5
0
24 Oct 2023
NetDistiller: Empowering Tiny Deep Learning via In-Situ Distillation
Shunyao Zhang
Y. Fu
Shang Wu
Jyotikrishna Dass
Haoran You
Yingyan Lin
Lin
UQCV
FedML
128
0
0
24 Oct 2023
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training
Mingyang Wang
Heike Adel
Lukas Lange
Jannik Strötgen
Hinrich Schütze
122
4
0
23 Oct 2023
FairBranch: Fairness Conflict Correction on Task-group Branches for Fair Multi-Task Learning
Arjun Roy
C. Koutlis
Symeon Papadopoulos
Eirini Ntoutsi
85
0
0
20 Oct 2023
Adaptive Neural Ranking Framework: Toward Maximized Business Goal for Cascade Ranking Systems
Yunli Wang
Zhiqiang Wang
Jian Yang
Shiyang Wen
Dongying Kong
Han Li
Kun Gai
125
14
0
16 Oct 2023
Scalarization for Multi-Task and Multi-Domain Learning at Scale
Amelie Royer
Tijmen Blankevoort
B. Bejnordi
113
22
0
13 Oct 2023
PolyTask: Learning Unified Policies through Behavior Distillation
Siddhant Haldar
Lerrel Pinto
108
9
0
12 Oct 2023
Denoising Task Routing for Diffusion Models
Byeongjun Park
Sangmin Woo
Hyojun Go
Jin-Young Kim
Changick Kim
DiffM
161
19
0
11 Oct 2023
Factorized Tensor Networks for Multi-Task and Multi-Domain Learning
Yash Garg
Nebiyou Yismaw
Rakib Hyder
Ashley Prater-Bennette
M. Salman Asif
74
2
0
09 Oct 2023
Multiple Physics Pretraining for Physical Surrogate Models
Michael McCabe
Bruno Régaldo-Saint Blancard
Liam Parker
Ruben Ohana
M. Cranmer
...
Francois Lanusse
Mariel Pettee
Tiberiu Teşileanu
Kyunghyun Cho
Shirley Ho
PINN
AI4CE
136
66
0
04 Oct 2023
AdaMerging: Adaptive Model Merging for Multi-Task Learning
Enneng Yang
Zhenyi Wang
Li Shen
Shiwei Liu
Guibing Guo
Xingwei Wang
Dacheng Tao
MoMe
176
143
0
04 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
151
3
0
02 Oct 2023
Multi-task Learning with 3D-Aware Regularization
Weihong Li
Jingyu Sun
A. Leonardis
Hakan Bilen
95
4
0
02 Oct 2023
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
Tianjian Li
Haoran Xu
Philipp Koehn
Daniel Khashabi
Kenton W. Murray
96
4
0
02 Oct 2023
HarmonyDream: Task Harmonization Inside World Models
Haoyu Ma
Jialong Wu
Ningya Feng
Chenjun Xiao
Dong Li
Jianye Hao
Jianmin Wang
Mingsheng Long
88
12
0
30 Sep 2023
MORPH: Design Co-optimization with Reinforcement Learning via a Differentiable Hardware Model Proxy
Zhanpeng He
Hanze Dong
71
6
0
29 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
119
1
0
27 Sep 2023
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Jiamin Xie
Ke Li
Jinxi Guo
Andros Tjandra
Shangguan Yuan
Leda Sari
Chunyang Wu
Junteng Jia
Jay Mahadeokar
Ozlem Kalinli
167
3
0
22 Sep 2023
Multi-Task Cooperative Learning via Searching for Flat Minima
Fuping Wu
Le Zhang
Yang Sun
Yuanhan Mo
Thomas Nichols
Bartłomiej W. Papież
115
2
0
21 Sep 2023
GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels
YiXuan Wu
Jintai Chen
Jiahuan Yan
Yiheng Zhu
Benlin Liu
Jian Wu
VLM
107
6
0
16 Sep 2023
Projected Task-Specific Layers for Multi-Task Reinforcement Learning
Josselin Somerville Roberts
Julia Di
89
1
0
15 Sep 2023
Tackling the Non-IID Issue in Heterogeneous Federated Learning by Gradient Harmonization
Xinyu Zhang
Weiyu Sun
Ying-Cong Chen
FedML
136
9
0
13 Sep 2023
Previous
1
2
3
...
6
7
8
...
12
13
14
Next