Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.09139
Cited By
v1
v2
v3 (latest)
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
International Conference on Learning Representations (ICLR), 2020
19 September 2020
Jonathan Pilault
Amine Elhattami
C. Pal
CLL
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (56★)
Papers citing
"Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data"
50 / 53 papers shown
NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Junchi Yan
128
0
0
21 Oct 2025
Multi-task Learning with Active Learning for Arabic Offensive Speech Detection
Aisha Alansari
Hamzah Luqman
216
0
0
03 Jun 2025
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Sameer Ambekar
Zehao Xiao
Xiantong Zhen
Cees G. M. Snoek
OOD
432
2
0
15 Feb 2025
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Pengcheng Guo
Xuankai Chang
Hang Lv
Shinji Watanabe
Lei Xie
271
6
0
07 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
International Workshop on Semantic Evaluation (SemEval), 2024
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
346
2
0
28 Nov 2024
Designing Domain-Specific Large Language Models: The Critical Role of Fine-Tuning in Public Opinion Simulation
Haocheng Lin
ALM
141
3
0
28 Sep 2024
GO4Align: Group Optimization for Multi-Task Alignment
Neural Information Processing Systems (NeurIPS), 2024
Jiayi Shen
Cheems Wang
Zehao Xiao
Nanne van Noord
M. Worring
188
13
0
09 Apr 2024
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction
Wenshuo Chao
Zhaopeng Qiu
Likang Wu
Zhuoning Guo
Zhi Zheng
Hengshu Zhu
Hao Liu
367
6
0
31 Jan 2024
Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis
Aman Yadav
A. Vichare
117
1
0
28 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Neural Information Processing Systems (NeurIPS), 2023
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
288
21
0
25 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
212
7
0
16 Oct 2023
Denoising Task Routing for Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Byeongjun Park
Sangmin Woo
Hyojun Go
Jin-Young Kim
Changick Kim
DiffM
531
25
0
11 Oct 2023
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Markus Frohmann
Carolin Holtermann
Shahed Masoudian
Anne Lauscher
Navid Rekabsaz
342
2
0
02 Oct 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
216
6
0
16 Aug 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Proceedings of the IEEE (Proc. IEEE), 2023
Maxime Fontana
Michael W. Spratling
Miaojing Shi
270
10
0
25 Jul 2023
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
IEEE International Conference on Computer Vision (ICCV), 2023
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
VLM
LRM
260
7
0
15 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zióu Zheng
Xiao-Dan Zhu
AAML
LRM
282
6
0
06 Jul 2023
On Conditional and Compositional Language Model Differentiable Prompting
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Jonathan Pilault
Can Liu
Joey Tianyi Zhou
Markus Dreyer
183
1
0
04 Jul 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Chen
Yansong Feng
Dongyan Zhao
121
0
0
07 Jun 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Interspeech (Interspeech), 2023
Wangyou Zhang
Y. Qian
242
12
0
25 May 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
225
26
0
16 May 2023
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMe
OOD
437
103
0
22 Feb 2023
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding
Zhangyang Gao
Yuqi Hu
Cheng Tan
Stan Z. Li
279
17
0
14 Feb 2023
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yulong Chen
Yang Liu
Ruochen Xu
Ziyi Yang
Chenguang Zhu
Michael Zeng
Yue Zhang
320
21
0
17 Nov 2022
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zili Huang
Desh Raj
Leibny Paola García-Perera
Sanjeev Khudanpur
324
36
0
01 Nov 2022
M
3
^3
3
ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Neural Information Processing Systems (NeurIPS), 2022
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zinan Lin
MoE
247
130
0
26 Oct 2022
Using Graph Algorithms to Pretrain Graph Completion Transformers
Jonathan Pilault
Mikhail Galkin
Bahare Fatemi
Perouz Taslakian
David Vasquez
C. Pal
192
0
0
14 Oct 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
212
16
0
08 Jun 2022
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
205
1
0
22 May 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Michal Guerquin
Matthew E. Peters
AI4CE
349
23
0
15 Mar 2022
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models
International Conference on Computational Linguistics (COLING), 2022
Ze-Feng Gao
Peiyu Liu
Wayne Xin Zhao
Zhong-Yi Lu
Ji-Rong Wen
MoE
279
31
0
02 Mar 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
International Conference on Machine Learning (ICML), 2022
A. Zhmoginov
Mark Sandler
Max Vladymyrov
ViT
334
78
0
11 Jan 2022
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
VPVLM
338
433
0
13 Dec 2021
Analysis and Prediction of NLP Models Via Task Embeddings
Damien Sileo
Marie-Francine Moens
107
6
0
10 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
353
0
0
22 Nov 2021
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking
Denis Jered McInerney
Luyang Kong
Kristjan Arumae
Byron C. Wallace
Parminder Bhatia
CLL
158
1
0
11 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
257
6
0
03 Nov 2021
Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI
First Workshop on Insights from Negative Results in NLP (Insights), 2021
Yangqiaoyu Zhou
Chenhao Tan
96
10
0
12 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention
Knowledge Discovery and Data Mining (KDD), 2021
Jooyeon Kim
A. Lamb
Simon Woodhead
Simon L. Peyton Jones
Cheng Zheng
Miltiadis Allamanis
178
6
0
10 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
Sung Ju Hwang
MoMe
CLL
372
19
0
06 Oct 2021
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
KELM
LRM
463
68
0
29 Sep 2021
The Trade-offs of Domain Adaptation for Neural Language Models
David Grangier
Dan Iter
178
22
0
21 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
267
160
0
19 Sep 2021
Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation
Michalis Korakakis
Andreas Vlachos
CLL
200
2
0
13 Sep 2021
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
174
7
0
29 Aug 2021
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
368
55
0
15 Jul 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Neural Information Processing Systems (NeurIPS), 2021
Rabeeh Karimi Mahabadi
James Henderson
Sebastian Ruder
MoE
404
582
0
08 Jun 2021
A Survey of Transformers
AI Open (AO), 2021
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
441
1,380
0
08 Jun 2021
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Shuoran Jiang
Qingcai Chen
Xin Liu
Baotian Hu
Lisai Zhang
111
3
0
08 Jun 2021
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document Transformer
International Workshop on Semantic Evaluation (SemEval), 2021
Hossein Basafa
Sajad Movahedi
A. Ebrahimi
A. Shakery
Heshaam Faili
RALM
168
2
0
08 May 2021
1
2
Next