ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09139
  4. Cited By
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning
  in NLP Using Fewer Parameters & Less Data
v1v2v3 (latest)

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

International Conference on Learning Representations (ICLR), 2020
19 September 2020
Jonathan Pilault
Amine Elhattami
C. Pal
    CLLMoE
ArXiv (abs)PDFHTMLGithub (56★)

Papers citing "Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data"

50 / 53 papers shown
NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective
NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Junchi Yan
128
0
0
21 Oct 2025
Multi-task Learning with Active Learning for Arabic Offensive Speech Detection
Multi-task Learning with Active Learning for Arabic Offensive Speech Detection
Aisha Alansari
Hamzah Luqman
216
0
0
03 Jun 2025
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution ShiftsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Sameer Ambekar
Zehao Xiao
Xiantong Zhen
Cees G. M. Snoek
OOD
432
2
0
15 Feb 2025
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASRIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Pengcheng Guo
Xuankai Chang
Hang Lv
Shinji Watanabe
Lei Xie
271
6
0
07 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual
  Semantic Textual Relatedness Task
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness TaskInternational Workshop on Semantic Evaluation (SemEval), 2024
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
346
2
0
28 Nov 2024
Designing Domain-Specific Large Language Models: The Critical Role of
  Fine-Tuning in Public Opinion Simulation
Designing Domain-Specific Large Language Models: The Critical Role of Fine-Tuning in Public Opinion Simulation
Haocheng Lin
ALM
141
3
0
28 Sep 2024
GO4Align: Group Optimization for Multi-Task Alignment
GO4Align: Group Optimization for Multi-Task AlignmentNeural Information Processing Systems (NeurIPS), 2024
Jiayi Shen
Cheems Wang
Zehao Xiao
Nanne van Noord
M. Worring
188
13
0
09 Apr 2024
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill
  Demand-Supply Joint Prediction
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction
Wenshuo Chao
Zhaopeng Qiu
Likang Wu
Zhuoning Guo
Zhi Zheng
Hengshu Zhu
Hao Liu
367
6
0
31 Jan 2024
Natural Language Processing Through Transfer Learning: A Case Study on
  Sentiment Analysis
Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis
Aman Yadav
A. Vichare
117
1
0
28 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive
  Context-Aware Policies
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware PoliciesNeural Information Processing Systems (NeurIPS), 2023
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
288
21
0
25 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head
  Attention under Multi-task Learning
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
212
7
0
16 Oct 2023
Denoising Task Routing for Diffusion Models
Denoising Task Routing for Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Byeongjun Park
Sangmin Woo
Hyojun Go
Jin-Young Kim
Changick Kim
DiffM
531
25
0
11 Oct 2023
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by
  Learning to Scale
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to ScaleAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Markus Frohmann
Carolin Holtermann
Shahed Masoudian
Anne Lauscher
Navid Rekabsaz
342
2
0
02 Oct 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task
  Learning in NLP Through ML Lifecycle: A Survey
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
216
6
0
16 Aug 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision
  Review
When Multi-Task Learning Meets Partial Supervision: A Computer Vision ReviewProceedings of the IEEE (Proc. IEEE), 2023
Maxime Fontana
Michael W. Spratling
Miaojing Shi
270
10
0
25 Jul 2023
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
SINC: Self-Supervised In-Context Learning for Vision-Language TasksIEEE International Conference on Computer Vision (ICCV), 2023
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
VLMLRM
260
7
0
15 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference
  Models with Natural Logic
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural LogicAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zióu Zheng
Xiao-Dan Zhu
AAMLLRM
282
6
0
06 Jul 2023
On Conditional and Compositional Language Model Differentiable Prompting
On Conditional and Compositional Language Model Differentiable PromptingInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Jonathan Pilault
Can Liu
Joey Tianyi Zhou
Markus Dreyer
183
1
0
04 Jul 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction
  with Predicate Generation
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Chen
Yansong Feng
Dongyan Zhao
121
0
0
07 Jun 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech
  Recognition
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech RecognitionInterspeech (Interspeech), 2023
Wangyou Zhang
Y. Qian
242
12
0
25 May 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised
  Multimodal Contrastive Learning
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
225
26
0
16 May 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
437
103
0
22 Feb 2023
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix
  Embedding
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding
Zhangyang Gao
Yuqi Hu
Cheng Tan
Stan Z. Li
279
17
0
14 Feb 2023
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot
  Summarization
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yulong Chen
Yang Liu
Ruochen Xu
Ziyi Yang
Chenguang Zhu
Michael Zeng
Yue Zhang
320
21
0
17 Nov 2022
Adapting self-supervised models to multi-talker speech recognition using
  speaker embeddings
Adapting self-supervised models to multi-talker speech recognition using speaker embeddingsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zili Huang
Desh Raj
Leibny Paola García-Perera
Sanjeev Khudanpur
324
36
0
01 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-designNeural Information Processing Systems (NeurIPS), 2022
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zinan Lin
MoE
247
130
0
26 Oct 2022
Using Graph Algorithms to Pretrain Graph Completion Transformers
Using Graph Algorithms to Pretrain Graph Completion Transformers
Jonathan Pilault
Mikhail Galkin
Bahare Fatemi
Perouz Taslakian
David Vasquez
C. Pal
192
0
0
14 Oct 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for
  Zero-shot Commonsense Reasoning
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
212
16
0
08 Jun 2022
All Birds with One Stone: Multi-task Text Classification for Efficient
  Inference with One Forward Pass
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
205
1
0
22 May 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Michal Guerquin
Matthew E. Peters
AI4CE
349
23
0
15 Mar 2022
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained
  Language Models
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language ModelsInternational Conference on Computational Linguistics (COLING), 2022
Ze-Feng Gao
Peiyu Liu
Wayne Xin Zhao
Zhong-Yi Lu
Ji-Rong Wen
MoE
279
31
0
02 Mar 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised
  Few-Shot Learning
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot LearningInternational Conference on Machine Learning (ICML), 2022
A. Zhmoginov
Mark Sandler
Max Vladymyrov
ViT
334
78
0
11 Jan 2022
VL-Adapter: Parameter-Efficient Transfer Learning for
  Vision-and-Language Tasks
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLMVPVLM
338
433
0
13 Dec 2021
Analysis and Prediction of NLP Models Via Task Embeddings
Analysis and Prediction of NLP Models Via Task Embeddings
Damien Sileo
Marie-Francine Moens
107
6
0
10 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single
  Multimodal Multitask Architecture
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
353
0
0
22 Nov 2021
Kronecker Factorization for Preventing Catastrophic Forgetting in
  Large-scale Medical Entity Linking
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking
Denis Jered McInerney
Luyang Kong
Kristjan Arumae
Byron C. Wallace
Parminder Bhatia
CLL
158
1
0
11 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence
  Matching
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
257
6
0
03 Nov 2021
Investigating the Effect of Natural Language Explanations on
  Out-of-Distribution Generalization in Few-shot NLI
Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLIFirst Workshop on Insights from Negative Results in NLP (Insights), 2021
Yangqiaoyu Zhou
Chenhao Tan
96
10
0
12 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention
CoRGi: Content-Rich Graph Neural Networks with AttentionKnowledge Discovery and Data Mining (KDD), 2021
Jooyeon Kim
A. Lamb
Simon Woodhead
Simon L. Peyton Jones
Cheng Zheng
Miltiadis Allamanis
178
6
0
10 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual
  Learning
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
Sung Ju Hwang
MoMeCLL
372
19
0
06 Oct 2021
BeliefBank: Adding Memory to a Pre-Trained Language Model for a
  Systematic Notion of Belief
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
KELMLRM
463
68
0
29 Sep 2021
The Trade-offs of Domain Adaptation for Neural Language Models
The Trade-offs of Domain Adaptation for Neural Language Models
David Grangier
Dan Iter
178
22
0
21 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
267
160
0
19 Sep 2021
Improving Scheduled Sampling with Elastic Weight Consolidation for
  Neural Machine Translation
Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation
Michalis Korakakis
Andreas Vlachos
CLL
200
2
0
13 Sep 2021
Are Training Resources Insufficient? Predict First Then Explain!
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
174
7
0
29 Aug 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning SkillsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Ori Yoran
Alon Talmor
Jonathan Berant
ReLMLRM
368
55
0
15 Jul 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Compacter: Efficient Low-Rank Hypercomplex Adapter LayersNeural Information Processing Systems (NeurIPS), 2021
Rabeeh Karimi Mahabadi
James Henderson
Sebastian Ruder
MoE
404
582
0
08 Jun 2021
A Survey of Transformers
A Survey of TransformersAI Open (AO), 2021
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
441
1,380
0
08 Jun 2021
Multi-hop Graph Convolutional Network with High-order Chebyshev
  Approximation for Text Reasoning
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Shuoran Jiang
Qingcai Chen
Xin Liu
Baotian Hu
Lisai Zhang
111
3
0
08 Jun 2021
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using
  the Long Document Transformer
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document TransformerInternational Workshop on Semantic Evaluation (SemEval), 2021
Hossein Basafa
Sajad Movahedi
A. Ebrahimi
A. Shakery
Heshaam Faili
RALM
168
2
0
08 May 2021
12
Next