ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09139
  4. Cited By
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning
  in NLP Using Fewer Parameters & Less Data

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

19 September 2020
Jonathan Pilault
Amine Elhattami
C. Pal
    CLL
    MoE
ArXivPDFHTML

Papers citing "Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data"

50 / 52 papers shown
Title
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
Sameer Ambekar
Zehao Xiao
Xiantong Zhen
Cees G. M. Snoek
OOD
60
0
0
15 Feb 2025
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
Pengcheng Guo
Xuankai Chang
Hang Lv
Shinji Watanabe
Lei Xie
54
0
0
07 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual
  Semantic Textual Relatedness Task
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
63
2
0
28 Nov 2024
Designing Domain-Specific Large Language Models: The Critical Role of
  Fine-Tuning in Public Opinion Simulation
Designing Domain-Specific Large Language Models: The Critical Role of Fine-Tuning in Public Opinion Simulation
Haocheng Lin
ALM
19
0
0
28 Sep 2024
GO4Align: Group Optimization for Multi-Task Alignment
GO4Align: Group Optimization for Multi-Task Alignment
Jiayi Shen
Cheems Wang
Zehao Xiao
N. V. Noord
M. Worring
27
1
0
09 Apr 2024
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill
  Demand-Supply Joint Prediction
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction
Wenshuo Chao
Zhaopeng Qiu
Likang Wu
Zhuoning Guo
Zhi Zheng
Hengshu Zhu
Hao Liu
32
5
0
31 Jan 2024
Natural Language Processing Through Transfer Learning: A Case Study on
  Sentiment Analysis
Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis
Aman Yadav
A. Vichare
14
1
0
28 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive
  Context-Aware Policies
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
15
10
0
25 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head
  Attention under Multi-task Learning
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
17
4
0
16 Oct 2023
Denoising Task Routing for Diffusion Models
Denoising Task Routing for Diffusion Models
Byeongjun Park
Sangmin Woo
Hyojun Go
Jin-Young Kim
Changick Kim
DiffM
14
18
0
11 Oct 2023
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by
  Learning to Scale
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
Markus Frohmann
Carolin Holtermann
Shahed Masoudian
Anne Lauscher
Navid Rekabsaz
13
2
0
02 Oct 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task
  Learning in NLP Through ML Lifecycle: A Survey
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
19
5
0
16 Aug 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision
  Review
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Maxime Fontana
Michael W. Spratling
Miaojing Shi
28
5
0
25 Jul 2023
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
VLM
LRM
21
4
0
15 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference
  Models with Natural Logic
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic
Zióu Zheng
Xiao-Dan Zhu
AAML
LRM
36
5
0
06 Jul 2023
On Conditional and Compositional Language Model Differentiable Prompting
On Conditional and Compositional Language Model Differentiable Prompting
Jonathan Pilault
Can Liu
Mohit Bansal
Markus Dreyer
22
1
0
04 Jul 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction
  with Predicate Generation
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation
Zhibin Chen
Yansong Feng
Dongyan Zhao
11
0
0
07 Jun 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech
  Recognition
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Wangyou Zhang
Y. Qian
30
10
0
25 May 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised
  Multimodal Contrastive Learning
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
29
15
0
16 May 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
E. Ponti
MoMe
OOD
19
73
0
22 Feb 2023
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix
  Embedding
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding
Zhangyang Gao
Yuqi Hu
Cheng Tan
Stan Z. Li
18
13
0
14 Feb 2023
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot
  Summarization
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization
Yulong Chen
Yang Liu
Ruochen Xu
Ziyi Yang
Chenguang Zhu
Michael Zeng
Yue Zhang
24
17
0
17 Nov 2022
Adapting self-supervised models to multi-talker speech recognition using
  speaker embeddings
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Zili Huang
Desh Raj
Leibny Paola García-Perera
Sanjeev Khudanpur
73
21
0
01 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
19
79
0
26 Oct 2022
Using Graph Algorithms to Pretrain Graph Completion Transformers
Using Graph Algorithms to Pretrain Graph Completion Transformers
Jonathan Pilault
Mikhail Galkin
Bahare Fatemi
Perouz Taslakian
David Vasquez
C. Pal
17
0
0
14 Oct 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for
  Zero-shot Commonsense Reasoning
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
11
12
0
08 Jun 2022
All Birds with One Stone: Multi-task Text Classification for Efficient
  Inference with One Forward Pass
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
27
1
0
22 May 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hamish Ivison
Matthew E. Peters
AI4CE
17
20
0
15 Mar 2022
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained
  Language Models
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models
Ze-Feng Gao
Peiyu Liu
Wayne Xin Zhao
Zhong-Yi Lu
Ji-Rong Wen
MoE
14
25
0
02 Mar 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised
  Few-Shot Learning
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
A. Zhmoginov
Mark Sandler
Max Vladymyrov
ViT
20
68
0
11 Jan 2022
VL-Adapter: Parameter-Efficient Transfer Learning for
  Vision-and-Language Tasks
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Mohit Bansal
VLM
VPVLM
16
339
0
13 Dec 2021
Analysis and Prediction of NLP Models Via Task Embeddings
Analysis and Prediction of NLP Models Via Task Embeddings
Damien Sileo
Marie-Francine Moens
8
3
0
10 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single
  Multimodal Multitask Architecture
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
33
0
0
22 Nov 2021
Kronecker Factorization for Preventing Catastrophic Forgetting in
  Large-scale Medical Entity Linking
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking
Denis Jered McInerney
Luyang Kong
Kristjan Arumae
Byron C. Wallace
Parminder Bhatia
CLL
14
1
0
11 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence
  Matching
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
25
5
0
03 Nov 2021
Investigating the Effect of Natural Language Explanations on
  Out-of-Distribution Generalization in Few-shot NLI
Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI
Yangqiaoyu Zhou
Chenhao Tan
11
8
0
12 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention
CoRGi: Content-Rich Graph Neural Networks with Attention
Jooyeon Kim
A. Lamb
Simon Woodhead
Simon L. Peyton Jones
Cheng Zheng
Miltiadis Allamanis
28
6
0
10 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual
  Learning
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
S. Hwang
MoMe
CLL
19
16
0
06 Oct 2021
BeliefBank: Adding Memory to a Pre-Trained Language Model for a
  Systematic Notion of Belief
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
KELM
LRM
223
64
0
29 Sep 2021
The Trade-offs of Domain Adaptation for Neural Language Models
The Trade-offs of Domain Adaptation for Neural Language Models
David Grangier
Dan Iter
19
21
0
21 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
39
98
0
19 Sep 2021
Improving Scheduled Sampling with Elastic Weight Consolidation for
  Neural Machine Translation
Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation
Michalis Korakakis
Andreas Vlachos
CLL
13
2
0
13 Sep 2021
Are Training Resources Insufficient? Predict First Then Explain!
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
13
7
0
29 Aug 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
172
53
0
15 Jul 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Rabeeh Karimi Mahabadi
James Henderson
Sebastian Ruder
MoE
17
463
0
08 Jun 2021
A Survey of Transformers
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
8
1,077
0
08 Jun 2021
Multi-hop Graph Convolutional Network with High-order Chebyshev
  Approximation for Text Reasoning
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning
Shuoran Jiang
Qingcai Chen
Xin Liu
Baotian Hu
Lisai Zhang
13
3
0
08 Jun 2021
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using
  the Long Document Transformer
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document Transformer
Hossein Basafa
Sajad Movahedi
A. Ebrahimi
A. Shakery
Heshaam Faili
RALM
8
2
0
08 May 2021
Supervising Model Attention with Human Explanations for Robust Natural
  Language Inference
Supervising Model Attention with Human Explanations for Robust Natural Language Inference
Joe Stacey
Yonatan Belinkov
Marek Rei
15
44
0
16 Apr 2021
Self-Explaining Structures Improve NLP Models
Self-Explaining Structures Improve NLP Models
Zijun Sun
Chun Fan
Qinghong Han
Xiaofei Sun
Yuxian Meng
Fei Wu
Jiwei Li
MILM
XAI
LRM
FAtt
23
38
0
03 Dec 2020
12
Next