Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.09139
Cited By
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
19 September 2020
Jonathan Pilault
Amine Elhattami
C. Pal
CLL
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data"
50 / 52 papers shown
Title
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
Sameer Ambekar
Zehao Xiao
Xiantong Zhen
Cees G. M. Snoek
OOD
60
0
0
15 Feb 2025
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
Pengcheng Guo
Xuankai Chang
Hang Lv
Shinji Watanabe
Lei Xie
54
0
0
07 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
63
2
0
28 Nov 2024
Designing Domain-Specific Large Language Models: The Critical Role of Fine-Tuning in Public Opinion Simulation
Haocheng Lin
ALM
19
0
0
28 Sep 2024
GO4Align: Group Optimization for Multi-Task Alignment
Jiayi Shen
Cheems Wang
Zehao Xiao
N. V. Noord
M. Worring
27
1
0
09 Apr 2024
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction
Wenshuo Chao
Zhaopeng Qiu
Likang Wu
Zhuoning Guo
Zhi Zheng
Hengshu Zhu
Hao Liu
32
5
0
31 Jan 2024
Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis
Aman Yadav
A. Vichare
14
1
0
28 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
15
10
0
25 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
17
4
0
16 Oct 2023
Denoising Task Routing for Diffusion Models
Byeongjun Park
Sangmin Woo
Hyojun Go
Jin-Young Kim
Changick Kim
DiffM
14
18
0
11 Oct 2023
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
Markus Frohmann
Carolin Holtermann
Shahed Masoudian
Anne Lauscher
Navid Rekabsaz
13
2
0
02 Oct 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
19
5
0
16 Aug 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Maxime Fontana
Michael W. Spratling
Miaojing Shi
28
5
0
25 Jul 2023
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
VLM
LRM
21
4
0
15 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic
Zióu Zheng
Xiao-Dan Zhu
AAML
LRM
36
5
0
06 Jul 2023
On Conditional and Compositional Language Model Differentiable Prompting
Jonathan Pilault
Can Liu
Mohit Bansal
Markus Dreyer
22
1
0
04 Jul 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation
Zhibin Chen
Yansong Feng
Dongyan Zhao
11
0
0
07 Jun 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Wangyou Zhang
Y. Qian
30
10
0
25 May 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
29
15
0
16 May 2023
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
E. Ponti
MoMe
OOD
19
73
0
22 Feb 2023
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding
Zhangyang Gao
Yuqi Hu
Cheng Tan
Stan Z. Li
18
13
0
14 Feb 2023
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization
Yulong Chen
Yang Liu
Ruochen Xu
Ziyi Yang
Chenguang Zhu
Michael Zeng
Yue Zhang
24
17
0
17 Nov 2022
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Zili Huang
Desh Raj
Leibny Paola García-Perera
Sanjeev Khudanpur
73
21
0
01 Nov 2022
M
3
^3
3
ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
19
79
0
26 Oct 2022
Using Graph Algorithms to Pretrain Graph Completion Transformers
Jonathan Pilault
Mikhail Galkin
Bahare Fatemi
Perouz Taslakian
David Vasquez
C. Pal
17
0
0
14 Oct 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
11
12
0
08 Jun 2022
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
27
1
0
22 May 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hamish Ivison
Matthew E. Peters
AI4CE
17
20
0
15 Mar 2022
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models
Ze-Feng Gao
Peiyu Liu
Wayne Xin Zhao
Zhong-Yi Lu
Ji-Rong Wen
MoE
14
25
0
02 Mar 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
A. Zhmoginov
Mark Sandler
Max Vladymyrov
ViT
20
68
0
11 Jan 2022
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Mohit Bansal
VLM
VPVLM
16
339
0
13 Dec 2021
Analysis and Prediction of NLP Models Via Task Embeddings
Damien Sileo
Marie-Francine Moens
8
3
0
10 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
33
0
0
22 Nov 2021
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking
Denis Jered McInerney
Luyang Kong
Kristjan Arumae
Byron C. Wallace
Parminder Bhatia
CLL
14
1
0
11 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
25
5
0
03 Nov 2021
Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI
Yangqiaoyu Zhou
Chenhao Tan
11
8
0
12 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention
Jooyeon Kim
A. Lamb
Simon Woodhead
Simon L. Peyton Jones
Cheng Zheng
Miltiadis Allamanis
28
6
0
10 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
S. Hwang
MoMe
CLL
19
16
0
06 Oct 2021
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
KELM
LRM
223
64
0
29 Sep 2021
The Trade-offs of Domain Adaptation for Neural Language Models
David Grangier
Dan Iter
19
21
0
21 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
39
98
0
19 Sep 2021
Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation
Michalis Korakakis
Andreas Vlachos
CLL
13
2
0
13 Sep 2021
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
13
7
0
29 Aug 2021
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
172
53
0
15 Jul 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Rabeeh Karimi Mahabadi
James Henderson
Sebastian Ruder
MoE
17
463
0
08 Jun 2021
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
8
1,077
0
08 Jun 2021
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning
Shuoran Jiang
Qingcai Chen
Xin Liu
Baotian Hu
Lisai Zhang
13
3
0
08 Jun 2021
NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document Transformer
Hossein Basafa
Sajad Movahedi
A. Ebrahimi
A. Shakery
Heshaam Faili
RALM
8
2
0
08 May 2021
Supervising Model Attention with Human Explanations for Robust Natural Language Inference
Joe Stacey
Yonatan Belinkov
Marek Rei
15
44
0
16 Apr 2021
Self-Explaining Structures Improve NLP Models
Zijun Sun
Chun Fan
Qinghong Han
Xiaofei Sun
Yuxian Meng
Fei Wu
Jiwei Li
MILM
XAI
LRM
FAtt
23
38
0
03 Dec 2020
1
2
Next