ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05246
  4. Cited By
Learning to select data for transfer learning with Bayesian Optimization

Learning to select data for transfer learning with Bayesian Optimization

17 July 2017
Sebastian Ruder
Barbara Plank
ArXiv (abs)PDFHTML

Papers citing "Learning to select data for transfer learning with Bayesian Optimization"

50 / 93 papers shown
Title
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Sanwoo Lee
Jiahao Liu
Qifan Wang
Jiadong Wang
Xunliang Cai
Yunfang Wu
MoMe
462
1
0
26 Apr 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
122
2
0
17 Apr 2025
Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised
  Domain Adaptation
Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation
Yao Ma
Samuel Louvan
Z. Wang
46
0
0
11 Nov 2024
Proxy-informed Bayesian transfer learning with unknown sources
Proxy-informed Bayesian transfer learning with unknown sources
Sabina J. Sloman
Julien Martinelli
Samuel Kaski
136
1
0
05 Nov 2024
TSDS: Data Selection for Task-Specific Model Finetuning
TSDS: Data Selection for Task-Specific Model Finetuning
Zifan Liu
Amin Karbasi
Theodoros Rekatsinas
73
6
0
15 Oct 2024
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
David Grangier
Simin Fan
Skyler Seto
Pierre Ablin
205
5
0
30 Sep 2024
Intertwined Biases Across Social Media Spheres: Unpacking Correlations
  in Media Bias Dimensions
Intertwined Biases Across Social Media Spheres: Unpacking Correlations in Media Bias Dimensions
Yifan Liu
Yike Li
Dong Wang
64
0
0
27 Aug 2024
Domain-specific or Uncertainty-aware models: Does it really make a
  difference for biomedical text classification?
Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?
Aman Sinha
Timothee Mickus
Marianne Clausel
Mathieu Constant
X. Coubez
74
0
0
17 Jul 2024
MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
Xiang Dai
Sarvnaz Karimi
Abeed Sarker
Ben Hachey
Cécile Paris
71
3
0
28 May 2024
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for
  Continual Test Time Adaptation
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Rongyu Zhang
Aosong Cheng
Yulin Luo
Gaole Dai
Huanrui Yang
...
Ran Xu
Li Du
Yuan Du
Yanbing Jiang
Shanghang Zhang
MoETTA
95
6
0
26 May 2024
Improve Knowledge Distillation via Label Revision and Data Selection
Improve Knowledge Distillation via Label Revision and Data Selection
Weichao Lan
Yiu-ming Cheung
Qing Xu
Buhua Liu
Zhikai Hu
Mengke Li
Zhenghua Chen
69
3
0
03 Apr 2024
Can Humans Identify Domains?
Can Humans Identify Domains?
Maria Barrett
Max Müller-Eberstein
Elisa Bassignana
Amalie Brogaard Pauli
Mike Zhang
Rob van der Goot
104
1
0
02 Apr 2024
Checkpoint Merging via Bayesian Optimization in LLM Pretraining
Checkpoint Merging via Bayesian Optimization in LLM Pretraining
Deyuan Liu
Zecheng Wang
Bingning Wang
Weipeng Chen
Chunshan Li
Zhiying Tu
Dianhui Chu
Bo Li
Dianbo Sui
MoMe
94
18
0
28 Mar 2024
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions
Tomávs Horych
Martin Wessel
Jan Philip Wahle
Terry Ruas
Jerome Wassmuth
André Greiner-Petter
Akiko Aizawa
Bela Gipp
Timo Spinde
66
3
0
27 Feb 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
Rheeya Uppaal
Yixuan Li
Junjie Hu
146
6
0
31 Jan 2024
Selecting Subsets of Source Data for Transfer Learning with Applications
  in Metal Additive Manufacturing
Selecting Subsets of Source Data for Transfer Learning with Applications in Metal Additive Manufacturing
Yifan Tang
M. Rahmani Dehaghani
Pouyan Sajadi
G. G. Wang
27
13
0
16 Jan 2024
Plug-and-Play Transformer Modules for Test-Time Adaptation
Plug-and-Play Transformer Modules for Test-Time Adaptation
Xiangyu Chang
Sk. Miraj Ahmed
S. Krishnamurthy
Başak Güler
A. Swami
Samet Oymak
Amit K. Roy-Chowdhury
120
0
0
06 Jan 2024
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual
  Test-Time Adaptation
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
77
12
0
19 Dec 2023
Efficient Continual Pre-training for Building Domain Specific Large
  Language Models
Efficient Continual Pre-training for Building Domain Specific Large Language Models
Yong Xie
Karan Aggarwal
Aitzaz Ahmad
CLL
100
24
0
14 Nov 2023
Skill-it! A Data-Driven Skills Framework for Understanding and Training
  Language Models
Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models
Mayee F. Chen
Nicholas Roberts
Kush S. Bhatia
Jue Wang
Ce Zhang
Frederic Sala
Christopher Ré
SyDa
88
65
0
26 Jul 2023
Towards Robust and Efficient Continual Language Learning
Towards Robust and Efficient Continual Language Learning
Adam Fisch
Amal Rannen-Triki
Razvan Pascanu
J. Bornschein
Angeliki Lazaridou
E. Gribovskaya
MarcÁurelio Ranzato
CLL
51
1
0
11 Jul 2023
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time
  Adaptation
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu
Senqiao Yang
Peidong Jia
Renrui Zhang
Ming Lu
Yandong Guo
Wei Xue
Shanghang Zhang
TTAOODVLM
103
40
0
07 Jun 2023
Uncertainty in Natural Language Processing: Sources, Quantification, and
  Applications
Uncertainty in Natural Language Processing: Sources, Quantification, and Applications
Mengting Hu
Zhen Zhang
Shiwan Zhao
Minlie Huang
Bingzhe Wu
BDL
98
39
0
05 Jun 2023
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language
  Selection for Low-Resource Multilingual Sentiment Analysis
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis
Mingyang Wang
Heike Adel
Lukas Lange
Jannik Strötgen
Hinrich Schütze
104
19
0
28 Apr 2023
A Data-centric Framework for Improving Domain-specific Machine Reading
  Comprehension Datasets
A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets
I. Bojić
Josef Halim
Verena Suharman
Sreeja Tar
Qi Chwen Ong
Duy Phung
Mathieu Ravaut
Shafiq Joty
Josip Car
FedML
80
3
0
02 Apr 2023
Divergence-Based Domain Transferability for Zero-Shot Classification
Divergence-Based Domain Transferability for Zero-Shot Classification
Alexander Pugantsov
R. McCreadie
VLM
26
0
0
11 Feb 2023
Data Selection for Language Models via Importance Resampling
Data Selection for Language Models via Importance Resampling
Sang Michael Xie
Shibani Santurkar
Tengyu Ma
Percy Liang
131
196
0
06 Feb 2023
Rationale-Guided Few-Shot Classification to Detect Abusive Language
Rationale-Guided Few-Shot Classification to Detect Abusive Language
Punyajoy Saha
Divyanshu Sheth
Kushal Kedia
Binny Mathew
Animesh Mukherjee
49
3
0
30 Nov 2022
Analyzing Multi-Task Learning for Abstractive Text Summarization
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
68
4
0
26 Oct 2022
Automatic Document Selection for Efficient Encoder Pretraining
Automatic Document Selection for Efficient Encoder Pretraining
Yukun Feng
Patrick Xia
Benjamin Van Durme
João Sedoc
110
11
0
20 Oct 2022
Navigating Memory Construction by Global Pseudo-Task Simulation for
  Continual Learning
Navigating Memory Construction by Global Pseudo-Task Simulation for Continual Learning
Yejia Liu
Wang Zhu
Shaolei Ren
CLL
67
3
0
16 Oct 2022
Task Formulation Matters When Learning Continually: A Case Study in
  Visual Question Answering
Task Formulation Matters When Learning Continually: A Case Study in Visual Question Answering
Mavina Nikandrou
Lu Yu
Alessandro Suglia
Ioannis Konstas
Verena Rieser
OOD
76
5
0
30 Sep 2022
AdaPrompt: Adaptive Model Training for Prompt-based NLP
AdaPrompt: Adaptive Model Training for Prompt-based NLP
Yulong Chen
Yang Liu
Li Dong
Shuohang Wang
Chenguang Zhu
Michael Zeng
Yue Zhang
VLM
102
48
0
10 Feb 2022
Active Learning Over Multiple Domains in Natural Language Tasks
Active Learning Over Multiple Domains in Natural Language Tasks
Shayne Longpre
Julia Reisler
E. G. Huang
Yi Lu
Andrew J. Frank
Nikhil Ramesh
Chris DuBois
OOD
97
13
0
01 Feb 2022
A Survey on Visual Transfer Learning using Knowledge Graphs
A Survey on Visual Transfer Learning using Knowledge Graphs
Sebastian Monka
Lavdim Halilaj
Achim Rettinger
108
23
0
27 Jan 2022
AutoDistill: an End-to-End Framework to Explore and Distill
  Hardware-Efficient Language Models
AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models
Xiaofan Zhang
Zongwei Zhou
Deming Chen
Yu Emma Wang
81
11
0
21 Jan 2022
How Universal is Genre in Universal Dependencies?
How Universal is Genre in Universal Dependencies?
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
51
6
0
09 Dec 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
123
216
0
22 Nov 2021
ClimateBert: A Pretrained Language Model for Climate-Related Text
ClimateBert: A Pretrained Language Model for Climate-Related Text
Nicolas Webersinke
Mathias Kraus
Jiabo Huang
Markus Leippold
AI4CE
107
144
0
22 Oct 2021
Focus on the Common Good: Group Distributional Robustness Follows
Focus on the Common Good: Group Distributional Robustness Follows
Vihari Piratla
Praneeth Netrapalli
Sunita Sarawagi
OOD
92
26
0
06 Oct 2021
Identifying Untrustworthy Samples: Data Filtering for Open-domain
  Dialogues with Bayesian Optimization
Identifying Untrustworthy Samples: Data Filtering for Open-domain Dialogues with Bayesian Optimization
Lei Shen
Haolan Zhan
Xin Shen
Hongshen Chen
Xiaofang Zhao
Xiao-Dan Zhu
83
17
0
14 Sep 2021
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based
  Pre-Training
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
88
63
0
13 Sep 2021
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based
  on Transformer Networks
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
Weicheng Ma
Renze Lou
Kai Zhang
Lili Wang
Soroush Vosoughi
51
8
0
13 Sep 2021
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
225
19
0
10 Sep 2021
Contextualizing Variation in Text Style Transfer Datasets
Contextualizing Variation in Text Style Transfer Datasets
S. Schoch
Wanyu Du
Yangfeng Ji
62
5
0
17 Aug 2021
Cats, not CAT scans: a study of dataset similarity in transfer learning
  for 2D medical image classification
Cats, not CAT scans: a study of dataset similarity in transfer learning for 2D medical image classification
Irma van den Brandt
F. Fok
B. Mulders
Joaquin Vanschoren
Veronika Cheplygina
38
4
0
13 Jul 2021
Continual Learning in the Teacher-Student Setup: Impact of Task
  Similarity
Continual Learning in the Teacher-Student Setup: Impact of Task Similarity
Sebastian Lee
Sebastian Goldt
Andrew M. Saxe
CLL
86
74
0
09 Jul 2021
Adversarial Learning for Zero-Shot Stance Detection on Social Media
Adversarial Learning for Zero-Shot Stance Detection on Social Media
Emily Allaway
Malavika Srikanth
Kathleen McKeown
ObjDVLM
64
95
0
14 May 2021
Evaluating the Values of Sources in Transfer Learning
Evaluating the Values of Sources in Transfer Learning
Md. Rizwan Parvez
Kai-Wei Chang
73
18
0
26 Apr 2021
To Share or not to Share: Predicting Sets of Sources for Model Transfer
  Learning
To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning
Lukas Lange
Jannik Strötgen
Heike Adel
Dietrich Klakow
68
12
0
16 Apr 2021
12
Next