Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.10964
Cited By
v1
v2
v3 (latest)
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
23 April 2020
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Don't Stop Pretraining: Adapt Language Models to Domains and Tasks"
50 / 1,369 papers shown
Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Marc Felix Brinner
Tarek Al Mustafa
Sina Zarrieß
296
5
0
27 Mar 2025
Low-resource Information Extraction with the European Clinical Case Corpus
Soumitra Ghosh
Begona Altuna
Saeed Farzi
Pietro Ferrazzi
A. Lavelli
Giulia Mezzanotte
Manuela Speranza
Bernardo Magnini
229
1
0
26 Mar 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
Tadesse Destaw Belay
Israel Abebe Azime
Ibrahim Said Ahmad
David Ifeoluwa Adelani
Idris Abdulmumin
Abinew Ali Ayele
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
452
1
0
24 Mar 2025
OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery
Vignesh Prabhakar
Md Amirul Islam
Adam Atanas
Longji Xu
J. N. Han
...
Rucha Apte
Robert Clark
Kang Xu
Zihan Wang
Kai Liu
LRM
545
15
0
22 Mar 2025
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning
Peiyi Lin
Fukai Zhang
Kai Niu
Hao Fu
CLL
301
0
0
20 Mar 2025
Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon Detection
Knowledge Discovery and Data Mining (KDD), 2025
Minkyoo Song
Eugene Jang
Jaehan Kim
Seungwon Shin
180
0
0
19 Mar 2025
Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models?
Basab Jha
Firoj Paudel
195
0
0
16 Mar 2025
Neutralizing Bias in LLM Reasoning using Entailment Graphs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Liang Cheng
Tianyi Li
Zhaowei Wang
Tianyang Liu
Mark Steedman
219
3
0
14 Mar 2025
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection
Romain Thoreau
Valerio Marsocci
Dawa Derksen
AI4CE
330
6
0
12 Mar 2025
Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation
Zihao Chen
H. Handa
Miho Ohsaki
Kimiaki Shirahama
254
1
0
12 Mar 2025
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words
Hongyu Su
Yifeng Gao
Yifan Ding
Jie Zhang
338
1
0
10 Mar 2025
From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning
Eric Zhao
Pranjal Awasthi
Nika Haghtalab
172
4
0
07 Mar 2025
A Dataset for Analysing News Framing in Chinese Media
International Conference on Web and Social Media (ICWSM), 2025
Owen Cook
Yida Mu
Xinye Yang
Xingyi Song
Kalina Bontcheva
265
1
0
06 Mar 2025
CareerBERT: Matching Resumes to ESCO Jobs in a Shared Embedding Space for Generic Job Recommendations
Expert systems with applications (ESWA), 2025
Julian Rosenberger
Lukas Wolfrum
Sven Weinzierl
Mathias Kraus
Patrick Zschech
239
12
0
03 Mar 2025
GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Mufan Qiu
Xinyu Hu
Fengwei Zhan
Sukwon Yun
Jie Peng
Ruichen Zhang
B. Kailkhura
Jiekun Yang
Tianlong Chen
129
1
0
03 Mar 2025
Alchemist: Towards the Design of Efficient Online Continual Learning System
Yuyang Huang
Yuhan Liu
Haryadi S. Gunawi
Beibin Li
Changho Hwang
CLL
OnRL
397
1
0
03 Mar 2025
DUAL: Diversity and Uncertainty Active Learning for Text Summarization
Petros Stylianos Giouroukis
Alexios Gidiotis
Grigorios Tsoumakas
213
1
0
02 Mar 2025
Personalize Your LLM: Fake it then Align it
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Yijing Zhang
Dyah Adila
Changho Shin
Frederic Sala
503
6
0
02 Mar 2025
Autoencoder-Based Framework to Capture Vocabulary Quality in NLP
Vu Minh Hoang Dang
Rakesh M. Verma
142
0
0
28 Feb 2025
Unsupervised Parameter Efficient Source-free Post-pretraining
Abhishek Jha
Tinne Tuytelaars
Yuki M. Asano
OOD
272
0
0
28 Feb 2025
Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object Identification
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Vishnu Kabir Chhabra
Ding Zhu
Mohammad Mahdi Khalili
325
5
0
27 Feb 2025
NaijaNLP: A Survey of Nigerian Low-Resource Languages
Isa Inuwa-Dutse
355
2
0
27 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
3DV
KELM
578
17
0
20 Feb 2025
Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries
F. Jonske
M. Kim
Enrico Nasca
J. Evers
Johannes Haubold
...
F. Nensa
Michael Kamp
C. Seibold
Jan Egger
Jens Kleesiek
331
4
0
17 Feb 2025
FinMTEB: Finance Massive Text Embedding Benchmark
Yixuan Tang
Yi Yang
AIFin
391
10
0
16 Feb 2025
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training
Yao-Ching Yu
Tsun-Han Chiang
Cheng-Wei Tsai
Chien-Ming Huang
Wen-Kwang Tsao
363
11
0
16 Feb 2025
Assessing the Impact of the Quality of Textual Data on Feature Representation and Machine Learning Models
Tabinda Sarwar
Antonio Jose Jimeno Yepes
Lawrence Cavedon
297
0
0
12 Feb 2025
RideKE: Leveraging Low-Resource, User-Generated Twitter Content for Sentiment and Emotion Detection in Kenyan Code-Switched Dataset
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2025
Naome A. Etori
Maria Gini
666
5
0
10 Feb 2025
Privacy-Preserving Dataset Combination
Keren Fuentes
Mimee Xu
Irene Chen
351
0
0
09 Feb 2025
BTS: Harmonizing Specialized Experts into a Generalist LLM
Qizhen Zhang
Prajjwal Bhargava
Chloe Bi
Chris Cai
Jakob N. Foerster
...
Ruan Silva
Sheng Shen
Emily Dinan
Suchin Gururangan
M. Lewis
MoMe
153
2
0
31 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
International Conference on Web and Social Media (ICWSM), 2025
Peiling Yi
A. Zubiaga
Yunfei Long
388
2
0
28 Jan 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
958
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Information Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MA
AILaw
724
267
0
28 Jan 2025
Distributional Surgery for Language Model Activations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Bao Nguyen
Binh Nguyen
Duy Nguyen
V. Nguyen
314
3
0
27 Jan 2025
Addressing Bias in Generative AI: Challenges and Research Opportunities in Information Management
Information Manager (The) (TIM), 2025
Xiahua Wei
Naveen Kumar
Han Zhang
299
41
0
22 Jan 2025
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better
Neural Information Processing Systems (NeurIPS), 2024
Scott Geng
Cheng-Yu Hsieh
Vivek Ramanujan
Matthew Wallingford
Chun-Liang Li
Pang Wei Koh
Ranjay Krishna
DiffM
774
15
0
03 Jan 2025
INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Large Language Models and Ensemble Learning
Pablo Romero
Lifeng Han
Goran Nenadic
LM&MA
187
1
0
31 Dec 2024
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Hai Yu
Chong Deng
Qinglin Zhang
Jiaqing Liu
Qian Chen
Wen Wang
430
0
0
31 Dec 2024
On Adversarial Robustness of Language Models in Transfer Learning
Bohdan Turbal
Anastasiia Mazur
Jiaxu Zhao
Mykola Pechenizkiy
AAML
369
0
0
29 Dec 2024
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis
International Conference on Computational Linguistics (COLING), 2024
Senbin Zhu
Chenyuan He
Hongde Liu
Pengcheng Dong
Hanjie Zhao
Yuchen Yan
Yuxiang Jia
Hongying Zan
Min Peng
166
0
0
26 Dec 2024
Evaluating Self-Supervised Learning in Medical Imaging: A Benchmark for Robustness, Generalizability, and Multi-Domain Impact
Valay Bundele
Karahan Sarıtaş
Bora Kargi
Oğuz Ata Çal
Kıvanç Tezören
Zohreh Ghaderi
Hendrik Lensch
OOD
232
6
0
26 Dec 2024
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
Shahar Katz
Lior Wolf
142
0
0
22 Dec 2024
Enriching Social Science Research via Survey Item Linking
Tornike Tsereteli
Daniel Ruffinelli
Simone Paolo Ponzetto
LRM
309
0
0
20 Dec 2024
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Computer Vision and Pattern Recognition (CVPR), 2024
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
...
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLM
VLM
500
5
0
20 Dec 2024
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study
Eric Modesitt
Ke Yang
Spencer Hulsey
Chengxiang Zhai
Volodymyr Kindratenko
165
2
0
19 Dec 2024
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Sharad Duwal
Suraj Prasai
Suresh Manandhar
CLL
301
3
0
18 Dec 2024
A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Frontiers in Oncology (Front Oncol), 2024
Peilong Wang
J. Holmes
Ziqiang Liu
Dequan Chen
Tianming Liu
Jiajian Shen
Wen Liu
LRM
LM&MA
ELM
443
9
0
14 Dec 2024
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
Quang Hoang Trung
Nguyen Van Hoang Phuc
Le Trung Hoang
Quang Huu Hieu
Vo Nguyen Le Duy
AILaw
RALM
297
1
0
03 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hai Ye
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
618
1
0
02 Dec 2024
Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge
Yuhe Ji
Yilun Liu
Feiyu Yao
Minggui He
Shimin Tao
...
Weibin Meng
Yuming Xie
Boxing Chen
Hao Yang
Yongqian Sun
405
14
0
02 Dec 2024
Previous
1
2
3
4
5
...
26
27
28
Next