Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.10964
Cited By
v1
v2
v3 (latest)
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
23 April 2020
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Don't Stop Pretraining: Adapt Language Models to Domains and Tasks"
50 / 1,369 papers shown
Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Marc Felix Brinner
Tarek Al Mustafa
Sina Zarrieß
308
5
0
27 Mar 2025
Low-resource Information Extraction with the European Clinical Case Corpus
Soumitra Ghosh
Begona Altuna
Saeed Farzi
Pietro Ferrazzi
A. Lavelli
Giulia Mezzanotte
Manuela Speranza
Bernardo Magnini
240
1
0
26 Mar 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
Tadesse Destaw Belay
Israel Abebe Azime
Ibrahim Said Ahmad
David Ifeoluwa Adelani
Idris Abdulmumin
Abinew Ali Ayele
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
455
1
0
24 Mar 2025
OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery
Vignesh Prabhakar
Md Amirul Islam
Adam Atanas
Longji Xu
J. N. Han
...
Rucha Apte
Robert Clark
Kang Xu
Zihan Wang
Kai Liu
LRM
561
15
0
22 Mar 2025
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning
Peiyi Lin
Fukai Zhang
Kai Niu
Hao Fu
CLL
303
0
0
20 Mar 2025
Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon Detection
Knowledge Discovery and Data Mining (KDD), 2025
Minkyoo Song
Eugene Jang
Jaehan Kim
Seungwon Shin
190
0
0
19 Mar 2025
Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models?
Basab Jha
Firoj Paudel
195
0
0
16 Mar 2025
Neutralizing Bias in LLM Reasoning using Entailment Graphs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Liang Cheng
Tianyi Li
Zhaowei Wang
Tianyang Liu
Mark Steedman
220
3
0
14 Mar 2025
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection
Romain Thoreau
Valerio Marsocci
Dawa Derksen
AI4CE
332
6
0
12 Mar 2025
Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation
Zihao Chen
H. Handa
Miho Ohsaki
Kimiaki Shirahama
264
1
0
12 Mar 2025
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words
Hongyu Su
Yifeng Gao
Yifan Ding
Jie Zhang
342
1
0
10 Mar 2025
From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning
Eric Zhao
Pranjal Awasthi
Nika Haghtalab
179
4
0
07 Mar 2025
A Dataset for Analysing News Framing in Chinese Media
International Conference on Web and Social Media (ICWSM), 2025
Owen Cook
Yida Mu
Xinye Yang
Xingyi Song
Kalina Bontcheva
270
1
0
06 Mar 2025
CareerBERT: Matching Resumes to ESCO Jobs in a Shared Embedding Space for Generic Job Recommendations
Expert systems with applications (ESWA), 2025
Julian Rosenberger
Lukas Wolfrum
Sven Weinzierl
Mathias Kraus
Patrick Zschech
249
14
0
03 Mar 2025
GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Mufan Qiu
Xinyu Hu
Fengwei Zhan
Sukwon Yun
Jie Peng
Ruichen Zhang
B. Kailkhura
Jiekun Yang
Tianlong Chen
129
1
0
03 Mar 2025
Alchemist: Towards the Design of Efficient Online Continual Learning System
Yuyang Huang
Yuhan Liu
Haryadi S. Gunawi
Beibin Li
Changho Hwang
CLL
OnRL
397
1
0
03 Mar 2025
DUAL: Diversity and Uncertainty Active Learning for Text Summarization
Petros Stylianos Giouroukis
Alexios Gidiotis
Grigorios Tsoumakas
223
1
0
02 Mar 2025
Personalize Your LLM: Fake it then Align it
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Yijing Zhang
Dyah Adila
Changho Shin
Frederic Sala
519
6
0
02 Mar 2025
Autoencoder-Based Framework to Capture Vocabulary Quality in NLP
Vu Minh Hoang Dang
Rakesh M. Verma
145
0
0
28 Feb 2025
Unsupervised Parameter Efficient Source-free Post-pretraining
Abhishek Jha
Tinne Tuytelaars
Yuki M. Asano
OOD
272
0
0
28 Feb 2025
Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object Identification
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Vishnu Kabir Chhabra
Ding Zhu
Mohammad Mahdi Khalili
326
5
0
27 Feb 2025
NaijaNLP: A Survey of Nigerian Low-Resource Languages
Isa Inuwa-Dutse
356
2
0
27 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
3DV
KELM
584
18
0
20 Feb 2025
Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries
F. Jonske
M. Kim
Enrico Nasca
J. Evers
Johannes Haubold
...
F. Nensa
Michael Kamp
C. Seibold
Jan Egger
Jens Kleesiek
333
4
0
17 Feb 2025
FinMTEB: Finance Massive Text Embedding Benchmark
Yixuan Tang
Yi Yang
AIFin
391
11
0
16 Feb 2025
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training
Yao-Ching Yu
Tsun-Han Chiang
Cheng-Wei Tsai
Chien-Ming Huang
Wen-Kwang Tsao
367
11
0
16 Feb 2025
Assessing the Impact of the Quality of Textual Data on Feature Representation and Machine Learning Models
Tabinda Sarwar
Antonio Jose Jimeno Yepes
Lawrence Cavedon
301
0
0
12 Feb 2025
RideKE: Leveraging Low-Resource, User-Generated Twitter Content for Sentiment and Emotion Detection in Kenyan Code-Switched Dataset
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2025
Naome A. Etori
Maria Gini
668
5
0
10 Feb 2025
Privacy-Preserving Dataset Combination
Keren Fuentes
Mimee Xu
Irene Chen
357
0
0
09 Feb 2025
BTS: Harmonizing Specialized Experts into a Generalist LLM
Qizhen Zhang
Prajjwal Bhargava
Chloe Bi
Chris Cai
Jakob N. Foerster
...
Ruan Silva
Sheng Shen
Emily Dinan
Suchin Gururangan
M. Lewis
MoMe
155
2
0
31 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
International Conference on Web and Social Media (ICWSM), 2025
Peiling Yi
A. Zubiaga
Yunfei Long
391
2
0
28 Jan 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
962
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Information Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MA
AILaw
726
269
0
28 Jan 2025
Distributional Surgery for Language Model Activations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Bao Nguyen
Binh Nguyen
Duy Nguyen
V. Nguyen
317
3
0
27 Jan 2025
Addressing Bias in Generative AI: Challenges and Research Opportunities in Information Management
Information Manager (The) (TIM), 2025
Xiahua Wei
Naveen Kumar
Han Zhang
323
42
0
22 Jan 2025
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better
Neural Information Processing Systems (NeurIPS), 2024
Scott Geng
Cheng-Yu Hsieh
Vivek Ramanujan
Matthew Wallingford
Chun-Liang Li
Pang Wei Koh
Ranjay Krishna
DiffM
787
15
0
03 Jan 2025
INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Large Language Models and Ensemble Learning
Pablo Romero
Lifeng Han
Goran Nenadic
LM&MA
188
1
0
31 Dec 2024
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Hai Yu
Chong Deng
Qinglin Zhang
Jiaqing Liu
Qian Chen
Wen Wang
435
0
0
31 Dec 2024
On Adversarial Robustness of Language Models in Transfer Learning
Bohdan Turbal
Anastasiia Mazur
Jiaxu Zhao
Mykola Pechenizkiy
AAML
370
0
0
29 Dec 2024
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis
International Conference on Computational Linguistics (COLING), 2024
Senbin Zhu
Chenyuan He
Hongde Liu
Pengcheng Dong
Hanjie Zhao
Yuchen Yan
Yuxiang Jia
Hongying Zan
Min Peng
180
0
0
26 Dec 2024
Evaluating Self-Supervised Learning in Medical Imaging: A Benchmark for Robustness, Generalizability, and Multi-Domain Impact
Valay Bundele
Karahan Sarıtaş
Bora Kargi
Oğuz Ata Çal
Kıvanç Tezören
Zohreh Ghaderi
Hendrik Lensch
OOD
232
6
0
26 Dec 2024
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
Shahar Katz
Lior Wolf
150
0
0
22 Dec 2024
Enriching Social Science Research via Survey Item Linking
Tornike Tsereteli
Daniel Ruffinelli
Simone Paolo Ponzetto
LRM
310
0
0
20 Dec 2024
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Computer Vision and Pattern Recognition (CVPR), 2024
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
...
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLM
VLM
513
5
0
20 Dec 2024
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study
Eric Modesitt
Ke Yang
Spencer Hulsey
Chengxiang Zhai
Volodymyr Kindratenko
168
2
0
19 Dec 2024
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Sharad Duwal
Suraj Prasai
Suresh Manandhar
CLL
309
3
0
18 Dec 2024
A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Frontiers in Oncology (Front Oncol), 2024
Peilong Wang
J. Holmes
Ziqiang Liu
Dequan Chen
Tianming Liu
Jiajian Shen
Wen Liu
LRM
LM&MA
ELM
456
9
0
14 Dec 2024
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
Quang Hoang Trung
Nguyen Van Hoang Phuc
Le Trung Hoang
Quang Huu Hieu
Vo Nguyen Le Duy
AILaw
RALM
298
1
0
03 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hai Ye
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
628
1
0
02 Dec 2024
Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge
Yuhe Ji
Yilun Liu
Feiyu Yao
Minggui He
Shimin Tao
...
Weibin Meng
Yuming Xie
Boxing Chen
Hao Yang
Yongqian Sun
405
14
0
02 Dec 2024
Previous
1
2
3
4
5
...
26
27
28
Next