v1v2v3 (latest)

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

23 April 2020

Kyle Lo

Papers citing "Don't Stop Pretraining: Adapt Language Models to Domains and Tasks"

50 / 1,369 papers shown

Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without ThemConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Marc Felix Brinner

Tarek Al Mustafa

Sina Zarrieß

308

27 Mar 2025

Low-resource Information Extraction with the European Clinical Case Corpus

240

26 Mar 2025

AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Tadesse Destaw Belay

Israel Abebe Azime

Ibrahim Said Ahmad

David Ifeoluwa Adelani

Idris Abdulmumin

Abinew Ali Ayele

Shamsuddeen Hassan Muhammad

Seid Muhie Yimam

455

24 Mar 2025

OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery

...

561

22 Mar 2025

Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning

303

20 Mar 2025

Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon DetectionKnowledge Discovery and Data Mining (KDD), 2025

190

19 Mar 2025

Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models?

Basab Jha

Firoj Paudel

195

16 Mar 2025

Neutralizing Bias in LLM Reasoning using Entailment GraphsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

220

14 Mar 2025

Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

332

12 Mar 2025

Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation

264

12 Mar 2025

Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words

342

10 Mar 2025

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Eric Zhao

Pranjal Awasthi

Nika Haghtalab

179

07 Mar 2025

A Dataset for Analysing News Framing in Chinese MediaInternational Conference on Web and Social Media (ICWSM), 2025

270

06 Mar 2025

CareerBERT: Matching Resumes to ESCO Jobs in a Shared Embedding Space for Generic Job RecommendationsExpert systems with applications (ESWA), 2025

249

03 Mar 2025

GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

129

03 Mar 2025

Alchemist: Towards the Design of Efficient Online Continual Learning System

397

03 Mar 2025

DUAL: Diversity and Uncertainty Active Learning for Text Summarization

Petros Stylianos Giouroukis

Alexios Gidiotis

Grigorios Tsoumakas

223

02 Mar 2025

Personalize Your LLM: Fake it then Align itNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

519

02 Mar 2025

Autoencoder-Based Framework to Capture Vocabulary Quality in NLP

Vu Minh Hoang Dang

Rakesh M. Verma

145

28 Feb 2025

Unsupervised Parameter Efficient Source-free Post-pretraining

272

28 Feb 2025

Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object IdentificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Vishnu Kabir Chhabra

Ding Zhu

Mohammad Mahdi Khalili

326

27 Feb 2025

NaijaNLP: A Survey of Nigerian Low-Resource Languages

Isa Inuwa-Dutse

356

27 Feb 2025

A Survey of Model Architectures in Information Retrieval

584

20 Feb 2025

Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries

...

333

17 Feb 2025

FinMTEB: Finance Massive Text Embedding Benchmark

Yixuan Tang

Yi Yang

AIFin

391

16 Feb 2025

Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training

367

16 Feb 2025

Assessing the Impact of the Quality of Textual Data on Feature Representation and Machine Learning Models

Tabinda Sarwar

Antonio Jose Jimeno Yepes

Lawrence Cavedon

301

12 Feb 2025

RideKE: Leveraging Low-Resource, User-Generated Twitter Content for Sentiment and Emotion Detection in Kenyan Code-Switched DatasetWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2025

Naome A. Etori

Maria Gini

668

10 Feb 2025

Privacy-Preserving Dataset Combination

Keren Fuentes

Mimee Xu

Irene Chen

357

09 Feb 2025

BTS: Harmonizing Specialized Experts into a Generalist LLM

...

155

31 Jan 2025

Detecting harassment and defamation in cyberbullying with emotion-adaptive trainingInternational Conference on Web and Social Media (ICWSM), 2025

Peiling Yi

A. Zubiaga

Yunfei Long

391

28 Jan 2025

Speech Translation Refinement using Large Language Models

962

28 Jan 2025

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023

726

269

28 Jan 2025

Distributional Surgery for Language Model ActivationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

317

27 Jan 2025

Addressing Bias in Generative AI: Challenges and Research Opportunities in Information ManagementInformation Manager (The) (TIM), 2025

Xiahua Wei

Naveen Kumar

Han Zhang

323

22 Jan 2025

The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs BetterNeural Information Processing Systems (NeurIPS), 2024

787

03 Jan 2025

INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Large Language Models and Ensemble Learning

188

31 Dec 2024

Multimodal Fusion and Coherence Modeling for Video Topic SegmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

435

31 Dec 2024

On Adversarial Robustness of Language Models in Transfer Learning

370

29 Dec 2024

SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment AnalysisInternational Conference on Computational Linguistics (COLING), 2024

180

26 Dec 2024

Evaluating Self-Supervised Learning in Medical Imaging: A Benchmark for Robustness, Generalizability, and Multi-Domain Impact

232

26 Dec 2024

Reversed Attention: On The Gradient Descent Of Attention Layers In GPT

Shahar Katz

Lior Wolf

150

22 Dec 2024

Enriching Social Science Research via Survey Item Linking

Tornike Tsereteli

Daniel Ruffinelli

Simone Paolo Ponzetto

LRM

310

20 Dec 2024

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language EmbeddingComputer Vision and Pattern Recognition (CVPR), 2024

...

513

20 Dec 2024

ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study

Volodymyr Kindratenko

168

19 Dec 2024

Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali

309

18 Dec 2024

A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled optionsFrontiers in Oncology (Front Oncol), 2024

456

14 Dec 2024

Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval

Quang Hoang Trung

Nguyen Van Hoang Phuc

298

03 Dec 2024

CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial SearchNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

...

628

02 Dec 2024

Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge

...

405

02 Dec 2024