v1v2 (latest)

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

Annual Meeting of the Association for Computational Linguistics (ACL), 2025

15 May 2025

Jean-Philippe Corbeil

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment"

50 / 52 papers shown

Additive Large Language Models for Semi-Structured Text

Karthikeyan K

Raghuveer Thirukovalluru

David Edwin Carlson

108

14 Nov 2025

Hearing Health in Home Healthcare: Leveraging LLMs for Illness Scoring and ALMs for Vocal Biomarker Extraction

...

153

20 Oct 2025

Understanding the Effects of Domain Finetuning on LLMs

130

10 Oct 2025

H-DDx: A Hierarchical Evaluation Framework for Differential Diagnosis

144

04 Oct 2025

Understanding Post-Training Structural Changes in Large Language Models

Xinyu He

Xianghui Cao

158

22 Sep 2025

Large language models surpass domain-specific architectures for antepartum electronic fetal monitoring analysis

143

09 Sep 2025

MedRiskEval: Medical Risk Evaluation Benchmark of Language Models, On the Importance of User Perspectives in Healthcare Settings

Jean-Philippe Corbeil

Francois Beaulieu

Paul Vozila

LM&MA

177

09 Jul 2025

MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters

144

05 Feb 2025

From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond

281

06 Nov 2024

Scalable Data Ablation Approximations for Language Models through Modular Training and MergingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Jesse Dodge

158

21 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

269

04 Oct 2024

Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Clément Christophe

Tathagata Raha

Svetlana Maslenkova

Muhammad Umar Salman

Praveen K Kanithi

Marco AF Pimentel

Shadab Khan

LM&MA

161

23 Sep 2024

Med42-v2: A Suite of Clinical LLMs

Clément Christophe

Praveen K Kanithi

Tathagata Raha

Shadab Khan

Marco AF Pimentel

ELM LM&MA AI4MH

233

12 Aug 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons

...

346

20 Jul 2024

AgentInstruct: Toward Generative Teaching with Agentic Flows

...

Ahmed Awadallah

440

03 Jul 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies

311

24 Jun 2024

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Mete Ozay

274

20 Jun 2024

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Yejin Choi

Bill Yuchen Lin

SyDa

354

257

12 Jun 2024

Aloe: A Family of Fine-tuned Open Healthcare LLMs

Ashwin Kumar Gururajan

...

Lucia Urcelay-Ganzabal

Marta Gonzalez-Mallo

Sergio Alvarez-Napagao

Eduard Ayguadé-Parra

Ulises Cortés Dario Garcia-Gasulla

ELM LM&MA

311

03 May 2024

Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare

234

25 Apr 2024

IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents

Jean-Philippe Corbeil

165

23 Apr 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Ahmed Hassan Awadallah

...

Yue Zhang

593

1,887

22 Apr 2024

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model PerformanceNeural Information Processing Systems (NeurIPS), 2024

Vishaal Udandarao

Christian Schroeder de Witt

705

04 Apr 2024

Arcee's MergeKit: A Toolkit for Merging Large Language Models

704

171

20 Mar 2024

Instruction-tuned Language Models are Better Knowledge Learners

Weijia Shi

Graham Neubig

293

20 Feb 2024

BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains

479

364

15 Feb 2024

LongHealth: A Question Answering Benchmark with Long Clinical Documents

Lisa Christine Adams

Felix Busch

T. Han

Jean-Baptiste Excoffier

Matthieu Ortala

Alexander Loser

Hugo J. W. L. Aerts

Jakob Nikolas Kather

Daniel Truhn

Keno Bressem

ELM LM&MA AI4MH

231

25 Jan 2024

Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling LawsInternational Conference on Machine Learning (ICML), 2023

989

122

31 Dec 2023

Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks

Mohammad-Javad Davari

Eugene Belilovsky

MoMe

262

11 Dec 2023

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

...

LM&MA AI4MH MedIm ELM

245

438

28 Nov 2023

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free LunchInternational Conference on Machine Learning (ICML), 2023

555

492

06 Nov 2023

AlpaCare:Instruction-tuned Large Language Models for Medical Application

460

23 Oct 2023

Textbooks Are All You Need II: phi-1.5 technical report

473

587

11 Sep 2023

Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical NotesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Junu Kim

...

381

01 Sep 2023

Instruction Tuning for Large Language Models: A Survey

...

Jiwei Li

920

765

21 Aug 2023

ACI-BENCH: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note GenerationScientific Data (Sci Data), 2023

213

127

03 Jun 2023

TIES-Merging: Resolving Interference When Merging ModelsNeural Information Processing Systems (NeurIPS), 2023

378

520

02 Jun 2023

Enhancing Chat Language Models by Scaling High-quality Instructional ConversationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zhiyuan Liu

Maosong Sun

Bowen Zhou

ALM

365

747

23 May 2023

The Flan Collection: Designing Data and Methods for Effective Instruction TuningInternational Conference on Machine Learning (ICML), 2023

...

409

849

31 Jan 2023

Large Language Models Encode Clinical KnowledgeNature (Nature), 2022

...

Alan Karthikesalingam

Vivek Natarajan

LM&MA ELM AI4MH

602

3,407

26 Dec 2022

Editing Models with Task ArithmeticInternational Conference on Learning Representations (ICLR), 2022

1.2K

740

08 Dec 2022

Will we run out of data? Limits of LLM scaling based on human-generated data

308

198

26 Oct 2022

Fine-tuned Language Models are Continual LearnersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Thomas Scialom

Tuhin Chakrabarty

Smaranda Muresan

CLL LRM

492

152

24 May 2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference timeInternational Conference on Machine Learning (ICML), 2022

Raphael Gontijo-Lopes

...

728

1,281

10 Mar 2022

Linear Mode Connectivity in Multitask and Continual LearningInternational Conference on Learning Representations (ICLR), 2020

289

169

09 Oct 2020

What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical ExamsApplied Sciences (Appl. Sci.), 2020

420

1,263

28 Sep 2020

Measuring Massive Multitask Language UnderstandingInternational Conference on Learning Representations (ICLR), 2020

2.3K

6,566

07 Sep 2020

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

Kyle Lo

573

2,740

23 Apr 2020

Linear Mode Connectivity and the Lottery Ticket HypothesisInternational Conference on Machine Learning (ICML), 2019

Jonathan Frankle

Gintare Karolina Dziugaite

Daniel M. Roy

Michael Carbin

MoMe

728

702

11 Dec 2019

PubMedQA: A Dataset for Biomedical Research Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

763

1,293

13 Sep 2019