v1v2 (latest)

Knowledge Fusion of Large Language Models

19 January 2024

Wei Bi

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (569★)

Papers citing "Knowledge Fusion of Large Language Models"

50 / 69 papers shown

ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval

Ahmed Masry

Megh Thakkar

Patrice Bechard

Sathwik Tejaswi Madhusudhan

...

191

02 Nov 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

164

28 Oct 2025

Teaming LLMs to Detect and Mitigate Hallucinations

320

22 Oct 2025

Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding

151

21 Oct 2025

Lossless Vocabulary Reduction for Auto-Regressive Language Models

104

09 Oct 2025

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Chanjoo Jung

Jaehyung Kim

149

06 Oct 2025

Making, not Taking, the Best of N

143

01 Oct 2025

Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say

177

25 Sep 2025

Probabilistic Token Alignment for Large Language Model Fusion

...

164

21 Sep 2025

World Model Implanting for Test-time Adaptation of Embodied Agents

119

04 Sep 2025

IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection

117

28 Aug 2025

A Taxonomy of Transcendence

133

25 Aug 2025

Industrial LLM-based Code Optimization under Regulation: A Mixture-of-Agents Approach

144

05 Aug 2025

Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration

399

04 Jun 2025

Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models

286

31 May 2025

LightRouter: Towards Efficient LLM Collaboration with Minimal Overhead

226

22 May 2025

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

294

20 May 2025

A Survey on Collaborative Mechanisms Between Large and Small Language Models

Yi Chen

JiaHao Zhao

HaoHao Han

380

12 May 2025

A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models NetworkIEEE Transactions on Cognitive Communications and Networking (TCCN), 2025

228

08 May 2025

Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks

...

1.0K

24 Apr 2025

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

340

16 Apr 2025

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMsInternational Conference on Learning Representations (ICLR), 2025

314

15 Apr 2025

A Dual-Space Framework for General Knowledge Distillation of Large Language Models

374

15 Apr 2025

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

321

09 Apr 2025

LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence

...

245

27 Mar 2025

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling

338

24 Mar 2025

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

332

13 Mar 2025

Collaborative Speculative Inference for Efficient LLM Inference Serving

329

13 Mar 2025

System 0/1/2/3: Quad-process theory for multi-timescale embodied collective cognitive systems

345

08 Mar 2025

Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

248

03 Mar 2025

Scalable Model Merging with Progressive Layer-wise Distillation

645

18 Feb 2025

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

427

11 Feb 2025

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language ModelsInternational Conference on Learning Representations (ICLR), 2025

582

28 Jan 2025

Multi-Task Model Merging via Adaptive Weight Disentanglement

582

10 Jan 2025

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

931

571

03 Jan 2025

Copyright-Protected Language Generation via Adaptive Model FusionInternational Conference on Learning Representations (ICLR), 2024

342

09 Dec 2024

Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion

369

02 Dec 2024

H3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

363

26 Nov 2024

Exploring Model Kinship for Merging Large Language Models

473

16 Oct 2024

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

...

309

15 Oct 2024

Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model EnsemblesInternational Conference on Learning Representations (ICLR), 2024

Itai Gat

272

11 Oct 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the WildNeural Information Processing Systems (NeurIPS), 2024

...

Binhang Yuan

Hongyi Wang

Ang Li

Zhangyang Wang

Tianlong Chen

MoMe ALM

331

07 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

269

04 Oct 2024

Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024

Jing Li

...

Min Zhang

244

03 Oct 2024

Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model EnsemblingInternational Conference on Learning Representations (ICLR), 2024

235

03 Oct 2024

Disentangling Latent Shifts of In-Context Learning with Weak Supervision

Josip Jukić

Jan Snajder

297

02 Oct 2024

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models

Anke Tang

Li Shen

Yong Luo

Shuai Xie

Han Hu

Lefei Zhang

Di Lin

Dacheng Tao

MoMe

310

19 Aug 2024

FuseChat: Knowledge Fusion of Chat Models

Xiaojun Quan

357

15 Aug 2024

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Andreas Triantafyllopoulos

403

22 Jul 2024

Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives

D. Hagos

Rick Battle

Danda B. Rawat

LM&MA OffRL

485

20 Jul 2024