v1v2 (latest)

Knowledge Fusion of Large Language Models

19 January 2024

Wei Bi

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (569★)

Papers citing "Knowledge Fusion of Large Language Models"

50 / 69 papers shown

ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval

Ahmed Masry

Megh Thakkar

Patrice Bechard

Sathwik Tejaswi Madhusudhan

...

228

02 Nov 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

203

28 Oct 2025

Teaming LLMs to Detect and Mitigate Hallucinations

380

22 Oct 2025

Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding

176

21 Oct 2025

Lossless Vocabulary Reduction for Auto-Regressive Language Models

142

09 Oct 2025

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Chanjoo Jung

Jaehyung Kim

199

06 Oct 2025

Making, not Taking, the Best of N

240

01 Oct 2025

Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say

216

25 Sep 2025

Probabilistic Token Alignment for Large Language Model Fusion

...

207

21 Sep 2025

World Model Implanting for Test-time Adaptation of Embodied Agents

149

04 Sep 2025

IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection

131

28 Aug 2025

A Taxonomy of Transcendence

158

25 Aug 2025

Industrial LLM-based Code Optimization under Regulation: A Mixture-of-Agents Approach

240

05 Aug 2025

Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration

448

04 Jun 2025

Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models

345

31 May 2025

LightRouter: Towards Efficient LLM Collaboration with Minimal Overhead

262

22 May 2025

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

378

20 May 2025

A Survey on Collaborative Mechanisms Between Large and Small Language Models

Yi Chen

JiaHao Zhao

HaoHao Han

467

12 May 2025

A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models NetworkIEEE Transactions on Cognitive Communications and Networking (TCCN), 2025

271

08 May 2025

Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks

...

1.1K

24 Apr 2025

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

370

16 Apr 2025

A Dual-Space Framework for General Knowledge Distillation of Large Language Models

411

15 Apr 2025

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMsInternational Conference on Learning Representations (ICLR), 2025

349

15 Apr 2025

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

391

09 Apr 2025

LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence

...

303

27 Mar 2025

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling

389

24 Mar 2025

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

430

13 Mar 2025

Collaborative Speculative Inference for Efficient LLM Inference Serving

422

13 Mar 2025

System 0/1/2/3: Quad-process theory for multi-timescale embodied collective cognitive systems

380

08 Mar 2025

Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

312

03 Mar 2025

Scalable Model Merging with Progressive Layer-wise Distillation

713

18 Feb 2025

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

476

11 Feb 2025

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language ModelsInternational Conference on Learning Representations (ICLR), 2025

650

28 Jan 2025

Multi-Task Model Merging via Adaptive Weight Disentanglement

670

10 Jan 2025

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

1.1K

675

03 Jan 2025

Copyright-Protected Language Generation via Adaptive Model FusionInternational Conference on Learning Representations (ICLR), 2024

374

09 Dec 2024

Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion

463

02 Dec 2024

H3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

447

26 Nov 2024

Exploring Model Kinship for Merging Large Language Models

503

16 Oct 2024

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

...

339

15 Oct 2024

Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model EnsemblesInternational Conference on Learning Representations (ICLR), 2024

Itai Gat

306

11 Oct 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the WildNeural Information Processing Systems (NeurIPS), 2024

...

Binhang Yuan

Hongyi Wang

Ang Li

Zhangyang Wang

Tianlong Chen

MoMe ALM

407

07 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

296

04 Oct 2024

Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024

Jing Li

...

Min Zhang

275

03 Oct 2024

Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model EnsemblingInternational Conference on Learning Representations (ICLR), 2024

277

03 Oct 2024

Disentangling Latent Shifts of In-Context Learning with Weak Supervision

Josip Jukić

Jan Snajder

365

02 Oct 2024

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models

Anke Tang

Li Shen

Yong Luo

Shuai Xie

Han Hu

Lefei Zhang

Di Lin

Dacheng Tao

MoMe

396

19 Aug 2024

FuseChat: Knowledge Fusion of Chat Models

Xiaojun Quan

402

15 Aug 2024

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Andreas Triantafyllopoulos

455

22 Jul 2024

Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives

D. Hagos

Rick Battle

Danda B. Rawat

LM&MA OffRL

568

20 Jul 2024