Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.02412
Cited By
LLM Augmented LLMs: Expanding Capabilities through Composition
4 January 2024
Rachit Bansal
Bidisha Samanta
Siddharth Dalmia
Nitish Gupta
Shikhar Vashishth
Sriram Ganapathy
Abhishek Bapna
Prateek Jain
Partha P. Talukdar
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLM Augmented LLMs: Expanding Capabilities through Composition"
26 / 26 papers shown
Title
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning
Yinghui Li
Haojing Huang
Jiayi Kuang
Yangning Li
Shu Guo
C. Qu
Xiaoyu Tan
Hai-Tao Zheng
Ying Shen
Philip S. Yu
CLL
66
5
0
11 Feb 2025
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Hieu Man
Nghia Trung Ngo
Viet Dac Lai
Ryan Rossi
Franck Dernoncourt
T. Nguyen
76
0
0
01 Jan 2025
Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code
Jipeng Zhang
Jianshu Zhang
Yuanzhe Li
Renjie Pi
Rui Pan
Runtao Liu
Ziqiang Zheng
Tong Zhang
36
0
0
24 Oct 2024
Unconstrained Model Merging for Enhanced LLM Reasoning
Yiming Zhang
Baoyi He
Shengyu Zhang
Yuhao Fu
Qi Zhou
...
Guanghan Ning
Linyi Li
Chunlin Ji
Fei Wu
Hongxia Yang
MoMe
27
0
0
17 Oct 2024
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng
Zifeng Wang
Yike Wang
Sayna Ebrahimi
Hamid Palangi
...
Nathalie Rauschmayr
Yejin Choi
Yulia Tsvetkov
Chen-Yu Lee
Tomas Pfister
MoMe
30
3
0
15 Oct 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
16
5
0
26 Sep 2024
Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts
Rhui Dih Lee
L. Wynter
R. Ganti
MoE
34
1
0
30 Aug 2024
Cool-Fusion: Fuse Large Language Models without Training
Cong Liu
Xiaojun Quan
Yan Pan
Liangzhi Li
Weigang Wu
Xu Chen
VLM
MoMe
46
3
0
29 Jul 2024
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
A. Mesaros
Tuomas Virtanen
Björn Schuller
39
4
0
22 Jul 2024
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Zehan Wang
Ziang Zhang
Hang Zhang
Luping Liu
Rongjie Huang
Xize Cheng
Hengshuang Zhao
Zhou Zhao
30
7
0
16 Jul 2024
AspirinSum: an Aspect-based utility-preserved de-identification Summarization framework
Ya-Lun Li
35
0
0
20 Jun 2024
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities
Vicky Zayats
Peter Chen
Melissa Ferrari
Dirk Padfield
AI4CE
30
0
0
29 May 2024
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
Zixian Huang
Wenhao Zhu
Gong Cheng
Lei Li
Fei Yuan
LRM
24
8
0
27 May 2024
Why Not Transform Chat Large Language Models to Non-English?
Xiang Geng
Ming Zhu
Jiahuan Li
Zhejian Lai
Wei Zou
...
Xinglin Lyu
Min Zhang
Jiajun Chen
Hao Yang
Shujian Huang
32
2
0
22 May 2024
Position: Leverage Foundational Models for Black-Box Optimization
Xingyou Song
Yingtao Tian
Robert Tjarko Lange
Chansoo Lee
Yujin Tang
Yutian Chen
38
5
0
06 May 2024
Relay Decoding: Concatenating Large Language Models for Machine Translation
Chengpeng Fu
Xiaocheng Feng
Yi-Chong Huang
Wenshuai Huo
Baohang Li
Hui Wang
Bing Qin
Ting Liu
24
0
0
05 May 2024
Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
Lakshmi Nair
Evana Gizzi
Jivko Sinapov
MLLM
48
2
0
02 May 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Charles Goddard
Shamane Siriwardhana
Malikeh Ehghaghi
Luke Meyers
Vladimir Karpukhin
Brian Benedict
Mark McQuade
Jacob Solawetz
MoMe
KELM
75
75
0
20 Mar 2024
Semiparametric Token-Sequence Co-Supervision
Hyunji Lee
Doyoung Kim
Jihoon Jun
Se June Joo
Joel Jang
Kyoung-Woon On
Minjoon Seo
25
0
0
14 Mar 2024
DAM: Dynamic Adapter Merging for Continual Video QA Learning
Feng Cheng
Ziyang Wang
Yi-Lin Sung
Yan-Bo Lin
Mohit Bansal
Gedas Bertasius
CLL
MoMe
26
10
0
13 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
33
7
0
13 Mar 2024
Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems
Shengzhe Xu
Christo Kurisummoottil Thomas
Omar Hashash
Nikhil Muralidhar
Walid Saad
Naren Ramakrishnan
21
21
0
30 Jan 2024
LangBridge: Multilingual Reasoning Without Multilingual Supervision
Dongkeun Yoon
Joel Jang
Sungdong Kim
Seungone Kim
Sheikh Shafayat
Minjoon Seo
LRM
22
14
0
19 Jan 2024
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
215
2,232
0
22 Mar 2023
Language Models are Multilingual Chain-of-Thought Reasoners
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLM
LRM
167
320
0
06 Oct 2022
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
196
853
0
09 Feb 2021
1