Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.06910
Cited By
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation
10 April 2024
Thomas Merth
Qichen Fu
Mohammad Rastegari
Mahyar Najibi
LRM
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation"
9 / 9 papers shown
Title
Accelerating Causal Network Discovery of Alzheimer Disease Biomarkers via Scientific Literature-based Retrieval Augmented Generation
Xiaofan Zhou
Liangjie Huang
Pinyang Cheng
Wenpen Yin
Rui Zhang
Wenrui Hao
Lu Cheng
16
0
0
01 Apr 2025
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and Language Models
Abhinav Jain
Chris Jermaine
Vaibhav Unhelkar
KELM
LLMAG
21
1
0
18 Sep 2024
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Qichen Fu
Minsik Cho
Thomas Merth
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
28
25
0
19 Jul 2024
OpenELM: An Efficient Language Model Family with Open Training and Inference Framework
Sachin Mehta
Mohammad Hossein Sekhavat
Qingqing Cao
Maxwell Horton
Yanzi Jin
...
Iman Mirzadeh
Mahyar Najibi
Dmitry Belenko
Peter Zatloukal
Mohammad Rastegari
OSLM
AIFin
38
49
0
22 Apr 2024
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers
Kunal Sawarkar
Abhilasha Mangal
Shivam Raj Solanki
44
40
0
22 Mar 2024
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation
Keyang Xuan
Li Yi
Fan Yang
Ruochen Wu
Yi Ren Fung
Heng Ji
21
11
0
19 Feb 2024
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Akari Asai
Zeqiu Wu
Yizhong Wang
Avirup Sil
Hannaneh Hajishirzi
RALM
138
600
0
17 Oct 2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
138
208
0
13 Mar 2023
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
234
690
0
27 Aug 2021
1