ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.09949
  4. Cited By
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models

Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models

15 October 2023
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
ArXivPDFHTML

Papers citing "Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models"

14 / 14 papers shown
Title
Understanding and Optimizing Multi-Stage AI Inference Pipelines
Understanding and Optimizing Multi-Stage AI Inference Pipelines
A. Bambhaniya
Hanjiang Wu
Suvinay Subramanian
S. Srinivasan
Souvik Kundu
Amir Yazdanbakhsh
Midhilesh Elavazhagan
Madhu Kumar
Tushar Krishna
24
0
0
14 Apr 2025
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
J. Kim
Divya Mahajan
VLM
29
0
0
11 Apr 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin
Keisuke Kamahori
Yiyu Liu
Xiaoxiang Shi
Madhav Kashyap
...
Stephanie Wang
Arvind Krishnamurthy
Rohan Kadekodi
Luis Ceze
Baris Kasikci
3DV
VLM
42
0
0
28 Feb 2025
CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search
  with Dynamic Data Ingestion
CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
Xianzhi Zeng
Zhuoyan Wu
Xinjing Hu
Xuanhua Shi
Shixuan Sun
Shuhao Zhang
19
1
0
28 Jun 2024
A Survey on Retrieval-Augmented Text Generation for Large Language
  Models
A Survey on Retrieval-Augmented Text Generation for Large Language Models
Yizheng Huang
Jimmy X. Huang
3DV
RALM
30
33
0
17 Apr 2024
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System
  Co-design
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design
Wenqi Jiang
Shuai Zhang
Boran Han
Jie Wang
Bernie Wang
Tim Kraska
3DV
61
23
0
08 Mar 2024
Large Language Models for Information Retrieval: A Survey
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zhicheng Dou
Ji-Rong Wen
KELM
18
258
0
14 Aug 2023
Co-design Hardware and Algorithm for Vector Search
Co-design Hardware and Algorithm for Vector Search
Wenqi Jiang
Shigang Li
Yu Zhu
Johannes de Fine Licht
Zhenhao He
...
Cédric Renggli
Shuai Zhang
Theodoros Rekatsinas
Torsten Hoefler
Gustavo Alonso
55
8
0
19 Jun 2023
The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic
  Parrots and Hallucination
The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic Parrots and Hallucination
Z. Li
AILaw
SILM
23
23
0
21 Apr 2023
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Shigang Li
Kazuki Osawa
Torsten Hoefler
61
26
0
14 Sep 2022
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
Felix Chern
Blake A. Hechtman
Andy Davis
Ruiqi Guo
David Majnemer
Surinder Kumar
64
16
0
28 Jun 2022
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Uri Alon
Frank F. Xu
Junxian He
Sudipta Sengupta
Dan Roth
Graham Neubig
RALM
67
62
0
28 Jan 2022
Internet-Augmented Dialogue Generation
Internet-Augmented Dialogue Generation
M. Komeili
Kurt Shuster
Jason Weston
RALM
226
278
0
15 Jul 2021
Extracting Training Data from Large Language Models
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
261
1,386
0
14 Dec 2020
1