Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.13035
Cited By
D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
18 June 2024
Zhongwei Wan
Xinjian Wu
Yu Zhang
Yi Xin
Chaofan Tao
Z. Zhu
Xin Wang
Siqi Luo
Jing Xiong
Mi Zhang
Mi Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models"
9 / 9 papers shown
Title
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
Y. Chen
J. Zhang
Baotong Lu
Qianxi Zhang
Chengruidong Zhang
...
Chen Chen
Mingxing Zhang
Yuqing Yang
Fan Yang
Mao Yang
32
0
0
05 May 2025
Cognitive Memory in Large Language Models
Lianlei Shan
Shixian Luo
Zezhou Zhu
Yu Yuan
Yong Wu
LLMAG
KELM
58
1
0
03 Apr 2025
SnapKV: LLM Knows What You are Looking for Before Generation
Yuhong Li
Yingbing Huang
Bowen Yang
Bharat Venkitesh
Acyr F. Locatelli
Hanchen Ye
Tianle Cai
Patrick Lewis
Deming Chen
VLM
73
148
0
22 Apr 2024
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Xin Wang
Yu Zheng
Zhongwei Wan
Mi Zhang
MQ
53
43
0
12 Mar 2024
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
Che Liu
Zhongwei Wan
Ouyang Cheng
Anand Shah
Wenjia Bai
Rossella Arcucci
28
26
0
11 Mar 2024
No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization
J. Yang
Byeongwook Kim
Jeongin Bae
Beomseok Kwon
Gunho Park
Eunho Yang
S. Kwon
Dongsoo Lee
MQ
31
12
0
28 Feb 2024
Towards Uncovering How Large Language Model Works: An Explainability Perspective
Haiyan Zhao
Fan Yang
Bo Shen
Himabindu Lakkaraju
Mengnan Du
30
10
0
16 Feb 2024
The Falcon Series of Open Language Models
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
...
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
AI4TS
ALM
104
389
0
28 Nov 2023
Self-consistent Reasoning For Solving Math Word Problems
Jing Xiong
Zhongwei Wan
Xiping Hu
Min Yang
Chengming Li
ReLM
LRM
40
10
0
27 Oct 2022
1