ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens
v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELMRALM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language
  Models are Strong Fake News Detectors
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Guanghua Li
Wensheng Lu
Wei Zhang
Defu Lian
Kezhong Lu
Rui Mao
Kai Shu
Hao Liao
HILM
192
15
0
14 Mar 2024
Development of a Reliable and Accessible Caregiving Language Model
  (CaLM)
Development of a Reliable and Accessible Caregiving Language Model (CaLM)
B. Parmanto
Bayu Aryoyudanta
Wilbert Soekinto
Agus Setiawan
Yuhan Wang
Haomin Hu
Andi Saptono
Yong K Choi
111
5
0
11 Mar 2024
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System
  Co-design
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design
Wenqi Jiang
Shuai Zhang
Boran Han
Jie Wang
Bernie Wang
Tim Kraska
3DV
227
46
0
08 Mar 2024
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Boshi Wang
Hao Fang
Jason Eisner
Benjamin Van Durme
Yu-Chuan Su
CLL
225
18
0
07 Mar 2024
RATSF: Empowering Customer Service Volume Management through
  Retrieval-Augmented Time-Series Forecasting
RATSF: Empowering Customer Service Volume Management through Retrieval-Augmented Time-Series Forecasting
Tianfeng Wang
Gaojie Cui
AI4TS
246
1
0
07 Mar 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
MeaCap: Memory-Augmented Zero-shot Image Captioning
Zequn Zeng
Yan Xie
Hao Zhang
Chiyu Chen
Zhengjue Wang
Boli Chen
VLM
303
46
0
06 Mar 2024
Reliable, Adaptable, and Attributable Language Models with Retrieval
Reliable, Adaptable, and Attributable Language Models with Retrieval
Akari Asai
Zexuan Zhong
Danqi Chen
Pang Wei Koh
Luke Zettlemoyer
Hanna Hajishirzi
Anuj Kumar
KELMRALM
322
82
0
05 Mar 2024
FakeNewsGPT4: Advancing Multimodal Fake News Detection through
  Knowledge-Augmented LVLMs
FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Xuannan Liu
Peipei Li
Huaibo Huang
Zekun Li
Xing Cui
Jiahao Liang
Lixiong Qin
Weihong Deng
Zhaofeng He
183
3
0
04 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Tengjiao Wang
3DV
942
454
0
29 Feb 2024
RNNs are not Transformers (Yet): The Key Bottleneck on In-context
  Retrieval
RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval
Kaiyue Wen
Xingyu Dang
Kaifeng Lyu
417
48
0
28 Feb 2024
VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning
  with Large Language Models
VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models
Seoyeon Kim
Kwangwook Seo
Hyungjoo Chae
Jinyoung Yeo
Dongha Lee
161
8
0
28 Feb 2024
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
Zihao Yi
Jiarui Ouyang
Yuwen Liu
Yuwen Liu
Tianhao Liao
Haohao Luo
Ying Shen
LRMLLMAG
396
152
0
28 Feb 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional
  Questions for LLM Web Agents
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
235
23
0
27 Feb 2024
JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning
  and Professional Question Answering Capability
JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability
Junda Wang
Zhichao Yang
Zonghai Yao
Hong-ye Yu
BDLAI4MHLRM
419
52
0
27 Feb 2024
Follow My Instruction and Spill the Beans: Scalable Data Extraction from
  Retrieval-Augmented Generation Systems
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Zhenting Qi
Hanlin Zhang
Eric Xing
Sham Kakade
Hima Lakkaraju
SILM
269
44
0
27 Feb 2024
Retrieval is Accurate Generation
Retrieval is Accurate Generation
Bowen Cao
Deng Cai
Leyang Cui
Xuxin Cheng
Wei Bi
Yuexian Zou
Shuming Shi
400
11
0
27 Feb 2024
Long-Context Language Modeling with Parallel Context Encoding
Long-Context Language Modeling with Parallel Context Encoding
Howard Yen
Tianyu Gao
Danqi Chen
326
79
0
26 Feb 2024
LLM Inference Unveiled: Survey and Roofline Model Insights
LLM Inference Unveiled: Survey and Roofline Model Insights
Zhihang Yuan
Yuzhang Shang
Yang Zhou
Zhen Dong
Zhe Zhou
...
Yong Jae Lee
Yan Yan
Beidi Chen
Guangyu Sun
Kurt Keutzer
623
149
0
26 Feb 2024
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic
  Health Records
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records
Ran Xu
Wenqi Shi
Yue Yu
Yuchen Zhuang
Bowen Jin
Hang Wu
Joyce C. Ho
Carl Yang
265
22
0
25 Feb 2024
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented
  Generation (RAG)
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
Shenglai Zeng
Jiankun Zhang
Pengfei He
Yue Xing
Yiding Liu
...
Jie Ren
Shuaiqiang Wang
D. Yin
Yi Chang
Shucheng Zhou
SILM
348
138
0
23 Feb 2024
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
Xiaolong Wang
Yile Wang
Sijie Cheng
Peng Li
Yang Liu
176
18
0
23 Feb 2024
Tug-of-War Between Knowledge: Exploring and Resolving Knowledge
  Conflicts in Retrieval-Augmented Language Models
Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
Zhuoran Jin
Pengfei Cao
Yubo Chen
Kang Liu
Xiaojian Jiang
Jiexin Xu
Qiuxia Li
Jun Zhao
688
94
0
22 Feb 2024
OpenTab: Advancing Large Language Models as Open-domain Table Reasoners
OpenTab: Advancing Large Language Models as Open-domain Table Reasoners
Kezhi Kong
Jiani Zhang
Zhengyuan Shen
Ninad Kulkarni
Chuan Lei
Christos Faloutsos
Huzefa Rangwala
George Karypis
LMTDReLMRALMLRM
328
35
0
22 Feb 2024
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Yiran Ding
Li Lyna Zhang
Chengruidong Zhang
Yuanyuan Xu
Ning Shang
Jiahang Xu
Fan Yang
Mao Yang
RALM
225
260
0
21 Feb 2024
ARL2: Aligning Retrievers for Black-box Large Language Models via
  Self-guided Adaptive Relevance Labeling
ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling
Lingxi Zhang
Yue Yu
Kuan-Chieh Wang
Chao Zhang
VLMRALM
222
12
0
21 Feb 2024
RoCode: A Dataset for Measuring Code Intelligence from Problem
  Definitions in Romanian
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
Adrian Cosma
Ioan-Bogdan Iordache
Paolo Rosso
OffRL
136
4
0
20 Feb 2024
Instruction-tuned Language Models are Better Knowledge Learners
Instruction-tuned Language Models are Better Knowledge Learners
Zhengbao Jiang
Zhiqing Sun
Weijia Shi
Pedro Rodriguez
Chunting Zhou
Graham Neubig
Xi Lin
Anuj Kumar
Srinivasan Iyer
KELM
294
54
0
20 Feb 2024
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware
  Image Classification
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image Classification
Sebastian Doerrich
Tobias Archut
Francesco Di Salvo
Christian Ledig
155
6
0
19 Feb 2024
BIDER: Bridging Knowledge Inconsistency for Efficient
  Retrieval-Augmented LLMs via Key Supporting Evidence
BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence
Jiajie Jin
Yutao Zhu
Yujia Zhou
Zhicheng Dou
RALM
306
32
0
19 Feb 2024
EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries
EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries
Jiateng Liu
Pengfei Yu
Yuji Zhang
Sha Li
Zixuan Zhang
Heng Ji
KELM
212
20
0
17 Feb 2024
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs
  Miss
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALM
378
41
0
16 Feb 2024
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large
  Language Models
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models
Ali AhmadiTeshnizi
Wenzhi Gao
Madeleine Udell
LLMAG
211
57
0
15 Feb 2024
Context Composing for Full Line Code Completion
Context Composing for Full Line Code Completion
Anton Semenkin
Yaroslav Sokolov
Evgeniia Vu
58
6
0
14 Feb 2024
Towards Faithful and Robust LLM Specialists for Evidence-Based
  Question-Answering
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering
Tobias Schimanski
Jingwei Ni
Mathias Kraus
Elliott Ash
Markus Leippold
235
11
0
13 Feb 2024
Nearest Neighbour Score Estimators for Diffusion Generative Models
Nearest Neighbour Score Estimators for Diffusion Generative Models
Matthew Niedoba
Dylan Green
Saeid Naderiparizi
Vasileios Lioutas
J. Lavington
...
Ke Zhang
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank Wood
DiffM
182
6
0
12 Feb 2024
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented
  Generation of Large Language Models
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Wei Zou
Runpeng Geng
Binghui Wang
Jinyuan Jia
SILM
426
45
1
12 Feb 2024
Retrieval-Augmented Thought Process as Sequential Decision Making
Retrieval-Augmented Thought Process as Sequential Decision Making
T. Pouplin
Hao Sun
Samuel Holt
M. Schaar
KELMRALMLRM
114
2
0
12 Feb 2024
Prompt Perturbation in Retrieval-Augmented Generation based Large
  Language Models
Prompt Perturbation in Retrieval-Augmented Generation based Large Language ModelsKnowledge Discovery and Data Mining (KDD), 2024
Zhibo Hu
Chen Wang
Yanfeng Shu
Helen Paik
Paik
Liming Zhu
SILMRALM
213
27
0
11 Feb 2024
ProtIR: Iterative Refinement between Retrievers and Predictors for
  Protein Function Annotation
ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation
Zuobai Zhang
Jiarui Lu
Vijil Chenthamarakshan
Aurélie C. Lozano
Payel Das
Jian Tang
163
1
0
10 Feb 2024
Large Language Models: A Survey
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALMLM&MAELM
839
762
0
09 Feb 2024
Memory Consolidation Enables Long-Context Video Understanding
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balavzević
Yuge Shi
Pinelopi Papalampidi
Rahma Chaabouni
Skanda Koppula
Olivier J. Hénaff
461
46
0
08 Feb 2024
DFA-RAG: Conversational Semantic Router for Large Language Model with
  Definite Finite Automaton
DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
Yiyou Sun
Junjie Hu
Wei Cheng
Haifeng Chen
RALMAI4CE
394
2
0
06 Feb 2024
Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target Identification
Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target IdentificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ravi Patel
Angus Brayne
Rogier E Hintzen
Daniel Jaroslawicz
Georgiana Neculae
Dane S. Corneil
260
2
0
06 Feb 2024
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
Tao Yuan
Xuefei Ning
Dong Zhou
Zhijie Yang
Shiyao Li
...
Dahua Lin
Boxun Li
Guohao Dai
Shengen Yan
Yu Wang
ALM
347
58
0
06 Feb 2024
Retrieval-Augmented Score Distillation for Text-to-3D Generation
Retrieval-Augmented Score Distillation for Text-to-3D GenerationInternational Conference on Machine Learning (ICML), 2024
Junyoung Seo
Susung Hong
Wooseok Jang
Ines Hyeonsu Kim
Minseop Kwak
Doyup Lee
Seungryong Kim
277
13
0
05 Feb 2024
IllusionX: An LLM-powered mixed reality personal companion
IllusionX: An LLM-powered mixed reality personal companion
Ramez Yousri
Zeyad Essam
Yehia Kareem
Youstina Sherief
Sherry Gamil
Soha Safwat
197
10
0
04 Feb 2024
Factuality of Large Language Models in the Year 2024
Factuality of Large Language Models in the Year 2024
Yuxia Wang
Minghan Wang
Muhammad Arslan Manzoor
Fei Liu
Georgi Georgiev
Rocktim Jyoti Das
Preslav Nakov
LRMHILM
218
7
0
04 Feb 2024
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and
  Dialogue Abilities
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Zhifeng Kong
Arushi Goel
Rohan Badlani
Ming-Yu Liu
Rafael Valle
Bryan Catanzaro
AuLLMLM&MAMLLM
515
162
0
02 Feb 2024
Retrieval Augmented End-to-End Spoken Dialog Models
Retrieval Augmented End-to-End Spoken Dialog Models
Mingqiu Wang
Izhak Shafran
H. Soltau
Wei Han
Yuan Cao
Dian Yu
Laurent El Shafey
RALMAuLLM
214
22
0
02 Feb 2024
CorpusLM: Towards a Unified Language Model on Corpus for
  Knowledge-Intensive Tasks
CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks
Xiaoxi Li
Zhicheng Dou
Yujia Zhou
Fangchao Liu
RALM
204
25
0
02 Feb 2024
Previous
123...91011...161718
Next