ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens
v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELMRALM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown
Efficient Prompt Caching via Embedding Similarity
Efficient Prompt Caching via Embedding Similarity
Hanlin Zhu
Banghua Zhu
Jiantao Jiao
RALM
206
10
0
02 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM
  Collaboration
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
325
164
0
01 Feb 2024
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Parth Sarthi
Salman Abdullah
Aditi Tuli
Shubh Khanna
Anna Goldie
Christopher D. Manning
RALM
465
296
0
31 Jan 2024
LOCOST: State-Space Models for Long Document Abstractive Summarization
LOCOST: State-Space Models for Long Document Abstractive Summarization
Florian Le Bronnec
Song Duong
Mathieu Ravaut
Alexandre Allauzen
Nancy F. Chen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
421
15
0
31 Jan 2024
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Hamish Ivison
Sewon Min
Luke Zettlemoyer
Yejin Choi
Hannaneh Hajishirzi
469
102
0
30 Jan 2024
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop
  Queries
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries
Yixuan Tang
Yi Yang
RALM
316
190
0
27 Jan 2024
Equipping Language Models with Tool Use Capability for Tabular Data
  Analysis in Finance
Equipping Language Models with Tool Use Capability for Tabular Data Analysis in FinanceConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Adrian Theuma
Ehsan Shareghi
118
17
0
27 Jan 2024
A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM
A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM
Ahmet Yusuf Alan
Enis Karaarslan
Ömer Aydin
321
1
0
27 Jan 2024
The Power of Noise: Redefining Retrieval for RAG Systems
The Power of Noise: Redefining Retrieval for RAG SystemsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Florin Cuconasu
Giovanni Trappolini
F. Siciliano
Simone Filice
Cesare Campagnano
Y. Maarek
Nicola Tonellotto
Fabrizio Silvestri
RALM
613
305
0
26 Jan 2024
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Zhihao Zhang
Alan Zhu
Lijie Yang
Yihua Xu
Lanting Li
P. Phothilimthana
Zhihao Jia
RALMKELM
260
21
0
25 Jan 2024
Automated Root Causing of Cloud Incidents using In-Context Learning with
  GPT-4
Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-4
Xuchao Zhang
Supriyo Ghosh
Chetan Bansal
Rujia Wang
Ming-Jie Ma
Yu Kang
Saravan Rajmohan
151
47
0
24 Jan 2024
JustiLM: Few-shot Justification Generation for Explainable Fact-Checking
  of Real-world Claims
JustiLM: Few-shot Justification Generation for Explainable Fact-Checking of Real-world ClaimsTransactions of the Association for Computational Linguistics (TACL), 2024
Fengzhu Zeng
Wei Gao
319
26
0
16 Jan 2024
Attendre: Wait To Attend By Retrieval With Evicted Queries in
  Memory-Based Transformers for Long Context Processing
Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing
Zi Yang
Nan Hua
RALM
225
4
0
10 Jan 2024
CaMML: Context-Aware Multimodal Learner for Large Models
CaMML: Context-Aware Multimodal Learner for Large Models
Yixin Chen
Shuai Zhang
Boran Han
Tong He
Bo Li
VLM
276
6
0
06 Jan 2024
Large Language Models for Social Networks: Applications, Challenges, and
  Solutions
Large Language Models for Social Networks: Applications, Challenges, and Solutions
Jingying Zeng
Richard Huang
Waleed Malik
Langxuan Yin
Bojan Babic
Danny Shacham
Xiao Yan
Jaewon Yang
Qi He
207
11
0
04 Jan 2024
ReFusion: Improving Natural Language Understanding with
  Computation-Efficient Retrieval Representation Fusion
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation FusionInternational Conference on Learning Representations (ICLR), 2024
Shangyu Wu
Ying Xiong
Yufei Cui
Xue Liu
Buzhou Tang
Tei-Wei Kuo
Chun Jason Xue
206
4
0
04 Jan 2024
Navigating Uncertainty: Optimizing API Dependency for Hallucination
  Reduction in Closed-Book Question Answering
Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering
Pierre Erbacher
Louis Falissard
Vincent Guigue
Laure Soulier
HILMRALM
124
4
0
03 Jan 2024
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code
  Empowers Large Language Models to Serve as Intelligent Agents
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Ke Yang
Jiateng Liu
John Wu
Chaoqi Yang
Yi R. Fung
...
Xu Cao
Xingyao Wang
Yiquan Wang
Chenhui Xu
Chengxiang Zhai
LLMAGELM
473
114
0
01 Jan 2024
Retrieval-Augmented Egocentric Video Captioning
Retrieval-Augmented Egocentric Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2024
Jilan Xu
Yifei Huang
Junlin Hou
Guo Chen
Yue Zhang
Rui Feng
Weidi Xie
EgoV
409
50
0
01 Jan 2024
Structured Packing in LLM Training Improves Long Context Utilization
Structured Packing in LLM Training Improves Long Context Utilization
Konrad Staniszewski
Szymon Tworkowski
Sebastian Jaszczur
Yu Zhao
Henryk Michalewski
Lukasz Kuciñski
Piotr Milo's
371
16
0
28 Dec 2023
Adapting Large Language Models for Education: Foundational Capabilities,
  Potentials, and Challenges
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4EdELM
359
40
0
27 Dec 2023
LeanVec: Searching vectors faster by making them fit
LeanVec: Searching vectors faster by making them fit
Mariano Tepper
Ishwar Bhati
Cecilia Aguerrebere
Mark Hildebrand
Ted Willke
VLMOODD
257
5
0
26 Dec 2023
Supervised Knowledge Makes Large Language Models Better In-context
  Learners
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang
Shuibai Zhang
Zhuohao Yu
Guangsheng Bao
Yidong Wang
...
Ruochen Xu
Weirong Ye
Xing Xie
Weizhu Chen
Yue Zhang
389
25
0
26 Dec 2023
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu
Steffi Chern
Ethan Chern
Ge Zhang
Zekun Wang
Ruibo Liu
Jing Li
Jie Fu
Pengfei Liu
181
23
0
26 Dec 2023
Towards Consistent Language Models Using Declarative Constraints
Towards Consistent Language Models Using Declarative Constraints
Jasmin Mousavi
Arash Termehchy
HILMALM
203
2
0
24 Dec 2023
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Xupeng Miao
Xupeng Miao
Zhihao Zhang
Xinhao Cheng
Hongyi Jin
Tianqi Chen
Zhihao Jia
419
121
0
23 Dec 2023
RealGen: Retrieval Augmented Generation for Controllable Traffic
  Scenarios
RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios
Wenhao Ding
Yulong Cao
Ding Zhao
Chaowei Xiao
Marco Pavone
172
42
0
19 Dec 2023
Jack of All Tasks, Master of Many: Designing General-purpose
  Coarse-to-Fine Vision-Language Model
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick
Guangxing Han
Rui Hou
Sayan Nag
Ser-Nam Lim
Nicolas Ballas
Qifan Wang
Rama Chellappa
Amjad Almahairi
VLMMLLM
386
50
0
19 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DVRALM
1.2K
2,702
1
18 Dec 2023
kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest
  Neighbor In-Context Learning
kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning
Wenting Zhao
Ye Liu
Yao Wan
Yibo Wang
Qingyang Wu
Zhongfen Deng
Jiangshu Du
Shuaiqi Liu
Yunlong Xu
Philip S. Yu
200
11
0
17 Dec 2023
AI capabilities can be significantly improved without expensive
  retraining
AI capabilities can be significantly improved without expensive retraining
Tom Davidson
Jean-Stanislas Denain
Pablo Villalobos
Guillem Bas
OffRLVLM
236
31
0
12 Dec 2023
PaperQA: Retrieval-Augmented Generative Agent for Scientific Research
PaperQA: Retrieval-Augmented Generative Agent for Scientific Research
Jakub Lála
Odhran O'Donoghue
Aleksandar Shtedritski
Sam Cox
Samuel G. Rodriques
Andrew D. White
RALM
433
147
0
08 Dec 2023
SparQ Attention: Bandwidth-Efficient LLM Inference
SparQ Attention: Bandwidth-Efficient LLM InferenceInternational Conference on Machine Learning (ICML), 2023
Luka Ribar
Ivan Chelombiev
Luke Hudlass-Galley
Charlie Blake
Carlo Luschi
Douglas Orr
437
85
0
08 Dec 2023
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent
  Ecosystem
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem
Yingqiang Ge
Yujie Ren
Qingfeng Lan
Shuyuan Xu
Juntao Tan
Zelong Li
LLMAG
248
39
0
06 Dec 2023
Scaling Laws for Adversarial Attacks on Language Model Activations
Scaling Laws for Adversarial Attacks on Language Model Activations
Stanislav Fort
140
21
0
05 Dec 2023
PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval
  Models
PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval ModelsWeb Search and Data Mining (WSDM), 2023
Wei-Cheng Chang
Jyun-Yu Jiang
Jiong Zhang
Mutasem Al-Darabsah
C. Teo
Cho-Jui Hsieh
Hsiang-Fu Yu
S. Vishwanathan
RALM
262
4
0
05 Dec 2023
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with FakepediaAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Giovanni Monea
Maxime Peyrard
Martin Josifoski
Vishrav Chaudhary
Jason Eisner
Emre Kiciman
Hamid Palangi
Barun Patra
Robert West
KELM
494
24
0
04 Dec 2023
UniIR: Training and Benchmarking Universal Multimodal Information
  Retrievers
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei
Yang Chen
Haonan Chen
Hexiang Hu
Ge Zhang
Jie Fu
Alan Ritter
Lei Ma
267
126
0
28 Nov 2023
Rethinking Privacy in Machine Learning Pipelines from an Information
  Flow Control Perspective
Rethinking Privacy in Machine Learning Pipelines from an Information Flow Control Perspective
Lukas Wutschitz
Boris Köpf
Andrew Paverd
Saravan Rajmohan
Ahmed Salem
Shruti Tople
Santiago Zanella Béguelin
Menglin Xia
Victor Rühle
205
17
0
27 Nov 2023
Transforming organic chemistry research paradigms: moving from manual
  efforts to the intersection of automation and artificial intelligence
Transforming organic chemistry research paradigms: moving from manual efforts to the intersection of automation and artificial intelligenceNational Science Open (NSO), 2023
Chengchun Liu
Yuntian Chen
Fanyang Mo
154
2
0
26 Nov 2023
Walking a Tightrope -- Evaluating Large Language Models in High-Risk
  Domains
Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Chia-Chien Hung
Wiem Ben-Rim
Lindsay Frost
Lars Bruckner
Carolin (Haas) Lawrence
AILawALMELM
273
12
0
25 Nov 2023
Calibrated Language Models Must Hallucinate
Calibrated Language Models Must HallucinateSymposium on the Theory of Computing (STOC), 2023
Adam Tauman Kalai
Santosh Vempala
HILM
415
132
0
24 Nov 2023
Probabilistic Tree-of-thought Reasoning for Answering
  Knowledge-intensive Complex Questions
Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex QuestionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
S. Cao
Jiajie Zhang
Jiaxin Shi
Xin Lv
Zijun Yao
Qingwen Tian
Juanzi Li
Lei Hou
LRM
167
31
0
23 Nov 2023
Minimizing Factual Inconsistency and Hallucination in Large Language
  Models
Minimizing Factual Inconsistency and Hallucination in Large Language Models
Muneeswaran Irulandi
Shreya Saxena
Siva Prasad
M. V. Sai Prakash
Advaith Shankar
V. Varun
Vishal Vaddina
Saisubramaniam Gopalakrishnan
HILM
156
7
0
23 Nov 2023
Retrieval-Augmented Layout Transformer for Content-Aware Layout
  Generation
Retrieval-Augmented Layout Transformer for Content-Aware Layout GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Daichi Horita
Naoto Inoue
Kotaro Kikuchi
Kota Yamaguchi
Kiyoharu Aizawa
3DV
440
40
0
22 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language
  Models: A Comprehensive Survey
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAGKELM
367
99
0
21 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
  Model-based Agents in Real-world Systems
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
292
51
0
19 Nov 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
216
1
0
16 Nov 2023
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language
  Models
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenhao Yu
Hongming Zhang
Xiaoman Pan
Kaixin Ma
Hongwei Wang
Dong Yu
KELMRALMLRM
264
166
0
15 Nov 2023
How Well Do Large Language Models Truly Ground?
How Well Do Large Language Models Truly Ground?North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Hyunji Lee
Se June Joo
Chaeeun Kim
Joel Jang
Doyoung Kim
Kyoung-Woon On
Minjoon Seo
HILM
257
14
0
15 Nov 2023
Previous
123...101112...161718
Next