ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELM
    RALM
ArXivPDFHTML

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 722 papers shown
Title
RoCode: A Dataset for Measuring Code Intelligence from Problem
  Definitions in Romanian
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
Adrian Cosma
Ioan-Bogdan Iordache
Paolo Rosso
OffRL
36
2
0
20 Feb 2024
Instruction-tuned Language Models are Better Knowledge Learners
Instruction-tuned Language Models are Better Knowledge Learners
Zhengbao Jiang
Zhiqing Sun
Weijia Shi
Pedro Rodriguez
Chunting Zhou
Graham Neubig
Xi Victoria Lin
Wen-tau Yih
Srinivasan Iyer
KELM
38
33
0
20 Feb 2024
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware
  Image Classification
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image Classification
Sebastian Doerrich
Tobias Archut
Francesco Di Salvo
Christian Ledig
19
4
0
19 Feb 2024
BIDER: Bridging Knowledge Inconsistency for Efficient
  Retrieval-Augmented LLMs via Key Supporting Evidence
BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence
Jiajie Jin
Yutao Zhu
Yujia Zhou
Zhicheng Dou
RALM
49
20
0
19 Feb 2024
EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries
EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries
Jiateng Liu
Pengfei Yu
Yuji Zhang
Sha Li
Zixuan Zhang
Heng Ji
KELM
24
16
0
17 Feb 2024
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs
  Miss
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Dmitry Sorokin
Artyom Sorokin
Mikhail Burtsev
RALM
117
33
0
16 Feb 2024
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large
  Language Models
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models
Ali AhmadiTeshnizi
Wenzhi Gao
Madeleine Udell
LLMAG
11
22
0
15 Feb 2024
Context Composing for Full Line Code Completion
Context Composing for Full Line Code Completion
Anton Semenkin
Yaroslav Sokolov
Evgeniia Vu
18
4
0
14 Feb 2024
Towards Faithful and Robust LLM Specialists for Evidence-Based
  Question-Answering
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering
Tobias Schimanski
Jingwei Ni
Mathias Kraus
Elliott Ash
Markus Leippold
21
4
0
13 Feb 2024
Nearest Neighbour Score Estimators for Diffusion Generative Models
Nearest Neighbour Score Estimators for Diffusion Generative Models
Matthew Niedoba
Dylan Green
Saeid Naderiparizi
Vasileios Lioutas
J. Lavington
...
Ke Zhang
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank D. Wood
DiffM
27
0
0
12 Feb 2024
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented
  Generation of Large Language Models
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Wei Zou
Runpeng Geng
Binghui Wang
Jinyuan Jia
SILM
28
45
1
12 Feb 2024
Retrieval-Augmented Thought Process as Sequential Decision Making
Retrieval-Augmented Thought Process as Sequential Decision Making
T. Pouplin
Hao Sun
Samuel Holt
M. Schaar
KELM
RALM
LRM
11
2
0
12 Feb 2024
Prompt Perturbation in Retrieval-Augmented Generation based Large
  Language Models
Prompt Perturbation in Retrieval-Augmented Generation based Large Language Models
Zhibo Hu
Chen Wang
Yanfeng Shu
Helen Paik
Paik
Liming Zhu
SILM
RALM
37
7
0
11 Feb 2024
ProtIR: Iterative Refinement between Retrievers and Predictors for
  Protein Function Annotation
ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation
Zuobai Zhang
Jiarui Lu
Vijil Chenthamarakshan
Aurélie C. Lozano
Payel Das
Jian Tang
21
1
0
10 Feb 2024
Large Language Models: A Survey
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
120
364
0
09 Feb 2024
Memory Consolidation Enables Long-Context Video Understanding
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balavzević
Yuge Shi
Pinelopi Papalampidi
Rahma Chaabouni
Skanda Koppula
Olivier J. Hénaff
97
22
0
08 Feb 2024
DFA-RAG: Conversational Semantic Router for Large Language Model with
  Definite Finite Automaton
DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
Yiyou Sun
Junjie Hu
Wei Cheng
Haifeng Chen
RALM
AI4CE
26
1
0
06 Feb 2024
Retrieve to Explain: Evidence-driven Predictions with Language Models
Retrieve to Explain: Evidence-driven Predictions with Language Models
Ravi Patel
Angus Brayne
Rogier E Hintzen
Daniel Jaroslawicz
Georgiana Neculae
Dane S. Corneil
19
2
0
06 Feb 2024
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to
  256K
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
Tao Yuan
Xuefei Ning
Dong Zhou
Zhijie Yang
Shiyao Li
...
Dahua Lin
Boxun Li
Guohao Dai
Shengen Yan
Yu-Xiang Wang
ALM
36
34
0
06 Feb 2024
Retrieval-Augmented Score Distillation for Text-to-3D Generation
Retrieval-Augmented Score Distillation for Text-to-3D Generation
Junyoung Seo
Susung Hong
Wooseok Jang
Ines Hyeonsu Kim
Minseop Kwak
Doyup Lee
Seungryong Kim
57
9
0
05 Feb 2024
IllusionX: An LLM-powered mixed reality personal companion
IllusionX: An LLM-powered mixed reality personal companion
Ramez Yousri
Zeyad Essam
Yehia Kareem
Youstina Sherief
Sherry Gamil
Soha Safwat
22
3
0
04 Feb 2024
Factuality of Large Language Models in the Year 2024
Factuality of Large Language Models in the Year 2024
Yuxia Wang
Minghan Wang
Muhammad Arslan Manzoor
Fei Liu
Georgi Georgiev
Rocktim Jyoti Das
Preslav Nakov
LRM
HILM
30
7
0
04 Feb 2024
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and
  Dialogue Abilities
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Zhifeng Kong
Arushi Goel
Rohan Badlani
Wei Ping
Rafael Valle
Bryan Catanzaro
AuLLM
LM&MA
MLLM
66
73
0
02 Feb 2024
Retrieval Augmented End-to-End Spoken Dialog Models
Retrieval Augmented End-to-End Spoken Dialog Models
Mingqiu Wang
Izhak Shafran
H. Soltau
Wei Han
Yuan Cao
Dian Yu
Laurent El Shafey
RALM
AuLLM
22
11
0
02 Feb 2024
CorpusLM: Towards a Unified Language Model on Corpus for
  Knowledge-Intensive Tasks
CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks
Xiaoxi Li
Zhicheng Dou
Yujia Zhou
Fangchao Liu
RALM
38
14
0
02 Feb 2024
Efficient Prompt Caching via Embedding Similarity
Efficient Prompt Caching via Embedding Similarity
Hanlin Zhu
Banghua Zhu
Jiantao Jiao
RALM
21
9
0
02 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM
  Collaboration
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
25
77
0
01 Feb 2024
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Parth Sarthi
Salman Abdullah
Aditi Tuli
Shubh Khanna
Anna Goldie
Christopher D. Manning
RALM
19
122
0
31 Jan 2024
LOCOST: State-Space Models for Long Document Abstractive Summarization
LOCOST: State-Space Models for Long Document Abstractive Summarization
Florian Le Bronnec
Song Duong
Mathieu Ravaut
Alexandre Allauzen
Nancy F. Chen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
40
8
0
31 Jan 2024
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Jiacheng Liu
Sewon Min
Luke Zettlemoyer
Yejin Choi
Hannaneh Hajishirzi
43
50
0
30 Jan 2024
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop
  Queries
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries
Yixuan Tang
Yi Yang
RALM
39
78
0
27 Jan 2024
Equipping Language Models with Tool Use Capability for Tabular Data
  Analysis in Finance
Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance
Adrian Theuma
Ehsan Shareghi
24
4
0
27 Jan 2024
A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM
A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM
Ahmet Yusuf Alan
Enis Karaarslan
Ömer Aydin
14
1
0
27 Jan 2024
The Power of Noise: Redefining Retrieval for RAG Systems
The Power of Noise: Redefining Retrieval for RAG Systems
Florin Cuconasu
Giovanni Trappolini
F. Siciliano
Simone Filice
Cesare Campagnano
Y. Maarek
Nicola Tonellotto
Fabrizio Silvestri
RALM
37
143
0
26 Jan 2024
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Zhihao Zhang
Alan Zhu
Lijie Yang
Yihua Xu
Lanting Li
P. Phothilimthana
Zhihao Jia
RALM
KELM
40
16
0
25 Jan 2024
Automated Root Causing of Cloud Incidents using In-Context Learning with
  GPT-4
Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-4
Xuchao Zhang
Supriyo Ghosh
Chetan Bansal
Rujia Wang
Ming-Jie Ma
Yu Kang
Saravan Rajmohan
38
23
0
24 Jan 2024
JustiLM: Few-shot Justification Generation for Explainable Fact-Checking
  of Real-world Claims
JustiLM: Few-shot Justification Generation for Explainable Fact-Checking of Real-world Claims
Fengzhu Zeng
Wei Gao
39
15
0
16 Jan 2024
Attendre: Wait To Attend By Retrieval With Evicted Queries in
  Memory-Based Transformers for Long Context Processing
Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing
Zi Yang
Nan Hua
RALM
34
4
0
10 Jan 2024
CaMML: Context-Aware Multimodal Learner for Large Models
CaMML: Context-Aware Multimodal Learner for Large Models
Yixin Chen
Shuai Zhang
Boran Han
Tong He
Bo Li
VLM
24
4
0
06 Jan 2024
Large Language Models for Social Networks: Applications, Challenges, and
  Solutions
Large Language Models for Social Networks: Applications, Challenges, and Solutions
Jingying Zeng
Richard Huang
Waleed Malik
Langxuan Yin
Bojan Babic
Danny Shacham
Xiao Yan
Jaewon Yang
Qi He
22
3
0
04 Jan 2024
ReFusion: Improving Natural Language Understanding with
  Computation-Efficient Retrieval Representation Fusion
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion
Shangyu Wu
Ying Xiong
Yufei Cui
Xue Liu
Buzhou Tang
Tei-Wei Kuo
Chun Jason Xue
23
2
0
04 Jan 2024
Navigating Uncertainty: Optimizing API Dependency for Hallucination
  Reduction in Closed-Book Question Answering
Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering
Pierre Erbacher
Louis Falissard
Vincent Guigue
Laure Soulier
HILM
RALM
19
4
0
03 Jan 2024
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code
  Empowers Large Language Models to Serve as Intelligent Agents
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Ke Yang
Jiateng Liu
John Wu
Chaoqi Yang
Yi Ren Fung
...
Xu Cao
Xingyao Wang
Yiquan Wang
Heng Ji
Chengxiang Zhai
LLMAG
ELM
18
73
0
01 Jan 2024
Retrieval-Augmented Egocentric Video Captioning
Retrieval-Augmented Egocentric Video Captioning
Jilan Xu
Yifei Huang
Junlin Hou
Guo Chen
Yue Zhang
Rui Feng
Weidi Xie
EgoV
43
29
0
01 Jan 2024
Structured Packing in LLM Training Improves Long Context Utilization
Structured Packing in LLM Training Improves Long Context Utilization
Konrad Staniszewski
Szymon Tworkowski
Sebastian Jaszczur
Yu Zhao
Henryk Michalewski
Lukasz Kuciñski
Piotr Milo's
41
13
0
28 Dec 2023
Adapting Large Language Models for Education: Foundational Capabilities,
  Potentials, and Challenges
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
ELM
33
18
0
27 Dec 2023
LeanVec: Searching vectors faster by making them fit
LeanVec: Searching vectors faster by making them fit
Mariano Tepper
Ishwar Bhati
Cecilia Aguerrebere
Mark Hildebrand
Ted Willke
VLM
OODD
21
1
0
26 Dec 2023
Supervised Knowledge Makes Large Language Models Better In-context
  Learners
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang
Shuibai Zhang
Zhuohao Yu
Guangsheng Bao
Yidong Wang
...
Ruochen Xu
Weirong Ye
Xing Xie
Weizhu Chen
Yue Zhang
16
14
0
26 Dec 2023
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu
Steffi Chern
Ethan Chern
Ge Zhang
Zekun Wang
Ruibo Liu
Jing Li
Jie Fu
Pengfei Liu
21
20
0
26 Dec 2023
Towards Consistent Language Models Using Declarative Constraints
Towards Consistent Language Models Using Declarative Constraints
Jasmin Mousavi
Arash Termehchy
HILM
ALM
25
2
0
24 Dec 2023
Previous
123...678...131415
Next