ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELM
    RALM
ArXivPDFHTML

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 722 papers shown
Title
Anonymity at Risk? Assessing Re-Identification Capabilities of Large
  Language Models
Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models
Alex Nyffenegger
Matthias Sturmer
Joel Niklaus
34
5
0
22 Aug 2023
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A.
  Will LLMs Replace Knowledge Graphs?
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?
Kai Sun
Y. Xu
Hanwen Zha
Yue Liu
Xinhsuai Dong
AI4MH
28
130
0
20 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models
  to Prompt Injection
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELM
SILM
AAML
41
23
0
17 Aug 2023
End-to-End Open Vocabulary Keyword Search With Multilingual Neural
  Representations
End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations
Bolaji Yusuf
J. Černocký
Murat Saraclar
38
2
0
15 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder
  Language Models
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Wei Ping
Peng-Tao Xu
M. Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
32
33
0
15 Aug 2023
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language
  Models
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models
Peng Wang
Ningyu Zhang
Bo Tian
Zekun Xi
Yunzhi Yao
...
Shuyang Cheng
Kangwei Liu
Yuansheng Ni
Guozhou Zheng
Huajun Chen
KELM
29
41
0
14 Aug 2023
Large Language Models for Information Retrieval: A Survey
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zhicheng Dou
Ji-Rong Wen
KELM
51
284
0
14 Aug 2023
WeaverBird: Empowering Financial Decision-Making with Large Language
  Model, Knowledge Base, and Search Engine
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Siqiao Xue
Fan Zhou
Y. Xu
Ming Jin
Qingsong Wen
...
Jun Zhou
Shuo Xie
D. Xiu
James Y. Zhang
Hongyuan Mei
RALM
AIFin
31
15
0
10 Aug 2023
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Sewon Min
Suchin Gururangan
Eric Wallace
Hannaneh Hajishirzi
Noah A. Smith
Luke Zettlemoyer
AILaw
22
63
0
08 Aug 2023
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative
  AI Tool
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool
Youyang Ng
Daisuke Miyashita
Yasuto Hoshi
Yasuhiro Morioka
Osamu Torii
Tomoya Kodama
J. Deguchi
RALM
15
9
0
08 Aug 2023
TPTU: Large Language Model-based AI Agents for Task Planning and Tool
  Usage
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
Jingqing Ruan
Yihong Chen
Bin Zhang
Zhiwei Xu
Tianpeng Bao
...
Shiwei Shi
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
LM&Ro
41
32
0
07 Aug 2023
DeDrift: Robust Similarity Search under Content Drift
DeDrift: Robust Similarity Search under Content Drift
Dmitry Baranchuk
Matthijs Douze
Yash Upadhyay
I. Z. Yalniz
22
8
0
05 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
38
41
0
01 Aug 2023
HAGRID: A Human-LLM Collaborative Dataset for Generative
  Information-Seeking with Attribution
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution
Ehsan Kamalloo
A. Jafari
Xinyu Crystina Zhang
Nandan Thakur
Jimmy J. Lin
24
41
0
31 Jul 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models:
  A Survey and Outlook
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
31
3
0
31 Jul 2023
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023
Yu. V. Gorishniy
Ivan Rubachev
Nikolay Kartashev
Daniil Shlenskii
Akim Kotelnikov
Artem Babenko
OOD
LMTD
19
13
0
26 Jul 2023
Benchmarking and Analyzing Generative Data for Visual Recognition
Benchmarking and Analyzing Generative Data for Visual Recognition
Bo-wen Li
Haotian Liu
Liangyu Chen
Yong Jae Lee
C. Li
Ziwei Liu
EGVM
VLM
18
4
0
25 Jul 2023
Towards green AI-based software systems: an architecture-centric
  approach (GAISSA)
Towards green AI-based software systems: an architecture-centric approach (GAISSA)
Silverio Martínez-Fernández
Xavier Franch
Francisco Durán
30
7
0
19 Jul 2023
Information Retrieval Meets Large Language Models: A Strategic Report
  from Chinese IR Community
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community
Qingyao Ai
Ting Bai
Zhao Cao
Yi-Ju Chang
Jiawei Chen
...
Peng-Zhen Zhang
Fan Zhang
Wei-na Zhang
M. Zhang
Xiaofei Zhu
52
58
0
19 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Thrust: Adaptively Propels Large Language Models with External Knowledge
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
48
4
0
19 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models
Learning to Retrieve In-Context Examples for Large Language Models
Liang Wang
Nan Yang
Furu Wei
RALM
33
38
0
14 Jul 2023
Generating Benchmarks for Factuality Evaluation of Language Models
Generating Benchmarks for Factuality Evaluation of Language Models
Dor Muhlgay
Ori Ram
Inbal Magar
Yoav Levine
Nir Ratner
Yonatan Belinkov
Omri Abend
Kevin Leyton-Brown
Amnon Shashua
Y. Shoham
HILM
25
91
0
13 Jul 2023
Personalization for BERT-based Discriminative Speech Recognition
  Rescoring
Personalization for BERT-based Discriminative Speech Recognition Rescoring
J. Kolehmainen
Yile Gu
Aditya Gourav
Prashanth Gurunath Shivakumar
Ankur Gandhe
Ariya Rastrow
I. Bulyko
30
2
0
13 Jul 2023
Copy Is All You Need
Copy Is All You Need
Tian Lan
Deng Cai
Yan Wang
Heyan Huang
Xian-Ling Mao
27
26
0
13 Jul 2023
A Comprehensive Overview of Large Language Models
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Saeed Mian
OffRL
64
523
0
12 Jul 2023
Pluggable Neural Machine Translation Models via Memory-augmented
  Adapters
Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Yuzhuang Xu
Shuo Wang
Peng Li
Xuebo Liu
Xiaolong Wang
Weidong Liu
Yang Liu
27
1
0
12 Jul 2023
ReLoRA: High-Rank Training Through Low-Rank Updates
ReLoRA: High-Rank Training Through Low-Rank Updates
Vladislav Lialin
Namrata Shivagunde
Sherin Muckatira
Anna Rumshisky
BDL
29
93
0
11 Jul 2023
Linear Alignment of Vision-language Models for Image Captioning
Linear Alignment of Vision-language Models for Image Captioning
Fabian Paischer
M. Hofmarcher
Sepp Hochreiter
Thomas Adler
CLIP
VLM
47
0
0
10 Jul 2023
Focused Transformer: Contrastive Training for Context Scaling
Focused Transformer: Contrastive Training for Context Scaling
Szymon Tworkowski
Konrad Staniszewski
Mikolaj Pacek
Yuhuai Wu
Henryk Michalewski
Piotr Milo's
21
135
0
06 Jul 2023
VerifAI: Verified Generative AI
VerifAI: Verified Generative AI
Nan Tang
Chenyu Yang
Ju Fan
Lei Cao
Yuyu Luo
Alon Halevy
19
15
0
06 Jul 2023
Citation: A Key to Building Responsible and Accountable Large Language
  Models
Citation: A Key to Building Responsible and Accountable Large Language Models
Jie Huang
Kevin Chen-Chuan Chang
HILM
46
17
0
05 Jul 2023
Trainable Transformer in Transformer
Trainable Transformer in Transformer
A. Panigrahi
Sadhika Malladi
Mengzhou Xia
Sanjeev Arora
VLM
27
12
0
03 Jul 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot
  Learning
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
11
3
0
30 Jun 2023
Query Understanding in the Age of Large Language Models
Query Understanding in the Age of Large Language Models
Avishek Anand
Venktesh V
Abhijit Anand
Vinay Setty
LRM
43
4
0
28 Jun 2023
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
19
208
0
27 Jun 2023
Long-range Language Modeling with Self-retrieval
Long-range Language Modeling with Self-retrieval
Ohad Rubin
Jonathan Berant
RALM
KELM
19
18
0
23 Jun 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
ToolQA: A Dataset for LLM Question Answering with External Tools
Yuchen Zhuang
Yue Yu
Kuan-Chieh Jackson Wang
Haotian Sun
Chao Zhang
ELM
LLMAG
20
212
0
23 Jun 2023
Resources and Evaluations for Multi-Distribution Dense Information
  Retrieval
Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Soumya Chatterjee
Omar Khattab
Simran Arora
22
0
0
21 Jun 2023
Co-design Hardware and Algorithm for Vector Search
Co-design Hardware and Algorithm for Vector Search
Wenqi Jiang
Shigang Li
Yu Zhu
Johannes de Fine Licht
Zhenhao He
...
Cédric Renggli
Shuai Zhang
Theodoros Rekatsinas
Torsten Hoefler
Gustavo Alonso
84
19
0
19 Jun 2023
Large Language Models are Fixated by Red Herrings: Exploring Creative
  Problem Solving and Einstellung Effect using the Only Connect Wall Dataset
Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset
S. Naeini
Raeid Saqur
M. Saeidi
John Giorgi
Babak Taati
37
8
0
19 Jun 2023
RepoFusion: Training Code Models to Understand Your Repository
RepoFusion: Training Code Models to Understand Your Repository
Disha Shrivastava
Denis Kocetkov
H. D. Vries
Dzmitry Bahdanau
Torsten Scholak
81
27
0
19 Jun 2023
GLIMMER: generalized late-interaction memory reranker
GLIMMER: generalized late-interaction memory reranker
Michiel de Jong
Yury Zemlyanskiy
Nicholas FitzGerald
Sumit Sanghai
William W. Cohen
Joshua Ainslie
RALM
41
4
0
17 Jun 2023
Neural Priming for Sample-Efficient Adaptation
Neural Priming for Sample-Efficient Adaptation
Matthew Wallingford
Vivek Ramanujan
Alex Fang
Aditya Kusupati
Roozbeh Mottaghi
Aniruddha Kembhavi
Ludwig Schmidt
Ali Farhadi
VLM
108
13
0
16 Jun 2023
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen
  Large Language Models
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Junting Pan
Ziyi Lin
Yuying Ge
Xiatian Zhu
Renrui Zhang
Yi Wang
Yu Qiao
Hongsheng Li
MLLM
24
26
0
15 Jun 2023
Encyclopedic VQA: Visual questions about detailed properties of
  fine-grained categories
Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Thomas Mensink
J. Uijlings
Lluis Castrejon
A. Goel
Felipe Cadar
Howard Zhou
Fei Sha
A. Araújo
V. Ferrari
34
37
0
15 Jun 2023
Retrieval-Enhanced Contrastive Vision-Text Models
Retrieval-Enhanced Contrastive Vision-Text Models
Ahmet Iscen
Mathilde Caron
Alireza Fathi
Cordelia Schmid
CLIP
VLM
25
26
0
12 Jun 2023
Augmenting Language Models with Long-Term Memory
Augmenting Language Models with Long-Term Memory
Weizhi Wang
Li Dong
Hao Cheng
Xiaodong Liu
Xifeng Yan
Jianfeng Gao
Furu Wei
KELM
RALM
28
83
0
12 Jun 2023
PoET: A generative model of protein families as sequences-of-sequences
PoET: A generative model of protein families as sequences-of-sequences
Timothy F. Truong
Tristan Bepler
SLR
17
38
0
09 Jun 2023
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for
  Speech Understanding
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding
Mingqiu Wang
Izhak Shafran
H. Soltau
Wei Han
Yuan Cao
Dian Yu
Laurent El Shafey
RALM
AuLLM
13
9
0
08 Jun 2023
Information Flow Control in Machine Learning through Modular Model
  Architecture
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
19
6
0
05 Jun 2023
Previous
123...91011...131415
Next