Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
Title
Resolving Knowledge Conflicts in Large Language Models
Yike Wang
Shangbin Feng
Heng Wang
Weijia Shi
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
229
34
0
02 Oct 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
International Conference on Learning Representations (ICLR), 2023
Zhibin Gou
Zhihong Shao
Yeyun Gong
Haoran Pan
Yujiu Yang
Shiyu Huang
Nan Duan
Weizhu Chen
LRM
AI4CE
LLMAG
363
256
0
29 Sep 2023
Augmenting LLMs with Knowledge: A survey on hallucination prevention
Shaocong Long
Lizhuang Ma
KELM
RALM
HILM
141
25
0
28 Sep 2023
Ragas: Automated Evaluation of Retrieval Augmented Generation
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
472
385
0
26 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
357
111
0
13 Sep 2023
Retrieval-Augmented Meta Learning for Low-Resource Text Classification
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Rongsheng Li
Yongqian Li
Hai-Tao Zheng
Chaiyut Luoyiching
Hai-Tao Zheng
Nannan Zhou
Hanjing Su
RALM
219
2
0
10 Sep 2023
The Emergence of Chunking Structures with Hierarchical RNN
Zijun Wu
Anup Anand Deshmukh
Yongkang Wu
Jimmy Lin
Lili Mou
267
3
0
10 Sep 2023
Towards Reliable and Fluent Large Language Models: Incorporating Feedback Learning Loops in QA Systems
Dongyub Lee
Taesun Whang
Chanhee Lee
Heuiseok Lim
KELM
112
12
0
08 Sep 2023
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
International Conference on Learning Representations (ICLR), 2023
Yung-Sung Chuang
Yujia Xie
Hongyin Luo
Yoon Kim
James R. Glass
Pengcheng He
HILM
273
269
0
07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
198
62
0
06 Sep 2023
Cognitive Architectures for Language Agents
T. Sumers
Shunyu Yao
Karthik Narasimhan
Thomas Griffiths
LLMAG
LM&Ro
571
270
0
05 Sep 2023
NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Taehoon Kim
Pyunghwan Ahn
Sangyun Kim
Sihaeng Lee
Mark A Marsden
...
Yujin Wang
Yimu Wang
Tiancheng Gu
Xingchang Lv
Mingmao Sun
VLM
192
8
0
05 Sep 2023
Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yubo Wang
Xueguang Ma
Wenhu Chen
LM&MA
AI4MH
269
37
0
05 Sep 2023
Benchmarking Large Language Models in Retrieval-Augmented Generation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jiawei Chen
Hongyu Lin
Xianpei Han
Le Sun
3DV
RALM
361
444
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Computational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRM
RALM
HILM
654
793
0
03 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
245
50
0
02 Sep 2023
LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Chi Han
Qifan Wang
Yuan Yao
Wenhan Xiong
Yu Chen
Heng Ji
Sinong Wang
476
94
0
30 Aug 2023
Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection
Hongjin Qian
Zhicheng Dou
Jiejun Tan
Haonan Chen
Haoqi Gu
Ruofei Lai
Xinyu Zhang
Bo Zhao
Ji-Rong Wen
170
3
0
30 Aug 2023
MEMORY-VQ: Compression for Tractable Internet-Scale Memory
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yury Zemlyanskiy
Michiel de Jong
Luke Vilnis
Santiago Ontañón
William W. Cohen
Sumit Sanghai
Joshua Ainslie
RALM
MQ
164
2
0
28 Aug 2023
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yushi Bai
Xin Lv
Jiajie Zhang
Hong Lyu
Jiankai Tang
...
Aohan Zeng
Lei Hou
Yuxiao Dong
Jie Tang
Juanzi Li
LLMAG
RALM
287
891
0
28 Aug 2023
Spoken Language Intelligence of Large Language Models for Language Learning
Linkai Peng
Baorian Nuchged
Yingming Gao
ELM
265
5
0
28 Aug 2023
Generations of Knowledge Graphs: The Crazy Ideas and the Business Impact
Proceedings of the VLDB Endowment (PVLDB), 2023
Xin Luna Dong
172
28
0
27 Aug 2023
Learning to Intervene on Concept Bottlenecks
International Conference on Machine Learning (ICML), 2023
David Steinmann
Wolfgang Stammer
Felix Friedrich
Kristian Kersting
323
28
0
25 Aug 2023
Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs
Ye Liu
Semih Yavuz
Rui Meng
Meghana Moorthy
Shafiq Joty
Caiming Xiong
Yingbo Zhou
RALM
187
1
0
24 Aug 2023
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs
Ziyi Tang
Ruilin Wang
Weixing Chen
Keze Wang
Zehua Wang
Tianshui Chen
Liang Lin
Tianshui Chen
Liang Lin
LRM
212
12
0
23 Aug 2023
Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study of the Driver's License Knowledge Test
Saba Rahimi
T. Balch
Manuela Veloso
ELM
154
2
0
22 Aug 2023
Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models
Alex Nyffenegger
Matthias Sturmer
Joel Niklaus
180
10
0
22 Aug 2023
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Kai Sun
Yongjun Xu
Hanwen Zha
Yue Liu
Xinhsuai Dong
AI4MH
380
190
0
20 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELM
SILM
AAML
252
44
0
17 Aug 2023
End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Bolaji Yusuf
J. Černocký
Murat Saraclar
162
2
0
15 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
Mohammad Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
228
44
0
15 Aug 2023
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models
Peng Wang
Ningyu Zhang
Bo Tian
Zekun Xi
Yunzhi Yao
...
Shuyang Cheng
Kangwei Liu
Yuansheng Ni
Guozhou Zheng
Huajun Chen
KELM
217
79
0
14 Aug 2023
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zheng Liu
Zhicheng Dou
Ji-Rong Wen
KELM
589
442
0
14 Aug 2023
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Siqiao Xue
Fan Zhou
Y. Xu
Ming Jin
Qingsong Wen
...
Jun Zhou
Shuo Xie
D. Xiu
James Y. Zhang
Hongyuan Mei
RALM
AIFin
180
18
0
10 Aug 2023
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
International Conference on Learning Representations (ICLR), 2023
Sewon Min
Suchin Gururangan
Eric Wallace
Hannaneh Hajishirzi
Noah A. Smith
Luke Zettlemoyer
AILaw
260
85
0
08 Aug 2023
Hybrid Retrieval-Augmented Generation for Real-time Composition Assistance
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Menglin Xia
Xuchao Zhang
Camille Couturier
Guoqing Zheng
Saravan Rajmohan
Victor Rühle
RALM
132
4
0
08 Aug 2023
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool
Youyang Ng
Daisuke Miyashita
Yasuto Hoshi
Yasuhiro Morioka
Osamu Torii
Tomoya Kodama
J. Deguchi
RALM
117
10
0
08 Aug 2023
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
Jingqing Ruan
Yihong Chen
Bin Zhang
Zhiwei Xu
Tianpeng Bao
...
Shiwei Shi
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
LM&Ro
289
50
0
07 Aug 2023
DeDrift: Robust Similarity Search under Content Drift
IEEE International Conference on Computer Vision (ICCV), 2023
Dmitry Baranchuk
Matthijs Douze
Yash Upadhyay
I. Z. Yalniz
149
13
0
05 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
284
58
0
01 Aug 2023
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution
Ehsan Kamalloo
A. Jafari
Xinyu Crystina Zhang
Nandan Thakur
Jimmy J. Lin
230
61
0
31 Jul 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
International Journal of Computer Vision (IJCV), 2023
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
259
12
0
31 Jul 2023
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023
International Conference on Learning Representations (ICLR), 2023
Yu. V. Gorishniy
Ivan Rubachev
Nikolay Kartashev
Daniil Shlenskii
Akim Kotelnikov
Artem Babenko
OOD
LMTD
191
15
0
26 Jul 2023
Benchmarking and Analyzing Generative Data for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yue Liu
Haotian Liu
Liangyu Chen
Yong Jae Lee
Xuefei Liu
Yu Qiao
VLM
EGVM
195
5
0
25 Jul 2023
Towards green AI-based software systems: an architecture-centric approach (GAISSA)
EUROMICRO Conference on Software Engineering and Advanced Applications (EUROMICRO SEAA), 2023
Luís Cruz
Xavier Franch
Francisco Durán
132
9
0
19 Jul 2023
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community
AI Open (AO), 2023
Jiaxin Mao
Ting Bai
Bo Zhao
Yi-Ju Chang
Jiawei Chen
...
Peng Zhang
Fan Zhang
Wei-na Zhang
Hao Fei
Xiaofei Zhu
166
82
0
19 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Neural Information Processing Systems (NeurIPS), 2023
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
380
5
0
19 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Liang Wang
Nan Yang
Furu Wei
RALM
194
58
0
14 Jul 2023
Generating Benchmarks for Factuality Evaluation of Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Dor Muhlgay
Ori Ram
Inbal Magar
Yoav Levine
Nir Ratner
Yonatan Belinkov
Omri Abend
Kevin Leyton-Brown
Amnon Shashua
Y. Shoham
HILM
187
122
0
13 Jul 2023
Personalization for BERT-based Discriminative Speech Recognition Rescoring
Interspeech (Interspeech), 2023
J. Kolehmainen
Yile Gu
Aditya Gourav
Prashanth Gurunath Shivakumar
Ankur Gandhe
Ariya Rastrow
I. Bulyko
154
5
0
13 Jul 2023
Previous
1
2
3
...
12
13
14
...
16
17
18
Next