Co-design Hardware and Algorithm for Vector SearchInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023

Wenqi Jiang

Shigang Li

Yu Zhu

Johannes de Fine Licht

...

340

19 Jun 2023

Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall DatasetNeural Information Processing Systems (NeurIPS), 2023

346

19 Jun 2023

RepoFusion: Training Code Models to Understand Your Repository

306

19 Jun 2023

GLIMMER: generalized late-interaction memory reranker

Sumit Sanghai

Joshua Ainslie

232

17 Jun 2023

Neural Priming for Sample-Efficient AdaptationNeural Information Processing Systems (NeurIPS), 2023

486

16 Jun 2023

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Yi Wang

Yu Qiao

Jiaming Song

MLLM

180

15 Jun 2023

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categoriesIEEE International Conference on Computer Vision (ICCV), 2023

281

15 Jun 2023

Retrieval-Enhanced Contrastive Vision-Text ModelsInternational Conference on Learning Representations (ICLR), 2023

292

12 Jun 2023

Augmenting Language Models with Long-Term MemoryNeural Information Processing Systems (NeurIPS), 2023

Xiaodong Liu

241

142

12 Jun 2023

PoET: A generative model of protein families as sequences-of-sequencesNeural Information Processing Systems (NeurIPS), 2023

Timothy F. Truong

Tristan Bepler

SLR

211

09 Jun 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding

Dian Yu

Laurent El Shafey

RALM AuLLM

204

08 Jun 2023

Information Flow Control in Machine Learning through Modular Model ArchitectureUSENIX Security Symposium (USENIX Security), 2023

198

05 Jun 2023

SelfEvolve: A Code Evolution Framework via Large Language Models

Shuyang Jiang

Yuhao Wang

Yu Wang

264

05 Jun 2023

Taught by the Internet, Exploring Bias in OpenAIs GPT3

Ali Ayaz

Aditya Nawalgaria

Ruilian Yin

117

04 Jun 2023

Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification

Hao Chen

194

04 Jun 2023

AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap

Q. V. Liao

J. Vaughan

318

222

02 Jun 2023

KL-Divergence Guided Temperature Sampling

192

02 Jun 2023

Faster Causal Attention Over Large Sequences Through Sparse Flash Attention

176

01 Jun 2023

Reimagining Retrieval Augmented Language Models for Answering QueriesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

308

01 Jun 2023

Vocabulary-free Image ClassificationNeural Information Processing Systems (NeurIPS), 2023

462

01 Jun 2023

Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

...

Quanquan Gu

411

214

30 May 2023

Information Association for Language Model Updating by Mitigating LM-Logical DiscrepancyConference on Computational Natural Language Learning (CoNLL), 2023

Pengfei Yu

Heng Ji

KELM

205

29 May 2023

Test-Time Training on Nearest Neighbors for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Moritz Hardt

Yu Sun

VLM RALM

421

29 May 2023

Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksNeural Information Processing Systems (NeurIPS), 2023

291

28 May 2023

Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Zhicheng Guo

Sijie Cheng

Yile Wang

Peng Li

Yang Liu

RALM

147

28 May 2023

Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-InAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Zhiyuan Liu

292

27 May 2023

On the Tool Manipulation Capability of Open-source Large Language Models

256

25 May 2023

Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

190

25 May 2023

SAIL: Search-Augmented Instruction LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

237

24 May 2023

Privacy Implications of Retrieval-Based Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

181

24 May 2023

Adapting Language Models to Compress ContextsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Alexander Wettig

277

257

24 May 2023

Allies: Prompting Large Language Model with Beam SearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

233

24 May 2023

Enabling Large Language Models to Generate Text with CitationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

435

493

24 May 2023

KNN-LM Does Not Improve Open-ended Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

197

24 May 2023

Think Before You Act: Decision Transformers with Working MemoryInternational Conference on Machine Learning (ICML), 2023

281

24 May 2023