v1v2v3 (latest)

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

AAAI Conference on Artificial Intelligence (AAAI), 2024

20 August 2024

ArXiv (abs)PDF HTML Github (1★)

Papers citing "Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval"

35 / 35 papers shown

From Ranking to Selection: A Simple but Efficient Dynamic Passage Selector for Retrieval Augmented Generation

221

13 Aug 2025

Distributionally Robust Optimization with Adversarial Data Contamination

Shuyao Li

Ilias Diakonikolas

Jelena Diakonikolas

325

14 Jul 2025

LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference

531

18 May 2025

Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval

Aarush Sinha

RALM

385

20 Apr 2025

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Xu Han

...

Zhiyuan Liu

Maosong Sun

MoE

658

646

09 Apr 2024

Gemma: Open Models Based on Gemini Research and Technology

Gemma Team

Gemma Team Thomas Mesnard

...

731

969

13 Mar 2024

Multilingual E5 Text Embeddings: A Technical Report

Liang Wang

303

389

08 Feb 2024

M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Jianlv Chen

Shitao Xiao

Peitian Zhang

Kun Luo

Defu Lian

Zheng Liu

1.2K

893

05 Feb 2024

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024

399

20 Jan 2024

FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningInternational Conference on Learning Representations (ICLR), 2023

Tri Dao

LRM

614

2,426

17 Jul 2023

DoReMi: Optimizing Data Mixtures Speeds Up Language Model PretrainingNeural Information Processing Systems (NeurIPS), 2023

730

320

17 May 2023

LLaMA: Open and Efficient Foundation Language Models

...

20.1K

19,109

27 Feb 2023

ConTextual Masked Auto-Encoder for Dense Passage RetrievalAAAI Conference on Artificial Intelligence (AAAI), 2022

487

16 Aug 2022

Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022

Carroll L. Wainwright

...

2.3K

19,487

04 Mar 2022

Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

Jianmo Ni

Gustavo Hernández Ábrego

671

763

19 Aug 2021

Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval

356

157

19 Aug 2021

Unsupervised Corpus Aware Language Model Pre-training for Dense Passage RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Luyu Gao

Jamie Callan

RALM

764

383

12 Aug 2021

SimCSE: Simple Contrastive Learning of Sentence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

1.1K

4,272

18 Apr 2021

GooAQ: Open Question Answering with Diverse Answer TypesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Daniel Khashabi

291

18 Apr 2021

BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

1.6K

1,542

17 Apr 2021

Scaling Deep Contrastive Learning Batch Size under Memory Limited SetupWorkshop on Representation Learning for NLP (RepL4NLP), 2021

319

147

18 Jan 2021

Dense Passage Retrieval for Open-Domain Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Patrick Lewis

Sergey Edunov

860

5,400

10 Apr 2020

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization

Shiori Sagawa

Pang Wei Koh

Tatsunori B. Hashimoto

Abigail Z. Jacobs

OOD

422

1,531

20 Nov 2019

CodeSearchNet Challenge: Evaluating the State of Semantic Code Search

545

1,338

20 Sep 2019

ELI5: Long Form Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Angela Fan

Jason Weston

573

771

22 Jul 2019

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Christopher D. Manning

RALM

1.0K

4,056

25 Sep 2018

Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

715

1,971

27 Aug 2018

Representation Learning with Contrastive Predictive Coding

2.0K

12,894

10 Jul 2018

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Luke Zettlemoyer

2.8K

3,599

09 May 2017

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

Adina Williams

Nikita Nangia

Samuel R. Bowman

1.5K

4,948

18 Apr 2017

SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine

508

479

18 Apr 2017

Get To The Point: Summarization with Pointer-Generator Networks

A. See

Peter J. Liu

Christopher D. Manning

3DPC

1.1K

4,360

14 Apr 2017

SQuAD: 100,000+ Questions for Machine Comprehension of Text

878

9,183

16 Jun 2016

Training Deep Nets with Sublinear Memory Cost

683

1,412

21 Apr 2016

A large annotated corpus for learning natural language inferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2015

Samuel R. Bowman

Gabor Angeli

Christopher Potts

Christopher D. Manning

1.0K

4,621

21 Aug 2015