ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.10613
  4. Cited By
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
v1v2v3 (latest)

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

AAAI Conference on Artificial Intelligence (AAAI), 2024
20 August 2024
Guangyuan Ma
Yongliang Ma
Xing Wu
Zhenpeng Su
Ming Zhou
Songlin Hu
    OOD
ArXiv (abs)PDFHTMLGithub (1★)

Papers citing "Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval"

35 / 35 papers shown
From Ranking to Selection: A Simple but Efficient Dynamic Passage Selector for Retrieval Augmented Generation
From Ranking to Selection: A Simple but Efficient Dynamic Passage Selector for Retrieval Augmented Generation
Siyuan Meng
Junming Liu
Yirong Chen
Song Mao
Pinlong Cai
Guohang Yan
Botian Shi
Botian Shi
221
3
0
13 Aug 2025
Distributionally Robust Optimization with Adversarial Data Contamination
Distributionally Robust Optimization with Adversarial Data Contamination
Shuyao Li
Ilias Diakonikolas
Jelena Diakonikolas
325
2
0
14 Jul 2025
LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
Guangyuan Ma
Yongliang Ma
Xuanrui Gou
Zhenpeng Su
Ming Zhou
Songlin Hu
RALM
531
1
0
18 May 2025
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval
Aarush Sinha
RALM
385
3
0
20 Apr 2025
MiniCPM: Unveiling the Potential of Small Language Models with Scalable
  Training Strategies
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Shengding Hu
Yuge Tu
Xu Han
Chaoqun He
Ganqu Cui
...
Chaochao Jia
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
MoE
658
646
0
09 Apr 2024
Gemma: Open Models Based on Gemini Research and Technology
Gemma: Open Models Based on Gemini Research and Technology
Gemma Team
Gemma Team Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
...
Armand Joulin
Noah Fiedel
Evan Senter
Alek Andreev
Kathleen Kenealy
VLMLLMAG
731
969
0
13 Mar 2024
Multilingual E5 Text Embeddings: A Technical Report
Multilingual E5 Text Embeddings: A Technical Report
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
303
389
0
08 Feb 2024
M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jianlv Chen
Shitao Xiao
Peitian Zhang
Kun Luo
Defu Lian
Zheng Liu
1.2K
893
0
05 Feb 2024
Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense
  Passage Retrieval
Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Guangyuan Ma
Xing Wu
Zijia Lin
Songlin Hu
399
8
0
20 Jan 2024
FlashAttention-2: Faster Attention with Better Parallelism and Work
  Partitioning
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningInternational Conference on Learning Representations (ICLR), 2023
Tri Dao
LRM
614
2,426
0
17 Jul 2023
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
DoReMi: Optimizing Data Mixtures Speeds Up Language Model PretrainingNeural Information Processing Systems (NeurIPS), 2023
Sang Michael Xie
Hieu H. Pham
Xuanyi Dong
Nan Du
Hanxiao Liu
Yifeng Lu
Abigail Z. Jacobs
Quoc V. Le
Tengyu Ma
Adams Wei Yu
MoMeMoE
730
320
0
17 May 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALMPILM
20.1K
19,109
0
27 Feb 2023
ConTextual Masked Auto-Encoder for Dense Passage Retrieval
ConTextual Masked Auto-Encoder for Dense Passage RetrievalAAAI Conference on Artificial Intelligence (AAAI), 2022
Xing Wu
Guangyuan Ma
Meng Lin
Zijia Lin
Zhongyuan Wang
Songlin Hu
RALM
487
32
0
16 Aug 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
2.3K
19,487
0
04 Mar 2022
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text
  Models
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Cer
Yinfei Yang
671
763
0
19 Aug 2021
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval
Xinyu Crystina Zhang
Xueguang Ma
Peng Shi
Jimmy J. Lin
356
157
0
19 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage
  Retrieval
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Luyu Gao
Jamie Callan
RALM
764
383
0
12 Aug 2021
SimCSE: Simple Contrastive Learning of Sentence Embeddings
SimCSE: Simple Contrastive Learning of Sentence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILawSSL
1.1K
4,272
0
18 Apr 2021
GooAQ: Open Question Answering with Diverse Answer Types
GooAQ: Open Question Answering with Diverse Answer TypesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Daniel Khashabi
Amos Ng
Tushar Khot
Ashish Sabharwal
Hannaneh Hajishirzi
Chris Callison-Burch
291
67
0
18 Apr 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information
  Retrieval Models
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
1.6K
1,542
0
17 Apr 2021
Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup
Scaling Deep Contrastive Learning Batch Size under Memory Limited SetupWorkshop on Representation Learning for NLP (RepL4NLP), 2021
Luyu Gao
Yunyi Zhang
Jiawei Han
Jamie Callan
319
147
0
18 Jan 2021
Dense Passage Retrieval for Open-Domain Question Answering
Dense Passage Retrieval for Open-Domain Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Anuj Kumar
RALM
860
5,400
0
10 Apr 2020
Distributionally Robust Neural Networks for Group Shifts: On the
  Importance of Regularization for Worst-Case Generalization
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
Shiori Sagawa
Pang Wei Koh
Tatsunori B. Hashimoto
Abigail Z. Jacobs
OOD
422
1,531
0
20 Nov 2019
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
545
1,338
0
20 Sep 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MHELM
573
771
0
22 Jul 2019
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question
  Answering
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
1.0K
4,056
0
25 Sep 2018
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional
  Neural Networks for Extreme Summarization
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
715
1,971
0
27 Aug 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
2.0K
12,894
0
10 Jul 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
2.8K
3,599
0
09 May 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
1.5K
4,948
0
18 Apr 2017
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Dong Wang
RALM
508
479
0
18 Apr 2017
Get To The Point: Summarization with Pointer-Generator Networks
Get To The Point: Summarization with Pointer-Generator Networks
A. See
Peter J. Liu
Christopher D. Manning
3DPC
1.1K
4,360
0
14 Apr 2017
SQuAD: 100,000+ Questions for Machine Comprehension of Text
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Abigail Z. Jacobs
RALM
878
9,183
0
16 Jun 2016
Training Deep Nets with Sublinear Memory Cost
Training Deep Nets with Sublinear Memory Cost
Tianqi Chen
Bing Xu
Chiyuan Zhang
Carlos Guestrin
683
1,412
0
21 Apr 2016
A large annotated corpus for learning natural language inference
A large annotated corpus for learning natural language inferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2015
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
1.0K
4,621
0
21 Aug 2015
1
Page 1 of 1