Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.05250
Cited By
SQuAD: 100,000+ Questions for Machine Comprehension of Text
16 June 2016
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SQuAD: 100,000+ Questions for Machine Comprehension of Text"
50 / 141 papers shown
Title
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
Yi Jiang
Sendong Zhao
Jianbo Li
Haochun Wang
Bing Qin
RALM
98
0
0
24 May 2025
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
Zhendong Mi
Qitao Tan
Xiaodong Yu
Zining Zhu
Geng Yuan
Shaoyi Huang
120
0
0
24 May 2025
Social Bias in Popular Question-Answering Benchmarks
Angelie Kraft
Judith Simon
Sonja Schimmler
59
0
0
21 May 2025
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Hao Fang
Jiawei Kong
Tianqu Zhuang
Yixiang Qiu
Kuofeng Gao
Bin Chen
Shu-Tao Xia
Yaowei Wang
Min Zhang
43
0
0
21 May 2025
YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering
Jennifer D'Souza
Hamed Babaei Giglou
Quentin Münch
ELM
60
0
0
20 May 2025
MorphMark: Flexible Adaptive Watermarking for Large Language Models
Zongqi Wang
Tianle Gu
Baoyuan Wu
Yujiu Yang
WaLM
78
0
0
14 May 2025
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
Lhuqita Fazry
VLM
130
0
0
11 May 2025
Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction
Xiaowei Zhu
Yubing Ren
Yanan Cao
Xixun Lin
Fang Fang
Yangxi Li
110
0
0
08 May 2025
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation
D. Sculley
Will Cukierski
Phil Culliton
Sohier Dane
Maggie Demkin
...
Addison Howard
Paul Mooney
Walter Reade
Megan Risdal
Nate Keating
59
1
0
01 May 2025
ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese
H. Phung
Ngoc C. Lê
Van-Chien Nguyen
Hang Thi Nguyen
Thuy Phuong Thi Nguyen
143
1
0
21 Apr 2025
aiXamine: Simplified LLM Safety and Security
Fatih Deniz
Dorde Popovic
Yazan Boshmaf
Euisuh Jeong
M. Ahmad
Sanjay Chawla
Issa M. Khalil
ELM
184
0
0
21 Apr 2025
LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA
Xanh Ho
Jiahao Huang
Florian Boudin
Akiko Aizawa
ELM
63
0
0
16 Apr 2025
Confidence Regularized Masked Language Modeling using Text Length
Seunghyun Ji
Soowon Lee
122
0
0
08 Apr 2025
VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models
Dahun Kim
A. Piergiovanni
Ganesh Mallya
A. Angelova
CoGe
86
0
0
04 Apr 2025
Large (Vision) Language Models are Unsupervised In-Context Learners
Artyom Gadetsky
Andrei Atanov
Yulun Jiang
Zhitong Gao
Ghazal Hosseini Mighan
Amir Zamir
Maria Brbić
VLM
MLLM
LRM
168
0
0
03 Apr 2025
On the Consistency of Multilingual Context Utilization in Retrieval-Augmented Generation
Jirui Qi
Raquel Fernández
Arianna Bisazza
RALM
91
0
0
01 Apr 2025
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
Aden Haussmann
LMTD
116
0
0
29 Mar 2025
Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence
Sophia Hager
David Mueller
Kevin Duh
Nicholas Andrews
89
1
0
18 Mar 2025
SuperBPE: Space Travel for Language Models
Alisa Liu
J. Hayase
Valentin Hofmann
Sewoong Oh
Noah A. Smith
Yejin Choi
81
6
0
17 Mar 2025
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai
Yijie Xu
Jinhui Ye
Hao Liu
Hui Xiong
3DV
RALM
134
2
0
03 Mar 2025
Sanity Checking Causal Representation Learning on a Simple Real-World System
Juan L. Gamella
Simon Bing
Jakob Runge
CML
124
1
0
27 Feb 2025
Trustworthy Answers, Messier Data: Bridging the Gap in Low-Resource Retrieval-Augmented Generation for Domain Expert Systems
Nayoung Choi
Grace Byun
Andrew Chung
Ellie S. Paek
S. Lee
Jinho D. Choi
RALM
142
1
0
26 Feb 2025
Predicting Through Generation: Why Generation Is Better for Prediction
Md. Kowsher
Nusrat Jahan Prottasha
Prakash Bhat
Chun-Nam Yu
Mojtaba Soltanalian
Ivan Garibay
O. Garibay
Chen Chen
Niloofar Yousefi
AI4TS
140
0
0
25 Feb 2025
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Chenghao Fan
Zhenyi Lu
Sichen Liu
Xiaoye Qu
Xiaoye Qu
Wei Wei
Yu Cheng
MoE
409
0
0
24 Feb 2025
MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts
Bhawna Piryani
Jamshid Mozafari
Abdelrahman Abdallah
Antoine Doucet
Adam Jatowt
49
1
0
24 Feb 2025
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Viktor Moskvoretskii
M. Lysyuk
Mikhail Salnikov
Nikolay Ivanov
Sergey Pletenev
Daria Galimzianova
Nikita Krayko
Vasily Konovalov
Irina Nikishina
Alexander Panchenko
RALM
115
6
0
24 Feb 2025
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals
Linda Zeng
Rithwik Gupta
Divij Motwani
Diji Yang
Yi Zhang
AAML
99
2
0
22 Feb 2025
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
Jamshid Mozafari
Abdelrahman Abdallah
Bhawna Piryani
Adam Jatowt
64
0
0
22 Feb 2025
Large Language Model Confidence Estimation via Black-Box Access
Tejaswini Pedapati
Amit Dhurandhar
Soumya Ghosh
Soham Dan
P. Sattigeri
148
5
0
21 Feb 2025
Question Answering with Texts and Tables through Deep Reinforcement Learning
M. M. José
Flávio Nakasato Cação
Maria F. Ribeiro
Rafael M. Cheang
Paulo Pirozelli
Fabio Gagliardi Cozman
LMTD
RALM
202
0
0
21 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation
Jing Xu
Jiazheng Li
J.N. Zhang
MoMe
FedML
210
2
0
18 Feb 2025
Savaal: Scalable Concept-Driven Question Generation to Enhance Human Learning
Kimia Noorbakhsh
Joseph Chandler
Pantea Karimi
M. Alizadeh
H. Balakrishnan
LRM
77
1
0
18 Feb 2025
MassSpecGym: A benchmark for the discovery and identification of molecules
Roman Bushuiev
Anton Bushuiev
Niek F. de Jonge
A. Young
Fleming Kretschmer
...
Justin J. J. van der Hooft
Michael A. Stravs
Sebastian Böcker
Josef Sivic
Tomáš Pluskal
67
4
0
17 Feb 2025
QuOTE: Question-Oriented Text Embeddings
Andrew Neeser
Kaylen Latimer
Aadyant Khatri
Chris Latimer
Naren Ramakrishnan
RALM
63
0
0
16 Feb 2025
LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
Zican Dong
Junyi Li
Jinhao Jiang
Mingyu Xu
Wayne Xin Zhao
Bin Wang
Xin Wu
VLM
268
4
0
11 Feb 2025
MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers
Nicole Cho
William Watson
AAML
HILM
212
0
0
06 Feb 2025
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
Martin Mundt
Anaelia Ovalle
Felix Friedrich
A Pranav
Subarnaduti Paul
Manuel Brack
Kristian Kersting
William Agnew
519
0
0
05 Feb 2025
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models
Jiawen Zhang
Kejia Chen
Zunlei Feng
Jian Lou
Mingli Song
Qingbin Liu
Xiaoyu Yang
AAML
SILM
FedML
84
1
0
02 Feb 2025
Multilingual State Space Models for Structured Question Answering in Indic Languages
A. Vats
Rahul Raja
Mrinal Mathur
Vinija Jain
Aman Chadha
119
1
0
01 Feb 2025
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
Ahmed K. Kadhim
Lei Jiao
Rishad Shafik
Ole-Christoffer Granmo
DeLMO
128
0
0
31 Jan 2025
A linguistically-motivated evaluation methodology for unraveling model's abilities in reading comprehension tasks
Elie Antoine
Frédéric Béchet
Géraldine Damnati
Philippe Langlais
103
1
0
29 Jan 2025
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Ran Xu
Hui Liu
Sreyashi Nag
Zhenwei Dai
Yaochen Xie
...
Chen Luo
Yang Li
Joyce C. Ho
Carl Yang
Qi He
RALM
131
11
0
28 Jan 2025
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Nicolas Boizard
Kevin El Haddad
C´eline Hudelot
Pierre Colombo
104
16
0
28 Jan 2025
SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMs
Mohammad Mozaffari
Amir Yazdanbakhsh
Zhao Zhang
M. Dehnavi
113
6
0
28 Jan 2025
Chain-of-Retrieval Augmented Generation
Liang Wang
Haonan Chen
Nan Yang
Xiaolong Huang
Zhicheng Dou
Furu Wei
RALM
LRM
ReLM
3DV
106
7
0
24 Jan 2025
TrueReason: An Exemplar Personalised Learning System Integrating Reasoning with Foundational Models
Sahan Bulathwela
Daniel Van Niekerk
Jarrod Shipton
Maria Perez-Ortiz
Benjamin Rosman
John Shawe-Taylor
LRM
93
0
0
23 Jan 2025
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives
Nong Ming
Sachin Sharma
Jiho Noh
AI4Ed
72
0
0
20 Jan 2025
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
Soheil Feizi
DeLMO
156
381
0
20 Jan 2025
Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy
Saeid Asgari Taghanaki
Joao Monteiro
ELM
LRM
66
2
0
20 Jan 2025
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
Alireza Ghahramani Kure
Mahshid Dehghani
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Ehsaneddin Asgari
73
1
0
19 Jan 2025
1
2
3
Next