ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.03822
  4. Cited By
Know What You Don't Know: Unanswerable Questions for SQuAD

Know What You Don't Know: Unanswerable Questions for SQuAD

11 June 2018
Pranav Rajpurkar
Robin Jia
Percy Liang
    RALM
    ELM
ArXivPDFHTML

Papers citing "Know What You Don't Know: Unanswerable Questions for SQuAD"

50 / 530 papers shown
Title
Contextual Breach: Assessing the Robustness of Transformer-based QA
  Models
Contextual Breach: Assessing the Robustness of Transformer-based QA Models
Asir Saadat
Nahian Ibn Asad
Md Farhan Ishmam
AAML
43
0
0
17 Sep 2024
KodeXv0.1: A Family of State-of-the-Art Financial Large Language Models
KodeXv0.1: A Family of State-of-the-Art Financial Large Language Models
Neel Rajani
Lilli Kiessling
Aleksandr Ogaltsov
Claus Lang
ALM
28
0
0
13 Sep 2024
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic
  CheckLists
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists
Raoyuan Zhao
Abdullatif Köksal
Yihong Liu
Leonie Weissweiler
Anna Korhonen
Hinrich Schütze
SyDa
36
1
0
30 Aug 2024
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
Sania Nayab
Giulio Rossolini
Giorgio Buttazzo
Nicolamaria Manes
F. Giacomelli
Nicolamaria Manes
Fabrizio Giacomelli
LRM
49
24
0
29 Jul 2024
NV-Retriever: Improving text embedding models with effective hard-negative mining
NV-Retriever: Improving text embedding models with effective hard-negative mining
G. D. S. P. Moreira
Radek Osmulski
Mengyao Xu
Ronay Ak
Benedikt D. Schifferer
Even Oldridge
RALM
49
31
0
22 Jul 2024
INDIC QA BENCHMARK: A Multilingual Benchmark to Evaluate Question Answering capability of LLMs for Indic Languages
INDIC QA BENCHMARK: A Multilingual Benchmark to Evaluate Question Answering capability of LLMs for Indic Languages
A. Singh
Rudra Murthy
Vishwajeet Kumar
Jaydeep Sen
Ashish Mittal
Ganesh Ramakrishnan
35
6
0
18 Jul 2024
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Nikita Chernyadev
Nicholas Backshall
Xiao Ma
Yunfan Lu
Younggyo Seo
Stephen James
22
11
0
10 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
76
9
0
09 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization
  Approaches for Language Models
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
30
3
0
08 Jul 2024
A Systematic Survey and Critical Review on Evaluating Large Language
  Models: Challenges, Limitations, and Recommendations
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
Md Tahmid Rahman Laskar
Sawsan Alqahtani
M Saiful Bari
Mizanur Rahman
Mohammad Abdullah Matin Khan
...
Chee Wei Tan
Md. Rizwan Parvez
Enamul Hoque
Shafiq R. Joty
Jimmy Huang
ELM
ALM
29
28
0
04 Jul 2024
LLM Internal States Reveal Hallucination Risk Faced With a Query
LLM Internal States Reveal Hallucination Risk Faced With a Query
Ziwei Ji
Delong Chen
Etsuko Ishii
Samuel Cahyawijaya
Yejin Bang
Bryan Wilie
Pascale Fung
HILM
LRM
39
20
0
03 Jul 2024
Preserving Multilingual Quality While Tuning Query Encoder on English Only
Preserving Multilingual Quality While Tuning Query Encoder on English Only
Oleg V. Vasilyev
Randy Sawaya
John Bohannon
35
1
0
01 Jul 2024
Paraphrase and Aggregate with Large Language Models for Minimizing
  Intent Classification Errors
Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors
Vikas Yadav
Zheng Tang
Vijay Srinivasan
32
8
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing
  Backpropagation
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
37
1
0
24 Jun 2024
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in
  LLMs
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen
Jiatong Han
Muhammed Razzak
Lisa Schut
Shreshth A. Malik
Yarin Gal
HILM
58
34
0
22 Jun 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
77
13
0
20 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
46
0
0
14 Jun 2024
An Empirical Study of Mamba-based Language Models
An Empirical Study of Mamba-based Language Models
R. Waleffe
Wonmin Byeon
Duncan Riach
Brandon Norick
V. Korthikanti
...
Vartika Singh
Jared Casper
Jan Kautz
M. Shoeybi
Bryan Catanzaro
61
65
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
69
50
0
12 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
42
1
0
11 Jun 2024
Symmetric Dot-Product Attention for Efficient Training of BERT Language
  Models
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois
Malte Ostendorff
Leonhard Hennig
Georg Rehm
31
2
0
10 Jun 2024
Improved Out-of-Scope Intent Classification with Dual Encoding and
  Threshold-based Re-Classification
Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Hossam Zawbaa
Wael Rashwan
Sourav Dutta
H. Assem
OODD
41
0
0
30 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large
  Language Models
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong-jin Liu
Ruiming Tang
KELM
38
4
0
29 May 2024
T-curator: a trust based curation tool for LOD logs
T-curator: a trust based curation tool for LOD logs
Dihia Lanasri
18
0
0
11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
53
2
0
06 May 2024
Explainability for Transparent Conversational Information-Seeking
Explainability for Transparent Conversational Information-Seeking
Weronika Lajewska
Damiano Spina
Johanne Trippas
K. Balog
34
7
0
06 May 2024
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning
Qizhou Chen
Taolin Zhang
Xiaofeng He
Dongyang Li
Chengyu Wang
Longtao Huang
Hui Xue
CLL
KELM
43
10
0
06 May 2024
Towards Unbiased Evaluation of Detecting Unanswerable Questions in
  EHRSQL
Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Yongjin Yang
Sihyeon Kim
Sangmook Kim
Gyubok Lee
Se-Young Yun
Edward Choi
33
2
0
29 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
78
46
0
23 Apr 2024
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and
  Modalities
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities
Kunxi Li
Tianyu Zhan
Kairui Fu
Shengyu Zhang
Kun Kuang
Jiwei Li
Zhou Zhao
Fei Wu
MoMe
24
0
0
20 Apr 2024
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Yang Lin
Xinyu Ma
Xu Chu
Yujie Jin
Zhibang Yang
Yasha Wang
Hong-yan Mei
49
19
0
15 Apr 2024
Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions
Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions
Taojun Hu
Xiao-Hua Zhou
ELM
41
12
0
14 Apr 2024
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
Andi Zhang
Tim Z. Xiao
Weiyang Liu
Robert Bamler
Damon J. Wischik
OODD
46
4
0
07 Apr 2024
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
Xuechen Liang
Meiling Tao
Yinghui Xia
Yiting Xie
Jun Wang
JingSong Yang
LLMAG
33
12
0
02 Apr 2024
ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on
  Historical American Newspaper Pages
ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages
Bhawna Piryani
Jamshid Mozafari
Adam Jatowt
RALM
30
8
0
26 Mar 2024
Universal Model in Online Customer Service
Universal Model in Online Customer Service
S. Pi
Cheng-Ping Hsieh
Qun Liu
Yuying Zhu
25
4
0
24 Feb 2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring
  Mathematical Reasoning of Large Language Models
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Yanan Wu
Jie Liu
Xingyuan Bu
Jiaheng Liu
Zhanhui Zhou
...
Haibin Chen
Tiezheng Ge
Wanli Ouyang
Wenbo Su
Bo Zheng
LRM
29
6
0
22 Feb 2024
Novi jezički modeli za srpski jezik
Novi jezički modeli za srpski jezik
Mihailo vSkorić
15
0
0
22 Feb 2024
Qsnail: A Questionnaire Dataset for Sequential Question Generation
Qsnail: A Questionnaire Dataset for Sequential Question Generation
Yan Lei
Liang Pang
Yuanzhuo Wang
Huawei Shen
Xueqi Cheng
27
0
0
22 Feb 2024
$Se^2$: Sequential Example Selection for In-Context Learning
Se2Se^2Se2: Sequential Example Selection for In-Context Learning
Haoyu Liu
Jianfeng Liu
Shaohan Huang
Yuefeng Zhan
Hao Sun
Weiwei Deng
Furu Wei
Qi Zhang
33
3
0
21 Feb 2024
Contrastive Instruction Tuning
Contrastive Instruction Tuning
Tianyi Yan
Fei Wang
James Y. Huang
Wenxuan Zhou
Fan Yin
Aram Galstyan
Wenpeng Yin
Muhao Chen
ALM
23
5
0
17 Feb 2024
A Dataset of Open-Domain Question Answering with Multiple-Span Answers
A Dataset of Open-Domain Question Answering with Multiple-Span Answers
Zhiyi Luo
Yingying Zhang
Shuyun Luo
Ying Zhao
Wentao Lyu
RALM
16
0
0
15 Feb 2024
How the Advent of Ubiquitous Large Language Models both Stymie and
  Turbocharge Dynamic Adversarial Question Generation
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Yoo Yeon Sung
Ishani Mondal
Jordan L. Boyd-Graber
30
0
0
20 Jan 2024
Power in Numbers: Robust reading comprehension by finetuning with four
  adversarial sentences per example
Power in Numbers: Robust reading comprehension by finetuning with four adversarial sentences per example
Ariel Marcus
AAML
17
0
0
18 Jan 2024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai
Ukyo Honda
Tetsuro Morimura
Peinan Zhang
31
6
0
10 Jan 2024
AQUALLM: Audio Question Answering Data Generation Using Large Language
  Models
AQUALLM: Audio Question Answering Data Generation Using Large Language Models
Swarup Ranjan Behera
Krishna Mohan Injeti
Jaya Sai Kiran Patibandla
P. Pokala
Pailla Balakrishna Reddy
AuLLM
13
4
0
28 Dec 2023
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse
  Weight Factorization
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization
Rahul Chand
Yashoteja Prabhu
Pratyush Kumar
20
3
0
20 Dec 2023
GSQA: An End-to-End Model for Generative Spoken Question Answering
GSQA: An End-to-End Model for Generative Spoken Question Answering
Min-Han Shih
Ho-Lam Chung
Yu-Chi Pai
Ming-Hao Hsu
Guan-Ting Lin
Shang-Wen Li
Hung-yi Lee
ELM
AuLLM
33
2
0
15 Dec 2023
Evaluating ChatGPT as a Question Answering System: A Comprehensive
  Analysis and Comparison with Existing Models
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELM
AI4MH
34
17
0
11 Dec 2023
PCoQA: Persian Conversational Question Answering Dataset
PCoQA: Persian Conversational Question Answering Dataset
Hamed Hematian Hemati
Atousa Toghyani
Atena Souri
Sayed Hesam Alavian
Hossein Sameti
Hamid Beigy
22
3
0
07 Dec 2023
Previous
12345...91011
Next