ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.14493
  4. Cited By
K-QA: A Real-World Medical Q&A Benchmark

K-QA: A Real-World Medical Q&A Benchmark

Workshop on Biomedical Natural Language Processing (BioNLP), 2024
25 January 2024
Itay Manes
Naama Ronn
David Cohen
Ran Ilan Ber
Zehavi Horowitz-Kugler
Gabriel Stanovsky
    LM&MAHILMAI4MH
ArXiv (abs)PDFHTML

Papers citing "K-QA: A Real-World Medical Q&A Benchmark"

14 / 14 papers shown
Title
LONGQAEVAL: Designing Reliable Evaluations of Long-Form Clinical QA under Resource Constraints
LONGQAEVAL: Designing Reliable Evaluations of Long-Form Clinical QA under Resource Constraints
Federica Bologna
Tiffany Pan
Matthew Wilkens
Yue Guo
Lucy Lu Wang
ELM
8
0
0
12 Oct 2025
KnowMT-Bench: Benchmarking Knowledge-Intensive Long-Form Question Answering in Multi-Turn Dialogues
KnowMT-Bench: Benchmarking Knowledge-Intensive Long-Form Question Answering in Multi-Turn Dialogues
Junhao Chen
Yu Huang
Siyuan Li
Rui Yao
Hanqian Li
H. Zhang
Jungang Li
Jian Chen
Bowen Wang
Xuming Hu
ELM
52
1
0
26 Sep 2025
Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
Edward Phillips
Sean Wu
Soheila Molaei
Danielle Belgrave
A. Thakur
David Clifton
HILM
68
0
0
17 Sep 2025
PerMedCQA: Benchmarking Large Language Models on Medical Consumer Question Answering in Persian Language
PerMedCQA: Benchmarking Large Language Models on Medical Consumer Question Answering in Persian Language
Naghmeh Jamali
Milad Mohammadi
Danial Baledi
Zahra Rezvani
Hesham Faili
LM&MAELM
134
0
0
23 May 2025
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Mehrdad Ghassabi
Pedram Rostami
Hamidreza Baradaran Kashani
Amirhossein Poursina
Zahra Kazemi
Milad Tavakoli
LM&MA
319
1
0
21 May 2025
Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI
Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI
Agnik Saha
Victoria Churchill
Anny D. Rodriguez
Ugur Kursuncu
Muhammed Y. Idris
LM&MAELM
130
1
0
15 May 2025
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov
Amina Miftakhova
Artemiy Tereshchenko
Galina Zubkova
Pavel Blinov
Andrey Savchenko
LM&MA
201
3
0
26 Mar 2025
A Benchmark for Long-Form Medical Question Answering
A Benchmark for Long-Form Medical Question Answering
Pedram Hosseini
Jessica M. Sin
Bing Ren
Bryceton G. Thomas
Elnaz Nouri
Ali Farahanchi
Saeed Hassanpour
ELMLM&MAAI4MH
141
14
0
14 Nov 2024
LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction
LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction
Aishik Nagar
Viktor Schlegel
Thanh-Tung Nguyen
Hao Li
Yuping Wu
Kuluhan Binici
Stefan Winkler
LRM
284
7
0
22 Aug 2024
Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter
Putting People in LLMs' Shoes: Generating Better Answers via Question RewriterAAAI Conference on Artificial Intelligence (AAAI), 2024
Junhao Chen
Bowen Wang
Zhouqiang Jiang
Yuta Nakashima
162
3
0
20 Aug 2024
Large language model validity via enhanced conformal prediction methods
Large language model validity via enhanced conformal prediction methodsNeural Information Processing Systems (NeurIPS), 2024
John J. Cherian
Isaac Gibbs
Emmanuel J. Candès
129
57
0
14 Jun 2024
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
Minbyul Jeong
Hyeon Hwang
Chanwoong Yoon
Taewhoo Lee
Jaewoo Kang
MedImHILMLM&MA
226
16
0
21 May 2024
Small Language Models Learn Enhanced Reasoning Skills from Medical
  Textbooks
Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks
Hyunjae Kim
Hyeon Hwang
Jiwoo Lee
Sihyeon Park
Dain Kim
Taewhoo Lee
Chanwoong Yoon
Jiwoong Sohn
Donghee Choi
Jaewoo Kang
ELMAI4MHLRM
216
35
0
30 Mar 2024
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen
Zhouxiang Fang
Yash Singla
Mark Dredze
ELMAI4MH
276
71
0
28 Feb 2024
1