Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
2104.06486
Cited By
v1
v2
v3 (latest)
MS2: Multi-Document Summarization of Medical Studies
13 April 2021
Jay DeYoung
Iz Beltagy
Madeleine van Zuylen
Bailey Kuehl
Lucy Lu Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Github (67★)
Papers citing
"MS2: Multi-Document Summarization of Medical Studies"
50 / 72 papers shown
Title
Paper2Video: Automatic Video Generation from Scientific Papers
Zeyu Zhu
Kevin Qinghong Lin
Mike Zheng Shou
VGen
8
0
0
06 Oct 2025
MSRS: Evaluating Multi-Source Retrieval-Augmented Generation
Rohan Phanse
Yijie Zhou
Kejian Shi
Wencai Zhang
Yixin Liu
Yilun Zhao
Arman Cohan
RALM
28
0
0
28 Aug 2025
Benchmarking GPT-5 for biomedical natural language processing
Yu Hou
Zaifu Zhan
Rui Zhang
LM&MA
AI4MH
ELM
32
1
0
28 Aug 2025
SurveyGen: Quality-Aware Scientific Survey Generation with Large Language Models
Tong Bao
Mir Tafseer Nayeem
Davood Rafiei
Chengzhi Zhang
LM&MA
17
0
0
25 Aug 2025
PediatricsMQA: a Multi-modal Pediatrics Question Answering Benchmark
Adil Bahaj
Oumaima Fadi
Mohamed Chetouani
Mounir Ghogho
LM&MA
60
0
0
22 Aug 2025
RAG for Geoscience: What We Expect, Gaps and Opportunities
Runlong Yu
Shiyuan Luo
Rahul Ghosh
Jinkui Chi
Yiqun Xie
Xiaowei Jia
32
1
0
15 Aug 2025
When AIs Judge AIs: The Rise of Agent-as-a-Judge Evaluation for LLMs
Fangyi Yu
ELM
55
1
0
05 Aug 2025
Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Jiaju Chen
Yuxuan Lu
Xiaojie Wang
Huimin Zeng
Jing Huang
Jiri Gesi
Ying Xu
Bingsheng Yao
Dakuo Wang
LLMAG
ELM
61
2
0
28 Jul 2025
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey
Jiachen Zhu
Menghui Zhu
Renting Rui
Rong Shan
Congmin Zheng
...
Jianghao Lin
Weiwen Liu
Ruiming Tang
Yong Yu
Weinan Zhang
LLMAG
ELM
138
0
0
06 Jun 2025
From Chat Logs to Collective Insights: Aggregative Question Answering
Wentao Zhang
Woojeong Kim
Yuntian Deng
LMTD
106
0
0
29 May 2025
Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review
Zihan Xu
Haotian Ma
Gongbo Zhang
Yihao Ding
Chunhua Weng
Yifan Peng
75
0
0
28 May 2025
Multimodal Large Language Models for Medicine: A Comprehensive Survey
Jiarui Ye
Hao Tang
LM&MA
309
6
0
29 Apr 2025
Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Weiqi Wang
Jiefu Ou
Yangqiu Song
Benjamin Van Durme
Daniel Khashabi
LMTD
203
5
0
14 Apr 2025
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAG
ELM
279
39
0
20 Mar 2025
CS-PaperSum: A Large-Scale Dataset of AI-Generated Summaries for Scientific Papers
Javin Liu
Aryan Vats
Zihao He
157
2
0
27 Feb 2025
Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments
Marc Felix Brinner
Sina Zarrieß
195
1
0
10 Feb 2025
A foundation model for human-AI collaboration in medical literature mining
Zifeng Wang
Lang Cao
Qiao Jin
Joey Chan
Nicholas Wan
...
Christopher M. Zallek
Kyungsang Kim
Yifan Peng
Zhiyong Lu
Jimeng Sun
100
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Johan Sulaeman
LM&MA
AILaw
367
212
0
28 Jan 2025
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Benjamin Newman
Yoonjoo Lee
Aakanksha Naik
Pao Siangliulue
Raymond Fok
Juho Kim
Daniel S. Weld
Joseph Chee Chang
Kyle Lo
LMTD
201
4
0
25 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
165
7
0
22 Oct 2024
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Catarina G. Belem
Pouya Pezeskhpour
Hayate Iso
Seiji Maekawa
Nikita Bhutani
Estevam R. Hruschka
HILM
204
9
0
17 Oct 2024
Summarization of Investment Reports Using Pre-trained Model
Hiroki Sakaji
Ryotaro Kobayashi
Kiyoshi Izumi
Hiroyuki Mitsugi
Wataru Kuramoto
76
0
0
03 Aug 2024
CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support
Chao-Chun Hsu
Erin Bransom
Jenna Sparks
Bailey Kuehl
Chenhao Tan
David Wadden
Lucy Lu Wang
Aakanksha Naik
109
17
0
23 Jul 2024
M2DS: Multilingual Dataset for Multi-document Summarisation
Kushan Hewapathirana
Nisansa de Silva
Sri Lanka
123
3
0
17 Jul 2024
Panacea: A foundation model for clinical trial search, summarization, design, and recruitment
J. Lin
H. Xu
Zifeng Wang
Sheng Wang
Jimeng Sun
ELM
LM&MA
138
15
0
25 Jun 2024
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
David Wadden
Kejian Shi
Jacob Morrison
Aakanksha Naik
Shruti Singh
...
Luca Soldaini
Shannon Zejiang Shen
Doug Downey
Hannaneh Hajishirzi
Arman Cohan
197
18
0
10 Jun 2024
Aspect-oriented Consumer Health Answer Summarization
Rochana Chaturvedi
Abari Bhattacharya
S. Yadav
104
4
0
10 May 2024
ReflectSumm: A Benchmark for Course Reflection Summarization
Yang Zhong
Mohamed S. Elaraby
Diane Litman
A. Butt
Muhsin Menekse
83
1
0
27 Mar 2024
From Paper to Card: Transforming Design Implications with Generative AI
Donghoon Shin
Lucy Lu Wang
Gary Hsieh
137
17
0
12 Mar 2024
Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Preslav Nakov
Tairan Wang
Qingqing Zhu
Taicheng Guo
Shen Gao
Zhiyong Lu
Xin Gao
Xiangliang Zhang
269
4
0
22 Feb 2024
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews
Wojciech Kusa
Óscar E. Mendoza
Matthias Samwald
Petr Knoth
Allan Hanbury
117
5
0
21 Nov 2023
Responsible AI Considerations in Text Summarization Research: A Review of Current Practices
Yu Lu Liu
Meng Cao
Su Lin Blodgett
Jackie Chi Kit Cheung
Alexandra Olteanu
Adam Trischler
115
2
0
18 Nov 2023
CARE: Extracting Experimental Findings From Clinical Literature
Aakanksha Naik
Bailey Kuehl
Erin Bransom
Doug Downey
Kyle Lo
175
4
0
16 Nov 2023
FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization
Nan Zhang
Yusen Zhang
Wu Guo
P. Mitra
Rui Zhang
HILM
129
9
0
03 Nov 2023
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers
Chen Tang
Shunyu Wang
Tomas Goldsack
Chenghua Lin
99
19
0
24 Oct 2023
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles
Kung-Hsiang Huang
Philippe Laban
Alexander R. Fabbri
Prafulla Kumar Choubey
Shafiq Joty
Caiming Xiong
Chien-Sheng Wu
140
37
0
17 Sep 2023
ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Yijie Zhou
Kejian Shi
Wencai Zhang
Yixin Liu
Yilun Zhao
Arman Cohan
RALM
112
3
0
16 Sep 2023
Multi-document Summarization: A Comparative Evaluation
Kushan Hewapathirana
Nisansa de Silva
Sri Lanka
ELM
124
6
0
10 Sep 2023
Towards Argument-Aware Abstractive Summarization of Long Legal Opinions with Summary Reranking
Mohamed S. Elaraby
Yang Zhong
Diane Litman
AILaw
ELM
89
13
0
01 Jun 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
100
12
0
28 May 2023
SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation
Tetsu Kasanishi
Masaru Isonuma
Junichiro Mori
Ichiro Sakata
103
16
0
24 May 2023
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
Lucy Lu Wang
Yulia Otmakhova
Jay DeYoung
Thinh Hung Truong
Bailey Kuehl
Erin Bransom
Byron C. Wallace
212
22
0
23 May 2023
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews
Hye Sun Yun
Iain J. Marshall
T. Trikalinos
Byron C. Wallace
108
23
0
19 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Ziheng Lu
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
144
10
0
12 May 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
98
11
0
18 Apr 2023
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang
Shaoxiong Ji
Tianlin Zhang
Qianqian Xie
Zi-Zhou Kuang
Sophia Ananiadou
ELM
AI4MH
LRM
165
71
0
06 Apr 2023
Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges
S. Ramprasad
Denis Jered McInerney
Iain J. Marshal
Byron C. Wallace
130
12
0
07 Mar 2023
Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections
Srishti Palani
Aakanksha Naik
Doug Downey
Amy X. Zhang
Jonathan Bragg
Joseph Chee Chang
90
44
0
13 Feb 2023
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
151
8
0
31 Jan 2023
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization
Kalpesh Krishna
Erin Bransom
Bailey Kuehl
Mohit Iyyer
Pradeep Dasigi
Arman Cohan
Kyle Lo
120
104
0
30 Jan 2023
1
2
Next