Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.16739
Cited By
AlignScore: Evaluating Factual Consistency with a Unified Alignment Function
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
26 May 2023
Yuheng Zha
Yichi Yang
Ruichen Li
Zhiting Hu
HILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AlignScore: Evaluating Factual Consistency with a Unified Alignment Function"
50 / 182 papers shown
Title
SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xinping Zhao
Dongfang Li
Yan Zhong
Boren Hu
Yibin Chen
Baotian Hu
Min Zhang
240
5
0
15 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
239
13
0
05 Oct 2024
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval-Augmented Generation
Tobias Leemann
Periklis Petridis
G. Vietri
Dionysis Manousakas
Aaron Roth
Sergul Aydore
363
3
0
04 Oct 2024
Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Lei Xu
Mohammed Asad Karim
Saket Dingliwal
Aparna Elangovan
126
2
0
03 Oct 2024
Analysing Zero-Shot Readability-Controlled Sentence Simplification
International Conference on Computational Linguistics (COLING), 2024
Abdullah Barayan
Jose Camacho-Collados
Fernando Alva-Manchego
213
15
0
30 Sep 2024
Model-based Preference Optimization in Abstractive Summarization without Human Feedback
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jaepill Choi
Kyubyung Chae
Jiwoo Song
Yohan Jo
Taesup Kim
394
6
0
27 Sep 2024
AXCEL: Automated eXplainable Consistency Evaluation using LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
P Aditya Sreekar
Sahil Verma
Suransh Chopra
Sarik Ghazarian
Abhishek Persad
Narayanan Sadagopan
LRM
156
2
0
25 Sep 2024
Overview of the First Shared Task on Clinical Text Generation: RRG24 and "Discharge Me!"
Workshop on Biomedical Natural Language Processing (BioNLP), 2024
Justin Xu
Zhihong Chen
Andrew Johnston
Louis Blankemeier
Maya Varma
...
Ankit Modi
Robert Lloyd
Benjamin Hopkins
Curtis Langlotz
Jean-Benoit Delbrouck
LM&MA
272
35
0
25 Sep 2024
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Yunfei Xie
Juncheng Wu
Haoqin Tu
Siwei Yang
Bingchen Zhao
Yongshuo Zong
Qiao Jin
Cihang Xie
Yuyin Zhou
LM&MA
ELM
LRM
294
39
0
23 Sep 2024
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
International Conference on Computational Linguistics (COLING), 2024
Lin Ai
Ziwei Gong
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Ahmad Emami
Julia Hirschberg
130
2
0
14 Sep 2024
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
413
15
0
11 Sep 2024
RAG based Question-Answering for Contextual Response Prediction System
Sriram Veturi
Saurabh Vaichal
Reshma Lal Jagadheesh
Nafis Irtiza Tripto
Nian Yan
RALM
148
17
0
05 Sep 2024
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
525
18
0
26 Aug 2024
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection
Mengya Hu
Rui Xu
Deren Lei
Yaxi Li
Mingyu Wang
Emily Ching
Eslam Kamal
Alex Deng
142
6
0
22 Aug 2024
A Comparative Analysis of Faithfulness Metrics and Humans in Citation Evaluation
Weijia Zhang
Mohammad Aliannejadi
Jiahuan Pei
Yifei Yuan
Jia-Hong Huang
Evangelos Kanoulas
HILM
180
5
0
22 Aug 2024
Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models
Artem Vazhentsev
Ekaterina Fadeeva
Daniil Vasilev
Sergey Petrakov
Ivan Lazichny
Ilseyar Alimova
Preslav Nakov
Timothy Baldwin
Maxim Panov
Artem Shelmanov
HILM
185
3
0
20 Aug 2024
Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices
International Conference on Natural Language Generation (INLG), 2024
Patrícia Schmidtová
Saad Mahamood
Simone Balloccu
Ondřej Dušek
Albert Gatt
Dimitra Gkatzia
David M. Howcroft
Ondřej Plátek
Adarsa Sivaprasad
184
18
0
17 Aug 2024
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles
Workshop on Biomedical Natural Language Processing (BioNLP), 2024
Tomas Goldsack
Carolina Scarton
Matthew Shardlow
Chenghua Lin
98
45
0
16 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
276
1
0
07 Aug 2024
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Jaehun Jung
Faeze Brahman
Yejin Choi
ALM
183
37
0
25 Jul 2024
UF-HOBI at "Discharge Me!": A Hybrid Solution for Discharge Summary Generation Through Prompt-based Tuning of GatorTronGPT Models
Mengxian Lyu
C.A.I. Peng
Daniel Paredes
Ziyi Chen
Aokun Chen
Jiang Bian
Yonghui Wu
144
3
0
22 Jul 2024
Beyond Aesthetics: Cultural Competence in Text-to-Image Models
Nithish Kannen
Arif Ahmad
Marco Andreetto
Vinodkumar Prabhakaran
Utsav Prabhu
Adji Bousso Dieng
Pushpak Bhattacharyya
Shachi Dave
374
30
0
09 Jul 2024
STORYSUMM: Evaluating Faithfulness in Story Summarization
Melanie Subbiah
Faisal Ladhak
Akankshya Mishra
Griffin Adams
Lydia B. Chilton
Kathleen McKeown
410
8
0
09 Jul 2024
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
Md Tahmid Rahman Laskar
Sawsan Alqahtani
M Saiful Bari
Mizanur Rahman
Mohammad Abdullah Matin Khan
...
Chee Wei Tan
Md. Rizwan Parvez
Enamul Hoque
Shafiq Joty
Jimmy Huang
ELM
ALM
269
82
0
04 Jul 2024
e-Health CSIRO at "Discharge Me!" 2024: Generating Discharge Summary Sections with Fine-tuned Language Models
Jinghui Liu
Aaron Nicolson
Jason Dowling
Bevan Koopman
Anthony N. Nguyen
214
5
0
03 Jul 2024
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Khyathi Chandu
Linjie Li
Anas Awadalla
Ximing Lu
Jae Sung Park
Jack Hessel
Lijuan Wang
Yejin Choi
293
6
0
02 Jul 2024
Learning to Refine with Fine-Grained Natural Language Feedback
Manya Wadhwa
Xinyu Zhao
Junyi Jessy Li
Greg Durrett
488
25
0
02 Jul 2024
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Philippe Laban
Alexander R. Fabbri
Caiming Xiong
Chien-Sheng Wu
RALM
315
82
0
01 Jul 2024
Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese
Yunqi Xu
Tianchi Cai
Jiyan Jiang
Xierui Song
279
9
0
01 Jul 2024
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification
Anisha Gunjal
Greg Durrett
HILM
235
34
0
28 Jun 2024
Shimo Lab at "Discharge Me!": Discharge Summarization by Prompt-Driven Concatenation of Electronic Health Record Sections
Yunzhen He
Hiroaki Yamagiwa
Hidetoshi Shimodaira
242
3
0
26 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
367
7
0
24 Jun 2024
Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Weijia Zhang
Mohammad Aliannejadi
Yifei Yuan
Jiahuan Pei
Jia-Hong Huang
Evangelos Kanoulas
HILM
298
16
0
21 Jun 2024
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Roman Vashurin
Ekaterina Fadeeva
Artem Vazhentsev
Akim Tsvigun
Daniil Vasilev
...
Timothy Baldwin
Timothy Baldwin
Preslav Nakov
Maxim Panov
Artem Shelmanov
HILM
605
59
0
21 Jun 2024
Factual Dialogue Summarization via Learning from Large Language Models
Rongxin Zhu
Jey Han Lau
Jianzhong Qi
HILM
235
6
0
20 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
221
16
0
19 Jun 2024
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Di Wu
Jia-Chen Gu
Fan Yin
Nanyun Peng
Kai-Wei Chang
HILM
118
5
0
19 Jun 2024
Learning to Generate Answers with Citations via Factual Consistency Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Rami Aly
Zhiqiang Tang
Samson Tan
George Karypis
HILM
226
8
0
19 Jun 2024
Detecting Response Generation Not Requiring Factual Judgment
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Ryohei Kamei
Daiki Shiono
Reina Akama
Jun Suzuki
HILM
127
0
0
14 Jun 2024
Unlearning Climate Misinformation in Large Language Models
Michael Fore
Simranjit Singh
Chaehong Lee
Amritanshu Pandey
Antonios Anastasopoulos
Dimitrios Stamoulis
MU
257
5
0
29 May 2024
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
Jaewoo Ahn
Taehyun Lee
Junyoung Lim
Jin-Hwa Kim
Sangdoo Yun
Hwaran Lee
Gunhee Kim
LLMAG
HILM
210
19
0
28 May 2024
QUB-Cirdan at "Discharge Me!": Zero shot discharge letter generation by open-source LLM
Rui Guo
Greg Farnan
Niall McLaughlin
Barry Devereux
167
5
0
27 May 2024
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models
Xiangkun Hu
Dongyu Ru
Lin Qiu
Qipeng Guo
Tianhang Zhang
Yang Xu
Yun Luo
Pengfei Liu
Yue Zhang
Zheng Zhang
HILM
LRM
243
17
0
23 May 2024
RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts
Yuelyu Ji
Zhuochun Li
Rui Meng
Sonish Sivarajkumar
Yanshan Wang
Zeshui Yu
Hui Ji
Yushui Han
Hanyu Zeng
Daqing He
204
32
0
21 May 2024
WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles
T. M. G. Pakull
Hendrik Damm
Ahmad Idrissi-Yaghir
Henning Schafer
Peter A. Horn
Christoph M. Friedrich
142
2
0
20 May 2024
WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert Selection, and Priming Techniques on MIMIC-IV
Workshop on Biomedical Natural Language Processing (BioNLP), 2024
Hendrik Damm
T. M. G. Pakull
Bahadir Eryilmaz
Helmut Becker
Ahmad Idrissi-Yaghir
Henning Schafer
Sergej Schultenkämper
Christoph M. Friedrich
165
3
0
18 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
IEEE Communications Surveys and Tutorials (COMST), 2024
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
232
164
0
17 May 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Pengnian Qi
Zhiyu Li
285
17
0
25 Apr 2024
A Survey of Automatic Hallucination Evaluation on Natural Language Generation
Siya Qi
Petr Slovak
Yulan He
Zheng Yuan
LRM
HILM
309
1
0
18 Apr 2024
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
Joonho Yang
Seunghyun Yoon
Byeongjeong Kim
Hwanhee Lee
HILM
277
13
0
17 Apr 2024
Previous
1
2
3
4
Next