Title
FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text Varich Boonsanong Vidhisha Balachandran Xiaochuang Han Shangbin Feng Lucy Lu Wang Yulia Tsvetkov 296 3 0 19 Mar 2025
Optimizing Decomposition for Optimal Claim VerificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Yining Lu Noah Ziems Hy Dang Meng Jiang 319 2 0 19 Mar 2025
GraphEval: A Lightweight Graph-Based LLM Framework for Idea EvaluationInternational Conference on Learning Representations (ICLR), 2025 Tao Feng Yihang Sun Jiaxuan You 347 14 0 16 Mar 2025
AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation Fengyu Li Yilin Li Junhao Zhu Lu Chen Yanfei Zhang Jia Zhou Hui Zu Jingwen Zhao Yunjun Gao LLMAG 180 0 0 14 Mar 2025
Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Wen Luo Feifan Song Wei Li Guangyue Peng Shaohang Wei Houfeng Wang AI4CE 184 1 0 11 Mar 2025
Evaluating open-source Large Language Models for automated fact-checking Nicoló Fontana Francesco Corso Enrico Zuccolotto Francesco Pierri HILM 208 5 0 07 Mar 2025
Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting Jiyue Jiang Pengan Chen Jinqiao Wang Dongchen He Ziqin Wei ... Yimin Fan Xiangyu Shi Jimeng Sun Chuan Wu Yuan Li LM&MA 185 6 0 06 Mar 2025
Uncovering Gaps in How Humans and LLMs Interpret Subjective LanguageInternational Conference on Learning Representations (ICLR), 2025 Erik Jones Arjun Patrawala Jacob Steinhardt 176 2 0 06 Mar 2025
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models Y. Guo Yuchen Yang Zhe Chen Pingjie Wang Yusheng Liao Yujiao Shi Yanfeng Wang Yu Wang HILM 229 2 0 05 Mar 2025
AILS-NTUA at SemEval-2025 Task 3: Leveraging Large Language Models and Translation Strategies for Multilingual Hallucination Detection Dimitra Karkani Maria Lymperaiou Giorgos Filandrianos Nikolaos Spanos Athanasios Voulodimos Giorgos Stamou HILM LRM 218 0 0 04 Mar 2025
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMsInternational Conference on Learning Representations (ICLR), 2025 Yuzhe Gu Feiyu Xiong Chengqi Lyu Dahua Lin Kai Chen 241 12 0 04 Mar 2025
LLM as a Broken Telephone: Iterative Generation Distorts InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Amr Mohamed Mingmeng Geng Michalis Vazirgiannis Guokan Shang 350 3 0 27 Feb 2025
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in Product QA Agents A. Lewis Michael White Jing Liu T. Koike-Akino K. Parsons Yanjie Wang HILM 265 1 0 26 Feb 2025
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity Zhengping Jiang Anqi Liu Benjamin Van Durme 452 7 0 26 Feb 2025
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit ProfilesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Kuang Wang Xianrui Li Steve Yang Li Zhou Feng Jiang Haoyang Li 308 6 0 26 Feb 2025
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Hao Peng Yunjia Qi Xiaozhi Wang Zijun Yao Bin Xu Lei Hou Juanzi Li ALM LRM 186 25 0 26 Feb 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025 Radu Marinescu D. Bhattacharjya Junkyu Lee T. Tchrakian Javier Carnerero-Cano Yufang Hou Elizabeth M. Daly Alessandra Pascale HILM LRM 242 2 0 25 Feb 2025
Uncertainty Quantification in Retrieval Augmented Question Answering Laura Perez-Beltrachini Mirella Lapata RALM 459 4 0 25 Feb 2025
Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems Matthew Barker Andrew Bell Evan Thomas James Carr Thomas Andrews Umang Bhatt 337 4 0 25 Feb 2025
Is Free Self-Alignment Possible? Dyah Adila Changho Shin Yijing Zhang Frederic Sala MoMe 370 2 0 24 Feb 2025
AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting Jibang Wu Chenghao Yang Simon Mahns Simon Mahns Hao Zhu Hao Zhu Haifeng Xu Haifeng Xu 328 5 0 24 Feb 2025
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking Yi-Ling Chung Aurora Cobo Pablo Serna SyDa HILM 194 6 0 24 Feb 2025
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization Rohit Saxena Pasquale Minervini Frank Keller VLM 192 8 0 24 Feb 2025
Is Relevance Propagated from Retriever to Generator in RAG?European Conference on Information Retrieval (ECIR), 2025 Fangzheng Tian Debasis Ganguly Craig Macdonald RALM 202 6 0 24 Feb 2025
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-CheckingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Yingjian Chen Haoran Liu Yinhong Liu Rui Yang Han Yuan ... Pengyuan Zhou Peng Yuan Zhou Qingyu Chen James Caverlee Irene Li HILM 527 5 0 23 Feb 2025
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text EvaluationInternational Conference on Human Factors in Computing Systems (CHI), 2024 SeongYeub Chu JongWoo Kim MunYong Yi 306 12 0 21 Feb 2025
Hallucination Detection in Large Language Models with Metamorphic Relations Borui Yang Md Afif Al Mamun Jie M. Zhang Gias Uddin HILM 396 17 0 20 Feb 2025
Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease Elliot Schumacher Dhruv Naik Anitha Kannan LM&MA 160 4 0 20 Feb 2025
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild Saad Obaid ul Islam Anne Lauscher Goran Glavaš HILM LRM 406 8 0 18 Feb 2025
STRIVE: Structured Reasoning for Self-Improvement in Claim Verification Haisong Gong Jing Li Junfei Wu Sihan Yang Shu Wu Shu Wu LRM 233 2 0 17 Feb 2025
HalluEntity: Benchmarking and Understanding Entity-Level Hallucination Detection Min-Hsuan Yeh Max Kamachee Seongheon Park Yixuan Li HILM 357 3 0 17 Feb 2025
Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey Zirui Song Bin Yan Yuhan Liu Miao Fang Mingzhe Li Rui Yan Preslav Nakov KELM LM&MA 356 40 0 15 Feb 2025
Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection Yan Weng Fengbin Zhu Tong Ye Haoyan Liu Fuli Feng Tat-Seng Chua RALM 337 3 0 10 Feb 2025
OverThink: Slowdown Attacks on Reasoning LLMs A. Kumar Jaechul Roh A. Naseh Marzena Karpinska Mohit Iyyer Amir Houmansadr Eugene Bagdasarian LRM 520 50 0 04 Feb 2025
Context-Aware Hierarchical Merging for Long Document SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Litu Ou Mirella Lapata MoMe 975 2 0 03 Feb 2025
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Takyoung Kim Kyungjae Lee Y. Jang Ji Yong Cho Gangwoo Kim Minseok Cho Moontae Lee 382 1 0 28 Jan 2025
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun Jian Wang Di Wu Benoit Boulet HILM LRM 217 5 0 22 Jan 2025
Iterative Tree Analysis for Medical Critics Zenan Huang Mingwei Li Zheng Zhou Youxin Jiang 759 0 0 18 Jan 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi Wenbiao Yin Jizhan Fang Jialong Wu Runnan Fang Ningyu Zhang Jiang Yong Pengjun Xie Fei Huang Ningyu Zhang SyDa LRM 386 12 0 16 Jan 2025
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesInternational Conference on Computational Linguistics (COLING), 2025 Siran Li Linus Stenzel Carsten Eickhoff Seyed Ali Bahrainian RALM 3DV 193 22 0 13 Jan 2025
Lived Experience Not Found: LLMs Struggle to Align with Experts on Addressing Adverse Drug Reactions from Psychiatric Medication UseNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Mohit Chandra Siddharth Sriraman Gaurav Verma Harneet Singh Khanuja Jose Suarez Campayo Zihang Li Michael L. Birnbaum M. D. Choudhury AI4MH 314 12 0 08 Jan 2025
The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input Alon Jacovi Andrew Wang Chris Alberti Connie Tao Jon Lipovetz ... Rachana Fellinger Rui Wang Zizhao Zhang Sasha Goldshtein Dipanjan Das HILM ALM 419 30 0 06 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Helia Hashemi J. Eisner Corby Rosset Benjamin Van Durme Chris Kedzie 365 33 0 03 Jan 2025
A review of faithfulness metrics for hallucination assessment in Large Language ModelsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2024 Ben Malin Tatiana Kalganova Nikoloas Boulgouris HILM 363 7 0 03 Jan 2025
Evaluate Summarization in Fine-Granularity: Auto Evaluation with LLM Dong Yuan Eti Rastogi Fen Zhao Sagar Goyal Gautam Naik Sree Prasanna Rajagopal 154 2 0 31 Dec 2024
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and UncertaintyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Qing Zong Zhaoxiang Wang Tianshi Zheng Xiyu Ren Yangqiu Song 345 11 0 28 Dec 2024
A Survey of Calibration Process for Black-Box LLMs Liangru Xie Hui Liu Jingying Zeng Xianfeng Tang Yan Han Chen Luo Jing Huang Zhen Li Suhang Wang Qi He 266 8 0 17 Dec 2024
Attention with Dependency Parsing Augmentation for Fine-Grained AttributionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Qiang Ding Lvzhou Luo Yixuan Cao Ping Luo 240 4 0 16 Dec 2024
Coverage-based Fairness in Multi-document SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Haoyuan Li Yusen Zhang Rui Zhang Snigdha Chaturvedi 319 2 0 11 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary LookaheadNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Tianyi Li Erenay Dayanik Shubhi Tyagi Andrea Pierleoni HILM 257 1 0 10 Dec 2024