ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.05922
  4. Cited By
A Survey of Hallucination in Large Foundation Models

A Survey of Hallucination in Large Foundation Models

12 September 2023
Vipula Rawte
A. Sheth
Amitava Das
    HILMLRM
ArXiv (abs)PDFHTML

Papers citing "A Survey of Hallucination in Large Foundation Models"

50 / 290 papers shown
Title
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards
Zhimin Zhao
A. A. Bangash
F. Côgo
Bram Adams
Ahmed E. Hassan
682
3
0
04 Jul 2024
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
Amy Xin
Yunjia Qi
Zijun Yao
Fangwei Zhu
Kaisheng Zeng
Xu Bin
Lei Hou
Juanzi Li
431
15
0
04 Jul 2024
Investigating and Mitigating the Multimodal Hallucination Snowballing in
  Large Vision-Language Models
Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Weihong Zhong
Xiaocheng Feng
Liang Zhao
Qiming Li
Lei Huang
Yuxuan Gu
Weitao Ma
Yuan Xu
Bing Qin
MLLM
398
19
0
30 Jun 2024
Resource Allocation and Secure Wireless Communication in the Large
  Model-based Mobile Edge Computing System
Resource Allocation and Secure Wireless Communication in the Large Model-based Mobile Edge Computing System
Zefan Wang
Yitong Wang
Jun Zhao
169
1
0
29 Jun 2024
VERISCORE: Evaluating the factuality of verifiable claims in long-form
  text generation
VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation
Yixiao Song
Yekyung Kim
Mohit Iyyer
HILM
213
70
0
27 Jun 2024
CogMG: Collaborative Augmentation Between Large Language Model and
  Knowledge Graph
CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph
Tong Zhou
Yubo Chen
Kang Liu
Jun Zhao
HILMRALM
195
9
0
25 Jun 2024
Evaluating the Quality of Hallucination Benchmarks for Large
  Vision-Language Models
Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models
Bei Yan
Jie Zhang
Zheng Yuan
Shiguang Shan
Xilin Chen
VLM
129
13
0
24 Jun 2024
INDICT: Code Generation with Internal Dialogues of Critiques for Both
  Security and Helpfulness
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
Hung Le
Yingbo Zhou
Caiming Xiong
Silvio Savarese
Doyen Sahoo
230
7
0
23 Jun 2024
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in
  LLMs
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen
Jiatong Han
Muhammed Razzak
Lisa Schut
Shreshth A. Malik
Yarin Gal
HILM
271
107
0
22 Jun 2024
Scaling Laws for Fact Memorization of Large Language Models
Scaling Laws for Fact Memorization of Large Language Models
Xingyu Lu
Xiaonan Li
Qinyuan Cheng
Kai Ding
Xuanjing Huang
Xipeng Qiu
264
24
0
22 Jun 2024
Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms
Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms
Mirabel Reid
Santosh Vempala
ELM
201
0
0
20 Jun 2024
Large Language Models are Skeptics: False Negative Problem of
  Input-conflicting Hallucination
Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination
Jongyoon Song
Sangwon Yu
Sungroh Yoon
HILM
97
7
0
20 Jun 2024
IoT-Based Preventive Mental Health Using Knowledge Graphs and Standards
  for Better Well-Being
IoT-Based Preventive Mental Health Using Knowledge Graphs and Standards for Better Well-Being
A. Gyrard
Seyedali Mohammadi
Manas Gaur
Antonio Kung
AI4MH
228
1
0
19 Jun 2024
Jailbreaking Large Language Models Through Alignment Vulnerabilities in Out-of-Distribution Settings
Jailbreaking Large Language Models Through Alignment Vulnerabilities in Out-of-Distribution Settings
Yue Huang
Jingyu Tang
Dongping Chen
Bingda Tang
Yao Wan
Lichao Sun
Philip S. Yu
Xiangliang Zhang
AAML
139
3
0
19 Jun 2024
Automating IRAC Analysis in Malaysian Contract Law using a Semi-Structured Knowledge Base
Automating IRAC Analysis in Malaysian Contract Law using a Semi-Structured Knowledge Base
Xiaoxi Kang
Zhuang Li
Lay-Ki Soon
Zhuang Li
Adnan Trakic
AILaw
274
2
0
19 Jun 2024
Small Agent Can Also Rock! Empowering Small Language Models as
  Hallucination Detector
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Hongzhi Zhang
Fuzheng Zhang
Di Zhang
Kun Gai
Ji-Rong Wen
HILMLLMAG
200
18
0
17 Jun 2024
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
A. B. M. A. Rahman
Saeed Anwar
Muhammad Usman
Ajmal Mian
HILM
173
7
0
13 Jun 2024
Understanding Sounds, Missing the Questions: The Challenge of Object
  Hallucination in Large Audio-Language Models
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Chun-Yi Kuan
Wei-Ping Huang
Hung-yi Lee
AuLLM
153
17
0
12 Jun 2024
Survey for Landing Generative AI in Social and E-commerce Recsys -- the
  Industry Perspectives
Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives
Da Xu
Danqing Zhang
Guangyu Yang
Bo Yang
Shuyuan Xu
Lingling Zheng
Cindy Liang
118
4
0
10 Jun 2024
LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering
LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-AnsweringVisual .. (VISUAL), 2024
Harry Li
G. Appleby
Ashley Suh
217
5
0
07 Jun 2024
ComplexTempQA:A 100m Dataset for Complex Temporal Question Answering
ComplexTempQA:A 100m Dataset for Complex Temporal Question Answering
Raphael Gruber
Abdelrahman Abdallah
Michael Färber
Adam Jatowt
343
9
0
07 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
644
17
0
06 Jun 2024
RATT: A Thought Structure for Coherent and Correct LLM Reasoning
RATT: A Thought Structure for Coherent and Correct LLM Reasoning
Jinghan Zhang
Xiting Wang
Weijieying Ren
Lu Jiang
Dongjie Wang
Kunpeng Liu
LRM
381
34
0
04 Jun 2024
Diver: Large Language Model Decoding with Span-Level Mutual Information
  Verification
Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
Jinliang Lu
Chen Wang
Jiajun Zhang
210
4
0
04 Jun 2024
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Maciej Besta
Lorenzo Paleari
Marcin Copik
Robert Gerstenberger
Aleš Kubíček
...
Eric Schreiber
Torsten Hoefler
Tomasz Lehmann
H. Niewiadomski
Torsten Hoefler
587
9
0
04 Jun 2024
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large
  Language Models
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models
Elias Stengel-Eskin
Peter Hase
Mohit Bansal
246
13
0
31 May 2024
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective
  Rationales
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Tianyang Xu
Shujin Wu
Shizhe Diao
Xiaoze Liu
Xingyao Wang
Yangyi Chen
Jing Gao
LRM
332
71
0
31 May 2024
Similarity is Not All You Need: Endowing Retrieval Augmented Generation
  with Multi Layered Thoughts
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Chunjing Gan
Dan Yang
Binbin Hu
Hanxiao Zhang
Siyuan Li
...
Lin Ju
Qing Cui
Jinjie Gu
Lei Liang
Jun Zhou
244
16
0
30 May 2024
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
Hao-Ran Cheng
Erjia Xiao
Jiayan Yang
Jinhao Duan
Yichi Wang
...
Qiang Zhang
Le Yang
Kaidi Xu
Jindong Gu
Zhanchen Zhu
AAML
513
10
0
30 May 2024
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge
  Transfer
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
Zengqun Zhao
Yu Cao
Shaogang Gong
Ioannis Patras
319
16
0
29 May 2024
Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top
Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top
Keyuan Cheng
Muhammad Asif Ali
Shu Yang
Gang Lin
Yuxuan Zhai
Haoyang Fei
Ke Xu
Lu Yu
Lijie Hu
Haiyan Zhao
KELM
289
11
0
24 May 2024
Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model
  Against LLM Red-Teaming
Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming
Jiaxu Liu
Xiangyu Yin
Sihao Wu
Jianhong Wang
Meng Fang
Xinping Yi
Xiaowei Huang
290
6
0
21 May 2024
From Generalist to Specialist: Improving Large Language Models for
  Medical Physics Using ARCoT
From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT
Jace Grandinetti
R. Mcbeth
AI4CELRMLM&MA
164
1
0
17 May 2024
Navigating the Future of Federated Recommendation Systems with Foundation Models
Navigating the Future of Federated Recommendation Systems with Foundation Models
Zhiwei Li
Guodong Long
Chunxu Zhang
Honglei Zhang
Jing Jiang
Chengqi Zhang
725
0
0
12 May 2024
ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
Ana Brassard
Benjamin Heinzerling
Keito Kudo
Keisuke Sakaguchi
Kentaro Inui
LRM
123
3
0
08 May 2024
Question Suggestion for Conversational Shopping Assistants Using Product
  Metadata
Question Suggestion for Conversational Shopping Assistants Using Product MetadataAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Nikhita Vedula
Oleg Rokhlenko
S. Malmasi
198
10
0
02 May 2024
Evaluating Consistency and Reasoning Capabilities of Large Language
  Models
Evaluating Consistency and Reasoning Capabilities of Large Language Models
Yash Saxena
Sarthak Chopra
Arunendra Mani Tripathi
ELMLRM
151
9
0
25 Apr 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of
  Theories, Detection Methods, and Opportunities
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Pengnian Qi
Zhiyu Li
285
17
0
25 Apr 2024
Uncertainty Estimation and Quantification for LLMs: A Simple Supervised
  Approach
Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach
Linyu Liu
Yu Pan
Xiaocheng Li
Guanting Chen
276
71
0
24 Apr 2024
KS-LLM: Knowledge Selection of Large Language Models with Evidence
  Document for Question Answering
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering
Xinxin Zheng
Feihu Che
Jinyang Wu
Shuai Zhang
Shuai Nie
Kang Liu
Jianhua Tao
RALMHILM
159
5
0
24 Apr 2024
Vision Beyond Boundaries: An Initial Design Space of Domain-specific
  Large Vision Models in Human-robot Interaction
Vision Beyond Boundaries: An Initial Design Space of Domain-specific Large Vision Models in Human-robot Interaction
Yuchong Zhang
Yong Ma
Danica Kragic
VLM
186
9
0
23 Apr 2024
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection
  and Correction
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
Hang Hua
Jing Shi
Kushal Kafle
Simon Jenni
Daoan Zhang
John Collomosse
Scott D. Cohen
Jiebo Luo
CoGeVLM
190
14
0
23 Apr 2024
A Survey on the Memory Mechanism of Large Language Model based Agents
A Survey on the Memory Mechanism of Large Language Model based Agents
Zeyu Zhang
Xiaohe Bo
Chen Ma
Rui Li
Xu Chen
Quanyu Dai
Jieming Zhu
Zhenhua Dong
Ji-Rong Wen
LLMAGKELM
244
285
0
21 Apr 2024
Data Authenticity, Consent, & Provenance for AI are all broken: what
  will it take to fix them?
Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
Shayne Longpre
Robert Mahari
Naana Obeng-Marnu
William Brannon
Tobin South
Katy Gero
Sandy Pentland
Jad Kabbara
246
19
0
19 Apr 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang
Philippe Laban
Greg Durrett
HILMSyDa
311
166
0
16 Apr 2024
TV100: A TV Series Dataset that Pre-Trained CLIP Has Not Seen
TV100: A TV Series Dataset that Pre-Trained CLIP Has Not Seen
Da-Wei Zhou
Zhi-Hong Qi
Han-Jia Ye
De-Chuan Zhan
CLIPVLM
61
2
0
16 Apr 2024
Zero-shot Building Age Classification from Facade Image Using GPT-4
Zero-shot Building Age Classification from Facade Image Using GPT-4
Zichao Zeng
June Moh Goo
Xinglei Wang
Bin Chi
Meihui Wang
Jan Boehm
VLM
102
9
0
15 Apr 2024
RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data
RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data
Yun Feng
Zhi Chen
Prashant Kumar
Qingyun Pei
Yangyang Yu
Haohang Li
Fabrizio Dimino
Lorenzo Ausiello
K. P. Subbalakshmi
Papa Momar Ndiaye
155
16
0
11 Apr 2024
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on
  Graphs
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Sara Szymkuć
Chulin Xie
Jiawei Zhang
Kashob Kumar Roy
Yu Zhang
...
Ruirui Li
Xianfeng Tang
Suhang Wang
Yu Meng
Jiawei Han
LRMRALM
188
90
0
10 Apr 2024
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM
  Applications
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Shishir G. Patil
Tianjun Zhang
Vivian Fang
Noppapon C Roy Huang
Uc Berkeley
Aaron Hao
Martin Casado
Joseph E. Gonzalez Raluca
Ada Popa
Ion Stoica
ALM
216
18
0
10 Apr 2024
Previous
123456
Next