ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.00942
  4. Cited By
Evaluating the Factuality of Large Language Models using Large-Scale
  Knowledge Graphs

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

1 April 2024
Xiaoze Liu
Feijie Wu
Tianyang Xu
Zhuo Chen
Yichi Zhang
Xiaoqian Wang
Jing Gao
    HILM
ArXivPDFHTML

Papers citing "Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs"

16 / 16 papers shown
Title
SymPlanner: Deliberate Planning in Language Models with Symbolic Representation
SymPlanner: Deliberate Planning in Language Models with Symbolic Representation
Siheng Xiong
Jieyu Zhou
Zhangding Liu
Yusen Su
LLMAG
LM&Ro
40
0
0
02 May 2025
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph
Langming Liu
Haibin Chen
Yuhao Wang
Yujin Yuan
Shilei Liu
Wenbo Su
Xiangyu Zhao
Bo Zheng
RALM
53
0
0
20 Mar 2025
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking
Yingjian Chen
Haoran Liu
Yinhong Liu
Rui Yang
Han Yuan
Yanran Fu
Pengyuan Zhou
Qingyu Chen
James Caverlee
Irene Z Li
HILM
38
0
0
23 Feb 2025
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models
Fushuo Huo
Wenchao Xu
Zhong Zhang
Haozhao Wang
Zhicheng Chen
Peilin Zhao
VLM
MLLM
41
18
0
04 Aug 2024
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for
  LLM-based Agents
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Ruixuan Xiao
Wentao Ma
Ke Wang
Yuchuan Wu
Junbo Zhao
Haobo Wang
Fei Huang
Yongbin Li
24
8
0
21 Jun 2024
BlendFilter: Advancing Retrieval-Augmented Large Language Models via
  Query Generation Blending and Knowledge Filtering
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering
Haoyu Wang
Ruirui Li
Haoming Jiang
Jinjin Tian
Zhengyang Wang
Chen Luo
Xianfeng Tang
Monica Cheng
Tuo Zhao
Jing Gao
RALM
KELM
38
16
0
16 Feb 2024
Large Language Models Can Learn Temporal Reasoning
Large Language Models Can Learn Temporal Reasoning
Siheng Xiong
Ali Payani
Ramana Rao Kompella
Faramarz Fekri
LRM
22
73
0
12 Jan 2024
Don't Make Your LLM an Evaluation Benchmark Cheater
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
99
136
0
03 Nov 2023
Investigating the Catastrophic Forgetting in Multimodal Large Language
  Models
Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Yuexiang Zhai
Shengbang Tong
Xiao Li
Mu Cai
Qing Qu
Yong Jae Lee
Y. Ma
VLM
MLLM
CLL
66
75
0
19 Sep 2023
"According to ...": Prompting Language Models Improves Quoting from
  Pre-Training Data
"According to ...": Prompting Language Models Improves Quoting from Pre-Training Data
Orion Weller
Marc Marone
Nathaniel Weir
Dawn J Lawrie
Daniel Khashabi
Benjamin Van Durme
HILM
61
44
0
22 May 2023
We're Afraid Language Models Aren't Modeling Ambiguity
We're Afraid Language Models Aren't Modeling Ambiguity
Alisa Liu
Zhaofeng Wu
Julian Michael
Alane Suhr
Peter West
Alexander Koller
Swabha Swayamdipta
Noah A. Smith
Yejin Choi
51
87
0
27 Apr 2023
The Internal State of an LLM Knows When It's Lying
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
210
297
0
26 Apr 2023
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the
  Question Answering Performance of the GPT LLM Family
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family
Yiming Tan
Dehai Min
Y. Li
Wenbo Li
Nan Hu
Yongrui Chen
Guilin Qi
AI4MH
ELM
47
51
0
14 Mar 2023
Learn to Explain: Multimodal Reasoning via Thought Chains for Science
  Question Answering
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
A. Kalyan
ELM
ReLM
LRM
198
1,089
0
20 Sep 2022
ClusterEA: Scalable Entity Alignment with Stochastic Training and
  Normalized Mini-batch Similarities
ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities
Yunjun Gao
Xiaoze Liu
Junyang Wu
Tianyi Li
Pengfei Wang
Lu Chen
38
37
0
20 May 2022
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally
  Across Scales and Tasks
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Xiao Liu
Kaixuan Ji
Yicheng Fu
Weng Lam Tam
Zhengxiao Du
Zhilin Yang
Jie Tang
VLM
228
780
0
14 Oct 2021
1