ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.11493
  4. Cited By
Benchmarking Knowledge Boundary for Large Language Models: A Different
  Perspective on Model Evaluation

Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation

18 February 2024
Xunjian Yin
Xu Zhang
Jie Ruan
Xiaojun Wan
    ELM
ArXivPDFHTML

Papers citing "Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation"

10 / 10 papers shown
Title
LLM-Independent Adaptive RAG: Let the Question Speak for Itself
LLM-Independent Adaptive RAG: Let the Question Speak for Itself
Maria Marina
Nikolay Ivanov
Sergey Pletenev
Mikhail Salnikov
Daria Galimzianova
Nikita Krayko
Vasily Konovalov
Alexander Panchenko
Viktor Moskvoretskii
RALM
40
0
0
07 May 2025
Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling
Hang Zheng
Hongshen Xu
Yuncong Liu
Lu Chen
Pascale Fung
Kai Yu
83
2
0
04 Mar 2025
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Viktor Moskvoretskii
M. Lysyuk
Mikhail Salnikov
Nikolay Ivanov
Sergey Pletenev
Daria Galimzianova
Nikita Krayko
Vasily Konovalov
Irina Nikishina
Alexander Panchenko
RALM
74
4
0
24 Feb 2025
SMART: Self-Aware Agent for Tool Overuse Mitigation
SMART: Self-Aware Agent for Tool Overuse Mitigation
Cheng Qian
Emre Can Acikgoz
H. Wang
X. Chen
Avirup Sil
Dilek Hakkani-Tür
Gökhan Tür
Heng Ji
LLMAG
KELM
LRM
63
4
0
17 Feb 2025
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
Jiahao Yuan
Dehui Du
Hao Zhang
Zixiang Di
Usman Naseem
LRM
27
2
0
16 Oct 2024
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
239
2,232
0
22 Mar 2023
Prompt-based Conservation Learning for Multi-hop Question Answering
Prompt-based Conservation Learning for Multi-hop Question Answering
Zhenyun Deng
Yonghua Zhu
Yang Chen
Qianqian Qi
Michael Witbrock
Patricia J. Riddle
RALM
LRM
27
4
0
14 Sep 2022
Gradient-based Adversarial Attacks against Text Transformers
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
98
227
0
15 Apr 2021
Measuring and Improving Consistency in Pretrained Language Models
Measuring and Improving Consistency in Pretrained Language Models
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Abhilasha Ravichander
Eduard H. Hovy
Hinrich Schütze
Yoav Goldberg
HILM
258
343
0
01 Feb 2021
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
406
2,576
0
03 Sep 2019
1