All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() SelfPrompt: Autonomously Evaluating LLM Robustness via
Domain-Constrained Knowledge Guidelines and Refined Adversarial PromptsInternational Conference on Computational Linguistics (COLING), 2024 |
![]() LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete
Information from Lateral Thinking PuzzlesInternational Conference on Language Resources and Evaluation (LREC), 2023 |
![]() A Survey on Evaluation of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023 Yu-Chu Chang Xu Wang Yongfeng Zhang Yuanyi Wu Linyi Yang ...Yue Zhang Yi-Ju Chang Philip S. Yu Qian Yang Xingxu Xie |