Title |
---|
![]() E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for
Large Language Models Jinchang Hou Chang Ao Haihong Wu Xiangtao Kong Zhigang Zheng ...Chengming Li Xiping Hu Ruifeng Xu Shiwen Ni Min Yang |
![]() Can AI Assistants Know What They Don't Know? Qinyuan Cheng Tianxiang Sun Xiangyang Liu Wenwei Zhang Zhangyue Yin Shimin Li Linyang Li Zhengfu He Kai Chen Xipeng Qiu |