Title |
---|
![]() SysBench: Can Large Language Models Follow System Messages? Yanzhao Qin Tao Zhang Tao Zhang Yanjun Shen Wenjing Luo ...Yujing Qiao Weipeng Chen Zenan Zhou Wentao Zhang Bin Cui |
![]() The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models Seungone Kim Juyoung Suk Ji Yong Cho Shayne Longpre Chaeeun Kim ...Sean Welleck Graham Neubig Moontae Lee Kyungjae Lee Minjoon Seo |