
![]() Qwen Technical Report Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang ...Zhenru Zhang Chang Zhou Jingren Zhou Xiaohuan Zhou Tianhang Zhu |
![]() AGIBench: A Multi-granularity, Multimodal, Human-referenced,
Auto-scoring Benchmark for Large Language ModelsBenchCouncil International Symposium (ISB), 2023 |
![]() CLEVA: Chinese Language Models EVAluation PlatformConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yanyang Li Jianqiao Zhao Duo Zheng Zi-Yuan Hu Zhi Chen ...Yongfeng Huang Shijia Huang Dahua Lin Michael R. Lyu Liwei Wang |
![]() Model Spider: Learning to Rank Pre-Trained Models EfficientlyNeural Information Processing Systems (NeurIPS), 2023 |
![]() ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist
ExaminationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |