Rubric Is All You Need: Enhancing LLM-based Code Evaluation With Question-Specific RubricsInternational Computing Education Research Workshop (ICER), 2025 |
Distilling Step-by-Step! Outperforming Larger Language Models with Less
Training Data and Smaller Model SizesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Lokesh Nagalapatti Chun-Liang Li Chih-Kuan Yeh Hootan Nakhost Yasuhisa Fujii Alexander Ratner Ranjay Krishna Chen-Yu Lee Tomas Pfister |
G-Eval: NLG Evaluation using GPT-4 with Better Human AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
GPTScore: Evaluate as You DesireNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 |
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022 |
Representation Learning: A Review and New PerspectivesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2012 |