Applying Large Language Models and Chain-of-Thought for Automatic Scoring

30 November 2023

Ninghao Liu

Papers citing "Applying Large Language Models and Chain-of-Thought for Automatic Scoring"

41 / 41 papers shown

Title
Evolution of AI in Education: Agentic Workflows Firuz Kamalov David Santandreu Calonge Linda Smail Dilshod Azizov Dimple R. Thadani Theresa Kwong Amara Atif 43 0 0 25 Apr 2025
Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation Yucheng Chu Peng He Hang Li Haoyu Han Kaiqi Yang Yu Xue Tingting Li Joseph Krajcik Jiliang Tang AI4Ed 33 0 0 07 Apr 2025
CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring Clayton Cohn Nicole M. Hutchins Ashwin T S Gautam Biswas LRM 31 0 0 03 Apr 2025
Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data Fatemeh Mohammadi Tommaso Romano S. Maghool Paolo Ceravolo SyDa 42 0 0 31 Mar 2025
Efficient Multi-Task Inferencing: Model Merging with Gromov-Wasserstein Feature Alignment Luyang Fang Ehsan Latif Haoran Lu Y. Zhou Ping Ma Xiaoming Zhai MoMe 81 0 0 12 Mar 2025
Improving LLM-as-a-Judge Inference with the Judgment Distribution Victor Wang Michael J.Q. Zhang Eunsol Choi 53 0 0 04 Mar 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring Xuansheng Wu Padmaja Pravin Saraf Gyeong-Geon Lee Ehsan Latif Ninghao Liu Xiaoming Zhai 55 4 0 24 Feb 2025
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications Feng Ma X. Wang Chen Chen Xiao-bin Xu Xin-ping Yan 42 0 0 23 Feb 2025
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications Jodi M. Casabianca Daniel F. McCaffrey Matthew S. Johnson Naim Alper Vladimir Zubenko 21 0 0 04 Jan 2025
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts Ioana Buhnila Georgeta Cislaru Amalia Todirascu 80 1 0 19 Dec 2024
Does Multiple Choice Have a Future in the Age of Generative AI? A Posttest-only RCT Danielle R. Thomas Conrad Borchers Sanjit Kakarla Jionghao Lin Shambhavi Bhushan Boyuan Guo Erin Gatz Kenneth R. Koedinger ELM AI4Ed 77 3 0 13 Dec 2024
Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring Kathrin Seßler Maurice Fürstenberg B. Bühler Enkelejda Kasneci AI4Ed ELM 66 3 0 25 Nov 2024
Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation Safeyah Khaled Alshemali Daniel Bauer Yuval Marton BDL 35 0 0 19 Oct 2024
Automated Genre-Aware Article Scoring and Feedback Using Large Language Models Chihang Wang Yuxin Dong Zhenhong Zhang Ruotong Wang Shuo Wang Jiajing Chen 19 6 0 18 Oct 2024
A Systematic Review on Prompt Engineering in Large Language Models for K-12 STEM Education Eason Chen Danyang Wang Luyi Xu Chen Cao Xiao Fang Jionghao Lin AI4CE 32 5 0 14 Oct 2024
Transforming Teachers' Roles and Agencies in the Era of Generative AI: Perceptions, Acceptance, Knowledge, and Practices Xiaoming Zhai AI4CE 23 15 0 03 Oct 2024
A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization Yucheng Chu Hang Li Kaiqi Yang Harry Shomer Hui Liu Yasemin Copur-Gencturk Jiliang Tang LLMAG 26 2 0 03 Oct 2024
Beyond Scalar Reward Model: Learning Generative Judge from Preference Data Ziyi Ye Xiangsheng Li Qiuchi Li Qingyao Ai Yujia Zhou Wei Shen Dong Yan Yiqun Liu 45 10 0 01 Oct 2024
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course Cheng-Han Chiang Wei-Chih Chen Chun-Yi Kuan Chienchou Yang Hung-yi Lee ELM AI4Ed 28 5 0 07 Jul 2024
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression Kun Sun Rong Wang 31 2 0 03 Jun 2024
Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI Gyeong-Geon Lee Xiaoming Zhai 27 4 0 12 May 2024
Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large Jussi S. Jauhiainen Agustín Garagorry Guerra 27 5 0 08 May 2024
Leveraging Prompts in LLMs to Overcome Imbalances in Complex Educational Text Data Jeanne McClure Machi Shimmei Noboru Matsuda Shiyan Jiang 16 1 0 28 Apr 2024
CodecLM: Aligning Language Models with Tailored Synthetic Data Zifeng Wang Chun-Liang Li Vincent Perot Long T. Le Jin Miao Zizhao Zhang Chen-Yu Lee Tomas Pfister SyDa ALM 16 17 0 08 Apr 2024
3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation Ehsan Latif LLMAG LM&Ro 32 6 0 27 Mar 2024
PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations Ehsan Latif Ramviyas Parasuraman Xiaoming Zhai 33 13 0 27 Mar 2024
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German Ehsan Latif Gyeong-Geon Lee Knut Neuman Tamara Kastorff Xiaoming Zhai 20 3 0 09 Feb 2024
Gemini Pro Defeated by GPT-4V: Evidence from Education Gyeong-Geon Lee Ehsan Latif Lehong Shi Xiaoming Zhai 16 21 0 27 Dec 2023
Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments Ehsan Latif Luyang Fang Ping Ma Xiaoming Zhai 16 4 0 26 Dec 2023
Automatic Scoring of Students' Science Writing Using Hybrid Neural Network Ehsan Latif Xiaoming Zhai 19 1 0 02 Dec 2023
Using GPT-4 to Augment Unbalanced Data for Automatic Scoring Luyang Fang Gyeong-Geon Lee Xiaoming Zhai 13 17 0 25 Oct 2023
Fine-tuning ChatGPT for Automatic Scoring Ehsan Latif Xiaoming Zhai AI4MH 41 86 0 16 Oct 2023
AGI: Artificial General Intelligence for Education Ehsan Latif Gengchen Mai Matthew Nyaaba Xuansheng Wu Ninghao Liu Guoyu Lu Sheng R. Li Tianming Liu Xiaoming Zhai ELM AI4CE 16 21 0 24 Apr 2023
Unpacking the "Black Box" of AI in Education Nabeel Gillani R. Eynon Catherine Chiabaut Kelsey Finkel 19 55 0 31 Dec 2022
Binding Language Models in Symbolic Languages Zhoujun Cheng Tianbao Xie Peng Shi Chengzu Li Rahul Nadkarni ... Dragomir R. Radev Mari Ostendorf Luke Zettlemoyer Noah A. Smith Tao Yu LMTD 109 195 0 06 Oct 2022
Large Language Models are Zero-Shot Reasoners Takeshi Kojima S. Gu Machel Reid Yutaka Matsuo Yusuke Iwasawa ReLM LRM 291 2,712 0 24 May 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations Jaehun Jung Lianhui Qin Sean Welleck Faeze Brahman Chandra Bhagavatula Ronan Le Bras Yejin Choi ReLM LRM 206 189 0 24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models Xuezhi Wang Jason W. Wei Dale Schuurmans Quoc Le Ed H. Chi Sharan Narang Aakanksha Chowdhery Denny Zhou ReLM BDL LRM AI4CE 297 3,163 0 21 Mar 2022
Locally Typical Sampling Clara Meister Tiago Pimentel Gian Wiher Ryan Cotterell 138 85 0 01 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jason W. Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter F. Xia Ed H. Chi Quoc Le Denny Zhou LM&Ro LRM AI4CE ReLM 315 8,261 0 28 Jan 2022
A Theoretical Analysis of the Repetition Problem in Text Generation Z. Fu Wai Lam Anthony Man-Cho So Bei Shi 67 89 0 29 Dec 2020