ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.03748
  4. Cited By
Applying Large Language Models and Chain-of-Thought for Automatic
  Scoring

Applying Large Language Models and Chain-of-Thought for Automatic Scoring

30 November 2023
Gyeong-Geon Lee
Ehsan Latif
Xuansheng Wu
Ninghao Liu
Xiaoming Zhai
ArXivPDFHTML

Papers citing "Applying Large Language Models and Chain-of-Thought for Automatic Scoring"

41 / 41 papers shown
Title
Evolution of AI in Education: Agentic Workflows
Evolution of AI in Education: Agentic Workflows
Firuz Kamalov
David Santandreu Calonge
Linda Smail
Dilshod Azizov
Dimple R. Thadani
Theresa Kwong
Amara Atif
43
0
0
25 Apr 2025
Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation
Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation
Yucheng Chu
Peng He
Hang Li
Haoyu Han
Kaiqi Yang
Yu Xue
Tingting Li
Joseph Krajcik
Jiliang Tang
AI4Ed
33
0
0
07 Apr 2025
CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring
CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring
Clayton Cohn
Nicole M. Hutchins
Ashwin T S
Gautam Biswas
LRM
31
0
0
03 Apr 2025
Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data
Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data
Fatemeh Mohammadi
Tommaso Romano
S. Maghool
Paolo Ceravolo
SyDa
42
0
0
31 Mar 2025
Efficient Multi-Task Inferencing: Model Merging with Gromov-Wasserstein Feature Alignment
Luyang Fang
Ehsan Latif
Haoran Lu
Y. Zhou
Ping Ma
Xiaoming Zhai
MoMe
81
0
0
12 Mar 2025
Improving LLM-as-a-Judge Inference with the Judgment Distribution
Victor Wang
Michael J.Q. Zhang
Eunsol Choi
53
0
0
04 Mar 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Xuansheng Wu
Padmaja Pravin Saraf
Gyeong-Geon Lee
Ehsan Latif
Ninghao Liu
Xiaoming Zhai
55
4
0
24 Feb 2025
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications
Feng Ma
X. Wang
Chen Chen
Xiao-bin Xu
Xin-ping Yan
42
0
0
23 Feb 2025
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications
Jodi M. Casabianca
Daniel F. McCaffrey
Matthew S. Johnson
Naim Alper
Vladimir Zubenko
21
0
0
04 Jan 2025
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small
  Language Models Write Young Students Texts
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts
Ioana Buhnila
Georgeta Cislaru
Amalia Todirascu
80
1
0
19 Dec 2024
Does Multiple Choice Have a Future in the Age of Generative AI? A
  Posttest-only RCT
Does Multiple Choice Have a Future in the Age of Generative AI? A Posttest-only RCT
Danielle R. Thomas
Conrad Borchers
Sanjit Kakarla
Jionghao Lin
Shambhavi Bhushan
Boyuan Guo
Erin Gatz
Kenneth R. Koedinger
ELM
AI4Ed
77
3
0
13 Dec 2024
Can AI grade your essays? A comparative analysis of large language
  models and teacher ratings in multidimensional essay scoring
Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring
Kathrin Seßler
Maurice Fürstenberg
B. Bühler
Enkelejda Kasneci
AI4Ed
ELM
66
3
0
25 Nov 2024
Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event
  Representation
Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation
Safeyah Khaled Alshemali
Daniel Bauer
Yuval Marton
BDL
35
0
0
19 Oct 2024
Automated Genre-Aware Article Scoring and Feedback Using Large Language
  Models
Automated Genre-Aware Article Scoring and Feedback Using Large Language Models
Chihang Wang
Yuxin Dong
Zhenhong Zhang
Ruotong Wang
Shuo Wang
Jiajing Chen
19
6
0
18 Oct 2024
A Systematic Review on Prompt Engineering in Large Language Models for
  K-12 STEM Education
A Systematic Review on Prompt Engineering in Large Language Models for K-12 STEM Education
Eason Chen
Danyang Wang
Luyi Xu
Chen Cao
Xiao Fang
Jionghao Lin
AI4CE
32
5
0
14 Oct 2024
Transforming Teachers' Roles and Agencies in the Era of Generative AI:
  Perceptions, Acceptance, Knowledge, and Practices
Transforming Teachers' Roles and Agencies in the Era of Generative AI: Perceptions, Acceptance, Knowledge, and Practices
Xiaoming Zhai
AI4CE
23
15
0
03 Oct 2024
A LLM-Powered Automatic Grading Framework with Human-Level Guidelines
  Optimization
A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization
Yucheng Chu
Hang Li
Kaiqi Yang
Harry Shomer
Hui Liu
Yasemin Copur-Gencturk
Jiliang Tang
LLMAG
26
2
0
03 Oct 2024
Beyond Scalar Reward Model: Learning Generative Judge from Preference
  Data
Beyond Scalar Reward Model: Learning Generative Judge from Preference Data
Ziyi Ye
Xiangsheng Li
Qiuchi Li
Qingyao Ai
Yujia Zhou
Wei Shen
Dong Yan
Yiqun Liu
45
10
0
01 Oct 2024
Large Language Model as an Assignment Evaluator: Insights, Feedback, and
  Challenges in a 1000+ Student Course
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Cheng-Han Chiang
Wei-Chih Chen
Chun-Yi Kuan
Chienchou Yang
Hung-yi Lee
ELM
AI4Ed
28
5
0
07 Jul 2024
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple
  Regression
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression
Kun Sun
Rong Wang
31
2
0
03 Jun 2024
Realizing Visual Question Answering for Education: GPT-4V as a
  Multimodal AI
Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI
Gyeong-Geon Lee
Xiaoming Zhai
27
4
0
12 May 2024
Evaluating Students' Open-ended Written Responses with LLMs: Using the
  RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large
Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large
Jussi S. Jauhiainen
Agustín Garagorry Guerra
27
5
0
08 May 2024
Leveraging Prompts in LLMs to Overcome Imbalances in Complex Educational
  Text Data
Leveraging Prompts in LLMs to Overcome Imbalances in Complex Educational Text Data
Jeanne McClure
Machi Shimmei
Noboru Matsuda
Shiyan Jiang
16
1
0
28 Apr 2024
CodecLM: Aligning Language Models with Tailored Synthetic Data
CodecLM: Aligning Language Models with Tailored Synthetic Data
Zifeng Wang
Chun-Liang Li
Vincent Perot
Long T. Le
Jin Miao
Zizhao Zhang
Chen-Yu Lee
Tomas Pfister
SyDa
ALM
16
17
0
08 Apr 2024
3P-LLM: Probabilistic Path Planning using Large Language Model for
  Autonomous Robot Navigation
3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation
Ehsan Latif
LLMAG
LM&Ro
32
6
0
27 Mar 2024
PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics
  Lab Investigations
PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations
Ehsan Latif
Ramviyas Parasuraman
Xiaoming Zhai
33
13
0
27 Mar 2024
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German
Ehsan Latif
Gyeong-Geon Lee
Knut Neuman
Tamara Kastorff
Xiaoming Zhai
20
3
0
09 Feb 2024
Gemini Pro Defeated by GPT-4V: Evidence from Education
Gemini Pro Defeated by GPT-4V: Evidence from Education
Gyeong-Geon Lee
Ehsan Latif
Lehong Shi
Xiaoming Zhai
16
21
0
27 Dec 2023
Knowledge Distillation of LLM for Automatic Scoring of Science Education
  Assessments
Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments
Ehsan Latif
Luyang Fang
Ping Ma
Xiaoming Zhai
16
4
0
26 Dec 2023
Automatic Scoring of Students' Science Writing Using Hybrid Neural
  Network
Automatic Scoring of Students' Science Writing Using Hybrid Neural Network
Ehsan Latif
Xiaoming Zhai
19
1
0
02 Dec 2023
Using GPT-4 to Augment Unbalanced Data for Automatic Scoring
Using GPT-4 to Augment Unbalanced Data for Automatic Scoring
Luyang Fang
Gyeong-Geon Lee
Xiaoming Zhai
13
17
0
25 Oct 2023
Fine-tuning ChatGPT for Automatic Scoring
Fine-tuning ChatGPT for Automatic Scoring
Ehsan Latif
Xiaoming Zhai
AI4MH
41
86
0
16 Oct 2023
AGI: Artificial General Intelligence for Education
AGI: Artificial General Intelligence for Education
Ehsan Latif
Gengchen Mai
Matthew Nyaaba
Xuansheng Wu
Ninghao Liu
Guoyu Lu
Sheng R. Li
Tianming Liu
Xiaoming Zhai
ELM
AI4CE
16
21
0
24 Apr 2023
Unpacking the "Black Box" of AI in Education
Unpacking the "Black Box" of AI in Education
Nabeel Gillani
R. Eynon
Catherine Chiabaut
Kelsey Finkel
19
55
0
31 Dec 2022
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic Languages
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
109
195
0
06 Oct 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive
  Explanations
Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations
Jaehun Jung
Lianhui Qin
Sean Welleck
Faeze Brahman
Chandra Bhagavatula
Ronan Le Bras
Yejin Choi
ReLM
LRM
206
189
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
Locally Typical Sampling
Locally Typical Sampling
Clara Meister
Tiago Pimentel
Gian Wiher
Ryan Cotterell
138
85
0
01 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
A Theoretical Analysis of the Repetition Problem in Text Generation
A Theoretical Analysis of the Repetition Problem in Text Generation
Z. Fu
Wai Lam
Anthony Man-Cho So
Bei Shi
67
89
0
29 Dec 2020
1