Automated Semantic Grading of Programs

ACM-SIGPLAN Symposium on Programming Language Design and Implementation (PLDI), 2012

8 April 2012

Abstract

We present a new method for automatically grading introductory programming assignments. In order to use this method, instructors provide a reference implementation of the assignment, and an error model consisting of potential corrections to errors that students might make. Using this information, the system automatically derives minimal corrections to student's incorrect solutions, providing them with a quantifiable measure of exactly how incorrect a given solution was, as well as feedback about what they did wrong. We introduce a simple language for describing error models in terms of correction rules, and formally define a rule-directed translation strategy that reduces the problem of finding minimal corrections in an incorrect program to the problem of synthesizing a correct program from a sketch. We have evaluated our system on over 1000 solution attempts by real beginner programmers. Our results show that relatively simple error models can correct on average over 70% of submissions with non-trivial errors. We also show that the error models generalize across different problems from the same category and that our technique scales even for complex error models and larger programming assignments such as those found in AP level computer science final examinations.

View on arXiv

Comments on this paper