Cope et. al: Can AI Exam-Grading Replace Law Professors?

Kevin L. Cope (Virginia), Jes Frankenreiter (Washington University in St. Louis), Scott Hirst (Boston University), Eric A. Posner (Chicago), Daniel Schwarcz (Minnesota), Dane Thorley (BYU), Grading Machines: Can AI Exam-Grading Replace Law Professors?

Abstract

In the past few years, large language models (LLMs) have achieved significant technical advances, such that legal-advocacy organizations are increasingly adopting them as complements to—or substitutes for—lawyers and other human experts. Several studies have examined LLMs’ performance in taking law school exams, finding mixed results. Yet there have been no published studies systematically analyzing LLMs’ competence at one of law professors’ chief responsibilities: grading law school exams. This paper presents results of an analysis of how LLMs perform in evaluating student responses to legal analysis questions of the kind typically administered in law school exams. The underlying data come from exams in four subjects administered at top-30 U.S. law schools. Unlike some projects in computer or data science, our goal is not to design a new LLM that minimizes error or maximizes agreement with human graders. Rather, we seek to determine whether existing models—which can be straightforwardly applied by most professors and students—are already suitable for the task of law exam evaluation. We find that, when provided with a detailed rubric, the LLM grades correlate with the human grader at Pearson correlation coefficients of up to 0.93. Our findings suggest that, even if they do not fully replace humans in the near future, LLMs could soon be put to valuable tasks by law school professors, such as reviewing and validating professor grading, providing substantive feedback on ungraded midterms, and providing students feedback on self-administered practice exams.

About the Author

Michael Hunter Schwartz

Michael Hunter Schwartz is the author of seven books, twelve law review papers, five book chapters addressing a wide variety of relating to teaching and learning in law school. Schwartz’s books include What the Best Law Teachers Do (Harvard University Press 2013) and a contracts textbook, Contracts: A Context and Practice Casebook (4th ed. 2025), which was the first book in a textbook series he designed and now edits. Schwartz has delivered more than 230 professional presentations about teaching and learning in law school. In January 2024, National Jurist Magazine named Professor Schwartz the 9th Most Influential Person in Legal Education.

TaxProf Blog

Abstract

About the Author

ABA: Tax Professionals Across Disciplines: Expanding the Table Panel

Teaching Tidbit of the Week: Set Great Teaching Goals This Summer

Tax Advisor: IRS launches online tool to help taxpayers manage tax debt

Cope et. al: Can AI Exam-Grading Replace Law Professors?

Abstract

About the Author

Discover more from TaxProf Blog