Ad: BlueJ Better Tax Answers. -Accomplish hours of research in seconds -Instantly draft high-quality communications -Verify answers using a library of trusted tax content. Learn more

Lunar et al.: GradeLegal: Automated Grading for German Legal Cases

Abdullah Al Zubaer (U. Passau), Lorenz Wendlinger (Deggendorf Inst. Tech.), Simon Alexander Nonn (U. Passau), Michael Granitzer (U. Passau) & Jelena Mitrović (U. Passau), GradeLegal: Automated Grading for German Legal Cases, arXiv:2605.21076 (cs) (May 20, 2026):

Grading German legal exam solutions faces growing volumes and a shortage of qualified graders, delaying feedback and creating a bottleneck. At the same time, it is a high-stakes expert task, since state exam grades strongly influence career outcomes in Germany. Despite this practical relevance, literature lacks systematic studies on effective methods for grading legal exams. To address this gap, we investigate whether large language models (LLMs) can support the automated grading of German legal case solutions in criminal and public law, thereby enabling scalable feedback and student self-testing. We present a systematic evaluation of 27 proprietary and open-source LLMs, benchmarking prompting strategies that incrementally add task-related information, such as a sample solution and a grading rubric. Using quadratic weighted kappa (QWK), reasoning-oriented LLMs can approximate expert grading in public law when given a sample solution and a grading rubric (up to 0.91), compared to 0.60 in criminal law, suggesting a harder grading task in criminal law. Beyond single-model grading, ensembling improves agreement by up to 0.15 over its best member and can offer an alternative to stronger closed-source single models. In addition, our findings suggest that effective prompt design and model selection are necessary for reliable LLM-based grading of legal exams.


About the Author

Ad: BlueJ Better Tax Answers. Blue J's generative AI tax research solution is transforming how tax experts work. Learn more.
Information and rates on advertising on TaxProf Blog

Discover more from TaxProf Blog

Subscribe now to keep reading and get access to the full archive.

Continue reading