Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Lost in the middle? examining scoring reliability and position bias in LLM-based automated essay scoring

2026·0 Zitationen·Education and Information TechnologiesOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2026

Jahr

Abstract

Abstract This study investigates position bias in ChatGPT’s scoring patterns for automated essay scoring, with a focus on primacy and recency effects. Position bias, originating from the serial position effect in cognitive psychology, refers to the tendency of Large Language Models (LLMs) to emphasize the introduction and conclusion of a text while potentially neglecting content in the middle sections. Using 192 synthetic essays across varying lengths and section qualities, this research explores whether ChatGPT disproportionately weighs the quality of introductions and conclusions compared to body paragraphs. Statistical analyses reveal that while ChatGPT successfully differentiates between strong and weak sections, no consistent evidence supports the presence of systematic primacy or recency effects in overall scoring. Domain-specific analyses further indicate that rubric categories such as grammar and mechanics are sensitive to errors throughout essays, while content and organization are more heavily influenced by body quality. The findings suggest that ChatGPT’s scoring patterns are largely balanced, with minimal signs of position bias, thereby enhancing the validity of its use for automated scoring. This research highlights the need for continued evaluation of AI-based grading systems to ensure fairness and reliability while proposing avenues for future exploration in LLM-driven assessments.

Autoren

Özgür Çelik

Institutionen

Balıkesir University(TR)

Themen

Artificial Intelligence in Healthcare and EducationTopic ModelingExplainable Artificial Intelligence (XAI)

Volltext beim Verlag öffnen

Lost in the middle? examining scoring reliability and position bias in LLM-based automated essay scoring

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen