Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
LCB-Bench v1.1: Benchmarking Large Language Models for Cognitive Bias Detection
0
Zitationen
1
Autoren
2026
Jahr
Abstract
LCB (LLM Cognitive Bias Benchmark) v1.1: Updated evaluation results with complete GPT-4o coverage (1500/1500 cases scored, 0 errors). 1,500 human-authored test cases measuring 30 cognitive biases across 7 categories. Includes evaluation harness and results for 4 frontier models (GPT-4o, GPT-4o-mini, Gemini 2.5 Flash, Claude Sonnet 4.6). LCB Scores range from 69.0 to 83.6/100. Related to DOI 10.5281/zenodo.19185736 (v1.0).
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.774 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.685 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.244 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.898 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.