Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
From Output to Process: A Case Study of Reasoning Patterns in LLMs for AI Risk Scenario Generation
0
Zitationen
2
Autoren
2025
Jahr
Abstract
Large Language Models (LLMs) are increasingly used for the critical task of generating AI risk scenarios, yet practitioners lack empirical guidance on model selection. This study addresses that gap through a case study benchmarking 23 LLMs against a real-world AI system to analyze their underlying reasoning patterns. We introduce a novel ”Hit Rate” metric based on actual incidents to quantitatively measure performance. The results suggest significant, statistically-verified performance disparities among models and show that this gap is uncorrelated with superficial linguistic fluency. Instead, we indicate that the performance gap appears to be strongly linked to the model's underlying reasoning pattern, which leaves an unmistakable qualitative signature on the final outputs. A ”Systematic Top-Down” approach, which mirrors expert human analysis, consistently produces specific and actionable scenarios, while less structured methods yield generic or contextually flawed warnings. These findings serve as a strong caution against model-agnosticism, establishing that an LLM's reasoning process—suggested by the specificity and actionability of its outputs—is a critical factor for its efficacy in safety-critical tasks.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.740 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.649 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.202 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.886 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.