Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Cross-Disciplinary Academic Evaluation of Generative AI Models in HR, Accounting, and Economics: ChatGPT-5 vs. DeepSeek
2
Zitationen
3
Autoren
2025
Jahr
Abstract
As generative AI is being further integrated into academic and professional contexts, there is a demonstrable need to determine the performance of generative AI within specific, applied domains. This research compares the performances of ChatGPT-5 and DeepSeek on tasks in the domains of accounting, economics, and human resources. The models were provided two prompts per domain, and outputs were evaluated by academics across five criteria: accuracy, clarity, conciseness, systematic reasoning, and indicators of potential bias. The inter-rater reliability was reported using Cohen’s Kappa. From the findings, both models display differences in performance. ChatGPT-5 outperformed DeepSeek in accounting and human resources, while DeepSeek outperformed ChatGPT-5 on epistemic economics tasks. Since results have shown that ChatGPT-5 outperformed DeepSeek in two out of three domains, the research recommends a reliability-based framework to compare generative AI outputs within business disciplines and offers practical suggestions on when and how to use the models within academic and professional contexts.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.693 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.598 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.124 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.871 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.