Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Comparative evaluation of ChatGPT, Claude, DeepSeek, and Gemini for text-to-style vector generation using RAG-enhanced BERT pipelines
0
Zitationen
4
Autoren
2026
Jahr
Abstract
This study presents a comprehensive comparative analysis of four major Large Language Models (LLMs) - ChatGPT-4, Claude Sonnet 4, DeepSeek-7B, and Gemini 2.0 Flash - for text-to-style vector generation in design applications. The research evaluates these models within a unified RAG-enhanced pipeline incorporating BERT for semantic vector extraction. Using ROUGE-L F1, Cosine Similarity, BERT Token Count, and Generation Time as evaluation metrics, we assess each model’s capability to process design-oriented prompts and generate semantically coherent style vectors. Our findings reveal that DeepSeek-7B achieved superior performance in both structural alignment (0.27 ROUGE-L F1) and semantic preservation (0.83 cosine similarity), while ChatGPT-4 demonstrated optimal efficiency with the fastest processing time (1.94 seconds) and excellent token utilization (69 tokens). The study emphasizes the critical importance of complete pipeline validation and provides practical guidance for selecting appropriate LLMs based on specific application requirements in creative AI systems.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.700 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.605 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.133 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.873 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.