Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Few-Shot Named Entity Recognition Approach Based on Large Language Models

2026·0 ZitationenOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2026

Jahr

Abstract

Named Entity Recognition (NER), as a pivotal task in Natural Language Processing (NLP), holds significant research value. However, current NER approaches typically rely on large amounts of annotated data and exhibit severe overfitting in low-resource scenarios. In 2022, ChatGPT based on GPT-3.5 was introduced, marking the rapid advancement of large language models. Their robust semantic understanding capabilities demonstrate strong potential for few-shot learning. Consequently, this paper investigates few-shot named entity recognition based on large language models. Given that large models are generative and prone to hallucinations, this study proposes a novel framework—CoT-LoRA-NER—that integrates Chain-of-Thought (CoT) reasoning guidance with Low-Rank Adaptation (LoRA) fine-tuning. This approach enables large models to perform named entity recognition in a reasoning-enhanced manner, implemented through three stages: CoT-guided data augmentation: Chain-of-Thought samples with explicit entity-level reasoning are automatically generated via the Deepseek API, with quality controlled through prompt template design, label schema specification, and limited random inspection, without per-instance manual correction; prompt construction: a structured question-reasoning-entity input format is designed to enhance contextual awareness of entity boundaries; parameter-efficient fine-tuning: LoRA is employed to optimize attention weights for the NER task, thereby reducing the risk of overfitting.

Autoren

Institutionen

Xi'an University of Science and Technology(CN)

Themen

Topic ModelingAdvanced Graph Neural NetworksArtificial Intelligence in Healthcare and Education

Volltext beim Verlag öffnen

Few-Shot Named Entity Recognition Approach Based on Large Language Models

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen