Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Kaldi Speech Recognition Toolkit

2024·4.896 Zitationen·Infoscience (Ecole Polytechnique Fédérale de Lausanne)Open Access

Volltext beim Verlag öffnen

4.896

Zitationen

Autoren

2024

Jahr

Abstract

Abstract—We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state transducers (using the freely available OpenFst), together with detailed documentation and scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users. I.

Autoren

Daniel Povey

Institutionen

Microsoft (United States)(US)

Themen

Speech Recognition and SynthesisSpeech and Audio ProcessingMusic and Audio Processing

Volltext beim Verlag öffnen

Kaldi Speech Recognition Toolkit

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen