Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Kaldi Speech Recognition Toolkit
4.896
Zitationen
1
Autoren
2024
Jahr
Abstract
Abstract—We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state transducers (using the freely available OpenFst), together with detailed documentation and scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users. I.
Ähnliche Arbeiten
AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale
2018 · 45.506 Zit.
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014 · 24.111 Zit.
A tutorial on hidden Markov models and selected applications in speech recognition
1989 · 22.698 Zit.
Efficient Estimation of Word Representations in Vector Space
2013 · 18.105 Zit.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
2001 · 12.995 Zit.