metadata
license: apache-2.0
tags:
- icefall
- phoneme-recognition
- automatic-speech-recognition
datasets:
- bookbot/common_voice_16_1_es
- bookbot/slr72_dataset
Pruned Stateless Zipformer RNN-T Streaming Robust ES v0
Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 is a Spanish automatic speech recognition model trained on the following datasets:
Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. ["w", "ɑ", "ʃ", "i", "ɑ"]. Therefore, the model's vocabulary contains the different IPA phonemes found in gruut.
This model was converted from the TorchScript version of Pruned Stateless Zipformer RNN-T Streaming Robust ES to ONNX format.
Converting from TorchScript
Refer to the official instructions for conversion to ONNX.