File size: 1,600 Bytes
84292e7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
---
license: apache-2.0
tags:
- icefall
- phoneme-recognition
- automatic-speech-recognition
datasets:
- bookbot/common_voice_16_1_es
- bookbot/slr72_dataset
---
# Pruned Stateless Zipformer RNN-T Streaming Robust ES v0
Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 is a Spanish automatic speech recognition model trained on the following datasets:
- [Common Voice 23.0 Spanish](https://datacollective.mozillafoundation.org/datasets/cmflnuzw51ddgmwjkxpm9z1lw)
- [SLR72 dataset](https://www.openslr.org/72/)
Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. `["w", "ɑ", "ʃ", "i", "ɑ"]`. Therefore, the model's [vocabulary](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0/blob/main/data/lang_phone/tokens.txt) contains the different IPA phonemes found in [gruut](https://github.com/rhasspy/gruut).
This model was converted from the TorchScript version of [Pruned Stateless Zipformer RNN-T Streaming Robust ES](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0) to ONNX format.
## Converting from TorchScript
Refer to the [official instructions](https://icefall.readthedocs.io/en/latest/model-export/export-onnx.html#export-the-model-to-onnx) for conversion to ONNX.
## Frameworks
- [k2](https://github.com/k2-fsa/k2)
- [icefall](https://github.com/bookbot-hive/icefall)
- [lhotse](https://github.com/bookbot-hive/lhotse)
- [sherpa-onnx](https://github.com/k2-fsa/sherpa-onnx)
- [onnx](https://github.com/onnx/onnx)
- [onnxruntime](https://github.com/microsoft/onnxruntime) |