|
|
--- |
|
|
license: apache-2.0 |
|
|
tags: |
|
|
- icefall |
|
|
- phoneme-recognition |
|
|
- automatic-speech-recognition |
|
|
datasets: |
|
|
- bookbot/common_voice_16_1_es |
|
|
- bookbot/slr72_dataset |
|
|
--- |
|
|
|
|
|
# Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 |
|
|
|
|
|
Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 is a Spanish automatic speech recognition model trained on the following datasets: |
|
|
|
|
|
- [Common Voice 23.0 Spanish](https://datacollective.mozillafoundation.org/datasets/cmflnuzw51ddgmwjkxpm9z1lw) |
|
|
- [SLR72 dataset](https://www.openslr.org/72/) |
|
|
|
|
|
Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. `["w", "ɑ", "ʃ", "i", "ɑ"]`. Therefore, the model's [vocabulary](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0/blob/main/data/lang_phone/tokens.txt) contains the different IPA phonemes found in [gruut](https://github.com/rhasspy/gruut). |
|
|
|
|
|
This model was converted from the TorchScript version of [Pruned Stateless Zipformer RNN-T Streaming Robust ES](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0) to ONNX format. |
|
|
|
|
|
|
|
|
## Converting from TorchScript |
|
|
|
|
|
Refer to the [official instructions](https://icefall.readthedocs.io/en/latest/model-export/export-onnx.html#export-the-model-to-onnx) for conversion to ONNX. |
|
|
|
|
|
## Frameworks |
|
|
|
|
|
- [k2](https://github.com/k2-fsa/k2) |
|
|
- [icefall](https://github.com/bookbot-hive/icefall) |
|
|
- [lhotse](https://github.com/bookbot-hive/lhotse) |
|
|
- [sherpa-onnx](https://github.com/k2-fsa/sherpa-onnx) |
|
|
- [onnx](https://github.com/onnx/onnx) |
|
|
- [onnxruntime](https://github.com/microsoft/onnxruntime) |