Davidsamuel101's picture
Update README.md
84292e7 verified
metadata
license: apache-2.0
tags:
  - icefall
  - phoneme-recognition
  - automatic-speech-recognition
datasets:
  - bookbot/common_voice_16_1_es
  - bookbot/slr72_dataset

Pruned Stateless Zipformer RNN-T Streaming Robust ES v0

Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 is a Spanish automatic speech recognition model trained on the following datasets:

Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. ["w", "ɑ", "ʃ", "i", "ɑ"]. Therefore, the model's vocabulary contains the different IPA phonemes found in gruut.

This model was converted from the TorchScript version of Pruned Stateless Zipformer RNN-T Streaming Robust ES to ONNX format.

Converting from TorchScript

Refer to the official instructions for conversion to ONNX.

Frameworks