bookbot
/

sherpa-onnx-zipformer-streaming-robust-es-v0

Automatic Speech Recognition

phoneme-recognition

Model card Files Files and versions

sherpa-onnx-zipformer-streaming-robust-es-v0 / README.md

Davidsamuel101's picture

Update README.md

84292e7 verified 22 days ago

|

history blame contribute delete

1.6 kB

	---
	license: apache-2.0
	tags:
	- icefall
	- phoneme-recognition
	- automatic-speech-recognition
	datasets:
	- bookbot/common_voice_16_1_es
	- bookbot/slr72_dataset
	---

	# Pruned Stateless Zipformer RNN-T Streaming Robust ES v0

	Pruned Stateless Zipformer RNN-T Streaming Robust ES v0 is a Spanish automatic speech recognition model trained on the following datasets:

	- [Common Voice 23.0 Spanish](https://datacollective.mozillafoundation.org/datasets/cmflnuzw51ddgmwjkxpm9z1lw)
	- [SLR72 dataset](https://www.openslr.org/72/)

	Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. `["w", "ɑ", "ʃ", "i", "ɑ"]`. Therefore, the model's [vocabulary](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0/blob/main/data/lang_phone/tokens.txt) contains the different IPA phonemes found in [gruut](https://github.com/rhasspy/gruut).

	This model was converted from the TorchScript version of [Pruned Stateless Zipformer RNN-T Streaming Robust ES](https://huggingface.co/bookbot/zipformer-streaming-robust-es-v0) to ONNX format.


	## Converting from TorchScript

	Refer to the [official instructions](https://icefall.readthedocs.io/en/latest/model-export/export-onnx.html#export-the-model-to-onnx) for conversion to ONNX.

	## Frameworks

	- [k2](https://github.com/k2-fsa/k2)
	- [icefall](https://github.com/bookbot-hive/icefall)
	- [lhotse](https://github.com/bookbot-hive/lhotse)
	- [sherpa-onnx](https://github.com/k2-fsa/sherpa-onnx)
	- [onnx](https://github.com/onnx/onnx)
	- [onnxruntime](https://github.com/microsoft/onnxruntime)