Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
espnet 's Collections
OpenBEATs
ARECHO Series
OpusLM
UniVERSA
Codec Survey - Pre-trained Models
OWSM: Fully Open Speech Recognition and Translation Models
OWLS: Scaling Laws for Speech Recognition and Translation
OWSM-CTC: Ultra-Fast Speech Foundation Models
Neural Codecs
XEUS Model and Data

OWSM-CTC: Ultra-Fast Speech Foundation Models

updated Mar 8, 2025

CTC-based models from the OWSM project, designed for fast non-autoregressive inference: https://www.wavlab.org/activities/2024/owsm/

Upvote
1

  • espnet/owsm_ctc_v3.2_ft_1B

    Automatic Speech Recognition • Updated Aug 30, 2025 • 26 • 4

  • espnet/owsm_ctc_v3.1_1B

    Automatic Speech Recognition • Updated Aug 30, 2025 • 24 • 14
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs