gokashi/qwen3.5-2b-lite-jax
This model is a LiteRT (TFLite) export of a Qwen 3.5 Hybrid model.
Architecture
- Hybrid Layout: 3x Gated DeltaNet layers + 1x Gated Attention layer.
- Vocab Size: 248,320
- Target: Mobile/Edge Inference via LiteRT.
Usage
Run via TensorFlow Lite Interpreter with SELECT_TF_OPS enabled.
- Downloads last month
- 8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support