Hello! Is it possible to instead of using Whisper, take an audio encoder from Gemma 3n and attach it here (Gemma 3 4B)?
why not use 3n in first place?
Cause I hate matryoshka architecture!
· Sign up or log in to comment