Devstral 2 BF16 weight

by cpatonn - opened 26 days ago

Discussion

cpatonn

26 days ago

Hi, thank you for publishing Devstral 2. Will its BF16 weights be publicly available?

patrickvonplaten

Mistral AI_ org 26 days ago

Hey @cpatonn ,

The model was trained natively in FP8 so these are the "original weights".
You can retrieve BF16 quite easily by doing the same as is shown here:
https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512#transformers-bf16

(you could then run model.save_pretrained(...) and you should have correct BF16 weights :-)

cpatonn

25 days ago

Thank you for sharing with me :)

cpatonn changed discussion status to closed 25 days ago

YourFriendSky

17 days ago

•

edited 17 days ago

After convert it to BF16 format with transformers and model.save_pretrained(...), the output is model.safetensors.index.json not consolidated.safetensors.index.json. vLLM can't serve it anymore even with nightly build.

juliendenize

Mistral AI_ org 15 days ago

@YourFriendSky in this case you should load with

--config-format hf --load-format hf --tokenizer_mode hf

so that it loads from Transformers format and not ours.

Thanks for bringing this up, we plan to improve integrations with Transformers and making it easier to change the format of the checkpoints.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment