just for curiosity

by prudant - opened Mar 24, 2024

Discussion

prudant

Mar 24, 2024

how much time took the final training process?

hiyouga

Owner Mar 25, 2024

@prudant The training speed of AQLM fine-tuning is around 26.25s/example for a 70B model, so it requires ~15h to fine-tune the model over 2000 examples.

prudant

Mar 25, 2024

thanks!

etemiz

Mar 26, 2024

can you share the command line and config files ?

hiyouga

Owner Mar 26, 2024

https://github.com/hiyouga/LLaMA-Factory/blob/main/examples/qlora_single_gpu/aqlm.sh

etemiz

Mar 27, 2024

why is this taking so little resources, compared to accelerate/fsdp_config.yaml ?
I am doing AlexWortega/miqu-1-70b-AQLM-2Bit-1x16-hf

(sorry i am a beginner)

Vasanth

Apr 14, 2024

Can you please tell how to format the data @hiyouga for training

etemiz

Apr 14, 2024

can I convert mixtral 8x22 to AQLM and then train using this method on 2x3090?

prudant

Apr 15, 2024

@bittamer i think AQLM quant process require a lot of gpu computational power (more than 4 gpus for a couple of days running)

hiyouga

Owner Apr 15, 2024

@bittamer I think FSDP+QLoRA should be more suitable for your case

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment