May we please get a light-weight Deepseek model?

#27

by blun4me - opened about 1 month ago

about 1 month ago

My 16GB GPU and 64GB RAM cannot load something like this. I'd love to see a model within the 16b-32b range in the future, if that is possible. Thanks in advance.

owenqwenllmwine

about 1 month ago

For real, would be great to see smaller models from them and not these huge TB models that require a data center to run at any usable speed much less load.

kth8

29 days ago

I think Deepseek is more interested in trying to push the frontier rather than catering to edge devices.

owenqwenllmwine

29 days ago

I think Deepseek is more interested in trying to push the frontier rather than catering to edge devices.

Fair but it would be nice to have a light weight deepseek model that CAN run on smaller devices.

YYYAMS

8 days ago

I'm testing a distributed cluster to run this full-weights on consumer cards (pooling 4090s) to bypass the VRAM limit. let me know if you want to run a test job.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment