view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke • Aug 23, 2023 • 64
view article Article TGI Multi-LoRA: Deploy Once, Serve 30 Models +1 derek-thomas, dmaniloff, drbh • Jul 18, 2024 • 63