Safetensors please?

by aimeri - opened 4 days ago

4 days ago

For best performance with vllm using safetensors would be best. Would it be possible to release the model as safetensors or a vllm compatible quantization (preferably at a high bpw). Thank you!

dipsht9999

4 days ago

+1 - also very easy to use mlx_lm.convert with original safetensors

Fashion-Italia

about 7 hours ago

Safetensors would be the best option. I would like to quantize the model to various formats, especially AWQ.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment