Can we get an fp8 version or similar?

by popedead - opened Jun 24

Jun 24

The model is great, clearly superior to the others. I'm very excited about it, but its VRAM consumption is quite high. Is there a possibility that you could publish an FP8 or similar quantized version?

HHHHHHUA

Aug 8

Can we get an fp8 version or similar

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment