Feature Request: GGUF and EXL as options.

#3
by Clevyby - opened

Hello, would like to request if you could add some specified formats when requesting a model to be quantized. Especially Exl since it's the only format for me that can run higher tier models on subpar specs with good generation speed.

Sign up or log in to comment