Now that this is quantized, what are the memory requirements?
#6
by
salamanders
- opened
With this being quantized to int4, how much GPU memory is required to run the demo? (I tried and failed on a 1080ti with 11GB so I assume it is more than that)