1 article tagged with this topic
A community researcher achieved usable quality from a 397B parameter model using 35% REAP quantization on a 96GB GPU.