找到 1 篇关于此标签的文章
A community researcher achieved usable quality from a 397B parameter model using 35% REAP quantization on a 96GB GPU.