sectorllmLlama2
1500 Bytes for Llama 2 Inference: Framework Bloat is a Choice, Not Inevitable
sectorllm achieves Llama 2 inference in <1500 bytes of x86 assembly. Core LLM logic is minimal; framework bloat is an engineering choice, not inevitab
Just now·2 min read