1 article tagged with this topic
MegaTrain enables full-precision training of 100B+ parameter LLMs on a single GPU, potentially democratizing large model training.