Tag

#nemo-framework

Articles tagged "nemo-framework" — 1 entry.

Article №44 fine-tuning NeMo ~16 hours wall (7h 34m Unsloth + 5h 38m NeMo + conversion + merge + probe)
Looking Beyond Spark

Two Trainers, One LoRA: NeMo Framework Beats Unsloth by 26% on a Patent-Strategist Fine-Tune

Same recipe, same R1-distilled base, same 5000-row patent corpus — once via Unsloth, once via NeMo Framework + Megatron-Bridge. NeMo finishes 26% faster and produces 44% longer patent-strategic chains. The cost is one YARN-defaults landmine and a stdout that lied for four hours.