Tag
#eval
Articles tagged "eval" — 1 entry.
Three-Mode Bracket: Baselining a Reasoning Model Before Fine-Tuning, On One Spark
Before you fine-tune a small reasoning model on a domain bench you need to know where it stands. Three context modes — closed, retrieval, oracle — triangulate the model's ceiling on one Spark, no Judge backend or cluster required.