Tag
#distillation
Articles tagged "distillation" — 2 entries.
Frontier Scout
Test-Time Distilling on Spark — Same Compute Envelope, Wider Semantic Reach
ESamp adds a tiny test-time-trained probe to vLLM that converts decoding from lexical resampling into semantic exploration. The runtime is vLLM-native — and that is a Spark catalog-gap story before it is a benchmark.
uses fieldkit.evalfieldkit.capabilities
Machine that Builds Machines
Distilling the Architect — A 3B LoRA Trained on the Agent's Own Trajectory
A4's 50-iter trajectory becomes training data for a Qwen2.5-3B LoRA proposer. Holding out 8 iters, the 3B mode-collapses onto d_model=768 (the trajectory's most-frequent keep) and matches 0 / 8 exact; the 8B at T=0.5 matches 4 / 8 of its own past picks.