Tag
#pass-at-k
Articles tagged "pass-at-k" — 1 entry.
Frontier Scout
Pass@k After the Seventh Patch — Three Shapes ESamp Takes on Spark
Patches were six. The Pass@k harness surfaced a seventh — a one-line slice in the residual tap that only fires once batches shrink mid-run. Once cleared, ESamp takes three shapes: flat on saturated cells, lifting both rates on instruct headroom, and +6.67pp pass@8 on the unsaturated reasoning cell.
uses fieldkit.evalfieldkit.capabilities