Name: patent-strategist-notebooks
Published: 2026-05-23T16:45:00Z
License: open

Build it

Use it

What this notebook does

The artifact → card → article loop sells the outcome but offers no runnable on-ramp: a researcher who wants to reproduce the fine-tune, or a developer who wants to call the model, has to reconstruct the journey from prose. These two notebooks close that gap. The builder notebook walks the full baseline → corpus → train → quantize → publish journey as typed fieldkit API calls; the user notebook calls the model on real patent-prosecution tasks and surfaces its reasoning chains. Both are one-click via Open in Colab / Open in Kaggle and run offline on a DGX Spark.

Use cases

Builder: reproduce the build — baseline, corpus gates, backend decision, train, probe, quantize, publish — as fieldkit calls
User: claim construction, MPEP-grounded office-action drafting, prior-art relevance, and licensing-scenario analysis with reasoning chains surfaced
User: ground answers in MPEP text with fieldkit.rag and gate quality with fieldkit.eval scorers
Both: run the whole workflow offline on a DGX Spark or on a free Colab / Kaggle GPU (dual-path, runtime-detected)

Audience — AI researchers and engineers who want to reproduce the build, and app developers who want to call the model — on Spark-class hardware (GB10, 128 GB unified memory) or a free cloud GPU.

Choosing the variant

Two facets of the same notebook — pick by your goal.

builder: Walks the build journey on Spark — fieldkit API calls replacing ad-hoc scripts; surfaces speed, feasibility, and viability.
user: Demonstrates the published model on realistic domain tasks — runtime-detected, runs on Spark or on a free Colab/Kaggle GPU.

Methods

Read the field note Two Trainers, One LoRA: NeMo Framework Beats Unsloth by 26% on a Patent-Strategist Fine-Tune Same recipe, same R1-distilled base, same 5000-row patent corpus — once via Unsloth, once via NeMo Framework + Megatron-Bridge. NeMo finishes 26% faster and produces 44% longer patent-strategic chains. The cost is one YARN-defaults landmine and a stdout that lied for four hours. Open article

Known drift

Bounded limitations — Colab/Kaggle runs use the published quant; reasoning quality may differ from the BF16 weights on Spark. Each entry carries an explicit bound.

The user notebook pins the Q5_K_M quant on both the Spark and cloud paths: Q5_K_M is the fast+accurate sweet spot — 10.04 wikitext perplexity at 35 tok/s on a GB10, within 0.8% of Q6_K's accuracy. Heavier/lighter variants are one keyword away; see the sibling GGUF card for the full matrix.
The builder notebook's heavy steps (baseline, corpus, train, probe, quantize) render the recorded Spark run, not a live re-execution: 5 recorded Spark-only cells; the remaining cells (feasibility envelope, backend decision, the four viz figures, publish dry-run) run live on any runtime.

Sibling artifacts

The model this notebook targets, plus other variants in the same family.

patent-strategist-v3-nemo-gguf Target model — quantization