← Notebooks
Notebook · IPYNB ·builder · user

patent-strategist-notebooks

Run the patent-strategist build — and use the model — on a Spark or a free cloud GPU

Notebook patent-strategist-notebooks — builder · user
Notebookbuilder · useron DeepSeek-R1-0528-Qwen3-8B
Build it
Use it

What this notebook does

The artifact → card → article loop sells the outcome but offers no runnable on-ramp: a researcher who wants to reproduce the fine-tune, or a developer who wants to call the model, has to reconstruct the journey from prose. These two notebooks close that gap. The builder notebook walks the full baseline → corpus → train → quantize → publish journey as typed fieldkit API calls; the user notebook calls the model on real patent-prosecution tasks and surfaces its reasoning chains. Both are one-click via Open in Colab / Open in Kaggle and run offline on a DGX Spark.

Use cases

Audience — AI researchers and engineers who want to reproduce the build, and app developers who want to call the model — on Spark-class hardware (GB10, 128 GB unified memory) or a free cloud GPU.

Choosing the variant

Two facets of the same notebook — pick by your goal.

builder
Walks the build journey on Spark — fieldkit API calls replacing ad-hoc scripts; surfaces speed, feasibility, and viability.
user
Demonstrates the published model on realistic domain tasks — runtime-detected, runs on Spark or on a free Colab/Kaggle GPU.

Methods

Read the field note Two Trainers, One LoRA: NeMo Framework Beats Unsloth by 26% on a Patent-Strategist Fine-Tune Same recipe, same R1-distilled base, same 5000-row patent corpus — once via Unsloth, once via NeMo Framework + Megatron-Bridge. NeMo finishes 26% faster and produces 44% longer patent-strategic chains. The cost is one YARN-defaults landmine and a stdout that lied for four hours. Open article

Known drift

Bounded limitations — Colab/Kaggle runs use the published quant; reasoning quality may differ from the BF16 weights on Spark. Each entry carries an explicit bound.

The user notebook pins the Q5_K_M quant on both the Spark and cloud paths
Q5_K_M is the fast+accurate sweet spot — 10.04 wikitext perplexity at 35 tok/s on a GB10, within 0.8% of Q6_K's accuracy. Heavier/lighter variants are one keyword away; see the sibling GGUF card for the full matrix.
The builder notebook's heavy steps (baseline, corpus, train, probe, quantize) render the recorded Spark run, not a live re-execution
5 recorded Spark-only cells; the remaining cells (feasibility envelope, backend decision, the four viz figures, publish dry-run) run live on any runtime.

Sibling artifacts

The model this notebook targets, plus other variants in the same family.