← LoRA fine-tunes
LoRA fine-tune · BF16 ·trained with nemo

patent-strategist-v3-nemo

Offline patent-prosecution reasoning on Spark-class hardware

LoRA fine-tune patent-strategist-v3-nemo, trained with nemo
nemoon DeepSeek-R1-0528-Qwen3-8B
HF Orionfold/patent-strategist-v3-nemo
License free apache-2.0
Published

What this model does

Patent prosecution work — claim construction, MPEP-grounded office-action responses, Markush analysis, doctrine-of-equivalents reasoning — happens inside firms that can't ship privileged client text to a hosted frontier API. This release distills DeepSeek-R1's chain-of-thought reasoning onto a 5,000-row synthetic patent-reasoning corpus so a single Spark-class box can run the workflow offline, with full IRAC-shaped reasoning chains.

Use cases

Audience — Patent attorneys, prosecution-team engineers, and IP-strategy teams running privileged workflows offline on Spark-class hardware (GB10, 128 GB unified memory) or comparable edge devices.

Choosing this lane

This fine-tune is the nemo lane of DeepSeek-R1-0528-Qwen3-8B. Compare to the 2 sibling lanes below.

How to use

Load with Transformers, or see the linked HF repo for full inference snippets.

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Orionfold/patent-strategist-v3-nemo", torch_dtype="bfloat16", device_map="auto")
tok = AutoTokenizer.from_pretrained("Orionfold/patent-strategist-v3-nemo")

Full inference walkthrough on the HuggingFace model card.

Methods

Read the field note Two Trainers, One LoRA: NeMo Framework Beats Unsloth by 26% on a Patent-Strategist Fine-Tune Same recipe, same R1-distilled base, same 5000-row patent corpus — once via Unsloth, once via NeMo Framework + Megatron-Bridge. NeMo finishes 26% faster and produces 44% longer patent-strategic chains. The cost is one YARN-defaults landmine and a stdout that lied for four hours. Open article

Known drift

Disclosed limitations from the v3 corpus. Each entry carries an explicit bound — the scope is named, not implied.

"metes-and-times" terminology
Two known terminology drifts inherited from the v3 synthetic corpus; balance of probe answers (~99%) cite real MPEP sections.
Fabricated MPEP §2163.05(s) citation
Same scope as above — corpus-generator artifact, not a model-wide hallucination pattern. Real §2163.05 has subsections (a)–(f) on written-description support.