notebook · cyber

Build the SecurityLLM quant — and call the model — on a Spark or a free cloud GPU

The artifact → card → article loop sells the outcome but offers no runnable on-ramp: a researcher who wants to reproduce the five-variant quant, or a developer who wants to call the model, has to reconstruct the journey from prose. These two notebooks close that gap. The builder notebook walks the feasibility → quantize → measure → publish journey as typed fieldkit API calls; the user notebook calls SecurityLLM on real security-MCQ and threat- reasoning tasks. Both are one-click via Open in Colab / Open in Kaggle and run offline on a DGX Spark — no incident detail leaves the network.

base ZySec-AI/SecurityLLM · license apache-2.0 ·recommended builder

▶ Try in chat ＋ Send to compare

What it's for

Builder: reproduce the release — feasibility envelope, quantize sweep, the Spark-tested quad + variants table, publish — as fieldkit calls
User: security knowledge MCQ across CyberMetric's domains, threat reasoning from a log line, and governance/compliance questions
User: ground answers in CVE / MITRE ATT&CK / RFC text with fieldkit.rag and gate MCQ answers with fieldkit.eval.mcq_letter
Both: run offline on a DGX Spark or on a free Colab / Kaggle GPU (dual-path, runtime-detected)

Audience — AI researchers and engineers who want to reproduce the quant, and security practitioners, threat analysts, and developers building security tooling who want a private offline assistant — on Spark-class hardware (GB10, 128 GB unified memory) or a free cloud GPU.

Quant economics quality × speed per build

Variant
builder sweet spot
user

Known drift bounded · honest

Cloud (Colab / Kaggle) path serves the Q4_K_M quant; the Spark path serves Q5_K_M One quant level apart, and on this model the cloud quant is no worse — Q4_K_M tops the CyberMetric n=50 mini-eval at 40% vs Q5_K_M's 38% (2 points, inside the noise floor); both run the identical code path. See the sibling GGUF card.
The builder notebook's quantize + publish steps render the recorded Spark run, not a live re-execution 2 recorded Spark-only cells (the quantize sweep and the publish dry-run); the remaining cells — feasibility envelope, the spark_quad panel, and the variants table — run live on any runtime from the manifest.
The user notebook's live model-chat cells are not captured in the published marketing snapshot 4 use-case cells call the model live on any runtime; the snapshot captures the deterministic charts + banners and describes the chat output rather than screenshotting it.

Get it

Open on HuggingFace ↗ Read the build article Build it — Colab ↗Build it — Kaggle ↗Use it — Colab ↗Use it — Kaggle ↗

Run it local

Yours, offline, on the Spark:

pip install fieldkit[arena]
fieldkit arena up

then drive this model from the cockpit — prompts and telemetry never leave the box.