Quantizations.

Open GGUF quants of vertical-finetuned base models — each shipped with a Spark-measured four-axis card so a downstream reader picks the right variant without re-running the eval. 04 published.

Quant GGUF 5 variants

ii-medical-8b-gguf

Quantization of Intelligent-Internet/II-Medical-8B. Best on MedMCQA (n=50, mcq_letter): Q5_K_M (0.52).

free apache-2.0
Quant GGUF 5 variants

securityllm-gguf

Quantization of ZySec-AI/SecurityLLM. Best on CyberMetric (n=50, mcq_letter): Q4_K_M (0.40).

free apache-2.0
Quant GGUF 5 variants

saul-7b-instruct-v1-gguf

Quantization of Equall/Saul-7B-Instruct-v1. Best on LegalBench (n=50, contains): Q5_K_M (0.72).

free mit
Quant GGUF 5 variants

finance-chat-gguf

Quantization of AdaptLLM/finance-chat. Best on FinanceBench (n=50, numeric_match): Q6_K (0.16).

free
More artifacts in preparation End of May 2026