Quantizations.
Open GGUF quants of vertical-finetuned base models — each shipped with a Spark-measured four-axis card so a downstream reader picks the right variant without re-running the eval. 04 published.
ii-medical-8b-gguf
Quantization of Intelligent-Internet/II-Medical-8B. Best on MedMCQA (n=50, mcq_letter): Q5_K_M (0.52).
free apache-2.0
securityllm-gguf
Quantization of ZySec-AI/SecurityLLM. Best on CyberMetric (n=50, mcq_letter): Q4_K_M (0.40).
free apache-2.0
saul-7b-instruct-v1-gguf
Quantization of Equall/Saul-7B-Instruct-v1. Best on LegalBench (n=50, contains): Q5_K_M (0.72).
free mit
finance-chat-gguf
Quantization of AdaptLLM/finance-chat. Best on FinanceBench (n=50, numeric_match): Q6_K (0.16).
free
※ More artifacts in preparation End of May 2026