Gemma 3 4B
Gemma visionGoogle's small instruct model with vision input.
- Params
- 4B
- Quant
- Q4_K_M
- Size
- 2.5 GB
Gemma 3 1B
GemmaTiny Gemma variant β runs anywhere.
- Params
- 1B
- Quant
- Q4_K_M
- Size
- 806 MB
Llama 3.2 3B Instruct
LlamaMeta's compact instruct model.
- Params
- 3B
- Quant
- Q4_K_M
- Size
- 2.0 GB
Qwen 2.5 7B Instruct
QwenStrong general-purpose mid-size model.
- Params
- 7B
- Quant
- Q4_K_M
- Size
- 4.7 GB
Qwen 2.5 Coder 7B
QwenCode-tuned Qwen β good for editor integrations.
- Params
- 7B
- Quant
- Q4_K_M
- Size
- 4.7 GB
Qwen 3 30B A3B
QwenMixture-of-experts β only 3B params active per token.
- Params
- 30B (MoE)
- Quant
- Q4_K_M
- Size
- 18 GB
Mistral Nemo Instruct
MistralMistral + NVIDIA 12B with a 128k context.
- Params
- 12B
- Quant
- Q4_K_M
- Size
- 7.5 GB
SmolLM3 3B
SmolLMHuggingFace's small, fast, multilingual model.
- Params
- 3B
- Quant
- Q4_K_M
- Size
- 1.9 GB