Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,042

Full-text search

Active filters: llama.cpp

jacqueasd/Mantrika-Gemma3-4B-GGUF

4B • Updated 6 days ago • 45

jacobbista/llama3-3b-finetome

3B • Updated 6 days ago • 69

astegaras/merged_kaggle

3B • Updated 6 days ago • 240

darrellxcheng/shaderWrap-Qwen2.5CoderGGUF

15B • Updated 2 days ago • 30

Melaraby/qwen_vlm3_detect2_gguf

8B • Updated 6 days ago • 81

darrellxcheng/tinyllama

1B • Updated 2 days ago • 124

kenny4225/gemma-finetune-gguf

1.0B • Updated 5 days ago • 40

raphaeltito/luvia-sales-gguf

21B • Updated 5 days ago • 225

Kezovic/iris-q4gguf

1B • Updated 5 days ago • 59

franco334578/doric-12b-it-gguf

12B • Updated 5 days ago • 53

solarwt/qwen2.5-medical-q8

15B • Updated 5 days ago • 40

Kezovic/iris-q4gguf-v2

1B • Updated 4 days ago • 93

wannaio/llama3.2-3b-finetome-gguf

3B • Updated 5 days ago • 65

wannaio/llama3.2-3b-finetome-25k

3B • Updated 5 days ago • 39

astegaras/lora_merged

3B • Updated 4 days ago • 66

mradermacher/Qwen3-VisionCaption-2B-it-REDACTED-GGUF

2B • Updated 4 days ago • 364

mradermacher/Qwen3-VisionCaption-2B-it-REDACTED-i1-GGUF

2B • Updated 1 day ago • 2.6k

bao2015/gemma3_miaomiao

0.3B • Updated 4 days ago • 52

Sr-Carlos/model_SFT_enron_XL_B4

4B • Updated 4 days ago • 53

Kezovic/iris-q4gguf-lora-test

1B • Updated 4 days ago • 58

Kezovic/iris-f16gguf-test

1B • Updated 4 days ago • 36

kalai4390/Qwen3_GRPO_Quantized

4B • Updated 4 days ago • 58

Kezovic/iris-q4gguf-cosine-test

1B • Updated 4 days ago • 37

Kezovic/iris-q4gguf-hermes-test

1B • Updated 4 days ago • 33

Sr-Carlos/Class_SFT_Fixed_Categories_M_No_Think

2B • Updated 2 days ago • 85

ViktorMardskog/lora_model_test

3B • Updated 4 days ago • 47

rgraceffa/llama-3-8b-Instruct-bnb-4bit-eraigra

8B • Updated 4 days ago • 227

ViktorMardskog/lora_model_base_10k_1b

1B • Updated 4 days ago • 109

lumen-models/spanish-tinyllama-v1-gguf

1B • Updated 2 days ago • 22

Capibara-LLM/gemma-2-9b-it-SimPO-Jopara-GGUF

9B • Updated 4 days ago • 56