Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

369

Full-text search

Active filters: int8

lelloman/xlm-roberta-base-onnx-int8

Updated 9 days ago

cstr/awesome-align-onnx-int8

Feature Extraction • Updated 9 days ago • 10

TevunahAi/Granite-3.3-8B-Instruct-GPTQ

Text Generation • 8B • Updated 5 days ago • 8

broadfield-dev/Qwen3-0.6B-20260105-055554-onnx

Text Generation • Updated 5 days ago • 12

broadfield-dev/Qwen3-0.6B-20260105-060935-onnx

Text Generation • Updated 5 days ago • 10

marksverdhai/vibevoice-7b-bnb-8bit

Text-to-Speech • 9B • Updated 4 days ago • 44

autolane/rfdetr-alpr

Object Detection • Updated 1 day ago • 14

AmdGoose/FLUX.2-dev-transformer-int8wo

Text-to-Image • Updated 3 days ago

tokenlabsdotrun/Llama-3.1-8B-Quanto-Int8

Text Generation • 8B • Updated 2 days ago • 124