Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
Jellybox
JoyFusion
LocalAI
vLLM
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
int8
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions

Models

369
Full-text search
Active filters: int8

lelloman/xlm-roberta-base-onnx-int8

Updated 9 days ago

cstr/awesome-align-onnx-int8

Feature Extraction • Updated 9 days ago • 10

TevunahAi/Granite-3.3-8B-Instruct-GPTQ

Text Generation • 8B • Updated 5 days ago • 8

broadfield-dev/Qwen3-0.6B-20260105-055554-onnx

Text Generation • Updated 5 days ago • 12

broadfield-dev/Qwen3-0.6B-20260105-060935-onnx

Text Generation • Updated 5 days ago • 10

marksverdhai/vibevoice-7b-bnb-8bit

Text-to-Speech • 9B • Updated 4 days ago • 44

autolane/rfdetr-alpr

Object Detection • Updated 1 day ago • 14

AmdGoose/FLUX.2-dev-transformer-int8wo

Text-to-Image • Updated 3 days ago

tokenlabsdotrun/Llama-3.1-8B-Quanto-Int8

Text Generation • 8B • Updated 2 days ago • 124
  • Previous
  • 1
  • ...
  • 11
  • 12
  • 13
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs