Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cerebras
/
GLM-4.6-REAP-218B-A32B-FP8
like
39
Follow
Cerebras
1.09k
Text Generation
Transformers
Safetensors
English
glm4_moe
glm
MOE
pruning
compression
conversational
compressed-tensors
arxiv:
2510.13999
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Definitely does NOT run on dual RTX PRO 6000 96GB
👍
1
3
#2 opened about 1 month ago by
aaron-newsome
BF16 checkpoint
#1 opened about 1 month ago by
lazarevich