JMV
jamiem90
ยท
AI & ML interests
None yet
Recent Activity
reacted to danielhanchen's post with ๐ 12 days ago
Google releases Gemma 4 QAT. โจ
You can now run Gemma 4 at 3x less memory with near original performance.
QAT makes it possible to run Gemma 4 26B-A4B on 16GB RAM.
GGUFs: https://huggingface.co/collections/unsloth/gemma-4-qat
QAT Guide: https://unsloth.ai/docs/models/gemma-4/qat reacted to danielhanchen's post with ๐ฅ 28 days ago
Qwen3.6 MTP is here! Run locally on 20GB RAM. โก๏ธ
MTP enables Qwen3.6 to generate ~1.4โ2.2ร faster with no accuracy change.
Qwen3.6-27B: https://huggingface.co/unsloth/Qwen3.6-27B-MTP-GGUF
Qwen3.6-35B-A3B: https://huggingface.co/unsloth/Qwen3.6-35B-A3B-MTP-GGUF
Guide: https://unsloth.ai/docs/models/qwen3.6#mtp-guide
liked a model about 1 month ago
unsloth/Qwen3.6-27B-MTP-GGUFOrganizations
None yet