view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 44
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! medmekk, marcsun13 • Mar 7, 2025 • 98
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280