RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published Mar 18 • 7
The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods Paper • 2602.11364 • Published Feb 11
QEIL v2: Heterogeneous Computing for Edge Intelligence via Roofline-Derived Pareto-Optimal Energy Modeling and Multi-Objective Orchestration Paper • 2602.06057 • Published Apr 5 • 5
Running on CPU Upgrade Featured 3.16k The Smol Training Playbook 📚 3.16k The secrets to building world-class LLMs
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published Mar 18 • 7
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published Mar 18 • 7
CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Paper • 2507.06013 • Published Jul 8, 2025
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 189
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters