Compressing visual tokens in vision-language models: 3x more requests per GPU on Qwen2-VL datas3nt • 32 minutes ago
Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub danf • about 8 hours ago • 7
Don’t Waste Tokens on Data Entry: Tag Customer Reviews Overnight with ZeroGPU Batch API its-maddy-a • 1 day ago • 1
curbcheck: can a small VLM tell you if you can legally park in San Francisco? shubhamgoel27 • 1 day ago
Yui Home Assistant — teaching a 3B model to write Home Assistant automations that actually *work* build-small-hackathon • 1 day ago
Household Food Waste Forensics: Smart Tracking for Sustainable Kitchens abhishekkataria16 • 1 day ago
My First MO§ES™ SigRank — the diagnostic x-ray of the token economy build-small-hackathon • 1 day ago
Keep the model on one axis: building *Penny Stock Persuasion* on a small Nemotron Han-Solo • 1 day ago • 1
Elder Care Copilot: Bringing Order to the Chaos of Caregiving Paperwork build-small-hackathon • 1 day ago
I built a tool that tells you exactly why you'll be rejected — before you apply build-small-hackathon • 1 day ago • 1