view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 390
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 11 days ago • 128
Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching Paper • 2512.18184 • Published 23 days ago • 20
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published about 1 month ago • 42
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published 19 days ago • 32
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 177