HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 112k • 96
configint/SmolVLM2-256M-Video-Instruct-ActionTokens-Dim Image-Text-to-Text • 0.3B • Updated Jul 29, 2025 • 1