FaceLLM Collection A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm β’ 3 items β’ Updated Jul 23, 2025 β’ 4
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25, 2025 β’ 217
Running on Zero MCP 405 Multimodal OCR π 405 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Running 240 MedGemma - Radiology Explainer Demo π©Ί 240 Radiology Image & Report Explainer Demo. Built with MedGemma
view changelog Hugging Face Changelog Filter by MCP compatibility available in HF Spaces May 21, 2025 β’ 79
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize β’ 7 items β’ Updated Jul 19, 2024 β’ 14
Sleeping Questionnanswer Document π Upload document and ask question about it β Groq will answer