Running on A100 226 Omnilingual ASR Media Transcription 🌍 226 Transcribe audio or video into text in any language
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17, 2025 • 128