Granite Vision Models Collection Multimodal models built for visual document analysis and image understanding. • 6 items • Updated about 8 hours ago • 33
Continuous Speech Synthesis using per-token Latent Diffusion Paper • 2410.16048 • Published Oct 21, 2024 • 30
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning Paper • 2406.15334 • Published Jun 21, 2024 • 9