Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2410.13085

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

Running

3.06k

AnyCoder

🏆

3.06k

Generate code with AI
Running

Featured

274

Qwen2.5 Coder Artifacts

🐢

274

Generate code from natural language prompts
Running

Featured

923

QwQ-32B-Preview

🔍

923

QwQ-32B-Preview
Running on CPU Upgrade

13.8k

Open LLM Leaderboard

🏆

13.8k

Track, rank and evaluate open LLMs and chatbots

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 30
MediAug: Exploring Visual Augmentation in Medical Imaging

Paper • 2504.18983 • Published Apr 26, 2025 • 7
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks

Paper • 2410.18387 • Published Oct 24, 2024

Medical Report Generation

FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models

Paper • 2411.18672 • Published Nov 27, 2024
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation

Paper • 2406.11451 • Published Jun 17, 2024
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 113
μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation

Paper • 2507.00316 • Published Jun 30, 2025 • 15

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23
Retrieval-Augmented Generation for Large Language Models: A Survey

Paper • 2312.10997 • Published Dec 18, 2023 • 12

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 30
MediAug: Exploring Visual Augmentation in Medical Imaging

Paper • 2504.18983 • Published Apr 26, 2025 • 7
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks

Paper • 2410.18387 • Published Oct 24, 2024

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

Medical Report Generation

FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models

Paper • 2411.18672 • Published Nov 27, 2024
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation

Paper • 2406.11451 • Published Jun 17, 2024
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 113
μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation

Paper • 2507.00316 • Published Jun 30, 2025 • 15

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23
Retrieval-Augmented Generation for Large Language Models: A Survey

Paper • 2312.10997 • Published Dec 18, 2023 • 12

Running

3.06k

AnyCoder

🏆

3.06k

Generate code with AI
Running

Featured

274

Qwen2.5 Coder Artifacts

🐢

274

Generate code from natural language prompts
Running

Featured

923

QwQ-32B-Preview

🔍

923

QwQ-32B-Preview
Running on CPU Upgrade

13.8k

Open LLM Leaderboard

🏆

13.8k

Track, rank and evaluate open LLMs and chatbots

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 23

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs