Efficient Vision Encoding for Vision Language Models
Apple
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning
Organization Card
Welcome to the official Hugging Face organization for Apple!
Apple Core ML – Build intelligence into your apps
Core ML is optimized for on-device performance of a broad variety of model types by leveraging Apple Silicon and minimizing memory footprint and power consumption.
- Models
- FastVLM Core ML: On-device Vision-Language Model.
- Depth Anything V2 Core ML: State-of-the-art depth estimation
- DETR Resnet50 Core ML: Semantic Segmentation
- FastViT Core ML: Image Classification
- Stable Diffusion Core ML
- Additional Core ML Model Gallery Models
Apple Machine Learning Research
Open research to enable the community to deliver amazing experiences that improve the lives of millions of people every day.
Models
- MobileCLIP 2: Mobile-friendly SOTA image-text models.
- FastVLM: Efficient Vision Language Models.
- DepthPro: State-of-the-art monocular depth estimation.
- OpenELM Base | Instruct: open, Transformer-based language model.
- MobileCLIP: Mobile-friendly image-text models.
- DCLM: State-of-the-art open data language models via dataset curation.
- DFN: State-of-the-art open data CLIP models via dataset curation.
Datasets
- FLAIR: A large image dataset for federated learning.
- DataCompDR: Improved datasets for training image-text models.
Benchmarks
- TiC-CLIP: Benchmark for the design of efficient continual learning of image-text models over years
Select Highlights and Other Resources
- Hugging Face CoreML Examples – Run Core ML models with two lines of code!
- Apple Model Gallery
- New features in Core ML Tools
- Apple Core ML Stable Diffusion – Library to run Stable Diffusion on Apple Silicon with Core ML.
- Hugging Face Blog Posts
models
137
apple/CLaRa-7B-E2E
Updated
•
13
apple/CLaRa-7B-Instruct
Updated
•
115
apple/CLaRa-7B-Base
Updated
•
10
apple/starflow
Updated
•
225
apple/CLaRa-7B-Base-16
Updated
•
2
apple/mobileclip2_coca_dfn2b_s13b_context77
Updated
•
9
apple/mobileclip2_coca_dfn2b_s13b_dci-extended_s12m_context256
Updated
•
9
apple/mobileclip2_coca_dfn2b_s13b_dci-complete_s12m_context256
Updated
•
5
apple/mobileclip2_coca_dfn2b_s13b_docci_s12m_context256
Updated
•
11
apple/mobileclip2_coca_dfn2b_s13b_recap-coco-30k_s12m_context77
Updated
•
7
datasets
9
apple/DataCompDR-12M-bf16
Updated
•
3.72k
•
4
apple/DataCompDR-12M
Viewer
•
Updated
•
12.8M
•
2.71k
•
31
apple/DataCompDR-1B
Viewer
•
Updated
•
1.28B
•
18.1k
•
27
apple/DataComp-12M
Viewer
•
Updated
•
12.8M
•
154
•
3
apple/GSM-Symbolic
Viewer
•
Updated
•
12.5k
•
1.23k
•
20
apple/mmau
Preview
•
Updated
•
173
•
4
apple/TiC-DataComp
Preview
•
Updated
•
2.05k
•
3
apple/flair
Viewer
•
Updated
•
429k
•
267
•
16
apple/mkqa
Updated
•
521
•
39