Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Music
Computer Use Models
Document & UI Intelligence
Multimodal Models
Medical MultiModal
Computer Use Models
updated
19 days ago
Upvote
1
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
2.25k
•
147
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
1.36k
•
221
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
480
•
1.7k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
273
•
68
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
826
•
24
showlab/ShowUI-2B
Updated
Mar 11
•
2.71k
•
269
Zery/CUA_World_State_Model
Image-Text-to-Text
•
Updated
Aug 7
•
10
•
4
microsoft/Fara-7B
Image-Text-to-Text
•
8B
•
Updated
2 days ago
•
66.9k
•
438
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
11B
•
Updated
Apr 30
•
149k
•
1.83k
Hcompany/Holo2-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
22 days ago
•
1.66k
•
36
Hcompany/Holo2-4B
Image-Text-to-Text
•
4B
•
Updated
29 days ago
•
3.11k
•
16
Hcompany/Holo2-8B
Image-Text-to-Text
•
9B
•
Updated
29 days ago
•
943
•
16
AskUI/PTA-1
Image-Text-to-Text
•
0.3B
•
Updated
Nov 28, 2024
•
771
•
97
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
8B
•
Updated
Nov 19, 2024
•
896
•
42
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
1.44M
•
•
1.25k
xlangai/OpenCUA-72B
Image-Text-to-Text
•
73B
•
Updated
Nov 11
•
207
•
4
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
Aug 18
•
651
•
25
xlangai/OpenCUA-7B
Image-Text-to-Text
•
8B
•
Updated
about 1 month ago
•
37.2k
•
21
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
113
•
29
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
92
•
17
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
2.55M
•
•
547
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
17 days ago
•
214k
•
152
Upvote
1
Share collection
View history
Collection guide
Browse collections