Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VITRA-VLA-3B
like
13
Follow
Microsoft
17.6k
Robotics
Transformers
English
Robotics
Vision-Language-Action
Manipulation
Multimodal
Pretraining
Diffusion
arxiv:
2510.21571
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VITRA-VLA-3B
15.1 GB
1 contributor
History:
3 commits
arnoldland
update the tag
4bd47d5
about 1 month ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
2.56 kB
update the tag
about 1 month ago
config.json
2.08 kB
Initial commit
about 1 month ago
dataset_statistics.json
12.4 kB
Initial commit
about 1 month ago
vitra-vla-3b.pt
15.1 GB
xet
Initial commit
about 1 month ago