Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos Paper โข 2512.13080 โข Published 15 days ago โข 15