Adversarial Exploitation of Data Diversity Improves Visual Localization Paper • 2412.00138 • Published Nov 28, 2024 • 3
INTACT Probing Suite Collection A probing suite for the generalization boundaries of VLA models. This collection holds the model checkpoints and more. https://ai4ce.github.io/INT-ACT • 4 items • Updated Jun 15 • 1
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20 • 29
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published Jun 11 • 8 • 2
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published Jun 11 • 8
INTACT Probing Suite Collection A probing suite for the generalization boundaries of VLA models. This collection holds the model checkpoints and more. https://ai4ce.github.io/INT-ACT • 4 items • Updated Jun 15 • 1
INTACT Probing Suite Collection Probing suite for the generalization boundaries of VLA models. This collection holds model checkpoints and more. • 3 items • Updated Jun 15