Supported Models

Name: vlarobot
Author: vlarobot

OpenVLA-7B

7B-parameter VLA using SigLIP + DinoV2 fused vision encoder with Llama 2 backbone. Predicts discrete action tokens.

model = VLAModel.from_preset("openvla-7b")

Compact 450M VLA by Hugging Face using flow-matching. Runs on consumer GPUs.

model = VLAModel.from_preset("smolvla-450m")

7B VLA built on diffusion language model backbone with parallel action generation.

model = VLAModel.from_preset("dream-vla-7b")

from vlarobot.models.registry import MODEL_REGISTRY
MODEL_REGISTRY["my-model"] = ("my_package:MyModel", "org/model-id")