Skip to content
Back to Docs

Supported Models

OpenVLA-7B

7B-parameter VLA using SigLIP + DinoV2 fused vision encoder with Llama 2 backbone. Predicts discrete action tokens.

SpecValue
Parameters7B
Min GPUA100 40GB
VRAM~28 GB
Inference~8 Hz
LicenseApache 2.0
model = VLAModel.from_preset("openvla-7b")

SmolVLA-450M

Compact 450M VLA by Hugging Face using flow-matching. Runs on consumer GPUs.

SpecValue
Parameters450M
Min GPURTX 3090
VRAM~8 GB
Inference~25 Hz
LicenseApache 2.0
model = VLAModel.from_preset("smolvla-450m")

Dream-VLA-7B

7B VLA built on diffusion language model backbone with parallel action generation.

SpecValue
Parameters7B
Min GPUA100 40GB
VRAM~30 GB
Inference~6 Hz
LicenseApache 2.0
model = VLAModel.from_preset("dream-vla-7b")

Custom Models

Register your own model:

from vlarobot.models.registry import MODEL_REGISTRY
MODEL_REGISTRY["my-model"] = ("my_package:MyModel", "org/model-id")