facebook/dinov3-vits16-pretrain-lvd1689m Image Feature Extraction • 21.6M • Updated Aug 19, 2025 • 164k • 63
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 16 days ago • 30.3k • 92
VST Collection A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 5 items • Updated Nov 12, 2025 • 6
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated 29 days ago • 18
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 29 days ago • 33