RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning
•
8B
•
Updated
•
67
None defined yet.
$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training