Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10 • 105
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published Jul 5 • 50
ServiceNow/Llama-3.2-11B-Vision-Instruct-StarFlow Image-Text-to-Text • 11B • Updated Sep 8 • 6 • 1
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs Mar 20, 2024 • 29
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39