AIcell/Qwen2-VL-2B-SFT-MCP-hg3-checkpoint-300 Image-to-Text • 4B • Updated about 16 hours ago • 15
AIcell/Qwen2-VL-2B-SFT-MCP-hg3-checkpoint-300 Image-to-Text • 4B • Updated about 16 hours ago • 15
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 20 days ago • 71
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 7 days ago • 13
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 21 days ago • 114
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 14 days ago • 66
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 8 days ago • 61
SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published 7 days ago • 42
Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction Paper • 2512.18880 • Published 9 days ago • 23
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions Paper • 2512.11995 • Published 18 days ago • 9