1 40 1

Rui Sun PRO

ThreeSR

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

upvoted a paper 16 days ago

RELIC: Interactive Video World Model with Long-Horizon Memory

upvoted a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

View all activity

Organizations

upvoted a paper 16 days ago

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published 23 days ago • 23

upvoted 6 papers about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24 • 60

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 91

upvoted 7 papers about 2 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 117

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11 • 44

UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Paper • 2510.20286 • Published Oct 23 • 23

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 98

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30 • 33

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 119

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 108

upvoted 3 papers 3 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 118

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 98

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 56

authored a paper 3 months ago

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

upvoted 2 papers 4 months ago

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6 • 161

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

Rui Sun PRO

AI & ML interests

Recent Activity

Organizations

ThreeSR's activity