OpenMMReasoner

community

AI & ML interests

None defined yet.

Recent Activity

wukeming11 updated a dataset 8 days ago

OpenMMReasoner/OpenMMReasoner-SFT-874K

wukeming11 updated a model 8 days ago

OpenMMReasoner/OpenMMReasoner-ColdStart

wukeming11 updated a model 8 days ago

OpenMMReasoner/OpenMMReasoner-RL

View all activity

wukeming11

updated a dataset 8 days ago

OpenMMReasoner/OpenMMReasoner-SFT-874K

Viewer • Updated 8 days ago • 874k • 290 • 5

wukeming11

updated 2 models 8 days ago

OpenMMReasoner/OpenMMReasoner-ColdStart

Image-Text-to-Text • 8B • Updated 8 days ago • 441 • 3

OpenMMReasoner/OpenMMReasoner-RL

Image-Text-to-Text • 8B • Updated 8 days ago • 587 • 15

mwxely

authored a paper 22 days ago

A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models

Paper • 2511.15098 • Published Nov 19, 2025

kcz358

updated a model 29 days ago

OpenMMReasoner/OpenMMReasoner-ColdStart

Image-Text-to-Text • 8B • Updated 8 days ago • 441 • 3

kcz358

updated a dataset 29 days ago

OpenMMReasoner/OpenMMReasoner-SFT-874K

Viewer • Updated 8 days ago • 874k • 290 • 5

kcz358

updated a model 29 days ago

OpenMMReasoner/OpenMMReasoner-RL

Image-Text-to-Text • 8B • Updated 8 days ago • 587 • 15

kcz358

authored a paper about 1 month ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

mwxely

authored 2 papers about 1 month ago

FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy

Paper • 2305.10307 • Published May 17, 2023

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

wukeming11

authored 2 papers about 1 month ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30, 2025 • 18

mwxely

authored a paper about 1 month ago

TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding

Paper • 2508.01699 • Published Aug 3, 2025

kcz358

updated a dataset about 1 month ago

OpenMMReasoner/OpenMMReasoner-RL-74K

Viewer • Updated Nov 25, 2025 • 74.7k • 473 • 7

mwxely

authored 4 papers about 1 month ago

AI-Generated Images as Data Source: The Dawn of Synthetic Era

Paper • 2310.01830 • Published Oct 3, 2023

ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models

Paper • 2505.18757 • Published May 24, 2025

Versatile Transition Generation with Image-to-Video Diffusion

Paper • 2508.01698 • Published Aug 3, 2025

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

wukeming11

authored a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

kcz358

authored a paper about 1 month ago

UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning

Paper • 2510.13515 • Published Oct 15, 2025 • 11