Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MMMU
non-profit
https://mmmu-benchmark.github.io/
MMMU-Benchmark
Activity Feed
Request to join this org
Follow
83
AI & ML interests
Multimodal Model Evaluation
Recent Activity
yuexiang96
authored
a paper
8 days ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
yuexiang96
authored
a paper
8 days ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
yuexiang96
authored
a paper
8 days ago
Simulating Environments with Reasoning Models for Agent Training
View all activity
Team members
17
MMMU
's datasets
2
Sort: Recently updated
MMMU/MMMU_Pro
Viewer
•
Updated
Mar 8
•
5.19k
•
8.81k
•
41
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
72k
•
307