Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MercedeSnape 's Collections
MoE
Problem Definition
LLM reasoning
RAG
future
kg
memory
Evolve
reasoning evaluation
agent reasoning
mm thinking
agent training
RL agent
agent env
model paradigm
mas

agent reasoning

updated 7 days ago
Upvote
-

  • MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

    Paper • 2511.11793 • Published Nov 14, 2025 • 165

    Note 第三维度指标 Interactive Scaling


  • Reinforcement Learning for Self-Improving Agent with Skill Library

    Paper • 2512.17102 • Published 15 days ago • 30
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs