Mc Sp

mcsp

AI & ML interests

None yet

Recent Activity

upvoted a collection 18 days ago

VibeVoice

upvoted an article 5 months ago

Open-source DeepResearch – Freeing our search agents

upvoted a paper 5 months ago

GAIA: a benchmark for General AI Assistants

View all activity

Organizations

upvoted a collection 18 days ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 24 days ago • 180

upvoted an article 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

upvoted a paper 5 months ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 243

upvoted a paper 7 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

upvoted a collection 7 months ago

Ovis2

Collection

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 65

liked a model 7 months ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Nov 21 • 2.14k • 235

liked a dataset 7 months ago

zhiyuanyou/Data-DeQA-Score

Preview • Updated Mar 3 • 246 • 6

upvoted a paper 7 months ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

upvoted a collection 7 months ago

Gemma 3n Preview

Collection

4 items • Updated Jul 10 • 192

upvoted 2 papers 7 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 319

upvoted 3 papers 8 months ago

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Paper • 2505.04842 • Published May 7 • 12

Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving

Paper • 2505.04528 • Published May 7 • 12

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 92

upvoted an article 8 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

504

upvoted 5 papers 8 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21 • 78

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 92

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation

Paper • 2504.17025 • Published Apr 23 • 17

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published Apr 23 • 57

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

Paper • 2504.19093 • Published Apr 27 • 18

Mc Sp

AI & ML interests

Recent Activity

Organizations

mcsp's activity

Open-source DeepResearch – Freeing our search agents

Vision Language Models Explained