WonJae Roh's picture

74

WonJae Roh

snuro

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Kling-Omni Technical Report

upvoted a paper 2 months ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

upvoted a paper 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 17 days ago • 163

upvoted a paper 2 months ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published Oct 17, 2025 • 147

upvoted 12 papers 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28, 2025 • 35

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 347

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 70

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published Sep 4, 2025 • 210

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 150

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 138

Beyond Transcription: Mechanistic Interpretability in ASR

Paper • 2508.15882 • Published Aug 21, 2025 • 86

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

upvoted 6 papers 5 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4, 2025 • 133

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Paper • 2508.03320 • Published Aug 5, 2025 • 62

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 93