13 24 7

ytaewon

hamzzi

AI & ML interests

None yet

Recent Activity

updated a dataset 8 days ago

DISLab/SumFeed-CoT

published a model 8 days ago

DISLab/ReFeed-8B

updated a model 8 days ago

DISLab/ReFeed-8B

View all activity

Organizations

updated a dataset 8 days ago

DISLab/SumFeed-CoT

Viewer • Updated 8 days ago • 7.71k • 14

published a model 8 days ago

DISLab/ReFeed-8B

Summarization • 8B • Updated 8 days ago • 4

updated a model 8 days ago

DISLab/ReFeed-8B

Summarization • 8B • Updated 8 days ago • 4

published a dataset 8 days ago

DISLab/SumFeed-CoT

Viewer • Updated 8 days ago • 7.71k • 14

commented a paper 3 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 102 •

upvoted a paper 3 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 102

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316

commented a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316 •

upvoted 2 papers 5 months ago

How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation

Paper • 2312.17115 • Published Dec 28, 2023 • 2

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States

Paper • 2505.17663 • Published May 23 • 15

commented a paper 5 months ago

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Paper • 2505.19187 • Published May 25 • 13 •

upvoted a paper 5 months ago

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Paper • 2505.19187 • Published May 25 • 13

updated a model 6 months ago

hamzzi/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated Jun 25 • 8

published 3 models 6 months ago

commented a paper 7 months ago

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12 • 45 •

upvoted a paper 7 months ago

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12 • 45

upvoted a paper 8 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188

commented a paper 8 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 65 •

ytaewon

AI & ML interests

Recent Activity

Organizations

hamzzi's activity