arxiv:2510.18821
Yuhang Wen
Necolizer
AI & ML interests
Agent RL, RL Post-train, LLM
Recent Activity
new activity
1 day ago
Quark-LLM/SSP:docs: update readme
new activity
about 1 month ago
Quark-LLM/SSP:feat: upload training and evaluation data
commented on
a paper
2 months ago
Search Self-play: Pushing the Frontier of Agent Capability without
Supervision