PaceWang's picture

1 5 3

PaceWang

PaceWang

·

hill2hill

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 6 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 251

upvoted 3 articles 10 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

442

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

275

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

106

upvoted a paper almost 2 years ago

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1, 2024 • 31