Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Seungone Kim's picture
13 40 78

Seungone Kim PRO

seungone
jeffrobot's profile picture frimelle's profile picture samusenps's profile picture
·
https://seungonekim.github.io/
  • seungonekim
  • SeungoneKim

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

liked a dataset 5 days ago
facebook/principia-bench
authored a paper 23 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
upvoted a paper 23 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
View all activity

Organizations

NeuLab @ LTI/CMU's profile picture HAE-RAE's profile picture Mixture of Rewards's profile picture CMU-LTI's profile picture KAIST AI's profile picture Human_Eval_RLHF's profile picture prometheus-vision's profile picture prometheus-eval's profile picture MPA human eval's profile picture multilingual-reward-bench's profile picture Agora's profile picture 11777-S25 Project's profile picture cot_encyclopedia_human_eval's profile picture cot_encyclopedia_human_eval's profile picture RefineBench's profile picture Carnegie Mellon University's profile picture

Papers 33

arxiv:2511.22173
arxiv:2506.01789
arxiv:2505.22202
arxiv:2505.16409

spaces 2

pinned
Running

My Argilla

✍

Apr 12
Runtime error

Test3

🟧

Apr 12

models 1

seungone/skywork-reward-replicate

Text Classification • 8B • Updated Dec 11, 2024 • 11

datasets 5

seungone/ablation1_math_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 5.56k • 26

seungone/ablation3_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 24.8k • 33

seungone/ablation2_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 5.99k • 32

seungone/ablation1_code_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 10k • 30

seungone/final-math-claude3.5_sonnet-10000

Viewer • Updated Sep 16, 2024 • 10k • 25 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs