Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MultiRL
non-profit
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
KimSHine
updated
a model
1 day ago
MultiRL/qwen3_1.7b_easy_rl_gamma_1_step_40
KimSHine
published
a model
1 day ago
MultiRL/qwen3_1.7b_easy_rl_gamma_1_step_40
KimSHine
updated
a model
3 days ago
MultiRL/qwen3_4b_easy_rl_our_adv_final
View all activity
Team members
3
models
76
Sort: Recently updated
MultiRL/qwen3_1.7b_easy_rl_gamma_1_step_40
2B
•
Updated
1 day ago
•
92
MultiRL/qwen3_4b_easy_rl_our_adv_final
4B
•
Updated
3 days ago
•
1.34k
MultiRL/qwen3_1.7b_easy_rl_final_group_norm
2B
•
Updated
3 days ago
•
91
MultiRL/qwen3_1.7b_easy_rl_final_gamma_1
2B
•
Updated
7 days ago
•
2.88k
MultiRL/qwen3_4b_base_easy_rl_final
4B
•
Updated
7 days ago
•
135
MultiRL/qwen3_4b_base_sft_final
4B
•
Updated
8 days ago
•
557
MultiRL/qwen3_4b_easy_rl_new
4B
•
Updated
9 days ago
•
1.15k
MultiRL/qwen3_1.7b_easy_rl_gspo
2B
•
Updated
9 days ago
•
106
MultiRL/qwen3_4b_sft_new
4B
•
Updated
10 days ago
•
792
MultiRL/qwen3_1.7b_easy_rl_final_step120
2B
•
Updated
10 days ago
•
3.48k
View 76 models
datasets
18
Sort: Recently updated
MultiRL/final_sudoku_hard_new_rl
Viewer
•
Updated
7 days ago
•
480
•
33
MultiRL/final_sudoku_hard_rl_hint_raw_new
Viewer
•
Updated
8 days ago
•
635
•
23
MultiRL/final_sudoku_hard_rl_hint_raw
Viewer
•
Updated
8 days ago
•
640
•
14
MultiRL/final_sudoku_benchmark_with_hint_solver_difficulty
Viewer
•
Updated
9 days ago
•
300
•
11
MultiRL/final_sudoku_benchmark
Viewer
•
Updated
14 days ago
•
680
•
352
MultiRL/Sudoku-Benchmark_new
Viewer
•
Updated
15 days ago
•
300
•
20
MultiRL/final_sudoku_sft_A
Viewer
•
Updated
15 days ago
•
399
•
12
MultiRL/sudoku_hard_solved_first_final
Viewer
•
Updated
17 days ago
•
640
•
21
MultiRL/sudoku_hard_solved_25_final
Viewer
•
Updated
17 days ago
•
640
•
25
MultiRL/sudoku_hard_solved_10_final
Viewer
•
Updated
17 days ago
•
640
•
10
View 18 datasets