infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 2 days ago • 606k • 18
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 2 days ago • 606k • 18
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 6 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 6 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 6 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 6 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 6 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 6 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Single-Stage-Rollout-16-Full-Finetuning 2B • Updated 8 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Single-Stage-Rollout-16-Full-Finetuning 2B • Updated 8 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-2 Text Generation • 2B • Updated 10 days ago • 29
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-2 Text Generation • 2B • Updated 10 days ago • 29
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 Text Generation • 2B • Updated 14 days ago • 11.6k
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 Text Generation • 2B • Updated 14 days ago • 11.6k