Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 3 days ago • 25
MMFineReason Collection High-quality STEM reasoning dataset for Multimodal LLM post-training. • 13 items • Updated 3 days ago • 19
Continually pre-trained models Collection Language-specific LLMs continually pre-trained from fully open English base models • 2 items • Updated 12 days ago • 1
MS MARCO Mined Triplets Collection These datasets contain MS MARCO Triplets gathered by mining hard negatives using various models. Each dataset has various subsets. • 16 items • Updated 4 days ago • 13