PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published Oct 16, 2024 • 5
Reliable Fine-Grained Evaluation of Natural Language Math Proofs Paper • 2510.13888 • Published Oct 14, 2025 • 2
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 135
Typhoon Isan Collection An ASR and a language technology artifact for Thailand’s Isan dialect • 5 items • Updated Dec 2, 2025 • 3
ThaiOCRBench: A Task-Diverse Benchmark for Vision-Language Understanding in Thai Paper • 2511.04479 • Published Nov 6, 2025 • 1
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning Paper • 2506.16123 • Published Jun 19, 2025 • 8
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 192
Prior Prompt Engineering for Reinforcement Fine-Tuning Paper • 2505.14157 • Published May 20, 2025 • 7
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published Feb 13, 2025 • 31
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 21
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 4 days ago • 549
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8, 2025 • 99