Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection Paper • 2601.19375 • Published 1 day ago • 5
TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors Paper • 2601.17958 • Published 3 days ago • 1
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 2 days ago • 26
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published 8 days ago • 44 • 5
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 13 days ago • 38
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published 17 days ago • 32
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation Paper • 2601.02204 • Published 23 days ago • 60
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 83