$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models
Paper
•
2601.11969
•
Published
•
26
Long-context Modeling, Reinforcement-Learning, Multi-modality