Reactive Transformer MVP model with 3B total params and 190M activated in decoder. Training in progress