The DeepSeek lineage
Follow a single frontier lab's innovations in order: the math-RL algorithm they invented, the attention and MoE efficiency tricks, and the reasoning model those choices enabled.
Follow a single frontier lab's innovations in order: the math-RL algorithm they invented, the attention and MoE efficiency tricks, and the reasoning model those choices enabled.