Speculative Decoding vs Medusa vs EAGLE
Three ways to draft tokens for a target model to verify in parallel — a separate draft model, self-drafting heads, or feature-level autoregression.
A clear, side-by-side comparison with examples — part of Rudrite Research.