Speculative Decoding vs Medusa vs EAGLE

Three ways to draft tokens for a target model to verify in parallel — a separate draft model, self-drafting heads, or feature-level autoregression.

A clear, side-by-side comparison with examples — part of Rudrite Research.