Who published ProRL: Prolonged RL Expands Reasoning Boundaries, and where?

Liu et al. — arXiv 2025 (arXiv:2505.24864).

ProRL: Prolonged RL Expands Reasoning Boundaries

Prolonged RL with KL resets expands what a reasoning model can do, not just sharpens it.

Liu et al. · arXiv 2025 · Reasoning & RL. Read the paper ↗

A free, interactive, animated visual explainer of ProRL: Prolonged RL Expands Reasoning Boundaries — every exhibit computed from the real formulas, with verbatim quotes from the source.

Questions

What is ProRL: Prolonged RL Expands Reasoning Boundaries?: Prolonged RL with KL resets expands what a reasoning model can do, not just sharpens it.
Who published ProRL: Prolonged RL Expands Reasoning Boundaries, and where?: Liu et al. — arXiv 2025 (arXiv:2505.24864).
Where can I find a visual explainer of ProRL: Prolonged RL Expands Reasoning Boundaries?: Right here — a free, interactive, animated walkthrough of the whole paper, with exhibits computed from the real formulas and verbatim quotes from the source.

Related explainers

DeepSeek-R1
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Training language models to follow instructions with human feedback
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Constitutional AI: Harmlessness from AI Feedback
DAPO: An Open-Source LLM Reinforcement Learning System at Scale