NHL Playoff Prediction Methodology

How we compute series win probabilities for the 2026 Stanley Cup Playoffs.

Overview

Our model predicts the probability each team wins a best-of-seven playoff series using four factors combined into a per-game win probability, then run through a 10,000-simulation Monte Carlo of the remaining games. Predictions update live after every game played and automatically incorporate in-series data once three games have been played.

The weights for each factor were derived empirically from five seasons of historical playoff data (2020-21 through 2024-25, 75 series total) — not from intuition. We measured each factor's correlation with actual series outcomes and assigned weight proportional to predictive power.

The four factors

FactorWeightR1 correlation
Elo rating37%0.457
Starting goaltender GSAx/6028%0.355
Power play xGF per GP20%0.247
Penalty kill GA per GP (inverse)15%0.193

Elo rating (37% weight)

Elo is a continuously-updated team strength rating that incorporates every game result, opponent quality, and margin of victory. It was the single strongest predictor of playoff series outcomes in our historical analysis. Our Elo implementation updates after every game, so by the start of playoffs each team's rating reflects their full regular-season body of work — including late-season form.

Note: Elo correlates strongly with season goal differential (r=0.76), points percentage (r=0.76), and 5v5 xGF% (r=0.67). Including all of those separately would be triple-counting team strength. Using Elo alone captures the combined signal cleanly.

Starting goaltender GSAx/60 (28% weight)

Goals Saved Above Expected per 60 minutes. A shot-quality-adjusted measure of goaltender performance: positive numbers mean the goalie is stopping more pucks than an average NHL goalie would given the quality of chances faced.

GSAx is nearly independent from Elo (r=0.014 in our analysis) — a team's goaltender quality isn't baked into their team strength rating, making this a genuinely additive factor. We use current-season data only; historical multi-year goalie data has poor year-to-year repeatability.

Why not raw save percentage (SV%)? In our analysis, raw SV% had essentially zero predictive power (r=0.012 all rounds, -0.117 for Round 1). It's a well-known but noisy stat because it doesn't adjust for shot difficulty. GSAx is the correct metric.

Power play xGF per GP (20% weight)

Expected goals generated on the power play, per team game played across the season. We use expected goals rather than raw goals because xG captures chance quality — a team that creates high-danger scoring chances on the power play will continue to convert in the playoffs even if their regular-season finishing was unlucky.

Penalty kill GA per GP (15% weight, inverse)

Goals allowed on the penalty kill per team game played. Lower is better, so this factor favors the team with the stronger penalty kill. In tight playoff games, preventing power play goals is a meaningful differentiator.

How the weights were determined

We ran a correlation analysis on 75 playoff series (2020-21 through 2024-25) — 40 Round 1 series plus 35 later-round series. For each potential factor, we computed:

Factors with higher correlations received proportionally more weight. We also checked pairwise correlations between factors to avoid including multiple metrics that measure the same underlying thing — hence using Elo alone as our team-strength factor rather than stacking goal differential, points %, and CF% on top.

Factors we considered and dropped

Several metrics that are frequently cited in playoff predictions didn't hold up against our data:

Per-game win probability

The four factors plus home ice are combined into a single per-game win probability for the higher-seeded team:

per_game_prob = 0.50
  + elo_shift        // max ±0.08, scaled by 0.04 per 100 Elo points
  + gsax_shift       // max ±0.06, scaled by 0.03 per 0.5 GSAx/60
  + pp_shift         // max ±0.04, scaled by 0.02 per 0.25 PP xGF/GP
  + pk_shift         // max ±0.03, scaled by 0.015 per 0.25 PK GA/GP
  + home_ice_shift   // ±0.04 depending on venue

Clamped to [0.28, 0.72]

The clamp exists because extreme factor alignment in a small sample can produce mathematically unrealistic per-game probabilities. In practice, the best historical Round 1 favorites produced series win probabilities around 75-78%.

Monte Carlo simulation

Once we have per-game probabilities for each of the remaining games in the series (games already played are locked in at their actual result), we run a 10,000-iteration Monte Carlo simulation. In each iteration:

  1. Start from the current series score (e.g., 2-1 after Game 3)
  2. Simulate the next scheduled game using its venue-specific per-game probability
  3. Increment the winner's series score
  4. Continue until one team reaches 4 wins, or we exhaust the remaining schedule

The final series win probability is the fraction of simulations where each team reached 4 wins first. We also track series length distributions (4-, 5-, 6-, 7-game outcomes) to produce the "most likely outcome" shown on each series page.

Live updates during a series

Predictions recompute on every page load, automatically reflecting every game that's been played:

Validation against historical series-state base rates

We verified that our Monte Carlo produces realistic probabilities at each possible series state by comparing against historical NHL data:

Series stateHistorical leader win %Our MC (equal teams)
Up 1-068%~66%
Up 2-084%~81%
Up 2-171%~66%
Up 3-099%~94%
Up 3-188%~81%
Up 3-275%~75%

These match within 1-6 percentage points. The small empirical "extras" likely reflect real momentum or lineup adjustments that aren't in our model, but the gap is small enough that we don't apply an additional correction.

Round-specific behavior (and a caveat)

Historical higher-seed series win rates by round:

By Round 2 and beyond, the remaining teams are much more evenly matched in true talent regardless of regular-season seeding — upsets are common and "being the higher seed" stops meaning much.

Current weights are calibrated on Round 1 data. When Round 2 begins, we'll recompute weights specifically from Round 2+ historical data, which will likely reduce the impact of seeding-driven factors (Elo, goal differential) relative to goaltending and special teams.

Known limitations

Where to see predictions on the site

Questions or feedback? Model tuning is ongoing — we'll refine these weights after each playoff year as more data accumulates.