Home/Soccer Analytics/Penetrative Pass Probability Model (P3)

Penetrative Pass Probability Model (P3)

Passing MetricsLevel 3 — Advanced

What It Is

A deep learning model that predicts, at any passing moment, the probability that a penetrative pass is available — regardless of whether the player actually plays it. A penetrative pass is defined as one that ends inside the opponent's convex hull (the polygon formed by all outfield defending players, excluding the goalkeeper), with at least one attacking teammate inside to receive. The model uses CNN (ResNet34 with transfer learning from ImageNet) on custom pitch images encoding player positions, Voronoi space control, and the opponent's convex hull from StatsBomb 360 data. Event data alone is insufficient (AUC ~0.6); 360 data achieves AUC-ROC = 0.73.

Correct Execution

(1) Define penetrative passes: a pass whose end location falls inside the opponent's convex hull, with a receiving teammate present inside. Filter to passes originating between the first and last thirds of the pitch (exclude defensive buildup and final-third finishing). (2) Generate pitch images: for each passing moment, create an image encoding: on-ball team (blue), off-ball team (red), passer (black), Voronoi diagram (space control per team), convex hull (green). Normalize orientation so the attacking team always plays bottom-to-top. (3) Train CNN: use ResNet34 pretrained on ImageNet, fine-tune on the penetrative vs. non-penetrative classification task. 8 epochs on GPU (~2 hours). Training loss ~0.3, validation AUC-ROC = 0.73. (4) Set probability threshold: use minimum Euclidean distance from ROC curve to (0,1) point — yields threshold ~0.38. Above threshold = potential penetrative pass available.

Key design decisions: exclude pass end location from features (that would be data leakage — you only know the end location after the pass is made); exclude passer identity and foot (model should be team/player-agnostic); train on Bundesliga, validate on La Liga to prevent data leakage.

Progression Levels

Diagnostic Tree

Coaching Cues

  • "0.4% probability and he still found it. That's the kind of passer you want."
  • "The model says the pass was there. Check the video before blaming the player."
  • "Penetrative doesn't mean through ball. It means into the opponent's structure."

Common Errors

  1. Including pass end location as a feature: This is data leakage — you only know where the pass went after it was played. The model must predict from the pre-pass state.
  2. Using event data alone: Without player positions (360 data), the model can't see the convex hull or space control. Event-only AUC is ~0.6 (near useless).
  3. Blaming players for not playing penetrative passes without checking body orientation: A player facing their own goal physically can't see or play a forward penetrative pass, even if the model says the opportunity existed.

Edges

💎 Elite-Only Behavior

A 0.4% Probability Pass That Succeeds Is Your Best Recruitment Signal

The Penetrative Pass Probability (P3) model predicts whether a penetrative pass is AVAILABLE at any moment — regardless of whether the player actually plays it. The gap between probability and execution is the analytical insight: players who convert low-probability penetrative moments (0.4% probability passes that succeed) demonstrate exceptional passing ability visible nowhere else in the data. These are the signatures of genuinely special creative players.

What most people do
Evaluate passers by completion rate, progressive pass volume, or xA — all of which are dominated by volume and opportunity rather than quality of vision.
What the best do
Identify moments where the P3 model says no penetrative pass should be possible (compact defensive hull, controlled space) but the player finds one anyway. These rare moments (maybe 2-3 per match for elite players) are the strongest signal of creative passing quality available.
Why it's an edge: The market evaluates passers on volume metrics that correlate with team quality and playing time. P3-based evaluation isolates individual vision from team context — a player on a mediocre team who consistently finds penetrative passes that the model says shouldn't exist is a hidden gem.
How to exploit: For each passing target, compute the ratio of actual penetrative passes to P3-predicted opportunities. Players with high actual/predicted ratios in low-probability situations are elite creators. This metric is team-independent and identifies talent on weaker teams that volume-based metrics miss.
Hadi Sotude, StatsBomb Conference, 2021-11-04. 0.4% probability successful pass cited as recruitment signal.

Sources

  • Hadi Sotude, StatsBomb Conference 2021, YouTube, 2021-11-04 — presented CNN-based P3 model using ResNet34 on 360 data images; AUC-ROC 0.73; defined penetrative pass via convex hull; identified camera angle and body orientation as key limitations