A deep learning model that predicts, at any passing moment, the probability that a penetrative pass is available — regardless of whether the player actually plays it. A penetrative pass is defined as one that ends inside the opponent's convex hull (the polygon formed by all outfield defending players, excluding the goalkeeper), with at least one attacking teammate inside to receive. The model uses CNN (ResNet34 with transfer learning from ImageNet) on custom pitch images encoding player positions, Voronoi space control, and the opponent's convex hull from StatsBomb 360 data. Event data alone is insufficient (AUC ~0.6); 360 data achieves AUC-ROC = 0.73.
(1) Define penetrative passes: a pass whose end location falls inside the opponent's convex hull, with a receiving teammate present inside. Filter to passes originating between the first and last thirds of the pitch (exclude defensive buildup and final-third finishing). (2) Generate pitch images: for each passing moment, create an image encoding: on-ball team (blue), off-ball team (red), passer (black), Voronoi diagram (space control per team), convex hull (green). Normalize orientation so the attacking team always plays bottom-to-top. (3) Train CNN: use ResNet34 pretrained on ImageNet, fine-tune on the penetrative vs. non-penetrative classification task. 8 epochs on GPU (~2 hours). Training loss ~0.3, validation AUC-ROC = 0.73. (4) Set probability threshold: use minimum Euclidean distance from ROC curve to (0,1) point — yields threshold ~0.38. Above threshold = potential penetrative pass available.
Key design decisions: exclude pass end location from features (that would be data leakage — you only know the end location after the pass is made); exclude passer identity and foot (model should be team/player-agnostic); train on Bundesliga, validate on La Liga to prevent data leakage.
The Penetrative Pass Probability (P3) model predicts whether a penetrative pass is AVAILABLE at any moment — regardless of whether the player actually plays it. The gap between probability and execution is the analytical insight: players who convert low-probability penetrative moments (0.4% probability passes that succeed) demonstrate exceptional passing ability visible nowhere else in the data. These are the signatures of genuinely special creative players.