Home/Soccer Analytics/Buildup Sequence Markov Model

Buildup Sequence Markov Model

Tactical AnalysisLevel 3 — Advanced

What It Is

A Markov Decision Process (MDP) specifically for modeling buildup sequences — the structured phase where a defender has the ball, no offensive pressure is applied, and the team attempts to progress into the final third. Unlike general xT or EPV models that cover all possessions, this model is conditioned on the defensive setup being faced (high block, low block, left-forcing, right-forcing). Using StatsBomb 360 data, the model incorporates both ball location (30 zones) and reachable teammate information (4 binary features per state: is there a reachable teammate to the left, right, in front, or behind the ball carrier). This produces a separate behavioral model per defensive setup per team.

Correct Execution

(1) Extract buildup sequences: filter possessions where a defender has the ball with no offensive pressure. (2) Classify defensive setup from 360 data: high block vs. low block (based on defensive line height); left-forcing vs. right-forcing (based on defensive shape asymmetry). (3) Build one MDP per setup: states = 30 pitch zones; actions = move-to (with intended destination) or shoot; transition probabilities extracted from event data; reward = 1 for reaching the final third. (4) Extended MDP: add reachable teammate info as 4 binary features per state — teammate is "reachable" if no opponent is within 1.5m of them AND no opponent is close enough to the passing lane to intercept. (5) Extract most likely sequences per team per defensive setup to identify preferred buildup patterns.

Key data requirement: StatsBomb 360 data (player position snapshots for every action) is essential for defensive setup classification and reachable teammate modeling. Standard event data alone is insufficient.

Progression Levels

Diagnostic Tree

Coaching Cues

  • "Barcelona goes through the center then to the sides. Real Madrid prefers the left. The model shows you the pattern."
  • "If they play the same buildup against everyone, they're predictable."
  • "The 360 data tells you who's actually open, not just where they are."

Common Errors

  1. Building one model for all possessions: Buildups are a specific structured phase. Mixing them with counter-attacks or set pieces contaminates the transition probabilities.
  2. Not conditioning on defensive setup: An unconditional model averages across all defensive contexts. Conditioning reveals how teams adapt.
  3. Using event data alone for reachable teammate modeling: Without 360 data, you can't determine which teammates are actually reachable vs. blocked by defenders.

Sources

  • Micah, KU Leuven (DTAI Sports Analytics Lab), StatsBomb Conference 2021, YouTube, 2021-11-04 — presented MDP for buildups conditioned on defensive setup using 360 data; showed Barcelona, Real Madrid, and Atlético buildup patterns under high/low block