Home/Soccer Analytics/Incremental Analytics Adoption Strategy

Incremental Analytics Adoption Strategy

Data InfrastructureLevel 2 — Intermediate

What It Is

A communication and deployment strategy for introducing analytics into a sports organization: start with simple descriptive statistics that accurately name what happened in the game, make sure they are well-defined and discrete (A is A, not B), then build toward complex models by stacking understood building blocks. The key insight from linguistics: language has six properties — arbitrariness, cultural transmission, discreteness, displacement, duality, and productivity. Analytics metrics must be culturally transmitted (taught, not assumed), discrete (clearly separated from similar metrics), and productive (combinable into larger meanings). The more complex something becomes, the harder it is for non-specialists to follow — so build the shared vocabulary first.

Correct Execution

(1) Start with descriptive stats: define well-named, well-bounded counting stats that describe what happened in a game in terms a coach would recognize. These are the atoms. (2) Ensure discreteness: each metric must mean one thing, not two. If "superstar" can mean top-5 or top-15, the word is useless — define it precisely. (3) Name things well: put the calculation in the name when possible (xG = expected goals; WHIP = walks + hits per inning pitched). Avoid opaque names (Corsi, Fenwick, PDO) and proprietary "roll-up grades" that promise to solve everything. (4) Build incrementally: once the basic vocabulary is shared, combine building blocks into more complex models. The underlying complexity doesn't change, but the apparent complexity to the audience decreases because they understand the components. (5) Budget 3+ years: this is a multi-year project. Each building block takes months to define, validate, and get buy-in.

Progression Levels

Diagnostic Tree

Coaching Cues

  • "If they don't understand the ingredients, they won't trust the recipe."
  • "Define the word before you debate the number."
  • "Descriptive stats are incredibly underrated. They're the foundation."
  • "Naming things well doesn't ensure adoption. Naming things poorly ensures there won't be adoption."

Common Errors

  1. Skipping to complex models: Building a WAR-equivalent before establishing xG adoption will fail. The building blocks must be understood first.
  2. Using opaque or proprietary names: Metrics named after people or using meaningless acronyms block adoption regardless of their analytical quality.
  3. Selling "roll-up grades" that solve everything: Single-number player grades promise to eliminate analysis work. They undermine trust and prevent deeper engagement.

Edges

🔑 Hidden Causal Lever

Descriptive Statistics Are Incredibly Underrated — Skip Them and Your Models Will Be Ignored

Analytics adoption follows a linguistic framework: metrics must be culturally transmitted, discrete (clearly bounded), and productive (combinable). Skipping straight to complex models before establishing shared vocabulary for basic descriptive stats is the #1 reason analytics departments fail. When a coach and analyst argue about a finding, they're usually arguing about definitions, not data. Building the shared vocabulary takes 3+ years.

What most people do
Jump to complex models (WAR-equivalents, EPV, xG-based composite grades) because they're analytically interesting, expecting coaches to adopt them based on accuracy.
What the best do
Start with simple, well-named counting stats that coaches recognize from watching the game. Build incrementally: once basic vocabulary is shared, combine building blocks into models. The apparent complexity decreases because stakeholders understand the components. Budget 3+ years for this process.
Why it's an edge: Clubs that rush to sophisticated models waste analytical investment because coaches can't use what they don't understand. The clubs that dominate are the ones that built vocabulary first — their coaches trust the data because they understand the ingredients. This is a competitive advantage measured in years of organizational learning.
How to exploit: Before deploying any model, ensure the component metrics are understood by all stakeholders. Name metrics so the calculation is in the name (like "Expected Goals"). Avoid opaque acronyms and proprietary roll-up grades. Test adoption: "Can the coach explain what this metric means in football terms?" If not, step back to simpler building blocks.
Seth Partnow, StatsBomb Innovation in Football Conference, 2019-10-28. Applied linguistics framework to analytics adoption. Cited Bill James: "statistics have acquired the power of language."
🔑 Hidden Causal Lever

Analytics Departments Fail Because They Solve Problems Nobody Asked About

The most common failure mode for analytics departments is not analytical quality but problem selection. Analysts build sophisticated models that answer questions coaches never had, then are surprised when their work is ignored. The root cause: analysts optimize for analytical impressiveness rather than coach/decision-maker pain points. The fix is not better models but better problem discovery — sit in tactical meetings, listen to what coaches argue about, and solve THOSE problems.

What most people do
Build the most analytically sophisticated work they can and present it, hoping the coach will see the value.
What the best do
Spend the first month in a new role attending every tactical meeting without presenting anything. Catalog the recurring arguments, the information gaps that coaches work around, and the decisions made with insufficient data. Then build tools that close THOSE gaps — not the gaps the analyst finds interesting.
Why it's an edge: A simple tool that answers a question the coach asks every week has 100x the impact of a sophisticated model that answers a question nobody has. The analyst's skill ceiling is bounded by their ability to identify the right problems, not their modeling capability.
How to exploit: Before building any new analytical tool, verify that a specific decision-maker has the problem you're solving. If you can't name the person and the decision, the tool will be ignored.
Ted Knutson, Barcelona Coach Analytics Summit, 2018-11-18; Sam Gregory, Inter Miami, StatsBomb Conference, 2022-09-29. Both emphasize problem selection over model sophistication.

Sources

  • Seth Partnow, StatsBomb Innovation in Football Conference, YouTube, 2019-10-28 — applied linguistics framework to analytics adoption; demonstrated alternative box score exercise; argued for incremental complexity building; cited Bill James's "statistics have acquired the power of language"