Constructing or combining predictive models to generate team ratings and match outcome probabilities — then layering human judgment on top. The competitive edge is not in any single model but in the combination of models and contextual knowledge.
You use multiple models (Market-Implied + xG at minimum) as baselines. You understand what each model captures and where it's blind. You layer in human knowledge about injuries, coaching, financial situations, and matchups. You resist the temptation to overfit models to match bookmaker odds.
Tuning model drift factors to match bookmaker odds is circular — you're forcing your model to agree with the market, which is the exact thing you're trying to beat. The whole point of a model is independent assessment. Calibrating to market prices produces a model that confirms the market rather than finding where it's wrong.
EPL team ratings stay flat (99% of initial) across a season. Championship ratings end at 149% of initial dispersion, League One at 168%, League Two at 182%. This means a static-rating model works fine for EPL but is systematically wrong for lower divisions where 17-25% of teams are new each season and quality shifts dramatically.
Market-Implied ratings (what the market thinks) and xG models (what the data shows) approach team quality from completely different angles. When both agree, confidence is high. When they diverge, the divergence itself is the signal — investigate WHY they disagree, because one of them is seeing something the other can't.
Books set prop lines using the mean (average). But skewed distributions (receiving yards, points scored, rushing yards) always have a median below the mean — big games pull the mean up while most games are below average. In-game injuries can only hurt, never help, a player's total. The structural result: overs are systematically overpriced and unders systematically underpriced.
A binary garbage-time cutoff (e.g., "filter all plays under 5% WP") creates discontinuous functions and throws away real signal. Teams performing well in garbage time exhibit real skill (moving the ball, completing passes) — it's just lower-leverage. A continuous leverage multiplier that drops off steeply near blowout territory captures this signal while the public's binary "doesn't count" filter discards it.