We freeze every forecast before kickoff and grade it against the real result — then publish the numbers.
Every forecast is frozen in our prediction log before the match kicks off. After the final whistle, we compare the frozen prediction to the real result and record:
Nothing is staged or cherry-picked. Every graded prediction is published in the predictions log.
We back-test our model head-to-head against every serious alternative — Dixon-Coles, a self-fitting regression (GLM), generic Elo, raw goal-scoring strength, and simply sorting by national rank. Each approach gets scored on the same real decisive games. Our model has come out on top in the states where we've validated it.