Forecasting and Anticipating Minutes

How many Premier League minutes do you think Mohamed Salah will play next season?

It may seem a strange question, but accurately anticipating how many minutes players will play in future seasons is absolutely key to building successful squads.

Salah generally starts every game, has a good fitness record, and is young enough that you wouldn’t expect a sudden decline.

We know 38 x 90 = 3420 minutes (excluding injury time) is the maximum possible for any Premier League player so let us say that 2500+ minutes would be a good guess for 2020/2021.

What about for 2021/22? Or 22/23? The certainty decreases. We lack information about future form and fitness, maybe Liverpool have a youngster who is better than 30 year old Salah or maybe Real Madrid make an offer that can’t be refused.

The further ahead we try and anticipate the more difficult it is to have certainty.

What is even trickier is predicting future growth of players.

Sticking with Liverpool, how many Premier League minutes do we expect from Curtis Jones next season?

He is an excellent young player but Liverpool have established players ahead of him and very specific tactical requirements. Maybe 300 minutes?

What about in 2022/23? Do we anticipate 2500 minutes and him as a first-team regular?

This series of articles will show why forecasting matters, and look at the logical steps clubs can take to plan their recruitment effectively.

For as we will see there is a very high correlation between coherently assembled squads and success.

Quality not quantity

The below graphs show the number of players used for >100 minutes this season in every club from the Premier League to League Two.

In each step we drop the average number of players used per team increases.

Premier League = 21.8 players used

Championship = 24.5 players used

League One = 25.5 players used

League Two = 26.1 players used

The correlation is interesting too between the number of players used and the points gained.

The X axis shows the number of players used and the Y axis the points gained in 2019/20 to date.

In the Premier League data we see a slight negative correlation between the number of players used (for >100 minutes) and the number of points gained.

With an R value of -0.27 the relationship isn’t that strong.

In the Championship we see a medium-strength negative correlation. This time R-0.66

League One comes in a strong negative correlation of R-0.797

And League Two also had a moderate negative correlation of R-0.57

In summary there is a good chance that the more players a team uses over the course of a season the fewer points they will gather.

But as everyone who has ever mentioned correlation on the internet is always told; correlation does not equal causation.

In other words we need to look at why might a team change frequently?

Injuries, financial problems, losing lots of games, managerial changes, different systems.

So perhaps we’d be better saying that the problem is instability.

Stable teams do better. They have smaller, tighter, and more coherently formed squads.

It isn’t a surprise to see that the teams with the fewest used players in the Premier League are Burnley, Sheffield United, Watford, Wolves, and Brighton. These are fairly coherent squads who are the leagues overachievers financially.

We can take this further and look at the core players, those squads members playing at least 2000, roughly half a season, of game time each year.

This time we want to compare the size of the core (defined as those players with over 2000 league minutes played) with the number of points won in the season.

The Premier League season (within our dataset) that correlates most strongly with the theory that playing a stable core lineup leads to good results is the 2015/16 season. Leicester’s famous title winning team was basically unchanged each week. 

Even so in the Premier League core stability is only moderately correlated to points won. 0.44 in this cherry-picked season and usually around 0.25.

In the 2019/20 season to date we see a 0.26 correlation. Perhaps if we were to list which teams are most coherently assembled for the money available to them we may see a pattern though with Wolves, Leicester and Sheffield United the top three for using the same core regularly. 

How about the EFL divisions during 2018/19?

The Championship had a moderate correlation of 0.51. The relegated Premier League clubs tended to have large squads and failed to find a regular core. Birmingham’s point deductions forced them into using a small squad and actually performed well.

League One showed the lowest correlation at 0.18 but looking into the data it is obvious as to why. Luton Town topped the division with a core of 9 who played almost every week. Other high performing teams showed up well too. However both Sunderland and Charlton rotated players throughout the season and finished in the top 6. It is certainly possible to brute force the creation of a winning team by trying lots of different players but it doesn’t seem to be financially prudent. Remove the outliers and the correlation is around 0.40.

League Two seems to be the best example of a league where a small number of key players playing regularly helps. A 0.70 correlation this time between the number of players with 2000+ played minutes and total points won.

Our hypothesis is that the quality of replacement players is the key to why the Premier League teams show the smallest correlation in general between points and core stability. If Sergio Aguero is injured then playing Gabriel Jesus is not too much of a drop off. However, if the League Two top scorer is out for a month his replacement may be an untested kid. 

The lower down the leagues you go the best ability may be availability.

And assessing availability is the key to anticipating minutes.

And anticipated minutes is a key part of forecasting.

Leave a comment

Your email address will not be published. Required fields are marked *