Don’t give fast bowlers the new ball so often

Analysing the effectiveness of bowlers with the old ball, I think Fast bowlers should be used sparingly with the new ball, which should mainly be in the hands of swing/seam bowlers.

I was looking at the impact of the new ball in Tests, and how it varies by country*. The general trend is that once the ball is 20 odd overs old the pace bowlers get little help. But then Australia bucked the trend.

Enigmatic Australia. I couldn’t find a new ball benefit for pace bowlers, other than the first three overs.

Fig 1: Pace bowler average against batsmen that average over 30, since 2005, in Australia. Note that the green line is a nine-over-rolling average, while the blue line is how I see the trend.

If you knew nothing of cricket, and just went by the chart, you’d say overs 6-20 (average 37) are as hard to bowl in as any of the first 60 overs.

Why isn’t the new ball helping the bowlers in Australia? I think it’s because selectors pick fast bowlers who are best suited to the quick and bouncy wickets. Think Steven Finn rather than Chris Woakes. “Fast” bowlers are more consistent through an innings than other pace bowlers**. Here’s the performances of fast bowlers*** in all countries:

Fig 2: Fast bowler average against batsmen that average over 30, since 2005. All countries.

Figures 1 and 2 are very similar! Averages by over in Australia look just like those of fast bowlers generally. The pitches in Australia encourage fast bowling, so the graphs are basically the same. Putting it another way, the new ball effect looks small in Australia because most bowlers don’t rely on the new ball.

What about pace (not fast) bowlers? Contrastingly, they are deadly with a brand new ball, dangerous until the 20th over, but then rather ineffective – especially overs 60-80.

Fig 3: Pace (excl Fast) bowler average against batsmen that average over 30, since 2005

Pace bowlers are not a homogeneous group. From now on, my model won’t just look at spin vs pace, it will split pace bowlers into “fast” and “not fast”. The “fast” bowlers don’t need the new ball, but do get an edge in the first five overs as the batsmen aren’t set. Other pace bowlers get a boost through the first 20 overs.

Who should bowl and when?

A big question. The fielding team’s goal is to minimise the expected runs of the batting team. That means managing resources – 68% of innings last over 70 overs, so four bowlers are going to at least three spells. When should those spells be?

There’s a point in the innings when a fast bowler becomes more effective than a swing bowler. It depends on the ground, the relative quality of bowlers, and the weather.

On an average pitch, the crossover is in the fifteenth over. If you had an equally talented attack of three swing bowlers and one fast bowler, the fast bowler should be held back until after the crossover, and bowl as much as possible with the old ball.

The trend of the above charts (since 2005) still holds true: current Fast bowlers average 4% more in overs 20-80 compared to overs 1-19. The equivalent figure for other pace bowlers is a whopping 19%****. You don’t want to make Medium-Fast / Fast-Medium bowlers use and old ball.

Please forgive the absolutes above. Of course, there’s no sudden leap between “Fast” and “Fast Medium” bowlers. And if the old ball is reversing then by all means pass it to the swing bowler.

But – I think this kind of analysis is important. It’s only by codifying and quantifying we can get closer to understanding the game. The assumptions and simplifications can be ironed out later.

What’s next? Once someone (maybe me, maybe you) has the data on how spell length impacts performance, and a reliable way of combining specific (head-to-head) and general matchups (eg. OS vs LHB), we’ll have a model for the optimum bowler for the next over. From there it’s a small step to planning optimum bowlers for the next session.

Footnotes

*Methodology: Pace bowlers (career average under 35) against batsmen (career average over 30). That way we’re avoiding the effect of cheap wickets at the end of the innings, and just looking at the real contest between bat and ball. All my ball-by-ball data comes from cricsheet.org.

** Fast bowlers should be about as quick later in the day. Two bits of evidence for this- firstly the academic research implies it (link and link). Secondly the speed data for Jasprit Bumrah and Olly Stone from India v England, 2021.

Data from India vs England 2021, screengrabs from the BCCI website

And, because it’s interesting I’ll give you two more footnotes to this footnote:

  1. According to this there is a speed decrease of around 4kph from bowling in heat on consecutive days
  2. Jofra Archer had a tendency to decrease in speed through the innings in India. He might not be like other fast bowlers. That’s not necessarily a criticism – being able to switch from RF to RFM might allow him to bowl more overs.

***Note this is 20 Fast bowlers, among the leading wicket takers of the last 20 years. Not quite the top 20, as I tried not to have too many from one country (Australia).

****Based on cricinfo’s classification of bowlers (F, FM, MF). Only includes balls when bowling to batsmen who average over 30. The population in question are the 25 leading pace wicket takers from March 2019 – March 2021.

Adjusting averages (Lyon vs Ashwin)

Pick a big enough sample size and conditions should average out… At least that’s what I’d always assumed.

Let’s disprove that conceptually and then with numbers.

Is it fair to directly compare the averages of these two players? Bowler A plays his home Tests in Sri Lanka, ending his career with a bounty of Bangladesh wickets. Player B is part of a four man attack. Bowls a lot against the top order, and his home games are in Australia. He never gets a sniff of the tail – less than 10% of his wickets are batsmen averaging 10 or less.

No matter how big the sample size, A and B aren’t on a level playing field; there is a bias in favour of A.

How would we expect A or B to perform against a batsman that averaged 30, on an average pitch? Our best bet is to adjust their stats for:

  1. Ground
  2. Batsmen bowled to
  3. Innings number
  4. Specific pitch condition
  5. Ball age (maybe)
  6. Match situation (eg. Team playing for draw / declaration)

Here, I’ll do the first two, looking at the players with 50 wickets over the last four years. Will assume that factors 3-6 average out over a career.

For “Ground” take a weighted average of spinners’ averages at the stadia where each bowler has taken wickets (ie. Mehidy Hasan Miraz’s 29 wickets at 19 at Dhaka are still valuable, but worth more like 29 wickets at 21).

For “Batsman bowled to” each run conceded is worth one run – but the wickets are awarded a value based on who was dismissed – so getting Virat Kohli gives you more credit than Ishant Sharma.

Data is the four years to 17th Feb 2021. Note positive adjustments to averages are bad; negative is good.

The mean adjustment is really interesting: increasing spinners’ averages by 4%. This indicates that just looking at raw averages flatters spinners. Why is this? I think it’s a function of when spinners bowl. If they don’t get much action in the first 30 overs, three wickets will already be down. Thus they’ll disproportionately dismiss the (weaker) lower middle order.

Lyon vs Ashwin

The similarity of their adjusted records looks striking when compared to raw averages. Let’s take a closer look and see if it stacks up.

Firstly, who they dismiss:

It’s not like Ashwin is getting an easy ride, but 30% of Lyon’s wickets come against batsmen who’ve averaged over 40, while for Ashwin that figure is 20%.

Again, Ashwin plays on a mix of pitches, while Lyon has taken over half his wickets at grounds where spinners traditionally struggle.

Overall, Lyon has done amazingly well to average under 30 over the last four years given where he has bowled and to whom.

Other observations

While Ravi Jadeja’s raw average of 24.6 is flattering, he’s still right up there.

Moeen Ali can feel aggrieved not to be ahead of Dom Bess as England’s second spinner.

Roston Chase is better than his average would say – but with relatively little data the error bars get large (60 wickets means his rating is 37 +/- 5).

Nathan Lyon is the best current spinner – we adjust his average down by 11%, of which 8% comes from where he plays. He also gets a boost from who he bowls to: as part of a four man attack, Lyon does feature more against the top order.

Where do we go with this? Extending this to pace bowlers is harder, as strictly one should adjust for when in the innings they bowl (the new ball is helpful). This would need a model of wicket and run probability by ball bowled, and then to compare each player’s actual results to what the average player would achieve.

PS. This would be easy to check… if you had CricViz data. Expected averages would tell the story. Especially comparing head-to-head for the games in which both Lyon and Ashwin played. And splitting LHB and RHB so there was no bias driven by matchups.

India vs England Preview February 2021

Test probability: India 62%, England 24% Draw 12%.

Series probability: India 72%, England 11%, Draw 17%. India are more likely to win 4-0 than England are to win the series.

Or at least that’s what my model thinks. Betting markets have England as low as 18% for the first Test. That’s reflecting low expectations of England’s batsmen against spin, and higher home advantage that my reckoning.

Hereafter are some notes that inform my thinking:

Country – Spin takes 60% of wickets in India. For England, that means Root will probably bowl a bit to support Leach and Bess. However, since 2011 overseas spinners average 43 in India, for the hosts that figure is 25. India seems a tough place to crack.

Grounds – Chennai has only had two Tests since 2011, Ahmedabad has been rebuilt since it last hosted a Test. So not as much to go on as usual. What I can tell you is that in the last seven FC games at Chennai only twice has a team gone past 350. If I’m awake, it’ll be interesting to see how the wicket plays (and how CricViz rate the batting conditions).

Batting Talent – India are 10% stronger than England. Add 15% home advantage that becomes 25%.

Bowling Talent – adds a further 9% advantage to India. There’s no area of the game where England are stronger than India in India. That doesn’t mean they can’t win, it would just be an upset.

Matchups

  1. India’s current lineup are really good against spin (Pujara averages 76, Kohli 71). England’s batsmen mostly have better stats against pace. Ashwin averages more batting against the twirlymen than Ben Stokes. Bairstow’s skills in this area will be missed (1,685 at 46). England may need someone to Make Things Happen with the old ball.
  2. Ravi Jadeja is injured. Thus England’s right handers benefit from facing two off spinners (Ashwin and Sundar). Ashwin averages 31 against RHB (SR 60), 20 against LHB. So while England’s right handers might have a good series, expect to see Ashwin into the attack early when Stokes comes to the crease, and if Burns starts well.

Format – back-to-back Tests at Chennai, and back-to-back Tests at Ahmedabad. One silver lining for England is that Anderson and Stone can rotate in for Broad and Archer. Bumrah is harder to replace. England may benefit from the 7% increase in a bowler’s average playing back-to-back Tests.

Home advantage – 15% (lower than the usual 21%, might flatter England as they won in 2012 with peak Swann and Panesar, which distorts the stats). Maybe I’m being generous to England putting 15% into the model.

Sri Lanka vs England “Preview” January 2021

Here’s some brief notes written ahead of the first Test. I really should have put this up before the Test started. Anyway:

I give England only a 31% chance in the first Test. The betting markets say 39%. Why the difference? The toss is vital and England’s batting isn’t at full strength.

  • Batting first is key. SL are W7 L1 D1 batting first, W3 L4 D0 batting second recently. Batting first is worth 148 runs (runs per wicket by innings over the last 10 years: 40, 28, 29, 26). A 400 pitch becomes a 280 one after the successful tossers have had their fun with it.
    • Note spin is no good in first innings (average 42, SR 77). If you field first and get nowhere in 20 overs, you are in very deep trouble.

  • England have a lot of right handers. A tasty matchup for a leg spinner or SLA bowler. There are two in the Sri Lanka squad: Lasith Embuldeniya averages five wickets per FC game, PWH de Silva is more an all rounder who averages two per game. Embuldeniya averages 40 after seven Tests, but with a FC average of 25 in Sri Lankan conditions, he has a great opportunity. Surprised to see Embuldeniya’s odds 25-1 for Man of the Match. Oh, and he’s Sri Lanka’s leading wicket taker over the last two years.

  • On the topic of Sri Lankan FC averages, there’s a gulf between Test Cricket and the Sri Lanka Premier League Tier A. It’s hard to estimate because there are few (if any) overseas players for calibration, but I make the increase in bowling average 70%: a 25 average in Tier A translates to a Test average of 43. Here’s the expected averages for Sri Lanka’s attack:
Expected averages for Sri Lanka’s attack. Lakmal will be missed in the first Test. Fernando looks useful.
  • Away teams pick too many spinners (over the last ten years away spinners average 35 at Galle) likely because teams pick more spinners than are Test standard. The relevant decision is “who will do better, our third spinner or our first change pace bowler”?
    • In England’s case they that’s not a question of spinning ability, more the balance of the side. With Ali unavailable, England don’t have the batting depth to pick a third specialist spinner. Expect Curran+Bess+Leach+Two Pacers+Root. Sri Lanka will know this, so have an incentive to prepare a spinning pitch and nullify England’s pace attack. Unclear what the pitch will be like as has to be good enough to take back-to-back Tests.

  • Curran and Bess may not offer enough in either batting or bowling to balance the team. Maybe in a couple of years, but today England look beatable.
  • Put all that together, England have the better bowlers, but the toss is so important that it’s a great leveller. Win the toss, bat, win the game.

Changing conditions

I hadn’t noticed this change – it used to be that the 2nd innings was the time to bat in Sri Lanka. Now it’s the 1st innings. See below the difference in runs per wicket from batting first/third versus second/fourth. A big advantage to winning the toss and batting.

Why should that change happen? Different groundsmen? Different grass? Playing at a different time of year? Either way it shows the importance of “live” queries feeding models rather than fixed assumptions.

  • PS. Reflecting after the first day’s play I need to think about specific matchups. Bairstow and Root are good against spin, even when it turns away from them.
  • PPS. There’s a lot of Test series happening right now – will December/January become the annual window of international red ball cricket?
  • PPPS. The comments about the importance of the toss look silly when spin took 6-85 in the first innings. Was I wrong or were Sri Lanka’s batsmen wrong? Hard to gauge without xW data.

Country vs Country matchups in Test Cricket

It’s naive to assume that England will play as well in Mohali as they do in Melbourne.

But how to measure this? Results are misleading: a 20 run win is not as dominant as a 220 run one. Hence runs per wicket (RPW) is the best approach.

We should adjust for the relative strength of teams: Bangladesh have lost all four games in England this century – but is that purely because of the gulf in talent? If Bangladesh were as good as England, how much would they lose expect to lose by because they were playing in English conditions?

Here’s my approach: use all data since 2000 to calculate the number of runs per wicket scored by each team, and the equivalent conceded when fielding. Comparing runs per wicket when fielding to the average team gives a measure of each team’s bowling strength (eg. India’s 32 makes them 3% better than the average fielding team). New Zealand average 31 runs per wicket batting, so we would expect New Zealand to score 31 * 0.97 = 30.1 runs per wicket when playing India.

Repeating that for every pair of teams gives a set of ratios of relative strength:

Relative team strengths 2000-2020 based on runs per wicket batting and bowling. eg. Australia would expect to outscore England by 26% in a Test on neutral territory.

See where we’re going with this? Now all we need to do is compare actual relative runs per wicket when the countries play each other to get specific country vs country matchups.

Here are the actual RPW ratios when the home team (first column) plays a specific away team (first row):

Actual RPW ratios 2000-2020. For example, Australia outscored England by 45% when at home, while England were outscored by 13% when hosting Australia. Minimum 50 wickets – blanks reflect a lack of data.

Let’s take stock. When Australia host West Indies they’ve dominated them – scoring 2.09 runs for every run scored by West Indies. Most of this can be explained by Australia being 73% better than West Indies. The remainder is from conditions and player-on-player matchups. Even if West Indies were able to field a team as strong as Australia, they would still be outscored by 2.09 / 1.73 = 1.21 times (or 21%) playing in Australia.

That 21% happens to be the average Home Advantage over the last 20 years. For the penultimate table, I’ll take the ratio of the first two tables, and adjust for the “normal” 21% home advantage to be left with specific additional adjustment factors for when two teams play each other.

This is noisy- reds and greens everywhere. Time for some judgement: I don’t think one can rely on the data for pairs of countries, because for some pairs of teams there just aren’t enough games. Instead I’ve grouped teams to pick up bigger trends.

Findings

  1. India & Australia get an average 27% home advantage (for most teams it’s 21%)
  2. Asian teams in SENA (South Africa, England, New Zealand, Australia) countries do on average 10% worse than expected.
  3. Sri Lanka don’t travel well

Based on that, here’s an adjusted version:

Home advantage (%) for specific pairs of teams – after adjustments made by me. For example, Australia get a 25% boost hosting Bangladesh

This analysis is crude. I’m not totally persuaded by it (yet). Such as why are New Zealand terrible in South Africa, when crudely similar teams like Australia and England do well there? Would we expect that trend to continue? Is it too reductive to assign characteristics to nations rather than specific players?* Perhaps, but if it helps understand why teams are winning then I’ll use it.

For instance, South Africa have a habit of beating England in England. This could be because conditions are similar in the two countries, so England lack their usual home advantage.

I’ll keep an eye on this in 2021. The four remaining series in 2020/21 are all fairly normal for home advantage. Relevant to the World Test Championship final, it’s worth noting the raw data for India and Pakistan in England hints that the location of the final suits Pakistan more than India.

Another good test for this approach will be India touring England next summer. Is this Indian team (armed with Bumrah), sufficiently talented in the pace department to avenge the 4-1 defeat from 2018? If so, that will hint that Team A being forever doomed touring Team B is twaddle.

*There’s a part of me that finds this analysis distasteful too – assigning characteristics to a whole nation.

DRS: The story so far

As the internet matures, the amount of freely available data has reduced. So I was excited when this popped up on twitter:

A chance to examine some of the received wisdom on the review system. I’ve got five myths and three trends to share with you.

Before we get into that, a summary. Over the decade of Decision Reviews, most reviews have been by the fielding team (57%). However, batsmen have had greater success overturning dismissals (35%, compared to 21% for the fielding team). The 907 overturned decisions are 6% of the wickets over the last decade, so while umpires are getting the overwhelming majority of decisions right, DRS is making a noticable difference to the accuracy of umpiring.

On with the show. Firstly, five myths:

I’m going to have to ask you to reverse your opinions

Myth 1 – Umpires favour the home team

Crunching the numbers, the hosts and visitors have uncovered almost exactly the same number of incorrect and borderline decisions. In terms of overturned decisions it’s 416-413 in favour of the home team, while the marginal decisions that haven’t been overturned (“Umpire’s Call”) have benefitted the home team slightly, with 109 reviews by the visitors being adjusted Umpire’s Call, against 100 for the home team.

If umpires were being influenced by the crowd, there would be more decisions against the away team then being overturned – this isn’t happening, so whatever home advantage is in Cricket, it’s not from umpires.

Myth 2 – Having a decision overturned gets into an umpire’s head

I took each example of an umpire who had a decision overturned, and looked at the next DRS review for that umpire on the same day in the same innings. If umpires were trying (even subconsciously) to even things up, you’d expect the umpire to give the next close one out, which the batsman would review. Putting this in terms of data, we’d look for a decision overturned against team A to be followed by a review by team B.

No evidence for this exists – of the 449 times when a decision was overturned and another review occurred on the same day, same innings, same umpire, 235 were the other side reviewing, 214 the same side. Umpires are considering each ball on its merits.

Myth 3 – Teams use reviews “just for the sake of it”

This one really surprised me. I’d expected to need to cleanse the data of the pointless reviews at the end of an innings when there’s no harm in reviewing. So I looked for those pointless reviews, but they don’t exist.

Opportunistic reviews should be visible by a dire success rate. Here’s the split of success rate by the batsman’s average:

Maybe a handful of spurious reviews from the worst batsmen, but they aren’t taking the mickey.

Myth 4 – Some teams are better at DRS than others

Not true – all the teams are very tightly bunched. I’ve excluded Afghanistan (30%), Ireland (50%) and Zimbabwe (30%) as they just haven’t played enough.

Myth 5 – Some umpires like to give things out and some like to say “not out”

There are two ways this would manifest itself for Outers: “Umpire’s Call” would tend to be batsmen reviewing balls, clipping the stumps, that were given out; and the proportion of successful reviews would be higher for batsmen.

Because of the small sample sizes, it looks like there are trends, but when you put the two methodologies side by side, the pattern disappears. Which is a shame, because I’d hoped that the umpires who were bowlers would be Outers and those that were batsmen would be Not Outers. Turns out Elite Umpires are just professionals. Here’s the chart for good measure.

Now for the true trends

Stay with your original opinions; you’re on screen now.

Trend 1 – Quality of reviews drops by day

Tony Corke (@matterofstats) got in before me with this trend – here’s the chart he produced

Trend 2 – Resetting reviews after 80 overs (2013-17 rules) reduced review effectiveness

The long term trend is fairly consistent – flitting around the 27% mark. Except for 2014 and 2015. I think I can explain that dip.

In 2013 a rule was brought in whereby reviews reset after 80 overs. This was to avoid punishing a team who lost reviews to marginal decisions. A better rule took over from autumn 2017 – “Umpire’s Call” decisions would not cost a review.

The impact of the resetting reviews was felt in overs 60-80: teams were in a position of “use it or lose it”, so did the logical thing and reviewed liberally. Thus, from 2013-16 the success rate in overs 60-80 was only 20%, having been 28% for those overs before 2013. Naturally, once the new rules took over from 2017, the success rate for overs 60-80 returned to 28%.

Trend 3 – DRS success rates differ by ground

The harder batting conditions are, the better the relative performance of fielding reviews versus batting reviews. Any scatter plot for this looks ugly, so you’ll just have to take my word for it that this is a statistically significant correlation. In lieu of that, here’s a chart of batting and bowling DRS success rates by ground.

Now, I’m not sure which way the causation runs. One possibility is that at high scoring grounds the umpires get lulled into thinking batsmen aren’t going to get out, so they don’t believe their eyes when a batsman is out.

The key point I’d like you to take from this is just how consistent umpires are.

T20 batting: running out of steam

When should you consolidate? I’ve devised a general rule to calculate when the batting team should slow down and conserve wickets.

Let’s recap the current state of T20 International batting. Teams usually end their innings with wickets in hand. Since 2016 the average first innings score is 166-6*. Teams rarely get bowled out, having enough batting depth to attack throughout, even if a wicket falls early on.

Looking at this another way, on average it takes more than 120 balls to bowl a team out. The openers can go out and play naturally, expecting the top seven to do the business. The number eight batsman averages only three balls per game. I think of limited overs batting in terms of “Expected Balls”: how long would you expect it to take to bowl this team out if they were batting normally? For example, England bat deep, expecting to last 172 balls before being bowled out – this gives them licence to attack in a game that’s only 120 balls long.

~~~

The more you get in the first innings, the higher your chance of victory. But get greedy, take too many risks, and you may fall short of a middling score that might have been enough. Any approach to batting has a range of possible outcomes. The goal is to pick the approach that maximises expected win %. How do you do that?

Here’s one example – consider a binary choice where number four in the first innings can either bat normally or anchor (Strike Rate down 10%, Average 20% higher). According to my model, for this current England T20 team, pre innings or at 0-1 anchoring is not optimal. 0-2 it’s marginal. It’s only worthwhile if you’ve slumped to 0-3. Which, coincidentally, is the point at which England’s expected balls drops below 120 (ie. they run the risk of the median innings not lasting 20 overs). This makes intuitive sense: tailor your batting aggression so you almost (but not quite) get bowled out.

Note this assumes England are playing against an equally talented team – hence win % pre game is 50%

The general rule: bat normally unless Expected Balls < Balls Remaining.

A recent example – England were 34-3 (5.3) – which looks precarious, but the Bairstow-Stokes-Morgan middle order meant it was more-likely-than-not that England would bat all 20 overs, and have a reasonable chance of chasing their 180 target. England won the game in the 20th over. Maybe that “lose three wickets in the powerplay, lose the game” maxim is outdated as T20 averages improve. For England, Expected Balls exceeded Balls Remaining, even having lost three wickets in the powerplay.

But this is too simplistic. Not everyone can strike at 150: you can’t expect fireworks from every tail. Here’s the strike rates of the top 10 T20I teams over the last five years. Numbers 9-11 just aren’t as good. I think of teams “Running out of steam” when all the quick scorers are out.**

“Running Out Of Steam” depends of the composition of one’s batting order. England currently have Jofra Archer at number nine. Deep. West Indies aren’t so lucky – Keemo Paul bats at eight with a domestic career SR of 107 – so they Run Out Of Steam at six down.

My hunch is that cricketers know what their tail is like, and how likely it is that tail will be exposed, and bat accordingly. Take another recent example – WI T20 #1 – at 59-5 (5.1) West Indies were vulnerable. One more wicket and they were done for. So Pollard and Allen consolidated, taking 37 from the next five overs. A rain interruption meant the innings was reduced to 16 overs. With just six overs left – it was time to attack, lifting the score to 180 by the end of the innings. Subsequent discussion focussed on the impressive assault, missing the responsible consolidation period that made it possible.

~~~

Here’s the “Balls to Run Out Of Steam”*** for the Top 8 T20I sides, based on their most recent XI

As at December 2020

This tells us that England, Australia and Pakistan have the capacity to score more quickly than each player’s career record (ie. if they bat naturally, they are wasting resources by being too conservative). If wickets fall, that should be reassessed****.

Note Sri Lanka put out a particularly weak XI in their last game. Numbers four and below would expect to strike at below 130 – way off the pace. Hence they run out of hitters unusually quickly.

Teams should tailor their aggression, aiming to not quite run out of steam. To do this, throughout the innings the batting team should compare EBTROOS to balls remaining, adjusting the EBTROOS as wickets fall.

Just imagine those clipboards showing live updates of Expected Balls To Run Out Of Steam, and Optimum Strike Rate. (Screenshot from Sky Sports)

Footnotes

*Top 10 teams against each other. Sorry Luxembourg.

**I wish I was good at writing. Spent ages trying to come up with a better name for it than “running out of steam”. Ideas welcome.

***BTROOS = Balls To Run Out Of Steam. This is clunky stuff.

**** There’s an added complexity which I’ll keep for the footnotes: median innings length is not the same as balls per wicket. The difference is only 4% at the start of the innings, but gets bigger as fewer wickets are left. Here’s the same table, but with Median Balls To Run Out Of Steam*****. MBTROOS = Median Balls To Run Out Of Steam. Perhaps MBTROOS could rhyme with albatross. Anyway, here’s the MBTROOS for the latest England T20 lineup:

MBTROOS for England. Note 3 down after the powerplay would mean MBTROOS 86, which are two balls more than the 84 remaining. So keep attacking!

The perils of batsmen switching counties

In a recent article for “County Cricket Matters” magazine, I looked at the impact of changing counties on a batsman’s average. You can buy it here.

The main conclusion was that a transfer tends to negatively impacts batting.

A surprising observation was that younger players are more adaptable and may improve, while the over thirties rarely benefit from a move. Small sample sizes, so just a tantalising hypothesis for now.

This data covered 2016-2019. The curtailed 2020 season was too short to extend the analysis, so we’ll have to wait for 2021 to see if I’m onto something.

P.S. Much of my analysis considers trends such as these. Since we’ve had two years’ since this blog began, at some point I’ll check if those trends continued. Trends that continue after you’ve noticed them are much more valuable than mere things-that-have-been-true-lately.

The ODI “who is winning?” formula

Modelling a chase is hard. I was looking for a rule of thumb: a quick calculation that could support the monte-carlo simulation I run. And here it is:

Decimal odds of chasing team winning = 1 + (Required Runs/Expected Runs)^8

Jonas (@cric_analytics)

Jonas gave the example of Australia needing 145 more to win an ODI against England. He thought Australia could on average expect to score 110 from their last 20 overs. Australia’s decimal odds were thus 1+(145/110)^8 = 10.1 (or roughly a 10% chance of winning).

To successfully unpack (or steal!) the formula, the element that needs a bit of thought is “Expected Runs”. We can use Duckworth-Lewis, combined with ground data to give an approximation. 20 overs & 5 wickets left meant 38.6% of resources remaining. On a 285 par pitch, that’s the 110 Expected Runs that Jonas calculated.

Taking the formula one step further, “Expected Runs” can be adjusted for the quality of the batting and bowling teams to give a more precise calculation for a specific run chase. I have added this expanded formula to my model to better understand who is winning and why.

Here’s an example of what this looked like when Australia were 222-5, needing another 81 from the last 10 overs (third ODI, 16th Sept 2020):

Aus 222-5 (40). Maxwell 74* Carey 77*. Target 303.

The raw formula gave Australia a 22% chance with 26.1% of resources remaining (Expected Runs = 69, on the basis that a normal par score is 264 – that may be an underestimate as scores keep rising). However Old Trafford slightly favours the batsmen, and England’s attack is sub par – lifting Australia to 32%.

My model had Australia at a 42% chance – the extra 10% coming from the strength of Australian batting, the two batsmen being set, and any other differences between my model’s Monte Carlo simulation and Jonas’ formula. The right hand column is the output of my model, and the penultimate column is the one that goes haywire if something is wrong: a useful check.

What’s the message? Firstly, if the model is working, I can see who is winning during a chase and why. Secondly, matchups and other complexity have made my model something of a “black box” – Jonas’ formula will be a useful check that my model isn’t off piste.

Rating the Blast teams … Lancashire’s batting is better than it looks

I rated the county 20-20 batting in one hour of analysis. Reckon I need to understand 20-20 eventually.

My starting position: scoring quickly is good but getting out is bad. Thus the best teams will score quickly, with high “balls per dismissal”.

Here’s how that looks for this year’s teams (rated using the most commonly used XI this year):

Quarter final qualifiers in green, eliminated teams in red. Only the top eight batsmen’s SR has been included as the tail rarely bats, but the whole team has been included in the BPD calculation.

Averages matter

Surrey and Gloucestershire score just as quickly as Yorkshire and Kent, but with higher balls per wicket they are less likely to fail – so are more consistent and better batting units.

I had heard that averages don’t matter in 20-20: I think they do.

A team has to be confident of lasting 120 balls. I don’t know how many balls you need to expect the unit to survive before getting bowled out in 120 becomes unlikely – maybe 180? Only three teams on the right of this chart are at that level. Once all teams are there then wickets cease to be a limiting factor, and it’s all about strike rate.

Lancashire – skewed by strong bowling

Bottom right should contain bad teams: trundling to 140 and losing.

Yet Lancashire won five games this summer with a team where no-one has a four year SR over 135. They even scored 190 (SR 158) against Durham. What’s going on?

The key is that they are a strong bowling team that often have easy chases. They thus play within themselves to secure the win. This makes their players look like plodders. Yet batting first they score 177 on average over the last two years. While chasing that drops to 129.*

Lancashire’s true position on the chart would be somewhere up and to the left. Repeating this chart with first innings data would help.

Here’s my attempt at a Boston Consulting Group** view of T20 team batting:

Stars: good teams. Dogs: bad teams. Question marks: Could be bad teams, could be players looking slow from chasing small totals. Roller-coasters: Might score 200 one game, 105 all out the next.

I’ve only looked at the batting, but I feel like this view might have some predictive power. 2/6 Dogs qualified, 1/5 Roller-coasters, 2/3 Question Marks, 3/4 Stars. The three Stars were the three group winners.

*Data up to 19th Sept 2020.

** Boston Consulting Group suggested in 1970 that companies could consider any product as being one of four types in a market (Star, Dog, Question Mark, Cash Cow). I’ve ripped off their idea to try to look like I know about business as well as cricket.