This is a major attempt to generate a set of measures for Test Cricket by period. The purpose is two-fold. The first is to look at the way the figures change over the years, letting us get a handle on the evolution of the game. The second is to establish a criteria for adjusting any analysis we do which spans across the years. Many a time have I found myself in a situation needing to adjust a particular period's figures and I have re-invented the wheel every time. Now I hope to have a set of figures which can be used as a ready reckoner for such adjustments. Readers who do similar analysis are welcome to use these figures.
Readers should also realize that after I thought of this complex topic, I have put in nearly a month's work, on and off. into preparing this complicated analysis. I would appreciate avoiding of a superficial read and flippant off-the-cuff comments.
The analysis covers various aspects of Test Cricket. Since the article has become too long, it has been split into two parts. The first part covers Matches, Innings, Results, Partnerships and Extras. The second part covers Batting, Bowling, Keeping and Dismissals.
To start with let me divide the 130 years into 8 periods, taking into account the evolution of the game, years and the number of Tests played. The following are the periods.
1. 1877 - 1914 (Pre World war 1)
2. 1920 - 1939 (In between the two World Wars)
3. 1946 - 1959 (1940s & 1950s)
4. 1960 - 1969 (1960s)
5. 1970 - 1979 (1970s)
6. 1980 - 1989 (1980s)
7. 1990 - 1999 (1990s)
8. 2000 - 2008 (2000s)
These are logical and reasonably evenly spaced periods. Anything more will result in too many periods with consequent difficulty in following the tables and anything less will telescope multiple differing periods into one and we will lose out in analysis.
Even the formatting of the article required a lot of thinking. I tried having the periods horizontally. It was difficult to read. There was also the need to present the core data such as runs, wickets, balls, wickets et al to the readers. So I adopted a dual presentation approach. In the main body of the article I show the calculated measures in a grouped form and the base core data in the supporting pages. That way all the information is shown and the main report is not cluttered. I have also avoided showing the variance of each period figure to the all-Test averages to avoid showing too many numbers. That will indeed be the key figure to make adjustments.
During the first period, timeless Tests and 3-day Tests alternated. Later 3-day, 4-day, 5-day, 6-day and timeless Tests were played through the years until 1979, from which year almost all the 1000+ tests have been played over 5 days. As recently as 1973, 4-days tests were played between New Zealand and Pakistan. Please remember these pertinent facts while perusing this table.
Surprisingly the Balls per match figure during the first period has been quite high despite the number of 3-day tests. This, despite 4-ball overs during most of these years requiring more change over time. During 1960s the balls per match is the highest. More than the match days, I feel this is certainly a result of lot more drawn matches during this period and to a lesser extent the 1970s.
The runs per match is the highest during the current decade and the lowest during the first period when batting was indeed difficult. The relatively high 1960s and 1970s figure must no doubt be due to the number of drawn matches.
More Wickets per match fell during the first period. Barring this period the figure has remained fairly static.
The RpO figure is the most important measure we have seen until now. It has varied quite significantly over the years. Surprisingly the Rpo figure was quite high during the first two periods despite the pitches. It fell drastically during the post-112 period, certainly due to a combination of accurate bowling and defensive batting and attitudes. The figure picked up later and has crossed the key value of 3.0 for the current decade, where it is 14% higher than the all-test average. This has been the result of most teams, led by Australia, scoring quickly in a bid to go for a result.
There have been 4/5/8 ball overs at different times in Test cricket, however all RpO figures have been standardized to 6 bpo for this table.
Barring the first period the Runs per wicket figure has remained fairly stable. The figure is highest during current decades. For most of the periods the RpW figure has exceeded 30.
The average completed innings size has followed the pattern. Quite low (15% below all-Test average) during the first period and then around the all-Test average mark subsequently, barring the low-scoring 40s-50s period..
During the first period, there was an extraordinarily high instances of sub-100 innings. Over 12.5% of the innings completed (53 out of 494) were below 100. The second period was a major drop in the sub-100 innings. However the figure almost doubled during the 40s-50s. Then it has settled down. The 1990s had the lowest figure. Surprisingly the current decade's is double that of the previous decade. There have been 36 such instances out of 1052 completed innings.
I was so intrigued by this sudden escalation that I decided to make a detailed study. As expected the culprits were Bangladesh with 8 sub-100 scores and Zimbabwe with 7. However the situation has been worsened by the West Indian decline. They have had 5 sub-100 scores. At the other end, Australia and South Africa have had one instance each.
Predictably there were very few 500+ innings during the first period. Then the % stabilized to the all-Test average during the next 6 periods. There has been a noticeable increase during the current decade with 147 of the 1489 innings crossing 500. Remember that these are not just completed innings but all innings.
Australia leads with 28 500+ scores while India is close with 24. At the other end Zimbabwe has only 2 scores in excess of 500 while Bangladesh has not crossed 500.
The paradoxical current decade situation of high number of 500+ scores and high number of sub-100 scores is a pointer to the wide gap between teams as well as the drive to achieve results.
This is an analysis of two types of partnerships. The first wicket partnership is the most important one since it lays the foundation for the innings. The average first-wicket score has been reasonably scattered around the all-Test average of 36.7 barring the first period when it fell below 30. In between the wars the partnership average went past 40, possibly owing to the strong opening partnerships of England and Australia.
Even though I am not a fan of measuring quality through individual 100s (I always treat the 100th run as nothing more than the run(s) scored around the 99 mark), a 100 partnership is more significant since it delivers a psychological blow for the team. A fairly low number of partnerships during the first period crossed 100. Surprisingly this was followed by a doubling during the next period with over 10% of the partnerships crossing 100 (56 out of 494). There has been a recent increase during the current decade, also at a good scoring rate.
The next is a measure of opening failures. These are the sub-10 (single digit) partnerships. This includes only instances where the first wicket has fallen. During the early days, especially during the first period, well over a third of the partnerships have been failures. This figure improved over the years but has picked up now and we are back to a fairly high (either side of) 30% figure. It may have to do with the attacking attitude of the opening batsmen nowadays. I could have done a "opener dismissed at 0" analysis. However I feel that a single digit partnership is a failure and a 0 is no worse than a 5 or 9.
The next measure is the number of runs added for the last 3 wickets. This has not varied much barring the 40s-50s when it fell to around 40. For the current decade the value is around 50, indicating a more committed late order batting set-up with better techniques and application.
All the extras calculations have been done per 1000 balls. This is just a convenient measure and is to be used only as a relative measure for comparison. All the extras components have been analyzed.
The number of Extras per Tb has increased over the years and the current decade figure is the highest, about 20% higher than the all-Tests average.
The Byes per Tb started at a high figure and now stands around the all-Test average. Have the keepers become that much better?
Leg Byes follows the reverse pattern. Starting at a low level it is now at a fairly high level.
No Balls per Tb have increased significantly. They were extremely low during the first 70 years and suddenly zoomed up during the 1970s and have remained there. Possibly the changing of the No ball rule during the 1960s must have contributed to this increase.
Wides per Tb have also increased during the current decade, almost double of the all-Test average and the previous period of value. Possibly the bowlers are striving for too much. May also be that the unmpires, no doubt influenced by the ODI experience, are calling wides more often now.
The increase in LB/Nb/Wides per Tb has more than odffset the drop in Byes per Tb and this has resulted in the overall increase in Extras per Tb.
This is a very interesting table. The overall Results % started at an incredible 82+ value during the first period, dropped to a low 50+% during the miserable 1960s and has risen again now to a near-80% value. Australia might be stuttering now. However they are the team which started the equivalent of "total cricket", hard, attacking and always striving for results. Due credit should be given to them for changing the face of Test cricket, especially after the miserable 1960s-1980s periods.
A similar pattern emerges in the Home wins measure. The first and last periods have high Home wins % values.
The best period for Away wins was the first one when the 3 month sea travel seemed to have done something good since 38% of the matches finished with Away wins. This value has since dropped and stood at its lowest during the 1960s when "Not to lose" was the motto. The value has picked up significantly during the current decade with over 30% Away wins.
The Draws % shows low values during the first and last periods. The most boring period in Test history was during 1960s when nearly half of the matches ended in draws, not all of them the exciting ones.
The second part of the article will follow in a week's time covering the Batting, Bowling, Keeping and Dismissals aspects. I will also try and do some changes based on any significant comments. I invite readers' comments, both on these areas and the ones being analyzed next week. At the end of the second part, if readers so desire, I will also make available an XL sheet containing all the measures analysed, including % variances to the all-Test averages.
Posted by: David Barry at November 1, 2008 10:19 AM
Run rates: the period from around 1895 (or 1890) to WWI is called the Golden Age, in large part because of the attacking batting of the time.
The front foot no-ball rule is indeed the cause of the rise in no-balls.
There was an artificially low percentage of draws before WWII because, for about 60 years, all Tests in Australia were timeless and played to completion.
The high proportion of century stands in the inter-War years is probably because of Hobbs and Sutcliffe, who had 15 of them. [[
Ananth:
David, thanks. All your points are relevant and throw light on the numbers.
]]
Posted by: Aaron at November 1, 2008 10:46 AM
An interesting analysis, and it could be used in lots of ways. Look forward to reading part 2.
Posted by: monkey fuel at November 1, 2008 11:09 AM
nice work!
if anything, this suggests to me that the game has changed less that we imagine it has [[
Ananth:
You have got in one.
My own reaction after I finished the work was "Hey! I expected lot more variations." Other than Extras, the other measures have not changed as much as one thinks.
]]
Posted by: Mish at November 1, 2008 1:28 PM
Do you know how much the high volume of positive results in the 2000s down to the performance of Bangladesh and Zimbabwe (and WI), in the same way that they have contributed to the sub-100 scores? [[
Ananth:
Hamish
Will do a calculation and post it as a comment. I will keep you informed. My gut feel is that Bangladesh, with their high-90% losses should certainly have contributed to the high % of results.
]]
Posted by: RM at November 1, 2008 3:06 PM
David Barry: "There was an artificially low percentage of draws before WWII because, for about 60 years, all Tests in Australia were timeless and played to completion."
I thought that only the last tests (of the series) were timeless!
Good effort, Ananth!
[[
Ananth:
Ravi
There were a total of 99 timeless tests, all played on or before 1939. Out of this 58 were played during the first period and 41 during the second period. Also not all the timeless tests were the last ones of series.
For instance 4 tests were played during 1982-83 season. All were timeless.
]]
Posted by: Vaibhav at November 1, 2008 4:53 PM
Extremely interesting and thought-provoking tables.
So the number of Test matches per decade has increased almost linearly. The percentage of results is the highest since WWII. It's tempting
to say that test cricket has never been healthier.
For all the variables measured, it would be revealing to have a matrix of per-country measurements over time (Country on one axis, time interval on the other). That should answer many questions: is the increase in Test matches simply due to more countries playing test cricket ? Are some countries playing far more matches per year than others ? Have some countries disproportionately increased / decreased the number of tests played every year ?
What does your analysis indicate about the impact of ODIs (starting mid 70s), or lack thereof on Test cricket ?
Could you also share the standard deviations for these measures ? They are often as interesting as the averages.
Thank you,
Vaibhav. [[
Ananth:
Good idea to provide Std Deviation. My worry is that the number of "numbers" increases quite a bit. Maybe I will incorporate the same in the Xl sheet I have promised at the end of the second article.
]]
Posted by: SRK at November 1, 2008 5:01 PM
In home/away win analysis, home win for someone is an away loss for the other team and vice versa. I do not understand how have you calculated the percentage of away wins and away losses? [[
Ananth:
Pl note that only the following have been calculated.
Home wins
Away wins
Draws
The total of these three is 100%.
Where do "Home loss" and "Away loss" come in.
]]
Posted by: D.V.C. at November 1, 2008 10:41 PM
An interesting article, I look forward to the second half.
If I may I'd like to suggest a couple of additions: (1) In the extras section could was also have Penalty runs per 1000 balls? (2) In the results section could we also have a Tie percentage (yes I know there haven't been many). I assume you have counted Ties as a result?
I would also be interested to see how the results percentage changes as a function of the intended duration of the game and also the actual duration of the game. This could be done as days, or as number of balls bowled (or minimum required). In this way we might be able to glean what is the "ideal" length of a Test, in terms of the percentage of results we would like to see.
Posted by: R Sivasubramaniam at November 1, 2008 11:42 PM
Fantastic work Ananth!
It's great that you have the time and the ability to link cricket with IT and produce a masterpiece!
Looking forward to your future instalments
Posted by: David Barry at November 2, 2008 12:20 AM
RM, from 1882/3 and WWII, there were no drawn Tests in Australia.
Posted by: Ananth at November 2, 2008 2:29 AM
This is in response to Hamish's request to show the tables excluding Bangladesh and Zimbabwe. I am not a great fan of this. There have been weak teams at all times (India/Nzl in 50s-60s, England at various times, Sri Lanka during early 80s, even Australia during early 80s etc). One has to accept this fact just as one accepts Bolton and Newcastle in the Prenmier League. However I have done the work and shown the results below.
It is a big surprise that the Results % actually falls when we exclude Ban/Zim. Paradoxical but tells us something. Possibly Zim/Ban might play for a draw more often.
Posted by: Marcus at November 2, 2008 5:15 AM
With regard to measure #2 (Rpo and Rpw) I wonder what the figures would look like if you split it between pace and spin? This is probably impractical unless you were to do an analysis that only takes into account bowling, but my hunch is that the averages and ERs of particularly pace bowlers in 2000 would increase noticeably compared to the equivalent figures of 90s pace bowlers. But then, it's possible that the performances of Sami and Agarkar have influenced my perceptions of pace-bowling strength somewhat! [[
Ananth:
Marcus
Good point. I will try and include this in my Part 2 posting.
]]
Posted by: David Barry at November 2, 2008 5:47 AM
That's not paradoxical at all. Bangladesh hardly ever draw, so excluding them will make the result percentage go down.
For the 1990's the exclusion of Zim makes the result percentage rise, because Zim played for the draw a lot.
Anyway, there are still a lot of results these days. [[
Ananth:
David
Good point on the 1990s figures. It is true that Zimbabwe were overall the better test team.
]]
Posted by: shrikanthk at November 2, 2008 6:37 AM
Also, the high run-rates in the pre WWII period can be in part attributed to the widespread use of spin as the primary form of attack. A four pronged pace attack was unheard of in the first half of the century. South-africa for instance had an attack spearheaded by four googly bowlers in the first decade of the century. The Australian attack was led by two great leg-spinners in the 30s.
Since batsmen tend to score faster against slow bowlers than against the fast men, it is no wonder that run rates were a tad higher.
Posted by: D.V.C. at November 2, 2008 7:14 AM
@Marcus: What do we count Underarm lobs as? Pace or spin?
Posted by: Geoffe at November 2, 2008 7:26 AM
This is fascinating analysis Ananth. It's amazing that '30s flat pitches, 50-60's negative tactics and the game since 1980 produce such consistent results - there is hope for meaningful comparisons of players' stats.
I'd be interested in knowing average length of timeless tests - did they make it to 5 days very often?
Posted by: Chris at November 2, 2008 11:41 AM
Considering that your own stats show that the average number of runs scored by the last three batsmen in the 2000s are barely any different to the 60s and actually lower than the 80s (in which bowling standards were probably much higher), I find your comment suggesting a more committed late order batting set-up with better techniques and application in the current decade to be at odds with your own stats. [[
Ananth:
Chris/James, a common reply.
Both of you are correct. The current decade figures are only at par with the previous two decades. It is however true that the current decade and 1980s have seen a 5% increase over the all-time average. 5% is a significant variation.
I agree that the only DEFINITE statement I can make is on the 1940s-50s when the figure was nearly 15% below all-test average.
]]
Posted by: James at November 2, 2008 11:43 AM
A look at the variation in the numbers you give doesn' appear to really support the statement about a 'more committed' tail, e.g. 49.8 doesn't seem very different from the 48.2 and 50.0 of the previous two decades. Not being a statistician, would the use of measures of statistical significance (p values and all that) to objectively assess qualitative statements such as 'tail order batting has improved in recent years' be applicable? It would be useful to differentiate between likely random fluctuations and significant trends.
Many thanks for an interesting article [[
Ananth:
Chris/James, a common reply.
Both of you are correct. The current decade figures are only at par with the previous two decades. It is however true that the current decade and 1980s have seen a 5% increase over the all-time average. 5% is a significant variation.
I agree that the only DEFINITE statement I can make is on the 1940s-50s when the figure was nearly 15% below all-test average.
]]
Posted by: Chris at November 2, 2008 11:54 AM
Byes were more prevalent in the early days as most keepers stood up to nearly all bowling, irrespective of the pace. The marginal increase in byes over the last decade is probably a result of the drive to pick the best batsman among keepers rather than the best keeper that has become prevalent in many countries during recent years.
Posted by: Cas at November 2, 2008 5:13 PM
I'll probably get a lot of stick for this but I find it hard to asknowledge the first period as real TEST cricket. Just a couple of teams playing against each other? From what i've read so far, the wickets were very contrasting/ Lohman getting his wickets at that average. I mean was he really that good or was it the batsman who were so crap!
Posted by: Gareth at November 3, 2008 4:37 AM
There was a law change for wides (amongst other things) in 2000. Previously to be a wide, the ball had to be "out of reach of the striker"; now it is a wide unless the batsman can "hit it with his bat by means of a normal cricket stroke."
Posted by: Ananth at November 3, 2008 5:38 AM
This is in response to Daniel's comments.
I have treated a tied match as a draw. That is the correct interpretation. To treat a tie as a result means there has a winner, consequently a loser which is clearly not the case. Neither team deserves that tag.
I accept that a tie may be a positive result. No more than many a great drawn match. There are draws and draws and let us consider a tie as a wonderful draw.
I have not considered Penalty runs separately since I felt that it would have very little impact on the overall conclusions. Only purpose would be to be 100% accurate.
Posted by: Mish at November 4, 2008 10:47 AM
In my defence, I wasn't suggesting that you removed Zimbabwe/ Bangladesh from the figures. I was more interested in how much the increased results of the 2000s were due to those weak sides or whether it was part of an overall trend due to increased run rates and generaly more agressive play.
Posted by: redneck at November 5, 2008 5:58 AM
very interesting read! with all this talk these days of batsman friendly wickets through out the test world, im suprised to see the current periods has a higher result percentage over every other period bar the golden age. excelent summary Ananth
Posted by: SRK at November 5, 2008 2:13 PM
I am sorry for troubling you again. However I am still confused. Can you explain how the entries in the table in the paragraph 6 are calculated? [[
Ananth:
I think there is a small blind spot which is preventing you from understanding. Happens to all of us. Also please read my earlier reply.
Let me explain.
If India and Australia play two series of 5 tests each, one at India and the other at Australia. The first series in India ends in a 2-1 win for India and the one at Australia ends in a 3-1 win for Australia.
There have been 10 matches, 7 results and 3 draws. That is simple.
Out of the results, there have been 5 home wins (2 for India and 3 for Australia) and 2 away wins (1 each for India and Australia). So this analysis will yield 5 home wins, 2 away wins and 3 draws.
As I have explained already, do not think about losses, only wins.
]]
Posted by: CM at November 7, 2008 6:45 AM
The increase in Leg Byes seems to correspond to the period when wearing helmets became common. Perhaps this figure indicates that now they are better protected batsmen are increasingly prepared to get behind the ball and stay there, resulting in more hits on the body and hence more LB's.
Posted by: Martin at November 7, 2008 12:42 PM
I like to do a bit of statistical analysis myself but with individual players performances, and find many of today's players more often do better than most of even the best of the 70s to 90s players. Trying to find some reason I have left statistics aside and studied financial motivation. In all sports especially football, but also very much in cricket the money incentive has changed the reason for playing well or achieving results. Your fascinating analysis seems to show that the last 20 years' cricket has been energised and players have become more professional and motivated. How about comparing players test match incomes and performances over the years?
Y Anantha Narayanan has over 35 years of IT background. Over the past 15 years, he has been concentrating on Cricket analysis and software development. He has been involved with StumpVision, Wisden, Hallmark Software and his own site www.thirdslip.com during this period.
David Barry was cricket-starved when teaching English in France, and
study of cricket stats was his only way to stay sane. He is now back
in Brisbane, Australia, and working towards a PhD in Physics. He once
played for the worst team in the G-division of Muscat's cricket
league.
After doing an MBA in marketing and working in an advertising agency, S Rajesh decided that his skills might be put to better use by number-crunching on cricket. He hasn’t regretted that decision in the last six years, and edits the Numbers Game column on cricinfo.com every Friday.
Andrew Samson had his moments with bat and ball, once scoring 43 and taking 3 for 14 with his legbreaks, but he was much better at arithmetic, which explains why he is where he is today. Andrew has been keeping cricket stats since the days when it used to be done with pen and paper, and has been involved in scoring/stats for Radio and TV since 1987. He has been Cricket South Africa's official statistician since1994.
A former scientist and occasional TV quiz champion, Charles Davis now works full time at sports statistics in Melbourne.
His only real contribution to the Test record books came at age 4, when he formed part of the record 90,800 crowd
who saw West Indies at the MCG in 1961. He has two books to his credit, and claims to be the only cricket statistician
ever who has been quoted in the New York Times and in Australian Federal Parliament on the same day. Not to be
confused with the West Indian batsman Charlie Davis, especially in terms of ability.
Having just taken early retirement as a Mathematics teacher in Hobart, Ric
Finlay now fully devotes his time to recording cricket, both past and
present, for the popular CSW cricket database, along with his colleague
David Fitzgerald (www.tastats.com.au). His interest in the game is
inversely proportional to his ability as a player, but he did once score a
century after being dropped at 3 and running out three of his team-mates.
His first memory of international cricket is the 1962-63 MCC tour of
Australia, described as one of the most boring ever. Totally fascinated, he
was instantly hooked, and has never looked back. Author of three books on
cricket of a historical nature, he has provided statistics and scored for
radio and television cricket coverage since 1983.