Featured

My Inability to Specialize Makes Me Special

A long time ago …

A long time ago near the start of my career as an economist, my mentor at that time sat me down to tell me one word. “Specialize” he advised. I outwardly acknowledged his wisdom while secretly knowing that if specialization was necessary for success in our profession, then I was doomed.

Since that conversation I’ve published peer-reviewed articles on a bizarrely diverse set of subjects. I started with an article on how to accurately measure consumer welfare in the face of non-linear prices. I then segued to the effects that unexpected changes in wheat harvests have on net exports in Australia. Next I published evidence that professional baseball players with long-term contracts don’t work as hard as other players. I followed that up with an oft-cited article that proved gamblers do not randomly select lottery numbers even though they would be better off if they did. My seemingly random list of articles also includes subjects such as the effect of physician fee capitation on consumer satisfaction, a test of the arbitrator exchangeability hypothesis using final-offer arbitration data from Major League Baseball, and how to estimate personal consumption using a statistical modelling technique called “instrumental variables”.

The closest I have come to a “specialty” is a series of articles dealing with cancer care, including cervical cancer in the Vietnamese-American population, the incidence of colorectal cancer by bowel section and the effect of distance to provider on the incidence of breast cancer. I’ve also published articles about the effectiveness of an employer-sponsored weight-management program and the rate of moral hazard caused by health insurance.

I should mention that I published these articles and much more while working at four different universities, a state agency, a county government and four different consulting companies. I’ve testified as an expert witness in both state and federal courts of law on things as unrelated as the government’s efforts to eradicate an infectious plant disease, anti-trust behavior in the stretch limousine market and hospital payment rates by Workers Compensation. Twenty-five years after earning my PhD in economics, I went back to school to earn a Masters’ degree in health services and completed my post-doctoral training at a well-known cancer research center.

Someone unfamiliar with the economics profession might be impressed with these accomplishments, but I can assure you that other economists are not. When they see my long array of seemingly random academic pursuits, they invariably wrinkle their noses in bewilderment. That mentor who advised me in my youth to specialize stopped talking to me nearly twenty years ago when he realized I was a lost cause.

I would like to claim that all this was part of some well-reasoned plan, but the truth is that I struggle with a restlessness that I am incapable of controlling. I have learned to embrace my weirdness. Ironically, my inability to specialize makes me special.

Hence the name of my blog: The Lone Economist. I plan to pursue a data-driven analysis of topics about which I am well-versed, such as Medicare-For-All vs. the Public Option, and a new type of baseball statistic.

Of course, given my history, who knows what I might talk about.

Who is the Greatest Pitcher of All Time?

In my quest to create a predictive set of baseball statistics, I’ve stumbled upon a statistic by which we can assess who is the greatest batter over the last 100 years. So far, by adjusting only for bases, outs and batter age, I have Babe Ruth in front with Ted Williams hot on his heels. But Mike Trout, the only current player in the running, remains within striking distance.

The next step is to bring pitchers into the equation. To do this we need to make some modifications to our new statistic, average individual run production (IRP). The IRP is basically the number of runs the team scores plus the improvement or deterioration of the game “location” during the batter’s plate appearances. This number is divided by the beginning locations (i.e. the number of runs expected to be scored by the end of the half-inning) in order to even out the advantage batters who come to the plate with lots of runners on base have.

For example, during Babe Ruth’s 8,954 plate appearances, his team scored 1,995 runs. The typical batter decreases his team’s location (i.e. increases the number of outs and decreases the number of runners on base) during a plate appearance. Babe Ruth did this also, but he only decreased his team’s locations by 597 runs over his career. The net difference is 1,398 runs. So, this was Ruth’s individual contribution to his teams’ runs. Under the same circumstances, his teams would have scored only 1,094 runs if an average batter had been at the plate.

The formula is (R + E)/B, where R is runs scored, E is the sum of the ending locations, and B is the sum of the beginning locations. For Ruth this results in an average IRP of 1.277 or 27.7% above average.

Except for a home run with no one on base, scoring in baseball is a team effort.  A batter reaches base during his plate appearance and is then batted in by another batter. So, crediting all the runs scored during a batter’s plate appearance to only that batter overstates his individual contribution to his team’s score.

Traditional Pitching Statistics

Early in my series of baseball statistics postings I stated that traditional batting statistics, like batting average, on-base percentage and slugging average, were based on statistical contrivances in order to produce individual performance statistics from what is fundamentally a group effort. If anything, traditional pitching statistics are even more contrived.

Starting pitchers are credited with a “win” if they pitch at least 5 innings and leave the game ahead in the score and if the rest of the team can maintain the lead to the end of the game. Relievers are credited with a “save” if they maintain the lead and do not allow more than three runs. Both of these statistics depend severely on the performance of other players.

Earned run average is the number of earned runs allowed per 27 outs. Even if we ignore the problem of inconsistent error scoring, this statistic suffers from the same inaccuracy that the batting statistics suffer from. Preventing runs is as much a team effort as scoring them is. When a relief pitcher comes into a game, any runs scored by inherited base runners are credited to the starting pitcher. This tends to overstate the relief pitcher’s contribution to preventing runs. But relievers typically enter a game when the location, i.e. the number of outs and base runners, is very precarious. Under those conditions, it is harder to prevent subsequent batters from reaching base and scoring even more runs.

What is needed is a statistic that doesn’t rely on errors at all, gives proportional credit and blame to both starters and relievers and takes the location of the game into account.

The Pitcher Formula

Pitchers don’t typically face just one batter during a game. They face a sequence of batters. Consequently, to assess their relative performance we need to know the beginning and ending game locations for each sequence and all the runs scored in between. This allows us to calculate the number of runs the pitcher should be credited with allowing and the number of runs the average pitcher would have allowed under the same circumstances.

I’ll name the pitcher statistic the individual run allowance (IRA) and the formula is (R + E)/(B + nH) , where R, E and B are defined the same as in the IRP formula, n is the number of outs recorded by the pitcher after his beginning inning and H is the number of runs scored per out allowed by the average pitcher.

Pitching is more specialized than batting. Pitchers are normally known as being either a starter or a reliever. So, I’ve segregated pitchers into these two categories. Table 1 lists the top 25 starting pitchers by average IRA for those who have recorded at least 3,000 career outs. Table 2 lists the top 25 relief pitchers by average IRA for those who have recorded at least 1,000 career outs.

Only two active batters made the top 25 list (i.e. Mike Trout and Joey Votto). However, the top 25 list of starters includes seven active pitchers, including the top two, Clayton Kershaw and Jacob deGrom. Nine active relievers made their top 25 list, including the top one, Craig Kimbrel. None of these three pitchers is close to retirement, so their career rankings can still change. And I have yet to adjust for age and ballpark.

Twelve of the top 25 starters pitched during the 21st century, but nearly all the top 25 relievers (i.e. 23) pitched in the current century. This shows how much the role of pitcher has changed over time. The game is now dominated by closers.

Lastly, notice that 10 of the top 25 starters are left-handed, whereas only three of the top 25 relievers are southpaws. We’ll explore this factoid in later postings.

The next step is to adjust for pitcher age and tackle the lefty vs. righty dichotomy.

Greatest of All Time and Still in His Prime?

6-D baseball statistics rely on 288 discrete “locations” that result from the 6 dimensions of baseball: outs, first, second and third base, balls and strikes. This is kind of a “quantum” alternative to traditional baseball statistics. In fact, I might start calling it Quantum Baseball. I’ll explain this in greater detail in a later post.

Although the power of this new type of statistic is predictive – unlike traditional baseball statistics – an interesting side-benefit is that it helps answer the question: who is the greatest hitter in baseball history? Our initial results showed this to be a two-man race between Babe Ruth and Ted Williams with Lou Gehrig in third place.

There are several effect modifiers that need to be considered to provide a more refined answer to this question. I introduced age as a leading candidate in my last post. Age is particularly important because Babe Ruth and Ted Williams had very different career paths. Babe Ruth didn’t become a full-time batter until his mid-twenties and retired at age 40. Ted Williams started his career at age 21 and retired at 42, but missed several years in between serving in the U.S. Marine Corps. Ruth’s best year – the best year any batter ever had –  was his first as a New York Yankee when he was only 25 years old. His productivity declined steeply in his late 30’s, while Ted Williams’ best season in his remarkable career occurred when he was 39 years old, when most players are a decade past their prime.

To adjust for age, we need to select a common weighting scheme and apply it to every batter. Figure 1 is the distribution of batter age for full-time batters from 1918-2019. That will be used as our weighting scheme.

Applying a common weighting scheme to all batters would be easy if not for one significant problem. How do we give a 10% weight to a season in which a batter didn’t play? For example, Ted Williams missed the entire 1945 season. He turned 27 during that year. His IRP wasn’t zero that year. It was null. It didn’t exist.

Therefore, the next step is to, in effect, estimate what Ted Williams’ individual runs production (IRP) statistic would have been if he had played baseball in his mid-twenties and what Babe Ruth’s IRP would have been in his early twenties if he had not been a pitcher at that age.

From my previous post, I postulated that baseball batters share a common trajectory over their careers. They improve from their rookie season, reach their prime somewhere in the middle and slowly decline as they approach retirement. The result of this model specification is Figure 2.

What this arc says is that batters typically reach their prime during ages 27-29. Most retire by the time they reach their late 30’s. Only extraordinary batters are good enough to play into their 40’s.

The steps to calculating a batter’s age-adjusted average IRP are

  1. Estimate a batter’s yearly average IRP for his missing years
  2. Calculate a weighted average IRP using the common weights illustrated in Figure 1.

First, we identify every season of the batter’s career in which he had at least 300 plate appearances. This is to remove seasons in which the player didn’t play enough to establish his batting ability at that age. Taking the relative IRP-by-age percentages from Figure 2, we estimate the average IRP for each missing season to be the average of the observed yearly IRPs weighted by the relative percentage at that age.

For example, Ted Williams missed the entire 1945 season – a year in which he would have been in his prime. For the 17 seasons he did play, his weighted IRP (relative to his prime years) was 1.292 or 29.2% greater than the IRP for the average batter.  So, if Ted Williams had played that year, we estimate that his IRP would have been 1.292.

I’ll illustrate how this calculation works for Ruth and Williams. Figure 3 shows the estimated and actual average IRP values over Babe Ruth’s career. The orange dots represent the actual IRPs for seasons in which he had at least 300 plate appearances. [The season when he was age 24 is missing due to incomplete data.] The blue dots are the estimated values based on the age trajectory from other batters and Ruth’s actual IRPs.

Notice that the orange dot at age 25 corresponds to a 1.406 IRP. That’s a record. The highest observed for any batter since 1918. Also notice that at age 32, Ruth’s average IRP was 1.304. For almost any other batter that would have been a career best, but it was only an average year for Ruth in terms of IRP. It also happens to be the year in which he hit 60 homeruns – an awe-inspiring baseball record that stood for 35 years – further evidence that homeruns are overrated as a measure of hitting ability.

Figure 4 shows the actual vs. estimated average annual IRPs by age for Ted Williams. Williams’ actual IRPs are in green and his estimated IRPs are in red. Notice that the highest green dot appears at age 39. Williams’ IRP that year was 1.352. That is one of the highest annual IRPs ever recorded and doubly impressive since less than one percent of batters even play at that age.

Also notice that at age 23 Williams’s average IRP was 1.328 – excellent – but only the third best of his career. That was the year his batting average was .406 and is frequently cited as one of the greatest achievements in baseball history. This is further evidence that batting average is a woefully inadequate statistic for measuring batting ability.

If we put the estimated age-trajectories for both players in the same graph, we get Figure 5. The two trajectories are almost coincident, that is, they are virtually identical. Ruth just barely edges out Williams.

Figure 6 summarizes the age-adjusted average IRP’s for the top 25 batters plus Pete Rose. Ruth, Williams and Gehrig still top the list. However, at 0.4%, the difference between Ruth and Williams is barely visible. Ty Cobb jumps from 47th, according to on-base plus slugging percentage (OSP), to 4th, according to age-adjusted IRP. This is especially impressive since this average is based on only one fourth of his lifetime plate appearances that occurred after his physical prime. If I had more complete data on Cobb, I might conclude that he was the greatest batter ever, but we’ll probably never know.

As expected, two famous batters whose career averages suffered because they played well into their 40’s benefitted the most from age-adjustment. Carl Yastrzemski jumped from 184th, according to OSP, to 24th, according to age-adjusted IRP. Pete Rose jumped from 532nd to 88th according to the same measures. Not bad for a self-proclaimed singles hitter.

Also as expected, the two active players on the list declined slightly in rank due to age adjustment. Mike Trout falls from 6th to 8th and Joey Votto falls from 16th to 17th. At 37 this year, Votto is past his prime, but still the highest ranked batter not born in the U.S. (he’s from Canada).

Greatest of All Time and Still in His Prime?

I’ll end this post with something for you to ponder. There are only seven batters ranked higher than Mike Trout. All of them batted left-handed except for Mickey Mantle, a switch-hitter, and Rogers Hornsby, for whom we only have partial data and who played during an era of exceptionally weak pitching. This means that Mike Trout is quite possibly the greatest right-handed batter to ever play baseball and at age 29 this August, he’s still in his prime. And get this, since most pitchers are right-handed, batting right-handed is a disadvantage. Once I adjust for handedness and pitching quality, Mr. Trout might just be revealed to be the greatest batter of all time, period.

I know of only a handful of all-time-greatest-players in their respective sports who played during my lifetime. Michael Jordan, Tiger Woods and Serena Williams are three that come to mind. I regret that I failed to see any of them play in person during their primes when I had the chance. Now that I realize Trout’s place in baseball history, I am precluded from seeing him in person due to the coronavirus pandemic.

If you ever have the chance, you really should make an effort to see him in the flesh. It’s something we can tell our grandkids about.

Hiyo, Silfur!

6-D Baseball and the Art of Extrapolation

In my last post, I stated that extrapolation is how we fill in gaps in the data created by the Curse of Dimensionality.  It is a blending of observation and theory, a tradeoff between accuracy and simplicity. As much art as it is science, its partial reliance on subjective judgement can make it susceptible to manipulation. That is why Mark Twain once said, “There are three kinds of lies: lies, damned lies and statistics.” Four, if you count economic impact studies.

It is, therefore, with great care and transparency that one should approach extrapolation. Economists, in particular, are guilty of creating arcane statistical models that no one outside our profession can understand, much less believe.

This is the reason why medical journals tend to have a low tolerance for studies that rely on complex statistical modeling. The more complex the model, the more difficult it is to be sure one is drawing the correct inference. Unlike economics, medical research is an experimental science. A well-designed experiment need not rely on a confusing array of equations.

For statistical analyses, the medical profession has, in effect, adopted the KISS principle: keep it simple, stupid. In keeping with this principle, the Lone Economist designed the 6-D baseball statistics using simple ratios, like runs per plate appearance, and a relatively simple rule for extrapolation.

I divided the observed data into six dimensions that results in 288 separate game locations. But every baseball fan knows there are many other bits of information by which to predict outcomes. There is the identity of the batter, the pitcher, and the stadium. There are even individual characteristics of the player that are useful predictors, such as handedness (i.e. which side of the plate the batter swings his bat) and age. Some players might be better at night, some during daylight. The list is quite lengthy.

The Curse of Dimensionality prevents the data from being subdivided into all possible combinations of predictors, so extrapolation must be used for the non-dimensional factors. I call them effect modifiers.

Effect Modification

The objective of 6-D Baseball is to predict what happens next during a live game. I want to make it easy for the casual spectator to know when a team is most likely to score without resorting to a hand calculator. The first step is to look at the separate effect of the current batter.

In an earlier post, I introduced the Individual Run Production (IRP) statistic. This statistic provides a value by which we can assess the scoring potential during a plate appearance for each of the 288 game positions. Although it can be used to compare batters across time, its main purpose is to measure the likelihood of scoring in a live game.

Every batter has two objectives. The first is to drive in runs during his plate appearance. The second is to setup the situation for the next batter. Consequently, the individual run production statistic (IRP) is the sum of two components: the runs scored during the plate appearance (RBI) and the value of the change in the game location.

The average or expected number of runs scored until the end of the half-inning is a function of the IRPs of the current and subsequent batters. In equation form it looks like this:

I’ll explain what these equations mean, one by one.

L represents “location” and refers to the count of balls and strikes, the disposition of each base and the number of outs. Specific values are six digits long in the following order: outs, 3rd, 2nd, 1st, Balls, and Strikes. For example, at the start of each half-inning, there are no outs, the bases are empty and the count of balls and strikes are 0 and 0. Therefore the value of L would be 000000. If there were two outs, a man on 1st and the count of balls and strikes was three and two, the value of L would be 200132.

Hb(L) is the average number of runs scored by the end of the half-inning when the game location is L. He(L) is the average number of runs scored after the current plate appearance.

Equations 1 and 2 say that Hb(L) is simply the average number of runs scored during the current PA, R(L), plus He(L), all the runs scored after the current PA.

Equation 3 says the average number of runs when the current batter is j and location is L is in part Hb(L) times the ratio of the average runs scored during the current PA regardless of the batter, R(L), to Hb(L). This ratio is denoted s(L) and is divided by the overall average value of s regardless of L, A(s). Since there are 288 values of L, there are 288 values of s(L). They range from 116.3% for L = 211130 to 3.1% for L = 000002.  The overall average value of s(L), A(s), is 35.8%.

The last component of equation 3, sj, is specific to batter j. It is the batter’s average ratio of runs scored during the PA to Hb(L), the first part of the IRP statistic.

Equation 4 determines the average number of runs scored after the PA, the situation the current batter sets up for the next batter. d(L) is the average ratio of He(L) to Hb(L) at location L. A(d) is the average value of d(L) over all values of L and dj is the average ratio for player j. d(L) ranges from 160.2% for L = 200030 to 12.5% for L = 211102.  The value of A(d) is 64.9%.

Equations 5 – 8 show that the expected number of runs scored is a function of the current location, L, and the components of the IRP’s for the current and subsequent batters.

Next time I’ll provide some illustrative examples using the IRP’s of active batters.

Baseball and the Curse of Dimensionality

Napoleon once said, “An army marches on its stomach”. Or was that Frederick the Great? I get my 18th century warmongers mixed up. No matter, his meaning was that the quantity and quality of food for one’s troops is of critical importance when fighting a war.

In the case of the Lone Economist, the quantity and quality of data — preferably free and publicly-available, not every masked vigilante can be as loaded as Bruce Wayne — is the vital fuel for crusading against bias, confounding and all other manner of inappropriate statistical inference. And don’t even get me started about post-randomization, sub-group analyses!

Fortunately, baseball (and for that matter healthcare) generates a huge volume of free data. I have data, courtesy of Retrosheet, covering 173,947 MLB games that date back to 1918. These games include 13,561,443 plate appearances by 13,196 different batters. I even have data on every pitch thrown since 1988, all 21,734,609 of them.

And yet, it isn’t enough.

Every masked vigilante has his or her nemesis and the Lone Economist is no exception. Mine is the Curse of Dimensionality. My fists clench at the mere thought of it. Curse you Curse of Dimensionality!

Can one curse a curse?

What am I saying? I’m a masked vigilante. I can do anything.

This curse refers to a data scarcity problem encountered often in statistical analysis. No matter how much data one possesses, dividing the data into even a small number of dimensions will quickly exhaust the supply.

For example, I have identified only six dimensions of baseball games: outs, three bases, and called balls and strikes. Since each of these dimensions has but a few discrete values, there are just 288 possible “locations” a ballgame can find itself in.

Whenever the count is three balls and less than two strikes and the bases are loaded, the pitcher is at a distinct disadvantage. He can’t afford to throw another pitch outside of the strike zone. The batter knows this and can count on the next pitch being where it can be hit hard. It’s known as a cripple pitch. The worst cripple pitch from the pitcher’s perspective is when there are no strikes and no outs. Put these two situations together and we get the Ultimate Cripple Pitch (UCP).

For the average batter facing the average pitcher, the number of runs scored from that point until the end of the half-inning is 2.88, higher than any other game location. So, if the batter were say Mike Trout, the best hitter in the major leagues today and arguably the sixth best hitter over the last hundred years, the average number of runs scored from that location would be even higher. Right? I imagine this nightmare scenario has caused many pitchers to wake up screaming in the night.

There’s only one problem with this assessment. Mike Trout has never faced a UCP and he probably never will. Out of over 21 million pitches thrown since 1988, only 781 of them have been UCPs. That’s less than four thousandths of one percent.

If I added just one more dimension to my list of six, e.g. the identity of the batter, a huge number of gaps would appear in the data. Are we then to conclude that when, if ever, Mike Trout finds himself in an UCP situation, that it is completely unknown what is likely to happen next? Is he just as likely to strike out as the average batter?

Of course not. Just because it has never happened before, doesn’t mean we know nothing about what is likely to happen. We know how well lesser hitters do in that situation and we know how well Mike Trout does in situations less advantageous to the batter. It doesn’t take a crystal ball to conclude that the average pitcher would be in extremely deep doodoo.

Well, Curse of Dimensionality, the Lone Economist has a silver bullet with your name on it and it’s called “extrapolation”.

Although, if the bullet is called “extrapolation”, shouldn’t that be written on it? These mythology idioms can be so confusing.

I’ll explain how extrapolation works in my next post.

Back to the Red-Zone

My last post was devoted to answering the ultimate baseball question: who was the greatest hitter? I plan to answer similar questions in the future, such as ‘who was the greatest pitcher?’ and so forth. But all of that is a side excursion from our main path, predicting what happens next in a live baseball game.

Anticipation of what is about to happen next is a primary cause of interest in observing a sporting event. Most spectator sports consist of numerous small skirmishes between the opposing sides. Winning skirmishes leads to winning more significant contests, battles. And winning battles leads to winning the war, the game. For American football, seeking a first down is a skirmish, a series of downs is a battle. For baseball, a plate appearance is a skirmish and an inning is a battle.

For today’s post, I want to return to the concept of a baseball Red-Zone. In a previous post, I showed a heat map of the 288 locations of a baseball game that can occur within its six dimensions: balls, strikes, outs and three bases. That Red-Zone was calculated at the inning level or in other words for battles. When I say I want to predict what happens next, I mean by the end of the plate appearance and even on the next pitch.

Consequently, a baseball Red-Zone would apply to a plate appearance as well as an inning. To see how this is done, look at Figure 1, the half-inning scoring heat map below. Illustrating six dimensions in a two-dimensional space is hard to do. I used one axis to measure outs and balls (the east-west dimension) and the other axis to measure bases and strikes (the north-south dimension). This resulted in a 12 by 24 matrix of 288 cells.

Figure 1. Half-Inning Scoring Zones (1988-2019)

Note: The information used here was obtained free of charge from and is copyrighted by Retrosheet.  Interested parties may contact Retrosheet at “www.retrosheet.org”. Retrosheet provides play-by-play data on most MLB baseball games from 1918 to 1931 and all games since 1932; however, data on every pitch in every MLB game dates back only to 1988.

Notice that there are three rows for each combination of occupied bases and eight rows for each value of strikes. There are four columns for each value of outs and three columns for each value of balls. This juxtaposition of dimensions is dictated by the need to produce a rectangular matrix.

What is not dictated though is the order in which the axes are sorted. The north-south axis is sorted first by bases and then by strikes. The east-west axis is sorted first by outs and then by balls.  I could have sorted them differently, but I chose to sort them in this way to achieve a visual effect.

I wanted to illustrate the fact that the more balls and the less strikes that are called, the greater is the potential for scoring. And I wanted to illustrate that the fewer outs and the more runners on base closer to home there is, the greater is the potential for scoring. Since outs and bases dominate in the determination of scoring at the inning-level, it was necessary to sort by those dimensions first. This led to a clear picture of three relatively intact scoring zones: a blue zone (i.e. low-scoring) in the southwestern cells, a red zone (i.e. high-scoring) in the northeastern cells and a yellow zone that clearly delineates them.

If, instead, I had sorted the axes primarily by balls and strikes, the picture would have looked like Figure 2.

Figure 2. Half-Inning Scoring Zones Resorted (1988-2019)

The relationships between balls vs. strikes and outs vs. bases is the same as in Figure 1. But due to the different sorting of the axes, it is harder to discern this from looking at the heat map.

The Plate Appearance Red-Zone

For long-time baseball fans, it should come as no shock to learn that when it comes to determining the outcomes of plate appearances, balls and strikes — not outs and bases — dominate. What might come as more of a surprise though, is the relationship between positive outcomes and outs vs. bases.

Figure 3 shows a heat map of the average plate appearance outcomes for each of the 288 baseball game locations. The north-south axis is sorted first by strikes and then by bases. The east-west axis is sorted by balls and then by outs. If you recall from my last post, a plate appearance outcome is the number of runs

scored during the PA (RBI+) plus the change in game location (ΔGL). The outcomes range from negative 0.31 to positive 0.62 and average zero. The lower the value, the bluer the background and the higher the value, the redder the background.

A matrix of 288 numbers can be very tedious to contemplate, so Figure 4 presents a trichotomized version that drops the Arabic numbers. Although the axes of Figure 4 are sorted in the same way as the axes in Figure 2, the color pattern in Figure 4 is similar to that of Figure 1. Blue cells cluster around the southwestern region and red cells cluster around the northeastern region. Yellow cells form a diagonal from the northwest to the southeast.

The pattern confirms the theory that positive outcomes generally increase with the count of balls and the number of runners on base and decrease with the count of strikes and the number of outs, but there are several exceptions. For example, according to this theory, the worst location for the batter should be two outs, no runners on third, second or first, no balls and two strikes (location 200002). According to Figure 3 however, the worst location for an individual batter is one out, the bases loaded, no balls and two strikes (111102). The reason is that with two strikes and no balls, the batter can’t afford to not swing at the next pitch unless it is far outside the strike zone. This makes a strike out or even worse, a double play, quite likely. Since the bases are loaded, the opportunity cost of that outcome would be very high.

Another exception is that this theory predicts location 011130 (i.e. no outs, the bases loaded, three balls and no strikes) would be the best location for the batter. But the best outcome is at 211130 (i.e. two outs) instead. As to why, I can only conjecture. When there are two outs, a double-play is not possible. But I suspect the main reason is that with two outs, the batter is less likely to try for a homerun and therefore is more likely to put the ball into play or walk in a run.

From this matrix of plate appearance outcomes, we can estimate the impact an individual batter has, based on his past history, as well as other effect modifiers like who is pitching, the stadium in which the game is played, etc.

The Ultimate Baseball Question

How does one assess individual production within a group activity, like manufacturing? There’s overhead and sunk costs, the law of diminishing marginal product and substitutability of labor and capital to contend with. Accountants and economists have struggled with this problem since the invention of money itself.

So have baseball statisticians, which brings me to today’s topic. How can we measure an individual player’s contribution to his team’s score? It’s not just the runs he scores himself, because that often relies on who batted him in. And it’s not just the runs he bats in, because that depends on who batted previously.

To solve this riddle, statisticians invented many of the terms and concepts we associate with the fundamental parts of baseball today. For example, the idea of a “base hit” has nothing to do with the design or execution of the game of baseball. If the batter puts the ball into play on the ground and beats the ball to first base, he is safe. Whether it’s a “single” or an “error” or a “fielder’s choice” is irrelevant.  These designations are merely statistical contrivances to facilitate measuring the productivity of an individual batter.

With that history in mind, we need to construct an individual batting statistic that is congruent with the goals of this study, that is, to predict scoring based on the six discrete dimensions discussed in previous posts plus several effect modifiers, like who is batting, pitching, etc. Even if the traditional individual batting performance measures, e.g. batting average (BA), on-base percentage (OBP) and slugging percentage (SLP), did not suffer from many flaws, they would not serve this purpose well. So, a completely new type of statistic is called for.

Traditional Batting Statistics

The flaws of these statistics are as well-known as their many proposed remedies. BA weights all base hits equally but ignores bases on balls and advancing base runners (i.e. sacrifice bunts and fly balls). OBP counts bases on balls but still counts a single as much as a homerun. Like BA, SLP only counts base hits but does weight doubles more than singles and so on. However, the weights (i.e. 4 for a homerun, 3 for a triple, etc.) are arbitrarily derived.

Modern statistics, such as Pete Palmer’s and John Thorn’s Linear Weights or Weighted Runs Created (wRC), improve upon the traditional batter performance measures, yet still rely on the same flawed contrivances, like base hits and sacrifice flies. Adding OBP and SLP together (aka OSP) is also a popular remedy, but this literally compounds the flaws rather than eliminates them.

A common flaw of all these statistics is that they suffer from confounding, the assignment of a spurious causal association between two variables due to missing information. Let me explain via anecdote.

When I was 13 years old, I started wearing a hat, because I thought it would look “cool”. This was before I realized that I was genetically incapable of judging what other people consider cool. My father saw me and said “Take that hat off! Don’t you know it will make you go bald?”

I thought about this for a while. Bald people wear hats to protect their bare scalps from the sun. Ergo, most bald people wear hats and most people with hair do not. My father had observed this and correctly deduced that there was a causal relationship between wearing a hat and going bald. Only, he got the causal direction wrong. Bald people are not bald because they wear hats. They wear hats because they are bald. His analysis suffered from confounding.

In baseball statistics, confounding results in a batter’s relative productivity being over or under measured. For example, some stadiums are easier to score in than others. A lot of work has gone into trying to figure out how many more homeruns Babe Ruth hit because Yankee Stadium had a short right field fence or how many fewer homeruns Willie Mays hit because he played in wintery Candlestick Park.

Adjusting for stadium effects is commendable, but the old and new statistics fail to adjust for the most reliable determinant of scoring of all, the multi-dimensional location of the ball game. A batter’s BA, OBP, and SLP all improve dramatically with runners on base. Some batters slogged it out during eras when scoring was relatively hard to do. I call this baseball’s Death Valley Days (another 1950’s TV show reference!), – I’m talking about you Mickey Mantle, Willie Mays and Hank Aaron – while others enjoyed the bountiful 1920’s and 30’s. The average number of runners on base when the terrific trio were at the plate was 0.64, 0.63 and 0.66, respectively. These are all close to the overall average of 0.64, but each of these guys batted third in the order. Their number-of-runners-on-base averages should have been higher. The corresponding numbers for Babe Ruth, Lou Gehrig and Rogers Hornsby were 0.73, 0.78 and 0.74, about 17% higher.

Mantle, Mays and Aaron played against a pervasive headwind that is not accounted for by the traditional statistics. What is needed is a statistic that recognizes how the game is designed and played and does not rely on subjectively determined events, like errors and base hits.

What’s that you hear? Why, it’s the William Tell Overture signaling the Lone Economist coming to the rescue. Hiyo Silfur!

Individual Run Production

If you recall from previous posts, baseball has six dimensions. At the individual pitch level there are exactly 288 discrete “locations” the game can find itself in. These range from the very start of the half-inning (i.e. no balls or strikes, no outs and the bases are empty) to a full count (i.e. three balls and two strikes), two outs and the bases are loaded. Some of these locations are more propitious for scoring than others, i.e. the Baseball Red-Zone.

At the individual plate appearance (PA) level, each PA starts with a zero count (i.e. no balls and no strikes). So, when we assess the change in the team’s prospects for scoring from the beginning of one PA to the beginning of the next, we need only consider four dimensions (i.e. outs and the three bases) and 25 possible outcomes (i.e. eight configurations of the three bases (2 x 2 x 2) times three values of outs, plus one for the end of the half-inning).

That last paragraph may be hard to understand at first, so let me explain via example. Below is a heat map of the average additional runs scored for the 24 starting locations. These figures cover the 102-year period from 1918 to 2019. My source, Retrosheet, provides data on all games played from 1932 to 2019, but from 1918 to 1931, only 75% of major league baseball games are covered. So, some of the games played by the likes of Babe Ruth and Ty Cobb are missing. But as a professional statistician, the Lone Economist has an aversion to discarding useful data. So, I left those years in.

Now suppose a batter is first up in a half-inning. The game’s “location” is at the bottom right-hand cell, there are no outs and no one is on base. Under average conditions, the batting team would be expected to score 0.49 runs by the end of the half-inning. Just what impact on the team’s expected runs can the individual batter make? There are exactly five possible outcomes by the end of this PA. He can reach first, second or third safely; score a run or be out. That’s it. None of the remaining 19 locations are possible.

If he reaches first base safely, he increases expected additional runs from 0.49 to 0.86, i.e. 0.37. Reaching second increases expected runs by 0.61 and reaching third increases it by 0.83. If he is out, expected additional runs decrease from 0.49 to 0.27 or by 0.22. A homerun doesn’t change expected additional runs at all (i.e. the next batter starts at 0.49 also), but a run is scored so that is the best possible outcome from the PA. The following table lists the possible individual run production (IRP) outcomes when there are no outs and no one is on base.

Notice that the IRP difference between a homerun and an out (1.22) is approximately double the difference between a single and an out (0.59). Remember that slugging percentage assumes this ratio is four, not two. Of course, this is true only for the special case when the bases are empty and there are no outs. But even under different conditions, a homerun is worth far less than four singles, usually less than two.

How about when the bases are loaded and there are no outs? That’s more complicated because there are a lot more than five possible outcomes in that scenario. The exact number of possible outcomes is 24.  These include all 24 PA locations except for one, the bases loaded and two outs. If the batter hits into a double play, there can be no more than two runners left on base.  Plus, there is the outcome of the triple play which ends the half-inning. Here is a heat map of 23 of those possible outcomes.

I won’t explain the value of every cell, but I can explain a couple of examples. The top right-hand cell is the outcome where the batter either walks, is hit by a pitch, reaches first due to a fielding error or hits a single and each baserunner advances one base. One run is scored and there is no change in the expected runs specified in Figure 1, the game is at the same location when the next batter comes to the plate. So, the IRP is one run exactly.

The bottom right-hand cell is the outcome when the batter hits a grand slam, i.e. the bases are cleared and there are still no outs. Four runs are scored, but the average additional runs from the start of the PA to the start of the next PA falls from 2.27 to 0.49. So, the IRP of that cell is 4 – 2.27 + 0.49 = 2.22.

The only possible outcome missing from Figure 3 is the triple play. If no runs are scored, the average additional runs from Figure 1 drops from 2.27 to zero.  Therefore, the IRP would be -2.27. It is possible that a run scores before the third out is recorded. In that case the IRP would be -1.27. The average value over the past 102 years is -1.63.

From Figure 3, we can see that relative to an out where no one scores (i.e. -0.70), a homerun is worth 2.92 (2.22 + 0.70) runs and a single is worth 1.7 (1.00 + 0.70) runs. The ratio of a homerun to a single is less than 2. Consequently, we can see how much slugging percentage over-values homeruns relative to singles.

Average IRP

The above discussion establishes the basis for our new statistic. Every time a batter comes to the plate, he is at one of the 24 locations. What he does with this opportunity depends on his ability and chance. He is credited with any runs that score from his plate appearance, i.e. Runs Batted-In plus any runs scored due to fielding errors (RBI+), and the change in the game location.

For example, suppose the game location is the bases are loaded and there is one out. According to Figure 1, expected runs are 1.57 (top row, middle column). This is a very favorable location, the third highest out of 24.

Suppose the batter hits a fly ball to right field, the runners on third and second tag up. The third base runner scores a run and the runner on second advances to third base. The batter is out, but one runner scores and another is closer to home. This might be considered a good outcome for the batter, but is it really?

The game location moves from 1.57 in Figure 1 to 0.51, two outs and runners on first and third. The IRP of that PA is therefore 1 (the run batted-in) + 0.51 – 1.57 (the change in the game location) = -0.06.

The negative value seems to indicate that this was not a good outcome, but we need to consider what the alternatives are to put this outcome into context. The batter could have hit a grand slam with an IRP of 2.7 (4 + .027 – 1.57) or into an inning-ending double play with an IRP of -1.57. Compared to that worst-case scenario, the -0.06 IRP is an improvement of 1.51 runs. An above-average batter might be disappointed with the outcome, but a below-average batter would be happy to hit the fly ball to right field.

The creators of the traditional statistics didn’t have a good solution to measuring this outcome. They labeled it a “sacrifice” and excluded it from batting average and slugging percentage, thus violating a cardinal tenet of statistical analysis to count all useful information. On-base percentage is even worse. A sacrifice is counted in the denominator, so it is just as bad as a strike out. And it counts double plays the same as single outs.

We take the number of IRPs and divide it by the number of plate appearances to calculate the average IRP. Although I used Figure 1 to explain the IRP concept, actual IRPs should be calculated using annual averages. So, when calculating Barry Bonds’ IRP in 2004, for example, I used the average runs for each game location in 2004.

Notice there is no reliance on base hits, errors, sacrifice flies or fielder’s choices. All plate appearances count. Nothing is excluded.  Batters that hit into double and triple plays are fully penalized. Batters who advance base runners are given proportional credit.

From an economist/statistician viewpoint, the beauty of this statistic is that it adheres to the “adding up” constraint. When devising a system of equations – in this case each batter’s statistic represents one equation – the sum of the individual parts should equal the total. By the way this statistic is defined, at the end of the year the sum of all players’ IRPs will be equal to the sum of runs scored during the season.

Career Average IRP vs. OPS

Figure 4 lists the top 25 players by average IRP. Only players with at least 3,000 plate appearances during the 1918-2019 time-span are ranked. For comparison sake, the right-hand column ranks each player’s OSP (on-base percentage plus slugging percentage) statistics.

The first thing to note is how similar the two rankings are. The top seven players are the same in both rankings. Babe Ruth, Ted Williams and Lou Gehrig are at the top of both lists. Several other familiar names also appear in both Top 25 lists: Joe DiMaggio, Mickey Mantle, Stan Musial, Willie Mays, etc.

The next things to notice are the players that fair much better with this new ranking as compared to that by OSP. Hank Aaron rises from 33rd by OSP to 22nd by average IRP. Ty Cobb jumps from 47th to 15th. And this was for only some of his games played after his prime years.

There are a few players who fair relatively poorly and are not shown in Figure 4. For example, Vladimir Guerrero drops from 27th by OSP to 121st by average IRP. Alex Rodriguez drops from 32nd to 47th

RBI+ vs. Change in Game Location

For anyone who thinks this is simply an RBI per plate appearance statistic, lets break average IRP into its two separate parts, RBI+ and the change in game location (ΔGL). In the aggregate, RBI+ equals the negative value of ΔGL. Figure 4 shows this to be 0.118 vs. -.117. This happens because when runners on base are batted in, the game location usually becomes less favorable.  Therefore, players who tend to have above average RBI+ per plate appearance will have below average ΔGL.

Hank Greenberg is an extreme example of this relationship. He has the highest RBI+ average over the last 102 years, but ranks only 1,312th (out of 1,598) in average ΔGL. I doubt that it is just a coincidence that he also had the highest average number of runners on base (0.82) when he came to bat. The 102-year average of this statistic is 0.64.

Despite his high number of runners on base, Greenberg did not enjoy the highest average game location when he came to bat. He ranked 30th in this department, below the king of cleanup hitters himself, Lou Gehrig.

Greenberg was known as a slugger, not for his speed around the bases. He was called Hammerin’ Hank before Aaron went by that nom-de-guerre.  So, a high average RBI+ and a low average ΔGL might be a marker for a power hitter. If so, then Gehrig, DiMaggio, Ramirez, McGwire and Aaron fit this description.

But what about Bonds and Mantle? They rank 127th and 164th, respectively, in average RBI+ and 36th and 33rd in average ΔGL. If those two weren’t power hitters, then nobody was. So, the power vs. average divide does not explain this relationship.

But to prove that a high average RBI+ does not always result in a low average ΔGL (and vice-versa), look at Babe Ruth and Ted Williams. Ruth’s average RBI+ was second only to Greenberg’s, but his average ΔGL was ranked 34th. Ted Williams had the 9th best average RBI+, but his average ΔGL was ranked 10th. Those two guys were great no matter the situation.

Who Was the Greatest Batter?

This is the ultimate question for a baseball statistician. However, the objective of 6-D statistics is to change the analytical focus (i.e. perspective) from comparing players’ abilities to predicting outcomes during a game. It is therefore ironic that a by-product of this refocusing is a statistic that attempts to answer the ultimate question. The Lone Economist admits that he would love to discover some new nugget of information that sheds light on the answer.

Average IPR is just the start. I plan to look at many other factors that affect baseball productivity. But before I end this post, I want to address an obvious source of confounding that even average IPR suffers from. I am referring to the low ranking of batters who played during the 1960’s (e.g. Mantle, Mays and Aaron) relative to those of the 1920’s and 1930’s (e.g. Ruth, Gehrig, and Hornsby) and 1990’s and 2000’s (e.g. Bonds, Ramirez and McGwire).

A change in the way baseballs were manufactured and the banning of the spitball in 1920 likely inflated the batting statistics of the 20’s and 30’s. Performance-enhancing drugs fueled the scoring surge of the 90’s and 2000’s.  So how can we reduce the effects of these confounding missing factors and level the playing field?

Notice that the average beginning game location was lower for Mantle, Mays and Aaron (i.e. 0.471, 0.459, and 0.464) than for any other member of the Top 25 club except for Mike Trout, the only active player to make the list. This happened not because they were on poor-hitting clubs, but because they played during a poor-hitting era.

One way to correct for this difference is to calculate the average IRP as the percentage change from the average beginning game location. Figure 5 does this and recalculates the rankings. Notice that Mays and Aaron rise from 18th and 22nd, respectively, to 13th and 14th. Mickey Mantle jumps from 9th to 4th and Mike Trout climbs in the rankings from 11th to 6th. And I’m happy to see Frank Robinson, Dick Allen and Wille Stargell climb into the Top 25. The traditional statistics never treated these great hitters fairly.

But we still have far to go to answer the ultimate question.

The Six Dimensions of Baseball

In a previous post, I stated that goal-line sports, like football and soccer, are basically one-dimensional. The closer the ball is to the goal, the greater the chance of scoring. If I calculated the probability of scoring as a function of distance to the goal, I imagine it would look something like this:

I believe this simplicity helps explain the broad appeal these games have for the general public.

The scoring in a baseball game is not so simple to predict or to illustrate, however. The chance of scoring is a function of six discrete variables or dimensions, not just one. These are the counts of balls and strikes, the dispositions of three separate bases and the number of outs. Since there are four values for balls, three for strikes, two for each of the three bases and three outs, there are 288 (i.e. 4 × 3 × 2 × 2 × 2 × 3) discrete values or locations that determine the probability of scoring during a half-inning.

Thinking in six dimensions is hard enough. But illustrating six dimensions on a two-dimensional space is even harder. For example, if the probability of scoring a touchdown were a function of even one more dimension than the distance to the goal, the above graph would need a third axis; one that is perpendicular to the two axes that already exist. In short, a three-dimensional drawing would be necessary.

But how do we illustrate five more dimensions when we can only see a total of three? The answer is to rely on another aspect of our visual senses, color. The frequency spectrum of visible light waves ranges from the lowest (i.e. blue) to the highest (i.e. red). So, if we associate light wave frequency with scoring probability, the range is from blue to green, yellow, orange and finally red. Think of it like temperature. Blue is for cold and red is for hot.

Balls vs. Strikes

If you recall from an earlier post, the average runs scored per inning is almost exactly one. So, at the start of each half-inning, when the balls-strikes count is 0-0, the average runs scored by the end of the half-inning is one half of a run.

Each additional ball should favor the batter and each additional strike should favor the pitcher. Do the data bear this out? Look at the following color-coded table (aka heat map) and see.

These numbers cover years 1988 to 2019.  The upper left-hand cell of the table represents the average additional runs scored by the end of the half-inning when the balls-strikes count is 0-0. This value is near the middle of the range from 0.39 to 0.73, so it has a yellow color. The highest value, when there are 3 balls and no strikes, is colored red. Conversely, the lowest value, when there are no balls and 2 strikes, is colored blue. The axes are ordered so that the highest values are in the upper right-hand cells and the lowest values are in the lower left-hand cells.

From this table we can infer three conclusions:

  1. Each additional ball favors the batter
  2. Each additional strike favors the pitcher
  3. Balls and strikes have a measurable, but small causal impact on scoring.

To find a larger causal impact on scoring, we look at outs and bases.

Outs and Bases

Here is a heat map of the average additional runs scored by the end of the half-inning for the 24 unique values of outs and bases.

Again, the axes are ordered so that the higher values are in the northeast cells and the lowest are in the southwest cells. Clearly, average runs decrease with the number of outs and increase with the number of runners on base and when they are closer to home.

Another conclusion is that the range is much greater for outs and bases (0.10 to 2.24) than it is for balls and strikes (0.39 to 0.73). In other words, outs and bases dominate balls and strikes in the determination of runs scored.

The Baseball Red-Zone

It is now time to combine all six dimensions into one 288-cell heat map.

Notice the familiar shading from the blue end of the spectrum to the red end as we move in a northeasterly direction. Also notice that the range is even greater, 0.06 to 2.88.

The detail of this heat map can be useful, if overwhelming. For example, it shows that average runs when there are no outs and a runner on first and there are no balls or strikes is 0.88. Directing the batter to perform a sacrifice bunt in order to advance the runner to second base would result in a decrease in average runs to 0.72. So, under average conditions, this would be a bad decision. However, when the batter is below-average, like a pitcher at bat in the National League, it can be a good decision.

The normal spectator will find this heat map tedious. So, I trichotomized it into three scoring zones: cold (blue), medium (yellow) and hot (red).

This is the baseball red-zone for the average batter facing the average pitcher in the average ballpark and so on. The 3-zone heat map for an above average batter would have more red cells and fewer blue ones. I imagine the red-zone for Mike Trout would be very large, unless he is facing Clayton Kershaw.

My mission is to determine the red-zones for all batters, pitchers, stadiums, etc. This is just the beginning of a long and interesting road.

What Matters to Baseball Spectators

Productivity is measured by rates, i.e. quantities of outputs relative to quantities of inputs. So, our first step is to choose the appropriate outputs and inputs. Let’s start with the choice of outputs.

For all types of prediction, there is usually one main outcome. For healthcare, it’s death. When a new cancer drug is tested, we want to know how many lives it will potentially save (the output) versus the lives lost if resources are diverted from some other use (the input). But death is thankfully a rare event and a clinical trial that uses death as the main outcome could take years to complete.

This is why so many clinical trials of potentially life-saving cancer drugs choose an intermediary outcome like the resumption of disease progression rather than just death. If they waited for all the trial participants to die before concluding that the drug is safe and effective compared to a placebo, several decades might pass.

From the baseball spectator’s perspective, the main outcome of interest is obviously which team wins the game. But we don’t want to wait to the end of the ball game to see who won. We want to anticipate which team is likely to win using intermediate outcomes. For baseball fans, there are two intermediate outcomes that build upon each other: reaching base and scoring.

Reaching base is not the team’s end objective, but it is corelated with scoring runs which is corelated with the end objective, i.e. winning the game. A game of baseball typically lasts three hours. To maintain spectator interest over such a long time-span, one must be able to appreciate when your side is about to win a small contest (the plate appearance) that may lead to winning a larger contest (scoring during the inning) that may lead to winning the game.

The little/medium/big contest strategy for maintaining spectator interest is not unique to baseball. For example, American football’s small contest is making a first down. If the team achieves that, then it might win the medium contest (score a touchdown) and ultimately the game.

Each output measure is associated with an input measure in order to calculate the rate of production. The input measure for reaching base is the plate appearance. For scoring, the input measure is the team’s side of the inning, aka the half-inning.

Runs Per Inning

A rate is the ratio between the amount of output and the amount of input. For example, from 1918 to 2019, a 102-year time span, 1,545,462 runs were scored by major league baseball teams in 1,598,551 innings. That comes to almost 1 run per inning (RPI) (i.e. 0.967/inning).

So, if you went to an MLB game tomorrow, would you expect nine runs to be scored? If both teams were more or less average, the answer is yes. But that happens to be a lucky guess because when it comes to scoring per inning, MLB is highly episodic. The present day just happens to be in concordance with the long-run average. Here is a graph of the yearly runs per innings from 1918 to 2019. Notice that there have been many peaks and valleys, but the latest year was close to the long-run average of one run per inning.

Note: The information used here was obtained free of charge from and is copyrighted by Retrosheet.  Interested parties may contact Retrosheet at “www.retrosheet.org”.

This graph makes RPI look highly variable by year, but that is more a result of my choice of the upper and lower bounds of the vertical axis than its actual variability. If I had chosen the bounds of the vertical axis to be 3 and 0, for example, the graph would have looked like this.

Relying on how the graph looks, can lead to mistaken inferences. The standard deviation, a measure of variability, is only 0.089, less than one tenth of the mean RPI. But what makes runs per inning episodic is the serial correlation of the annual figures, not its standard deviation.

What I mean by serial correlation is correlation of consecutive deviations from the mean. In other words, annual runs per inning are not independent random events. A higher than average year is likely to be followed by another higher than average year. The serial correlation coefficient for this time series (represented by the Greek letter “rho”, ρ) is 0.733. This is a high value for this statistic. If annual RPI was independent, ρ would be close to zero instead of close to 1.

The broad takeaway from this graphic is that the 1920’s, 30’s and 90’s were epochs of relatively high-scoring, while the 1960’s experienced relatively low-scoring, but that there is no time trend. The time trend coefficient is practically zero. Annual runs per inning that deviate from the long-run average tend to regress to the mean in the following years. This has led to a fairly stable RPI value for over a century.

Probability of Reaching Base

Since 1918, there have been 13,266,945 plate appearances which resulted in the batter reaching base 4,426,673 times. The rate is therefore 0.334. This means that for the last century the odds of the batter reaching base has been 1 to 2. Conversely, the odds of getting out have been 2 to 1.

This might seem high since a .300 batting average is considered quite high, but please keep in mind this is not the batting average and it isn’t even the on-base percentage. Both traditional statistics exclude reaching base by fielding error and batting average excludes bases on balls and hit by pitch. From the spectator’s perspective, the credit or blame for why the batter reached base is irrelevant. A walk is as good as a single and a two-base error is as good as a double. Like RPI, the PRB varies from year to year. The following graph illustrates the vicissitudes of PRB from 1918 to 2019. It looks quite similar to the graph of RPI. And just like RPI, it is serially correlated with multi-year peaks and valleys, but no long-term time trend. In fact, PRB’s variability is less than that of RPI (SD = 0.012, ρ = 0.83).

A Time Trend for Baseball

Before I end this post, I want to show that there is at least one long-run time trend in MLB. The following graph illustrates homeruns per plate appearance since 1918. It shows that in 1918, only 0.39% of plate appearances resulted in a homerun. In 2019, that percentage was 3.63%, a nearly tenfold increase.

Some see this as progress. The Lone Economist does not. I’ll explain why in a later post.