Kerry's CALCULUS

ELIMINATING RUN SUPPORT:  NEUTRAL WINS AND LOSSES

I've said it before and I'll say it again (right now...aren't you lucky?):  a pitcher's won-loss record is the most overrated stat in baseball, bar none, because it's so context-driven and, ultimately, so misplaced.  After all, pitchers don't win games, teams do.  It has long amazed me that people moan and groan about other context-driven pitching statistics--saves, for instance, and quality starts--as being susceptible to "cheap" occurrences but seem utterly unperturbed by the same problem plaguing W-L record.  A pitcher can throw a no-hitter and lose; a pitcher can throw nine perfect innings and get no decision at all.   A pitcher can allow, say, seven runs in five innings and win.  A relief pitcher can blow a lead and get a win.  All of these things have happened in baseball, and the latter two examples happen with stirring regularity, yet no one seems bothered by it.  Fair enough.  I'm increasingly tired of tilting at this particular windmill.  Pitchers' wins and losses are apparently here to stay, evidently too burned into the baseball public's psyche to be susceptible to being dumped in the ash can in which they belong.  Oops...sorry, I promised to stop trashing the stat.

How ingrained are wins credited to pitchers?  So much so that it's the one stat regarded as a sure-fire pitcher's guarantee of enshrinement in the Hall of Fame:  if a pitcher wins 300 games, he's a lock for Cooperstown, and everyone knows it.  There is no other guaranteed pitching benchmark.  Position players have 3000 hits or 500 home runs to shoot for, but pitchers, while not needing 300 wins for enshrinement (just as position players don't need 3000 hits or 500 home runs), know they're in if they reach the mark.  And it's the only such mark that offers that kind of lock.

So we're stuck with the W-L record.

But let's see if we can't look at the matter with a little more objectivity.

The central problem with a pitcher's win-loss record is that it's so heavily influenced by what a pitcher's team does on the offensive end of the game, where the pitcher's role is limited or, in the case of the designated hitter enabled American League (1973 to the present), totally irrelevant.  This much we all know.  A pitcher who allows five runs while his team is scoring nine will be credited with a win, as long as he throws at least the first five innings, leaves the game with the lead, and his team holds that lead to the end.  On the other hand, a pitcher who allows one run in a complete game is tagged with a loss if his club gets shut out.  There are plenty of ways to get around the idiosyncratic nature of W-L record in analyzing pitchers (ERA comes to mind as a good starting point), but the public is so focused on won-loss record...

For instance, does anyone seriously believe the Roger Clemens would have been the "obvious choice" as the 2001 AL Cy Young Award winner if his record hadn't been 20-3?

Enter "neutral wins" and "neutral losses," which together make for "neutral win-loss record."  This stat eliminates the effect of run support and, after adjusting for ballpark effects, tells us, for a given number of decisions, what a pitcher's record could be expected to be given an average amount of run support.  For instance...Roger Clemens finished 20-3 last year, and pitched well, for the most part, posting a 3.51 ERA in a league with a 4.48 earned run average.  That's good.  But 20-3?  Clemens benefited from exceptional run support--6.6 runs per game...the Yankees team average was 5.0 runs per game.  What if Clemens had received a league average amount of support (4.86 R/G)?  Assuming 23 decisions, he would have gone 14-9 in neutral wins and losses terms.  Clemens was clearly a better than average pitcher in 2001, but nowhere near as much as his traditional won-loss record would imply.

*            *            *

Mathematical Explanation of Neutral Wins and Losses 

Those not interested in the quasi-technical gobbledy-gook should skip this section.

The measure is, loosely put, a comparison of a pitcher's ERA to the league ERA.  This provides a benchmark for comparing the player to the league average--the more earned runs a pitcher allows per game compared to the league, the higher a percentage of decisions he can be expected to win.  An average pitcher--someone whose park-adjusted ERA matches that of the league as a whole--can be expected to win half his decisions.  The concept is, obviously, the fewer earned runs a pitcher allows, the more games he should be expected to win, and the fewer games he should be expected to lose, all other things being equal.  Let's look at Clemens/2001 as an example:  he posted a 3.51 ERA for the year.  Park-adjusting does not significantly change his ERA, so let's go with the raw value for the sake of simplicity.  The AL ERA for 2001 was 4.48.  3.51/4.48 is approximately .78, which we will refer to as the Loss Index.  The average mark is 1.00.  The Win Index is 1.22 (2 - LI = WI) Clemens' ERA this past year was 22 percentage points better than the league average; Clemens "should" win 14 of his 23 decisions:

(Loss Index/2)*No. of Decisions) = Neutral Losses

No. of Decisions - Neutral Losses = Neutral Wins

In the case of Clemens, the PI is 122, No. of Decisions is 23.  (122/2)*23 = 14

*            *            *

There has been some discussion on the Belfry message boards over the past few months about the possibility that run support itself can be a function of who the pitcher on the mound is.  The theory, loosely stated, is that some pitchers routinely get better than expected run support than others...because they're better liked by their teammates.  I am, to put it mildly, extremely skeptical of this claim, which arose in part from John Eisenberg's From 33rd Street to Camden Yards.  In that book, a number of ex-Orioles discuss the fact that, when he was with the club, Milt Pappas claimed that he suffered from lousy run support; some of his teammates from those years imply that this was because players on the team thought that Pappas was something of a malingerer.  Without dealing with the substance of the charge itself, it goes without saying that if Pappas suffered from a chronic run support problem, it would show up by his recording an actual win-loss record that was substantially worse than his neutral win-loss record.

Pappas pitched for the Orioles from 1957 to 1965, inclusive.  During those years, he compiled a record of 110-74--his ERA was 3.24 over those years while the league fashioned a 3.76 composite mark.  Pappas' neutral W-L mark over that time was 102-82.  In other words, Pappas won eight more games during his time with the Orioles than one would expect, given the number of decisions he had, which stands the run support theory on its head.  Now, it's possible that Pappas suffered from a larger than expected number of non-decisions over that time, but since Pappas earned decisions in roughly 80% of his starts during those years--approximately equivalent to, if not higher than, the league and team average during that time frame, it's difficult to see any evidence in support of the claim that Pappas received less run support overall than expected.

One of the interesting things about the whole run support thing is that it really does appear to play out according to Hoyle...that is to say, it does seem to even out, as evidenced by neutral win-loss records, over a lengthy career.  There certainly are significant seasonal blips here and there--witness the Clemens example above--but over the course of many years, things tend to even out.

Take Clemens as an example again.  He was six wins better than expected in 2001.  But for his career, Clemens is 280-145.  His neutral record?  282-143.  He was eight games under for his career until last year; now he's two under.  That's pretty close.

Remember 1987?  Sure you do.  One thing that happened that year was Nolan Ryan's winning the National League ERA title as a member of the Houston Astros.  Nothing all that remarkable about that...but he posted a W-L record that year of 8-16.  His neutral W-L that season was 15-9, nearly the reverse of his actual mark and a substantial single-season difference of seven wins.  Ryan had a reputation as a guy who pitched better than his W-L record all through his career, but is it so?  His real life record was 324-292.  His neutral mark was 336-280.  So, for his career, Ryan was down 12 wins.  That's a fair number...but keep in mind that seven of them came in one season.  For the rest of his career, he was only down by five wins--not so large a number.

What about our buddy Milt Pappas, who finished his Orioles career eight wins down?  Pappas pitched eight seasons after he was traded away from Baltimore and posted a career mark of 209-164.  His neutral mark was 206-167, so he finished +3 wins over his lengthy career and actually made up 11 wins after moving over to the NL.

What follows is a list of the top 50 pitchers in terms of career wins, and how their neutral win totals compare.

RANK Player W NW DIFF % of W
1 Cy Young 511 533 -22 -4.3%
2 Walter Johnson 417 470 -53 -12.7%
T3 Grover Alexander 373 374 -1 -0.3%
T3 Christy Mathewson 373 361 12 3.2%
5 Warren Spahn 363 353 10 2.8%
6 Kid Nichols 361 373 -12 -3.3%
7 Pud Galvin 360 359 1 0.3%
8 Tim Keefe 341 346 -5 -1.5%
9 Steve Carlton 329 327 2 0.6%
10 John Clarkson 328 323 5 1.5%
11 Eddie Plank 326 311 15 4.6%
T12 Don Sutton 324 310 14 4.3%
T12 Nolan Ryan 324 336 -12 -3.7%
14 Phil Niekro 318 337 -19 -6.0%
15 Gaylord Perry 314 336 -22 -7.0%
16 Tom Seaver 311 312 -1 -0.3%
T17 Old Hoss Radbourn 309 300 9 2.9%
T17 Mickey Welch 309 291 18 5.8%
T19 Lefty Grove 300 298 2 0.7%
T19 Early Wynn 300 288 12 4.0%
21 Tommy John 288 284 4 1.4%
22 Bert Blyleven 287 313 -26 -9.1%
23 Robin Roberts 286 296 -10 -3.5%
T24 Tony Mullane 284 289 -5 -1.8%
T24 Ferguson Jenkins 284 291 -7 -2.5%
26 Jim Kaat 283 282 1 0.4%
27 Roger Clemens 280 282 -2 -0.7%
28 Red Ruffing 273 273 0 0.0%
29 Burleigh Grimes 270 259 11 4.1%
30 Jim Palmer 268 251 17 6.3%
T31 Bob Feller 266 252 14 5.3%
T31 Eppa Rixey 266 291 -25 -9.4%
33 Jim McCormick 265 275 -10 -3.8%
34 Gus Weyhing 264 262 2 0.8%
35 Ted Lyons 260 284 -24 -9.2%
36 Greg Maddux 257 268 -11 -4.3%
T37 Red Faber 254 269 -15 -5.9%
T37 Jack Morris 254 232 22 8.7%
39 Carl Hubbell 253 253 0 0.0%
40 Bob Gibson 251 264 -13 -5.2%
41 Vic Willis 249 258 -9 -3.6%
42 Jack Quinn 247 265 -18 -7.3%
43 Joe McGinnity 246 230 16 6.5%
T44 Amos Rusie 245 257 -12 -4.9%
T44 Jack Powell 245 265 -20 -8.2%
46 Juan Marichal 243 228 15 6.2%
47 Dennis Martinez 241 229 12 5.0%
T48 Herb Pennock 240 212 28 11.7%
T48 Frank Tanana 240 252 -12 -5.0%
50 Three Finger Brown 239 244 -5 -2.1%

Note how clustered these pitchers are around their real-life totals in a more or less normal manner.  The biggest percentage differences--indeed the only two of 50 on the list to reach double digits--are Walter Johnson, who comes in a staggering 53 wins (12.7%) below his neutral total, and Herb Pennock who finishes 28 wins (11.7%) on the upside.  These are the outlying cases--something one expects in virtually every sample, even non-random ones like that above.

The implication is that while there is a fair amount of season-to-season fluctuation in wins/losses from expected totals, it appears that the career won-loss marks are usually a reasonably accurate reflection of a pitcher's ability.  Perhaps I should rethink that rant at the beginning of this article.   Nah. <g>

As a postscript, here's the Top 50 list for the Orioles (1954 to the present):

RANK Player W NW DIFF % of W
1 Jim Palmer 268 251 17 6.3%
2 Dave McNally 181 158 23 12.7%
3 Mike Mussina 147 141 6 4.1%
4 Mike Cuellar 143 125 18 12.6%
5 Mike Flanagan 141 130 11 7.8%
6 Scott McGregor 138 121 17 12.3%
7 Milt Pappas 110 102 8 7.3%
8 Dennis Martinez 108 94 14 13.0%
9 Steve Barber 95 95 0 0.0%
10 Mike Boddicker 79 84 -5 -6.3%
11 Scott Erickson 74 67 7 9.5%
12 Dick Hall 65 59 6 9.2%
13 Hal Brown 62 58 4 6.5%
14 Storm Davis 61 57 4 6.6%
15 Ben McDonald 58 62 -4 -6.9%
16 Tippy Martinez 52 52 0 0.0%
T17 Ross Grimsley 51 43 8 15.7%
T17 Sammy Stewart 51 53 -2 -3.9%
19 Tom Phoebus 50 48 2 4.0%
20 Chuck Estrada 48 45 3 6.3%
21 Mark Williamson 46 42 4 8.7%
22 Wally Bunker 44 36 8 18.2%
T23 Hoyt Wilhelm 43 57 -14 -32.6%
T23 Arthur Rhodes 43 41 2 4.7%
25 Robin Roberts 42 44 -2 -4.8%
26 Steve Stone 40 32 8 20.0%
T27 Stu Miller 38 50 -12 -31.6%
T27 Jim Hardin 38 37 1 2.6%
T29 Bob Milacki 37 35 2 5.4%
T29 Eddie Watt 37 41 -4 -10.8%
T31 Pat Dobson 36 35 1 2.8%
T31 Jeff Ballard 36 37 -1 -2.8%
33 Doyle Alexander 35 36 -1 -2.9%
34 Sidney Ponson 34 36 -2 -5.9%
35 Ray Moore 33 29 4 12.1%
36 Alan Mills 32 29 3 9.4%
T37 Jack Fisher 30 34 -4 -13.3%
T37 Pete Richert 30 32 -2 -6.7%
T39 Connie Johnson 29 31 -2 -6.9%
T39 Billy O'Dell 29 37 -8 -27.6%
T39 Rudy May 29 24 5 17.2%
42 Dave Schmidt 28 24 4 14.3%
43 Wayne Garland 27 25 2 7.4%
T44 Rick Sutcliffe 26 22 4 15.4%
T44 Ken Dixon 26 24 2 7.7%
46 Jamie Moyer 25 25 0 0.0%
47 Grant Jackson 24 21 3 12.5%
48 George Zuverink 23 23 0 0.0%
T49 Jimmy Key 22 21 1 4.5%
T49 Jose Mercedes 22 21 1 4.5%

Just one note on the above table...take note of the fact that this is not a normally distributed sample.  Only 14 of the players on the list are on the minus side, and the top eight players are all on the plus side, four of them by double-digit percentages.  This is at least partly function of how good the Orioles were--offensively--during the careers of most of the above pitchers.  The run support that these pitchers received, relative to the league, most of the time gave them a larger than average margin for error.  One of the great myths of Orioles history is how the team based its success on pitching and defense, with relatively little attention paid to offense.  That's simply not true--the Orioles were routinely among the upper half of the American League in runs scored from the mid-1960s into the 1980s.