Our Inspiration

Saturday, April 27, 2013

Ichiro's Batting Stats: The April Edition

Ichiro Suzuki has hit poorly so far this season, with a slash line of .243/.289/.314. There's even been some media speculation that this means the Yankees might bench Ichiro in favor of Vernon Wells when Granderson returns.

How reasonable is it to pass judgement on Ichiro after 77 PAs in 22 games? This is a rhetorical question, as it's obviously unreasonable. However, for fun let's use the binomial distribution to estimate the probability that Ichiro's performance to date indicates his true talent level.

To use the binomial distribution, we need to estimate Ichiro's likely talent level. If we look at his Marcel projections, Ichiro's projected batting stats are as follows:
  • BA: .273
  • OBP: .311
  • XBH/AB (Extra-base hits per AB)*: .056
It's doubtful that the Yankees would have given Ichiro a 2-year deal if these rather anemic numbers represented the front office's estimation of Ichiro's expected performance. Since the Yankees don't broadcast their performance projections, we'll have to make an educated guess. To do so, I am going to do is use Ichiro's 2001 season stats; since he's now a few years older, I'll reduce those numbers by 5%, meaing we will use the following:
  • BA: .300
  • OBP: .341
  • XBH/AB*: .054
Here are the binomial probabilities that Ichiro's poor performance represents his actual skill levels.

Projection TypeBAOBPXBH/AB
Marcels34%37%44%
2010-based estimate18%18%47%

Not that this is surprising, but the probabilities indicate it is unlikely that Ichiro's first 77 PAs of the season reflect how he will perform in 2013. Moreso if you assume the Yankees' projections of his performance significantly exceed Marcels'.

Let's look at one other set of numbers, which are the expected range of these statistics after 77 PAs. The following table shows the range of values that the binomial distribution would expect, with 90% confidence, after 77 PAs:

Projection TypeBAOBPXBH/AB
Marcels.186 - .357.221 - .403.014 - .100
2010-based estimate.214 - .386.260 - .429.014 - .100

In other words, after 77 PAs a player's batting statistics could be either those of a superstar or those below a replacement-level player.


* To gauge Ichiro's projected power, I used extra-base hits per AB (XBH/AB) instead of slugging percentage because the latter cannot be used to estimate probability. This is because the binomial distribution requires that each trial have only 1 of 2 outcomes, and slugging percentage allows for 5 outcomes: out, single, double, triple, homer. So for today's purpose we'll just look at the probability that Ichiro gets an extra-base hit or not in HIS at-bats.

Monday, April 8, 2013

Rob Thomson's Reckless Tendency

On Saturday, Yankee third-base coach helped kill a rally. With a runner on 3rd and no outs, Francisco Cervelli lofted a shallow fly ball to Torii Hunter. Thomson sent the runner, who was thrown out - and by a mile, too!

Baseball conventional wisdom is that you should not make the first out of an inning at home. Statistics back this up.

Let's look at the impact of Thomson's coaching decision. According to the Run Expectancy Matrix published by Tom Tango:
  • Teams with a runner on third and one out on average score .989 runs in the rest of the inning, with a 67.4% chance of scoring at least one run. 
  • Conversely, a team with no one on base and 2 outs on average scores .112 runs in the rest of the inning, with a 7.5% chance of scoring at least one run. 
In other words, Thomson's decision reduced the Yankees' expected runs in the inning by .877 runs, and reduced the likelihood of scoring at least one run by 60%.
This unfortunately is hardly the first time Rob Thomson has so harmed the Yankees' chances to score runs. According to the Retrosheet play-by-play data, between 2009 (when Rob Thomson because the 3rd-base coach) and 2012, the Yankees have had 6 instances where a runner has been thrown out at home with zero outs. This is the 3rd worse such record in the major leagues - between 2009-2012, only the Red Sox and the Twins have had more instances of such bonehead coaching decisions.

There really is no excuse for this. Rob Thomson's standard response to such criticism is that he is aggressive. However, there is a difference between being aggressive and being reckless; and Thomson's tendencies here are merely reckless.

If you are interested, the following table lists the number of times each team has had a runner thrown out at home with no outs, ordered from worst to best:

TeamTimes Out at
Home, No Outs,
2009-2012
BOS9
MIN7
BAL6
NYA6
HOU6
KCA5
WAS4
OAK4
ARI4
LAN4
ANA4
SEA3
NYN3
CLE3
CHN3
SFN3
PIT3
TBA2
FLO2
SDN2
SLN2
MIL2
DET2
CIN2
TOR1
COL1
CHA1
TEX1
ATL1
PHI0

Sunday, March 31, 2013

2013 AL East Projections: Are the Yankees the Favorites To Win the AL East?

Using the Marcels projection systems, I generated some team projections for the AL East.

First, a few comments on how I went about this, which I mention so that you can form your own judgement on the merits or pitfalls of this approach:

  • To get to these team numbers, I started with the projected stats for players on each team most likely to play significant roles; these players were selected primarily by using MLB's depth charts. I then adjusted each team's totals to a full-season's worth of plate appearances and innings pitched - details are at the bottom of this post, below the fold.
  • The won-lost records were calculated using the the Pythagorean expectation formula. For runs scored, I used the team total batting runs. For runs allowed, I used the team total of projected runs.
  • The won-lost records should be used only to compare the AL East rivals; they are not intended for predicting actual wins. To do the latter properly, I would have had to total up the won-lost records of the entire league, and then make some adjustment to make sure the league total win-losses were equal. (Since I was primarily interested in gauging the Yankees' chances of winning the decision, and since the weather has been awesome this weekend I did not go to that length.)
Here are the projected AL East standings:

TeamWLRRAWinning %
Yankees9270770675.565
Blue Jays9072780695.557
Red Sox9072800716.555
Rays8874727663.546
Orioles8280734723.508

The projections indicate what I'm sure we all thought: a tight race in which any of the 5 teams has a realistic chance of winning. That being said, for what it is worth, the Marcels projections make the Yankees a slight favorite.

That's nice to see. However, let's not pretend this is the "championship caliber" team that Hal Steinbrenner claimed he's built. The Yankees could certainly win it all with this team; the pitching especially should give us hope as Yankee fans. However, they are not one of the elite teams going into the season, which is what "championship caliber" means from the perspective of evaluating the front office's efforts. The team has a lack of depth which may make it harder to withstand injuries than Yankee teams of the recent past. That is why the Yankees go into the season with something like a 22-25% chance to win the AL East; a championship calibre team would have better odds than that.

For a basis of comparison, I put together team projections for the 2 elite teams in the AL, the Tigers and the Angels. As you can see, they project as significantly better teams:

TeamWLRRAWinning %
Angels9963796634.612
Tigers9567799672.586

The glass half empty perspective is that the Yankees project to be worse than last year's team. The glass half full perspective is that they still project to be competitive for the AL East.

Finally, here are the Marcels projections for your 2013 Yankees.

First, the hitters:

HitterAgePAABH2B3BHRBBSOIBBHBPSHSFSBCSBAOBPSLGwOBABatting RunsReliability
Granderson, Curtis3261153313020634651582625144.244.330.495.3528887%
Cano, Robinson30617560169393274689970452.302.360.527.36910087%
Teixeira, Mark33530459114241265693280731.248.336.475.3467486%
Youkilis, Kevin3450643511023318551022120431.253.350.444.3436983%
Wells, Vernon3438435785172172362120252.238.286.440.3104582%
Boesch, Brennan28499457118242153499250373.258.315.418.3165983%
Hafner, Travis3636832282161133677480221.255.342.432.3314878%
Jeter, Derek396315761632621144931533124.283.338.392.3197487%
Rodriguez, Alex*3725422559111102452130341.262.339.453.3423584%
Nix, Jayson303162846515192277145163.229.293.384.2893269%
Francisco, Ben313332997417182764131352.247.313.391.3063770%
Overbay, Lyle363092776617182969211121.238.312.394.3033478%
Suzuki, Ichiro39604567155234730717223286.273.311.365.2936487%
Nunez, Eduardo2628425870132620391123164.271.323.407.3193563%
Stewart, Chris312972676314162345223231.236.299.363.2842956%
Gardner, Brett2927724064103430501331195.267.354.383.3293578%
Cervelli, Francisco27215189508141938132231.265.338.381.3122456%

* I prorated A-Rod's Marcels' projections by 50%, since we know he will not play until at least mid-season.

Here are the pitching projections:
PitcherAgeIPGGSHRERHRBBSOHBPIBBERAFIPWHIPHR/9K/9BB/9K/BB Reliability
Kuroda, Hiroki38190292918682752349147643.554.001.2371.17.02.33.0081%
Hughes, Philip27161302816084782546136514.364.431.281.47.62.62.9676%
Sabathia, C.C.32184262617478691949175633.383.461.2120.98.62.43.5782%
Nova, Ivan26161272616683772153131734.304.301.361.27.33.02.4776%
Pettitte, Andy41981616944339123182213.584.071.2761.17.52.82.6455%
Pineda, Michael2477131365353392477213.863.661.161.19.02.83.2154%
Phelps, David26872910763735113179423.624.131.231.18.23.22.5551%
Robertson, David286266053222152473223.053.021.2420.710.63.53.0457%
Logan, Boone285784053262462258233.793.611.3160.99.23.52.6451%
Eppley, Cody274961048232251739124.043.881.3270.97.23.12.2936%
Kelley, Shawn294848046221961643133.563.881.2921.18.13.02.6939%
Rapada, Clay324685042222051841113.913.971.304183.52.2835%
Chamberlain, Joba273838037181751235214.034.061.2891.28.32.82.9240%
Rivera, Mariano43353703214133930113.343.341.1710.87.72.33.3340%

Play Ball!


Friday, March 29, 2013

Yankees are Worth $2.3 Billion: The Jayson Nix Edition

Forbes Magazine released its annual valuation of MLB teams. The Yankees are worth $2.3 billion.

$2.3 billion. Clearly the classic Yankee business model of going all in - started by Jacob Ruppert and revived by George Steinbrenner - pays off.

Hopefully Hal give this some thought and stop running the team as a cheap nitwit. It should be a sobering thought to the Yankees' CEO that the Opening Day lineup against lefty Jon Lester is likely to be something like this:

  1. Brett Gardner
  2. Ichiro Suzuki
  3. Robinson Cano
  4. Kevin Youkilis
  5. Vernon Wells
  6. Eduardo Nunez
  7. Ben Francisco
  8. Chris Stewart
  9. Jayson Nix
My first thought when pondered this lineup was: "Wow. Hopefully the pitching lives up to its potential."

And you can't blame the injuries for the number of replacement-level and near-replacement players in this lineup. The lineup is this thin only because Hal put a lock on the petty cash drawer.

$2.3 billion dollars - that's a pretty nice return on the $8.8 million George Steinbrenner paid for the Yankees back in 1973. If the Yankee owners had run the team in Hal's penny-wise and pound-foolish fashion, the Yankees would be worth 1/3 of that. Just look at the Mets and do the math. The Mets have the same market as the Yankees, but are worth a mere $811 million - approximately 1/3 the worth of the Yankees.

Sunday, March 24, 2013

WTF Did the Yankees Get Vernon Wells?


This is reminding me of a certain type of test question that you get as a kid. For example:


    Which one of the following baseball players does not belong in the same group as the other 2:
  • Josh Hamilton
  • Nick Swisher
  • Vernon Wells
Pondering that one brought to mind another such question:

  Which one of the following people does not belong in the same group as the other 2:
But since hope springs eternal, the one good thing I can say is that Wells has been truly horrible in only 4 of the last 6 seasons. Oddly, he was very good in the other 2 seasons - 2008 and 2010.

Saturday, March 16, 2013

Dan Johnson, Projections, and Filling In for Mark Teixeira

There's a good chance that Dan Johnson will fill in for Mark Teixeira until Teixeira's wrist heals. So I thought it would be fun to compare the 2 players' batting projections, in order to estimate how much the injury might harm the Yankees in terms of runs scored.

This thought-experiment assumes that Teixeira will miss 45 games - about 28% of the season - and so we'll prorate their projected numbers to 190 PAs.

Prorating to 190 PAs assumes Johnson will play regularly in this period. If he makes the team he may just platoon. But since Johnson's career splits indicate no difference in his stats against righties vs. lefties, we'll keep this comparison simple by assuming he plays regularly.

We'll look at prorated projections from 2 systems - Marcels and Cairo - with the projected numbers prorated to 190 PAs. The reason for looking at both systems is that Marcels only uses major league statistics from recent years, while Cairo also incorporates minor-league statistics. As you'll see, this makes a difference for Johnson.

First, here are the Marcels projections:

PlayerPAABH2B3BHRBBBAOBPSLGwOBABatting RunsReliability
Teixeira1901654190920.248.337.467.3432686%
Johnson1901653771721.224.317.406.3122244%

Marcels projects Johnson as being significantly worse than Tex, as would be expected. In terms of a quarter-season, if we apply the sabermetric convention of 10 runs correlating to 1 win, the 4-run difference in batting runs has Teixeira contributing an extra .4 wins. So Marcels evaluates the injury to Teixera as being significant, but not so much that it risks ruining the season.

One thing to note before moving onto Cairo is the reliabilty column. In Marcels, the reliability percentage describes how much of the player's projection was based on his actual stats, versus the regression-to-the-mean component. This means that Teixeira's projections are significantly more likely to be accurate.

Now here are the prorated Cairo projections:

PlayerPAABH2B3BHRBBBAOBPSLGwOBABatting Runs
Teixeira1901644190921.250.342.470.34627
Johnson1901623970825.241.347.432.33825

Cairo is much more favorable to Dan Johnson. This is because it incorporates his minor league stats, which have been pretty good. Cairo thus has Teixeira as only marginally better than Johnson over 190 PAs.

As a fan, the differences in the 2 projection systems make Dan Johnson a more interesting player to me. The way I look at it is that Marcels rates Johnson the way GMs' decisions have rated him - as a marginal AAAA player. Cairo however rates him as an underrated bargain - a player whose skills GMs have been wrong about. If Johnson gets a chance, it'll be interesting to see which of these evaluations is closer to the mark. 190 PAs won't prove anything; but if Johnson hits similarly to what Cairo projects, he'll get an opportunity for more PAs.



Monday, March 4, 2013

Age and the 2013 Yankees: Historical Perspective

There's been a lot of talk about the Yankees' age. When I ran Marcels projections, the average ages of the 2013 Yankees were as follows:
  • Batters: 33.2
  • Pitchers: 30.6
The batters' collective age is relatively old; since 1996, only 4 teams had hitters averaging age 33 or older. One of them was last year's Yankees - they won 95 games and reached the ALCS.

Do these numbers mean anything?

One way to examine this question is to look at baseball history. For this exercise, we'll take all the teams since 1996, group them into ages, and compare the age groups' average winning percentages.

The following table shows the results for hitting. As you can see, the older teams actually do better than the younger teams:

Team Average Batter Ages and Winning Percentage (since 1996)
Age# of TeamsAvg. W. Pct.
32+15.540
3134.541
3079.528
29119.506
28132.491
2795.477
2624.461
Below 268.418

This surprised me - I excepted that the sweet spot would be teams averaging around 28-30.

Certainly I'm not suggesting that being older causes teams to be better; age is more nuanced than that. There have been plenty of mediocre old teams, such as the 1998 Orioles.

Rather, the point merely that being an older team does not mean the team will do poorly in a given season. This may seem obvious. But I mention this because there has been a fair amount of premature fork-sticking into the Yankees due to their age; these numbers provide a counter weight.

That being said, I am concerned about age and this year's team, because the team has lost depth. Starting players such as Nick Swisher and Russell Martin have not been well-replaced.  Travis Hafner's roster spot is devoted to a player who cannot play in the field. Invaluable bench player Eric Chavez was let go.

If you've got an aging team, it's wise to plan for needing to rest some players and to have perhaps more than your share of injuries. Hopefully players such as Corban Joseph, Ronnier Mustelier, Melky Mesa, Zoilo Almonte, etc., will exceed expectations; what scares me is the risk of players such as Jayson Nix, Dan Johnson, and Matt Diaz getting 400 ABs.