Jump to content
Twins Daily
  • Create Account
  • entries
    5
  • comments
    18
  • views
    9,706

Danny Santana: Projecting Regression and Estimating Value


Sam Morley

5,581 views

 Share

Twins Video

blog-0274815001415832917.jpgDanny Santana: Projecting Regression and Estimating Value

 

There has been occasional to frequent mention of the “fluke” seasonal performance of certain players; namely, Danny Santana and Eduardo Escobar- Brian Dozier in the not so distant past. Usually, the notion is raised within the contexts of position placement and possible trade packages. One recent thread in particular discusses, in depth, the idea of trading Santana for starting pitching help of varying degrees of quality.

 

http://twinsdaily.com/topic/15926-mets-need-a-ss-twins-are-a-viable-trade-partner/

 

When a young player, or a historically average player, performs better than expected it is certainly prudent to keep our expectations for future performance tempered. But, is it mutually wise to expect significant regression? To what extent is significant regression projectable? We have the tools of metrics and scouting reports available to help in such endeavors, but is their sum enough on which serious team and organizational decisions can be based?

 

First, within the contexts of starting lineup/player position and possible trade packaging, I think you have to acknowledge the value of the player based on his most recent season’s performance. What is the value of a SS who plays solid defense, and hits .275 with 35 doubles in 133 games? What is the value of a CF who plays solid, improving defense, runs plus, and hits .320 with 70 runs and 41extra base hits in 101 games? Obviously, the value is excellent. In combination with Brian Dozier at 2B, it might be the best up the middle value in the league (even without considering payroll). In the case of Santana, I would say that that offensive line in combination with his ability to play the two most premium defensive positions on the field, make him the most valuable player on the team. I would not trade him for any pitching prospect. In the case of Eduardo Escobar, his ability at SS, in combination with his offensive stat line, I think is good enough to stick at SS to open the season and keep Santana out of his natural position and in CF- although, this not happening would be far more palatable to me than trading Santana for a SP. Without getting into too many other factors and variables, I understand that Santana is probably the team’s SS of the future, and it would be perfectly reasonable for Molitor to want to start him at SS. After all, he can’t be a SS stop unless he plays SS.

 

But I have veered slightly off course, become slightly carried away at the notion Escobar and Santana will continue to rake so prodigiously. I have wandered from my own warning: prudence begs temperance.

 

.405

 

The BABIP of the MLB categorical leader, who is Danny Santana. Damn, that’s high. BABIP is probably the number one metric for projecting BA regression. BA rises and falls with BABIP and a BABIP this high is not realistically sustainable. Over the past three seasons, only four players have recorded a BABIP of .390 or higher, eleven at .380 and up, eighteen at .370 and up. Discounting his 24 games at AAA in 2014, Santana’s highest MiLB BABIP was .353- a high number, but not quite four hundo (there is a steady rise in both BABIP and BA from level to level for Santana). According to this logic, we can expect to see a drop in BA for Santana in 2015.

 

But while .405 might be exceptionally high, does it mean that Santana will not be able to maintain a more reasonably high BABIP, in the vicinity of, say, .350 plus? Over the past three seasons, on the list of the top 25 players in BABIP (.340-.400) are some regular names; some are exceptional: Mike Trout, Yasiel Puig, Joe Mauer, Andrew McCutchen, Paul Goldschmidt; some are not: Dexter Fowler, Chris Johnson, Starling Marte (with the exception of Chris Johnson, all of these players have a MiLB track record of hitting for high BABIP). So, hitting for a high BABIP can be done with some measure of expected consistency. Additionally, very few of the players on the lists were outliers- without previous seasons of high BABIP (Jhonny Peralta in a shortened 2013 and Jer Saltalamachiado also in 2013 are two examples).

 

Of course, the success of the aforementioned players bears little influence on that of Danny Santana. But there are additional batted ball statistics in support of Santana’s ability to maintain a high BABIP. The type of batted ball that results in the most hits is the line drive (this is actually an assumption, I haven’t looked it up). Santana’s LD% of 26 is excellent, 14th in baseball. His GB% of 45.9 puts him in the high middle, and his FB% of 28.1 puts him in the low middle. Fly balls that don’t result in home runs usually result in outs. Santana’s percentage of home runs to fly balls was on the low side, 8.5%, so the proportion of ground balls he hits to fly balls I think is appropriate for the type of hitter he is. I didn’t find average fly ball distances for Santana, but I think that would be helpful in projecting his future home run production- whether or not we might see more balls travel over the fence or more balls fall short for outs in 2015 (he’s listed at 5’11 160. 160 is pretty light for 5’11. It seems like he could easily put on 15 to 20 lbs without losing speed, and actually add some distance to those fly balls). The other significant stats here are his infield hit total/percentage and his bunt hit/percentage, and both are pretty good. He had 17 IFH with a percentage of 12.7 (the percent is relative to ground balls) and 9 bunt hits in 18 attempts. These, to me, seem like projectable statistics because of their relativity to a player’s speed (which is of course relative to his health).

 

Now, if I was a little savvier, or a little more motivated, I would figure out how to put Santana’s spray charts and heat maps from both sides of the plate into this piece right here:

 

http://www.fangraphs.com/zonegrid.aspx?playerid=10322&position=SS/OF

http://www.fangraphs.com/spraycharts.aspx?playerid=10322&position=SS/OF&type=battedball

 

The links will have to do. The spray charts illustrate his ability to hit line drives to all fields from both sides of the plate. The power is all pull, but relative to his ability to hit for high average it’s the line drives that count. The heat maps illustrate some susceptibility high in the zone, mostly as a right handed hitter, but overall, little weakness.

 

And here, check out his highlight video:

 

 

I don’t see anything mechanically to pick on. The swing is balanced and strong. There doesn’t seem to be any exploitable timing mechanism. His hands are still, save a slight movement back in the load, and they stay in to the point of contact, finishing high- evidence of that slight Ted Williams uppercut. I love to watch his feet the most. They also are quite still, starting wide in the stance and foregoing a stride but still generating power with a back leg load and forward weight transfer so smooth it’s almost unnoticeable. If you can freeze frame on the point of contact on one of his homers, it’s picture perfect. The homers are definitely not moon shots, but they’re not all wall scrapers either. My favorite of all of these are his steps out of the box. His swing finishes so balanced that he is very efficient coming out of the box. It’s almost identical whether he is hitting a GB single or a dinger- he plants his back foot and his first step is down the line toward first. I think this speaks to his infield hit totals as well as stretching singles into doubles. But he is not sacrificing any authority in the way that Nori Aoki does with that hideous running slap swing. Honestly, I got the notion to watch some film on Santana after I started writing this piece, and now that I have I’m even higher on him than I already was. Dude’s sharp; definitely more polished at the plate than I realized.

 

Granted, I just watched the good stuff. (Well, I did find one clip of a MiLB strikeout, but it didn’t unfound any of the goodness). But it looks like homey will hit, and he has too. Because if he doesn’t, he won’t be getting on base much, as his plate discipline is not so great. To begin with, his O-swing% (% of pitches swung at outside the strike zone) is 40.3, .1% behind our other friend, Eduardo Escobar, ranking them 11 and 12 in the bigs. Unfortunately, as is the case with other hitters who have a high O-swing%, neither Santana nor Escobar have a particularly high Z-swing% (% of pitches swung at in the strike zone) 63.1 for Santana and 67.4 for Escobar. I don’t know about you guys, but I just don’t have the historical standards of these advanced metrics locked into my brain yet (like BA, OPS, SLG, etc) so I have to look at lists and charts to compare. Anyway, having a high O-swing isn’t necessarily bad if you have a high O-contact to go along with it (i.e. Pablo Sandoval, Victor Martinez) but this is not really the case for Santana- his O-contact% was 65.9 which isn’t terrible either. Both his Z-contact and overall contact% end up fairly average (the pitch f/x numbers are slightly more favorable).

 

Optimistically, and somewhat reasonably, one would hope that as Santana gains experience, his plate discipline skills/numbers will improve and will translate to a BB% closer to league average and an even higher OBP%, which due to his BA is much higher than league average currently. I was about to suggest that with the success of some of the Twins hitters in the plate discipline department (Mauer, Dozier, and Plouffe) that some of that success might rub off on Santana. But then I considered the likes of Oswaldo Arcia and Kennys Vargas and had to laugh at the pretty dramatic (I’ll say coincidental) contrast between the approaches of our American hitters and our Latino hitters. I’m glad we have both. I wonder if/hope Brunansky is able to help individual hitters craft their own personal approach and style to be the best it can be, or if he has the tendency to force players into a single desired approach. It seems like in the past, the staff has been problematically forceful (not necessarily in temperament) with a philosophical bias toward things like patience, seeing pitches, and taking the ball the other way. I certainly think those are good things, but I also noticed that the two world series teams this season had lineups full of free swingers (although I think the team with the more balanced lineup-in these terms- won).

 

Aside- did anybody else hear Delmon Young’s quote after hitting that bases clearing double against Detroit? They were asking him about it after the game and he said he was trying to hit a grand slam. They were surprised and said something like, “You weren’t trying to stay within yourself and just get a base hit?” and he laughed and said, “No, man, every time I come in I just try to step up and hit the foul pole.” I think that was my favorite quote from the season. I love Delmon.

 

http://m.mlb.com/video/v36734763/delmon-young-joins-scott-braun-on-mlb-tonight

 

Haha, okay, so finally, I would like to come back around to Danny Santana’s projected value, both on the field for the Twins and as a possible trade piece. I don’t really understand the Steamer projections (maybe somebody has a great link to a site that explains it) particularly their estimations of games played, which are always really low. For 2015, it seems like they are not very kind to Daniel. They give him about 35 more games (138) without any uptick in counting stats and that BA comes down about 50 points. Ouch. Bastards. For no legitimate reason whatsoever, let’s just split the dif between last season and the steamer projection. Let’s say he plays 150 games, mostly at SS, with steadily improving D. Let’s say he slashes 290/330/420, scores 90, drives in 60, and steals 35. What is the value of that? Let’s say he’s better the next year. What about then?

 

In conclusion, I would like to see more research and writing done by some peeps more knowledgeable than me on this notion of the “fluke” season. What are some recent historical examples of players having “fluke” good seasons (Chris Davis?)? I probably have overlooked and/or am totally ignorant of other tools used in projecting regression, and it would be good to be educated. Although I am no authority, I do venture to assert that it is not wise to base serious team and/or organizational decisions on the estimation that a player’s recent success of significant sample size may have been a “fluke”. Further, I think that in the particular case of Danny Santana, while it makes sense to anticipate regression, it would be a terrible mistake to overestimate the amount and trade him on its account. I think that both his short and long term value is greater than 90% of starting pitchers.

 

-All the stats I pulled are from fangraphs.com.

 Share

11 Comments


Recommended Comments

I love Santana's approach at the plate but it is safe to project some regression. I think his offense at shortstop along with his range and arm will more than make up for the errors. Looking forward to watching him for many years.

Link to comment

Very nice write-up. Santana broke out last year and we can expect a bit of a sophomore slump, but he looks like a solid major leaguer. I'd like to see some reduction in strikeouts to make up for what almost certainly will be a lower BABIP. I'm also concerned that his RH numbers will regress more significantly than his LH stats.

Link to comment

Here's a comparable.  http://www.baseball-reference.com/players/s/smithdw01.shtml this guy hit very similar to Santana his rookie year.  Didn't do to well afterwards. 

 

I am a huge fan of Santana though and I actually hope he remains in the OF and Escobar keeps SS.  Santana could prove me wrong and reclaim SS but Escobar did just fine there.  Why fix what ain't broken?  Though there are lots of teams with starting pitchers looking for SS this offseason. 

Link to comment

One thing that I believe doesn't get enough attention in discussions like these is the age and playing experience of the player.  He will play next season at the age of 24 with one MLB season under his belt and AA prior to that.

 

There is a very real chance that he is still learning and improving as a player.  So, while the BABIP has to come down, there isn't any reason he can't get better in all facets of his game.

Link to comment

One thing that I believe doesn't get enough attention in discussions like these is the age and playing experience of the player.  He will play next season at the age of 24 with one MLB season under his belt and AA prior to that.

 

There is a very real chance that he is still learning and improving as a player.  So, while the BABIP has to come down, there isn't any reason he can't get better in all facets of his game.

 

You don't think this idea is given attention here? I think it's one of the main themes in the piece. Will Santana improve in enough in certain facets of his game to make up for the anticipated "regression" in others? In 100 games, 70 runs scored supported by a 350 OBP is nearing elite production. The OBP is supported by a 320 BA- definitely elite production- but when the BA is supported by an astronomic BABIP 405, you wonder whether he will improve his plate discipline, which he will need to do in order to maintain a high OBP and score an elite level run total. 

 

How much BABIP regression will be offset if he can improve on that 23% K rate?

Can he reduce his K rate? resulting in more balls in play and more on base/run scoring opportunity? Can he improve his BB rate (4.4%) enough to maintain that 350 OBP, making up for the inevitable drop in BABIP? I think the O-swing and Z-swing numbers are telling- he needs to swing at less pitches outside the zone and more pitches in the zone. 

 

Linus, I agree with you that Santana will likely get better next season, and I think his plate discipline is the facet of his game most primed for improvement. I think that this discussion and the piece focus so much on the BABIP is because of this:

 

Let's say that Santana actually does get better in all facets of his game, that even his hit tool- his ability to make consistent good contact- actually improves. This kind of improvement might statistically show up in something like line drive % (which at 26% is already quite good). BABIP is not something a player can 'improve'- it's basically a result of the combination of a hitter's ability to hit, the type of hitter the player is, and luck. So even if he strikes out less, putting the ball in play more, and even if a higher percentage of those extra balls he puts into play are line drives, his BABIP will still probably go down- because of luck. 

 

But again, Linus, I agree with you in that I think there is a reasonable likelyhood that Santana will make improvements. I'd hoped I was clearer about that in the piece. 

 

 

Santana broke out last year and we can expect a bit of a sophomore slump

The sophomore slump is an interesting phenomenon I didn't really talk about, made even more interesting considering Kennys Vargas as well. A sophomore slump is certainly different from a 'fluke' season. Pitching staffs make adjustments and Santana's season two success is probably tied to how he adjusts to those adjustments. Sounds like a whole new article topic to me. I will say that the heat maps from 2014 on both sides of the plate didn't seem to show much weakness, but I haven't looked at pitch specific heat maps yet either.

Link to comment

How much BABIP regression will be offset if he can improve on that 23% K rate?

Consider if Santana became league average for AL shortstops the last 5 seasons in BB%(6%) and K% (slightly less than 17%).  He would be striking out less, but walking more.  In a 400 AB season, it would result in about 16 more balls in play.  That would not be enough to offset regression.  On the other hand, if his 2013 AA season was indicative of what Santana could do, BA .297, K% 16, BB% 4.1 his BABIP was .353 Less strikouts, but also less walks. Thus it ofsets regression of his BABIP if while stiking out less, he also gets less bases on balls

Link to comment

What we are ultimately concerned with here is on base percentage, and whether or not we will see a regression in that and to what extent. The OBP is supported by batting average (which is tied to BABIP) plus BB%. So if Santana's BA drops along with his BABIP, can he add enough walks to make up for the lost hits and thus keep his OBP high enough to score lots of runs?

 

Consider if Santana became league average for AL shortstops the last 5 seasons in BB%(6%) and K% (slightly less than 17%).  He would be striking out less, but walking more.  In a 400 AB season, it would result in about 16 more balls in play.  That would not be enough to offset regression.  On the other hand, if his 2013 AA season was indicative of what Santana could do, BA .297, K% 16, BB% 4.1 his BABIP was .353 Less strikouts, but also less walks. Thus it ofsets regression of his BABIP if while stiking out less, he also gets less bases on balls

I want to understand you clearly here; in your first example, you are presuming a drop in BABIP, and that in terms of OBP the extra walks will not be enough to offset that drop. That would depend on the extent to which the BABIP drops, yes? So how much are you guessing, for that example his BABIP will drop? As for your second example, Santana's OBP that season was 333, 20 points lower than his 2014 MLB OBP, so I am confused about your assessment that this would be less regression. 

 

There are two things that I keep coming back to on this topic:

 

1) The factors resulting in BABIP, which I think are A) a hitter's ability to make contact B) the type of contact the hitter makes (LD, GB, FB, Bunt) C) the hitter's speed to first base and D) luck. With Santana, his contact rate isn't especially great (I assume contact rates include foul balls though- so maybe he hits fewer foul balls, and his rate of balls put in play to pitches swung at is actually higher than his contact rate- I couldn't find that statistic though) but he hits a high % of line drives and ground balls and bunts, and his speed to first base helps him turn a high % of GBs and bunts into hits. But he also must've been really really lucky. The way I see it, the luck factor is the biggest in terms of BABIP fluctuation. We have to assume that he will not be as lucky in 2015, but do people also think Santana's ability to make contact, good contact, will get worse? This could result from pitchers' adjustments, I suppose. Maybe that is a bigger factor than I am acknowledging. A hitter being less lucky is not really regression. So the regression that I am afraid of lies in this factor of his capacity to make adjustments to the adjustments pitchers make to him. Lastly, unless he gets hurt, he's not getting slower.

 

2) Even slight improvement in plate discipline and increase in BB rate will help offset a drop in BA in terms of OBP. I think that even if he doesn't make an intentional adjustment in his approach toward better swing selection, it will happen naturally with experience. It would be interesting to see his O/Z swing and O/Z contact rates from his time in the minor leagues, but they aren't available on fangraphs. Looking at other major league hitters for comparison doesn't yield much of anything conclusive. Some hitters are consistent in these categories from year to year (Pablo Sandoval, Jose Reyes). Some stayed consistent in O-swing but steadily increased their Z-swing (Carlos Gomez, Dexter Fowler). Few decrease their O-swing while also increasing their Z-swing (Trevor Plouffe is an example). So, who knows, but why not, in this case, be optimistic?

Link to comment

Apparently, I have entered a world in which a capital letter 'b' followed directly by a right parenthesi, ) automatically morphs into a yellow circle wearing sunglasses.

Link to comment
Guest
Add a comment...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...