Jump to content
Twins Daily
  • Create Account

Gavin_Sanford

Provisional Member
  • Posts

    7
  • Joined

  • Last visited

Gavin_Sanford's Achievements

  1. I didn't think of BABIP that way but you bring up a good point. I checked a lot of the people who are high in the residuals and it was a mix of each. generally the players were around their career numbers and I consider myself pretty handy on the computer programming but pulling from all the individual pages is what I had to do for others so I am sure it'd be the same. I will definitively revisit the suggestions you made at some point. Thanks for the input as it has been very valuable in looking at other points and changes I can make.
  2. Thanks for the feedback. Another thing i was going to look into was batting average on balls in play. I was hoping this could account for some randomness in the model and variance that was unaccounted for. And do they have these numbers for 2017? I would totally be willing to add those and look at how sprint speed and flares and burners work together. Uploading the charts was honestly a mess for me so I went this route. Thanks for the feedback!
  3. Being a Twins fan, and a statistics major, there are always connections to be made between on the field performance and statistics for a player. This has never been more available as statcast has allowed for measuring exit velocities, launch angles, and many other statistics for both pitchers and hitters. I wanted to look into which of these statistics has the greatest influence on Weighted Runs Created Plus. This is an offensive statistic which tries to credit each hit and situation for its true value. It measures hitters against each other and is able to be compared across years. It does this by including park effects and a season’s offensive comparison to others. In my research this is the dependent variable and is what analysis is done on. The variables originally included in the study are a long list. They are Average exit velocity, Strike-out percentage, Walk Percentage, Sprint Speed, Balls hit ninety five plus miles per hour, barrels per plate appearance, line drive percentage, fly ball percentage, ground ball percentage, pull percentage, center percentage, opposite field percentage, and average launch angle. All of these were included as they describe a makeup of a hitters profile in direction, launch angle, velocity and a player’s speed. Barrels are balls that have a certain launch angle and exit velocity that leads to a high batting average and slugging percentage on similarly hit balls. Obviously, all of these are not going to be important to the model. The first thing I did was test for multicolinearity between variables. Variables that were multicolinear include pull percentage, opposite field percentage, and center percentage and this is intuitively pretty to understand. These are percentages that add up to one so they clearly affect each other and the value that the others become. The same is said about the multicolinearity between line drive percentage, ground ball percentage, and fly ball percentage. Focusing on variables I deemed possibly most important, I Looked at multicolinearity on average exit velocity, barrels per plate appearance, average launch angle and line drive percentage because these all conceptually want to quantify the same thing. There was no multicolinearity. Also in variable manipulation, I took sprint speed, pull percentage, and balls hit ninety five plus miles per hour and made them discrete variables with a value of 0, 1, or 2. This was to quantify around the top 10 % in each category based on the distribution they followed. In selecting the final variables for a model I looked at proc glm in the programming language SAS. I did a forward and backward model selection and forced into the model average exit velocity and launch angle because these were not considered significant in model selection but we know they are in research. The reason glm was used instead of proc reg is because it gives extra attention to the discrete created variables and defines them as discrete instead of continuous which is what would be done with proc reg. I will touch on this later but essentially exit velocity and launch angle were extremely high in consideration for the model until barrels per plate appearance was entered. There was no multicolinearity between the two but it is interesting how average exit velocity and average launch angle do not get entered when barrels per plate appearance is added. This leads to the final model of: wRC+ = β1 + β2aev + β3ala + β4bbp + β5 nss + β6nnfp + β7bbpa + β8ldp + β9kp + u The following regression models are compared, looking at significance of variables and the overall models. The first model includes barrels per plate appearance while the second has that variable removed to look at interaction of the other variables within the model. * Means significant at 95% **means significant at 99% Parameter | Estimate | Standard Error | T-Value | Pr > |t| | Intercept | 66.1834 | 47.434822 | 1.4 | 0.1639 | AEV | -0.02094 | 0.55293177 |-0.04 | 0.9698 | Ala |-0.13735 | 0.21858149 | -0.63 | 0.5302 | Bbp** | 227.2026 | 27.9982043 | 8.11 | <.0001 | Kp** | -174.353 | 17.3155471 | -10.07 | <.0001 | nss 0** | -11.3559 | 3.13939906 | -3.62 | 0.0003 | nss 1** | -7.22713 | 2.58785107 | -2.79 | 0.0055 | nss 2 Is the baseline of the equation nnfp 0* |-9.880559 | 4.05187092 | -2.44 | 0.0153 | nnfp 1 | -0.36816 | 2.70815033 | -0.14 | 0.8919 | nnfp 2 Is the baseline of the equation Bppa** | 7.184027 | 0.64032719 | 11.22 | <.0001| Ldp** |158.30838| 27.2845322 | 5.8 | <.0001 | The above model has an F value of 57.18 vs. Parameter | Estimate | Standard Error | T-value | Pr > |t| | Intercept** | -226.969 | 46.59810 | -4.87 | <.0001 | AEV** | 3.55070 | 0.53212884 | 6.67 |<.0001 | Ala* | 0.61035 | 0.24502777 | 2.49 | 0.0132 | Bbp** | 250.163 | 32.86669476 | 7.61 | <.0001 | Kp** |-95.6383 | 18.63300806 | -5.13 | <.0001 | nss 0* | -9.3594 | 3.68923955 | -2.54 | 0.0117 | nss 1 | -5.67298 | 3.04162271 | -1.87 | 0.0631 | nss 2 Is the baseline of the equation nnfp 0** | -17.7382 | 4.69740546 | -3.78 | 0.0002 | nnfp 1 |-5.008342 | 3.15019426 | -1.59 | 0.1128 | nnfp 2 Is the baseline for the equation Ldp** |107.2872 | 31.66561379 | 3.39 | 0.0008 | The above model has an F value of 35.75 From the above models we see that including barrels per plate appearance is more efficient based on the F-values below for each model. We see in each model that walk percentage, strikeout percentage, and line drive percentage are statistically significant at 99%. In the first model, both sprint speeds are statistically significant at 99% meaning the faster you are the higher your wRC+ should be. In the second model only the nss0 which is the discrete variable for the slowest people in the category is significant and the negative coefficient on the parameter shows us the effect it has on the model. Balls hit ninety five miles per hour plus is significant only to those who have the least as well or around the bottom 10 % in both models in the first model at 95% and the second 99%. The first model also has barrels per plate appearance which is statistically significant at 99%. The value barrels contribute is immense to the model and we see average launch angle and average exit velocity as insignificant. In model two, without the barrels per plate appearance we see average launch angle and average exit velocity become statistically significant; exit velocity at 99% and launch angle at 95%. We see from above that strike-out percentage negatively effects wRC+. We see that balls hit ninety five miles per hour plus,sprint speed, walk percentage, and line drive percentage have a positive effect on wRC+. Barrels per plate appearance is the most important in deciding wRC+, but without this an increase in average exit velocity and average launch angle are valuable. The fly ball revolution has led to more players having success in the air and altering swing paths to lead to more fly balls. As we see from the negative values in average launch angle in model one and a positive in model two, it is good to get a general increase in launch angle but the value is more hitting the ball hard and "barreling" it on these or increases will lead to more fly outs and less grounders that may make it through the holes. Pairing the said analysis with predictions, there are players who based on the model including barrels had very large differences in their wRC+ and the predicted value for it. This means some players got "lucky" and had better results than expected and some played worse. These players should have bounce back years in 2018 if they can replicate what they did in 2017. Name | wRC+ | Resid | Predicted value | Studentized_residual D.J. Lamehieu | 94 | -58 | 152 | -4.04 Miguel Cabrera | 91 | -47 | 138 | -3.2 Mitch Moreland | 98 | -35 | 133 | -2.33 Dansby Swanson |66 | -34 |100 | -2.33 Brandon Moss | 84 | -34 |118 | -2.28 Alex Gordon |62 |-31 |93 | -2.1 Austin Romine |49 | -30 | 79 | -2.05 Maikel Franco | 76 | -30 | 106 | -2.04 Chris Herrman |58 | -29 | 87 |-2.01 Taylor Motter |57 |-30 | 87 | -1.99 Hyun Soo Kim |61 | -28 | 89 | -1.94 Pablo Sandoval |64 | -29 | 93 |-1.94 J.J. Hardy |50 | -28 | 78 | -1.9 Randal Grichuk |94 | -27 | 121 | -1.85 Tony Walters | 49 | -27 | 76 | -1.81 There are a handful of interesting names on this list. D.J. Lamehieu increasing his wRC+ is extreme but him being a bounce back player has already showed in a hot start so far this year. Maybe it continues. Miguel Cabrera is coming off his worst season since entering the league and numbers say he should have been better. Others on this list like Randal Grichuk are intriguing as he was traded and hadn't lived up to his potential. Can he catch up to his numbers? An interesting look at the numbers here show J.J. Hardy as a candidate to have a bounce back year but that would still be to be 22 % below average. For those wondering if he was an option at short to play for Polanco during the suspension, this could be why.Now for a list of players who could slow down. (Caution:Good Players on this List, and No I don't think they will regress, this much anyway) Name | wRC+ | Residual | Predicted Value | Studentized-Residual Jose Altuve | 160 |38 | 122 | 2.56 Mike Trout | 181 | 35 | 146 | 2.42 Mitch Haniger | 129 | 36 | 93 | 2.41 Marwin Gonzalez| 144 | 35 | 109 | 2.38 Zack Cozart |141 |32 | 109 | 2.12 Austin Jackson |131 | 31 | 100 | 2.08 Jose Rameriz |148 |31 |117 | 2.07 Eduardo Nunez |112 | 29 | 83 | 1.96 Marcell Ozuna |142 |27 | 115 | 1.85 Scooter Gennett |124 | 27 | 97 |1.84 George Springer |150 | 27 |123 |1.8 Like I said, good players. Mike Trout and Jose Altuve top the list but some players will always outplay projections. The interesting names here are those who had predicted values around average and were far batter. These are difference makes like Marwin Gonzalez, Austin Jackson, and George Springer that help push a team over the top and far into the playoffs. They also help a team on the fringe get in. Now what does this mean for the Twins? Good news is no Twins were on either lists, meaning the offense that was so explosive last year performed near expected as no players extremely over performed and no one under performed hurting the offense. A list of Twins players below shows a prediction if output is close to last year of what could happen. Name | wRC+ | Residual | Predicted Value Joe Mauer | 116 | -4.9345 | 121 Kennys Vargas | 98 | 14.2709 | 84 Logan Morrison | 130 | -.2185 | 130 Miguel Sano | 124 | 11.5868 | 112 Brian Dozier | 124 | 7.8157 | 116 Jorge Polanco | 89 | -9.8506 | 98 Eduardo Escobar | 96 | -16.9470 | 113 Max Kepler | 92 | 3.9207 | 88 Robbie Grossman | 102 | -9.9625 |112 Byron Buxton | 90 | .6321 | 89.388 Eddie Rosario | 116 | 6.8566 | 109 We see some players fluctuate around their number and some go up or down around 10 % No drastic changes should be expected from this roster of young players. We could see an increase in plate discipline with age which could lead to an increase and could be something worth watching. Though Logan Morrison got off to a bad start, looking at his numbers from last year, if he replicates this output from last year the Twins will have gotten a steal. Another thing to look at is Joe Mauer, though he has aged and has a huge contract expiring, he still has a lot of value in getting on base and that is hard to get at the rate Joe does. This could be an argument to pay him and keep him around.
  4. As most of us know, April in Minnesota can be brutal. Snow, sleet, and rain accompanied by cold temperatures are always a possibility early in the season. The old saying April showers bring May flowers is thrown around everywhere, but it especially rings true in Minnesota. With the Twins having a 14 game home stand beginning the first week in April, and having 18 games total for the month, weather becomes a huge factor for the Twins’ attendance early in the year. Even the most die-hard baseball fans find it difficult to sit outside at a game when it is cold and raining. This makes me wonder why so many northern teams get scheduled to have early home games in the first place, when it is known that weather affects attendance, but that isn’t my job so I won’t try to tackle it. April weather is definitely detrimental to the Twins’ attendance numbers, but it’s also going to hurt to lose All-Star and Cy Young vote-getter Ervin Santana, whom is still recovering from a right middle finger injury and will be out until at least May after having surgery. Missing him for a month and the weather could potentially make people wait until summer nights to come enjoy a game, but I say the time to make it to a game is now. There are a ton of reasons to be excited about the Twins and to make it to Target Field in April; I could go on forever. But to save us all time, here are the top 10 reasons why you should get your tickets and go see a game. 1. Home Opener, the Giveaway, and the Future The first home game of the year is always a big deal. It means spring is in full swing and baseball is back. It means there are 78 more home games to attend and good times to be had at the ballpark. The reason there is not 80 games this year is because there are 2 games in Puerto Rico versus the Indians that are home games for the Twins. The giveaway for opening day is always good, but the sweatshirt this year is awesome. Even if there’s a snow storm, we should all be willing to weather it on Opening Day to get these sweet sweatshirts. Another thing that can’t be over looked is Jose Berrios’s first Opening Day start. The guy flat out has some nasty stuff in his pitching arsenal and has showed moments of dominance that should make us all excited. He has been a Twins top prospect and could be the ace for years to come. May 18th of last year he went 7 and 2/3 innings with 11 K’s, a walk, and 2 hits against a talented Rockies lineup that was 3rd in runs scored. This is the kind of player the Twins want to see and that will get fans to show up every time he pitches. He is a must-watch in the month of April and hopefully, for months to come. 2. The World Series Champions are coming to Town Every baseball fan can agree; it is always a big deal to see the reigning World Series Champs play. The Astros led the league in runs scored with 5.5 runs a game last year, and with offense this good and the AL MVP on your team in Jose Altuve, you’re not going to want to miss watching them in action. The Astros’ offense is good enough that perhaps they could slide by with mediocre pitching, but the Astros have great pitching too. Verlander, McCullers, and Kuechel could put on a quite a show for us, and the Twins are slated to get the number one pitcher in game one. Regardless of how early it is, the only chance to see Houston at Target Field barring a playoff series is early on, April 9th-11th, so don’t miss it. 3. Seattle and the Potential Hall of Famers Ichiro, who will turn 45 in October, is the definition of a living legend. Imagine how much more he would have accomplished in the major leagues if he hadn’t spent as long in Japan. He has 3,080 hits in the Big leagues, and he could have had even more if he had come to America earlier. Who knows how often he will be in the lineup or how often he will play, but this could be the last time he comes to Minnesota as a professional baseball player. I’m confident that he will be enshrined into the Baseball Hall of Fame in Cooperstown, and besides this great accomplishment, he’s also a great individual. As great as he is, he isn’t the only player to see on Seattle worth the price of admission. Felix Hernandez, Robinson Cano, and Nelson Cruz make up a list of players who have had impressive careers. Felix and Robinson could be pushing towards the Hall if they continue the dominance they have had in the past. 4. Logan Morrison Many people suspected that the Twins were going to try adding pitching to their lineup this year, and they did just that via trade. Jake Odorizzi is one of the new players to join the Twins’ pitching force, and I will touch on him later. It is known that the free agent market has been historically slow this year as many top free agents, such as Jake Arrieta and Alex Cobb, still remain unsigned. The Twins had money to spend this year as they have a lot of young players in pre-arbitration seasons, and many thought it would be on a pitcher. The Twins got that pitcher in Lance Lynn and it was cheap enough to have money left over. The Twins enjoyed success with Eduardo Escobar as he hit a career high 21 homeruns, and many people thought that would suffice with Sano, Mauer, and Escobar holding down the DH and corner duties. Sano has sexual assault allegations against him, and what the MLB does suspension wise is still up in the air. Enter Logan Morrison, who increased his launch angle 12 degrees from the year prior, and who also had a career year with 38 homeruns. The Twins saw a cheap option to get a hitter who could help DH for $6.5 million guaranteed. This is a steal for someone who had the season he did, though it may be hard to repeat. There is a vesting option depending on at-bats, but seeing him put on a Twinkies jersey and go to bat for us will be exciting. 5. A Strong Bullpen The Twins’ Bullpen has had a fair share of ups and down over the past ten years. Despite this, the Twins’ list of all-star closers is composed of Joe Nathan, Glen Perkins, and Brandon Kintzler, and teams with this many closers of that caliber must have good bullpens. But actually, the twins bullpen has ranked 23rd in ERA over the last 5 years .The bullpen also includes names like Caleb Thielbar and Jared Burton. Good news for the Twins though is that entering this offseason with a weaker bullpen means free agent relievers were abundant. The Twins added three veteran relievers who had solid seasons last year and will be penciled in as contributors this year. Fernando Rodney was successful on 39 of 45 save opportunities, but he did have a 4.23 ERA. This scares a lot of folks, but his FIP of 3.03 shows solid chance for that ERA to fall this year. Addison Reed was another signee and reminds me of a poor man’s Andrew Mille, but that by no means is a slam. He doesn’t get the recognition he deserves, but is a solid reliever in the 7th and 8th innings. His 2.84 ERA, if replicated, will go a long way in the Twins’ pen. Zach Duke is another player to look at. He is a bounce back candidate whose move to the bullpen has had ups and downs, but his FIP of 2.85 in 2016 shows his potential as a reliever. Last year, that number was in the 5’s, but I remain optimistic. All three look to contribute and help hold leads late in games or at least keep them close, so this electric Twins’ offense can have a shot late in the game. 6. New Pitchers When the Twins traded for Jake Odorizzi, they made a move for a pitcher that didn’t cost much on the prospect front and saved money compared to going out and getting a starter. The Twins, who have made brilliant moves all offseason, do so yet again by signing Lance Lynn to a 1 year, twelve million dollar deal. Lynn was arguably the third or fourth best free agent starter on the market and, according to reports, chose to play here over money and financial security. Too bad Yu Darvish didn’t feel this way. Lynn has had 5 seasons with 175 inning plus and an ERA below 4. A drop in his strike out rate and a rising FIP have put into question if he can maintain this consistency, but that’s a bet I’m willing to take. When Ervin returns, a pitching staff of Santana, Berrios, Lynn, Odorizzi, and Gibson is far better than last years, and it’s quite intriguing. It would be totally worth coming out to watch early in the season. 7. Nothing Falls but Rain Drops Nothing falls but raindrops was the motto of the Twins’ outfield last year, and they stuck to their words. The outfield of Buxton, Kepler, and Rosario, according to DRS, saved the Twins 34 defensive runs last year. All of these guys make frequent appearances in the highlight reel, and they seem to get better each year. As they enter their primes, this will be an exciting time to be a Twins fan and a Twins pitcher. It be noted that the trio also had 18 outfield assists last year. They are one of the most exciting outfields to watch, and that will stay the course this year and early in April when the raindrops, do in fact, drop, as we expect they will. 8. Games with Canadians The Toronto Blue Jays have been an awesome team to watch the past few years with stars like Donaldson, Bautista, Stroman, Ozuna, and defensive wizard Kevin Pillar scattered throughout their lineup. On top of being a team with all of these players, they have a fan base that likes to travel. Obviously there are probably some Canadians who are big Twins fans being that they are closest to Minnesota, but Toronto is Canada’s team and they travel well. It is easier for people from Winnipeg to come to Minneapolis and they will, rain or shine. With all the changes that the Twins have made, coming out and supporting them would be huge in this series as the Toronto fans have a history of making their presence felt at Target Field. 9. Byron Buxton Who doesn’t want to watch Byron Buxton play? Search YouTube and you could spend a whole day watching highlight reel catch after catch provided by Mr. Buxton. His defense is stellar and he runs the bases so well and effortlessly. Last year, he was 29 of 30 on stolen bases, and the one time he was caught stealing he slid over the bag after initially beating the ball there. He has the fastest home to home time ever recorded in a game in the Statcast era. His gap hits are often stand-up triples due to his league-leading sprint speed of 30.2 ft/s in 2017. This helps his defensive prowess and makes him one of the league’s must-see. The Twins also have a Giveaway for Brian Dozier and Byron Buxton’s gold glove awards in April so you won’t want to miss that. 10. Presentations Paul Molitor won manager of the year last year and Byron Buxton and Brian Dozier both won Gold Gloves. There was a lot to celebrate last year along with the Wild Card appearance. There are two ceremonies that will happen pre-game on different days in April. April 7th versus the Mariners, there will be a ceremony congratulating Paul on winning AL manager of the year. I attended the World Series pre-game celebration last year and these events are things you don’t want to miss out on. I would try and get tickets to this one because it is a Saturday game, and it will be a great way to start off the season. The other event I mentioned is a Friday night Gold Glove presentation for Buxton and Dozier. A T-shirt giveaway on this day makes it a hot ticket and another must-attend event. There are a lot of exciting things surrounding Twins baseball this year. The reasons listed above, along with a hope for a successful season, are just a few of the reasons you don’t want to miss Twins baseball this April.
  5. Gavin_Sanford

    wRC+ research

  6. Gavin_Sanford

    Research Album

×
×
  • Create New...