Rockies vs. Giants final score: Angel Pagan's walk-off inside-the-park HR sinks Colorado, 6-5
Lately I have been spending a lot of time with the FanGraphs leader boards, excel, and SQL Pro. I've been interested in f-strike% (first pitch strike percentage) and wanted to see if it was a repeatable skill or not. The project started off as fairly basic: I grabbed f-strike percentage numbers from all starting pitchers with at least 120 innings pitched a season from 2002-2012. I then got the year n and year n+1 data and ran a linear regression. The linear regression returned an r^2 of .4075 and an r of .63, which lines up almost exactly with work that Bill Petti did.
I wasn't satisfied though, so I decided to see how year n f-strike% predicted year n+1 BB%. The regression gave a r^2 = .32372, and r = -.56. I thought since I was looking at one pitcher "skill", I would look at a few more pitcher "skills". Still, I didn't have the data that I exactly wanted, so I moved on to another idea of mine.
The Process:
I went into FanGraphs' custom leader boards and exported f-strike%, strikes, total pitches, and innings pitched. I divided strikes by total pitches to get strike percentage. Then imported the data into my database, and got all eligible pitchers who pitched greater than or equal to 120 innings in year n, and year n+1. My final number was n = 797.
The Study:
I subtracted f-strike% from strike%, because I figured that the metric would show that throwing first pitch strikes is a skill, otherwise it would be possible that a certain pitcher just throws a lot of strikes in general.
I then ran F-Strike%-Strike% during year n, and during year n+1 against each other.

The r^2 tells us that only 19% of the variation of F-Strike%-Strike% in year n+1 can be explained by year n. This isn't that strong. The correlation coefficient was .44 though, so that means there is some type of positive relationship there.
Al Leiter had the worst F%-S% in year n, posting a -11.39% mark. During that 2004 season, Leiter showed awful command. He walked 13% of batters, but he always had a notoriously high walk rate. He retired with a 11% walk rate. The following year he actually threw more first pitch strikes, even though he walked 1.5% more batters. Scott Baker actually had a lower walk rate in 2007 than he did in 2008. The same can be said for Lilly's 2009-2010 seasons, and Gil Meche's 2004-2005 seasons. In 2010 Lilly's walk rate was higher than 2009, yet he did a better job at throwing strikes.
On the other hand, these are the top five pitchers who had best positive F-Strike%-Strike%. Radke, Lohse and Mussina hardly walked anyone in their career. Lackey has walked batters slightly above average in his career though.
I don't know if you noticed but two of those pitchers are former Minnesota Twins pitchers. When I saw that I began to wonder if there were more former Twins pitchers on the list. It's no secret that the Twins love pitchers who have good command, so I was interested to see what I'd find.
As it turns out, former Twins pitchers made up 11 of the top 100. 10 of those appeared on their list as a Twin, while Lohse appeared as a Cardinal. The Angels also had 10 pitchers on the top 100. Is it possible both of these teams know something that we don't? As I continue to look at F-Strike% hopefully I find out.
While the numbers that were returned were decent, I wanted to look at this on a league wide level, so I took the league average F-Strike%-Strike%, and ran that with the individual numbers.

When looking at the individual F-Strike%-Strike%, and the league average F-Strike%-Strike% we get a somewhat strong relationship. We got a .26 r^2, which equates to a .51 r.
When I first started this project I was hoping to get a little more insight as to whether F-Strike% can tell us more about a pitcher. Turns I didn't find much at all really, which was admittedly a tad disappointing. I'm going to a little more researcy though. Next I'm going to see if certain pitches can tell us why one pitcher has a higher f-strike%-strike% than other pitchers. Again, I'm not sure what I'll find but we'll found out together.
All info was taken from FanGraphs, and manipulated in SQL.
Follow Alex on Twitter: @AKienholzBtB
Add to del.icio.us
Digg this
Post to Furl
Add to reddit
Add to myYahoo!
Powered by blogdig.net