2008 Player Cards
Well the new season is upon us and even though I haven't updated this blog in ages I do have a treat for anyone who happens to stumble on this blog. The 2008 player cards are here! Just click here or the player cards link to the right and off you go.
A few notes about the new cards. First, absolutely no corrections have been done to the data. Everything is straight from Sportvision. Second, Sportvision was kind enough to add in pitch types to the 2008 data so instead of running my pitch identifying code I am just using their pitch types. Third, I have lowered the number of pitches necessary to have a player card from 100 to 10 at least for the beginning part of the season. This applies to both batters and pitchers. Lastly, while I had to upload today's batch by hand tomorrow's should be automatically uploaded so you should have completely up to date player cards at your disposal. Enjoy!

11 Comments:
Thank you for posting these very neat cards. If you have time, I have always wondered how much the correction matters? I don't have the skills to do it to the data but I have the data in a database as well.
Thanks for taking the time to check them out Kyle. As for your question the corrections were huge in 2007 some parks needing correction factors of nearly a foot. You can check out my post about correcting the data here: http://www.baseball.bornbybits.com/blog/2007/11/explanation-of-correction-code.html
Everything I do to the data is explained there. Hopefully the 2008 data won't need such corrections though. In about a month when I have enough data I will post about it.
Josh,
any idea what the 'type confidence' number is all about in this years data?
I noticed that this year they are also adding in the Pitch F/X operator ID number (the one in Cincinnati slacked off today, missing over 100 pitches) as well as what you noted in the pitch types. Obviously right now its too early to see how accurate that is, but I can't wait until there is enough data to see if their pitch types are accurate.
As always though, great job with your blog/player cards.
Doug,
All that I know is confidence is between 0 and 1 and 0 is less confident. How are they are calculating it, I have no idea. The interesting thing is they are rounding all of their other variables but confidence seems to be unrounded. No idea if that means anything though. Having the operator ID should be very handy too.
Josh,
Just wanted tell you that I love the site and it's one of my most favorite baseball resources on the internet.
Thanks for all the incredible work.
Are you going to make the 08 searchable database? I really like the old one for checking out the pitch's effectiveness. I was specifically looking for Nick Masset and it looks like his numbers have changed? His fastball definitely didn't average 93 mph last year.
Josh,
This data fascinates me and I'm trying to figure out what would be a cool web application I can make from this data. Your articles and work has been inspiring, there's no doubt.
I have a nagging question ... why is the z-axis not called the y-axis? I mean, I understand that we're talking 3 dimensions, but ultimately, aren't we just taking a y-point (the release point) and subracting the 2nd y-point (where it crosses the plate?
Thanks in advance!
Hey Josh, huge fan of your blog here. I finally upgraded to Office 2007 so I could use Excel with the Pitch F/X data.
Just wondering, if I go to Brett Myers' card, is it just his '08 data? If so, where can I find his '07 data, if possible?
If it's not possible to grab his '07 data from your blog here, any ideas on how I could get it without going through all those Gameday files?
I've been viewing games on the Gameday applet and have to say that some of the pitch types are just wrong. I'm seeing 85mph fastballs as changeups, curveballs as sliders, splitters as changeups and vice versa.
I looked at Brandon Webb's 2008 card and it shows him throwing 7 different pitch types. Well he only throws 3, Sinker, Changeup, and Curveball. If you look at the second graph for "speed/horizontal movement", you can see three clusters. Those clusters are his three different pitches.
They need to get operators that know something about how a baseball moves.
Live updating isn't panning out, fwiw.
What is fun about using their pitch classifications is that you get to make fun of your team for making
a guy whose fastball consistantly registers as a changeup their "number 1" starter.
Post a Comment
<< Home