Tuesday, May 20, 2008

2008 corrections are now online!

Yes that is right, after just one set of interleague series I now can run my correction code on the 2008 data. The player cards have now been updated with the corrections and my pitch classifications. These classifications are not perfect at the moment as the 2008 corrections still aren't quite as strong as the 2007 corrections but they appear to be better than the MLBAM corrections which are done on the fly (and therefore are much harder to do). Automated updates will be down for a few days but expect them to be back in full force by the weekend.

2 Comments:

At June 10, 2008 12:30 PM , Blogger Darth Blog said...

Great site. I was wondering how you get your data for your site? I have been trying to find a way to grab up to date data via a down-loadable comma or space deliminated file, an xml feed, or a database connection. I want to do this so that I can make some equations to run against the data. Thanks for your help.

 
At June 12, 2008 10:18 AM , Blogger Josh Kalk said...

The data is in .xml format and you can find it here:
http://gd2.mlb.com/components/game/mlb/year_2008/

I then download it with python, parse it, correct it, and classify it with C++, and then upload it to the server.

 

Post a Comment

<< Home