(Author's note: I still don't know what I'm doing on this blog; I go from silly to serious to outraged to drunk-and-outraged. I'm still trying to mix it up, but I got one more nerdy task to accomplish.)
And that is to tell somebody with skill how to make a million dollars.
Step One Go here. This is the play-by-play of tonight's CAR-BUF game. They exist through NHL.com Stats for all NHL games from '02-'03 to present. Somehow find a way to access them all (I think there should be a pattern in the naming convention).
Step Two Use all this data to form a clean data table for all this. All events and all games should be given unique ID numbers. Record whatever data is available for each event, including shift charts if you're really bold.
Step Three This might be tricky, but things should also be 'known' or 'queriable' about each event, such as the score, or how long it has been 5-on-5. These things can be appended on to each line or if there is a better way, do that. Also if you can somehow figure each team's record at the time of each event, that might be helpful also.
Step Four Answer the following questions:
- How much more likely is a team down by one going to get a power play than take a penalty?
- Same question, but only on the road.
- Same question, but only Eastern Conference teams.
- Same question, but only in the 3rd period.
- How long into a power play is the average PPG scored?
- How long after a faceoff is the average goal scored? Is there a cluster shortly after faceoffs?
- Same question, but only after icings.
- Given two specific teams and a tied score, who is likely to score first?
- How likely is it that the other team scores next?
- Compare pre-lockout to post-lockout, regular season to postseason, etc.
(Oh, and if you've already done this on your own, you can owe me.)
* it might be Canadian dollars