A rare live blog post today. I'm writing this from Triangle Open Data Day 2014. This will basically be a page of links that I'll try to get around to later. GIS resources: QGIS - an open source GIS app! Download it here: http://www.qgis.org/ I can't wait to play around with this. PostGIS - Geospatial [...]
Category: Publicly Available Data
24 Days of R: Day 8
I saved the data from the last post which shows the percentage of Republican voters in each county. In addition to that column, I also have figures from the 2010 census. This will show things like age, ethnicity, urbanization and home ownership. Those census figures show actual population counts, so they'll need to be altered [...]
24 Days of R: Day 4
So my first attempt to sort out the career of Michael Caine via parsing of HTML data was a wash. I'm going to try this again, using Wikipedia. They've got a nice, easy list of his films in an HTML table. Reading an HTML table into R is incredibly easy. The XML library has a [...]
24 Days of R: Day 2
Need a Hanukkah or Christmas gift for an R analyst? You could do loads worse than Applied Spatial Data Analysis with R by Bivand, Pebesma and Gomez-Rubio. I don't have my copy yet- I'm still working through all of the other books I've bought this year- but I'll likely pick it up sometime early in [...]
Hurricanes in South Carolina
In a recent post, I discussed the occurrence of hurricanes in the North Atlantic basin. The data comes from the National Oceanic and Atmospheric Association, a member of the US federal government and it spans a bit more than 150 years. In that post, I make the observation that the data supports a model wherein [...]
Hurricanes and Reproducible Research
On vacation with my family this week and that means I have a few minutes now and again to read. One of the books I brought along is Christopher Gandrud's excellent “Reproducible Research with R and RStudio”. Looking for some data as a test project, I latched onto Hurricane data. Folly Beach was hit pretty [...]
Loss reserving has a new, silly name
I started using Git some time ago, but mostly for local work files. Today, I finally sync'ed up a repository for loss reserving analysis. It may be found here: https://github.com/PirateGrunt/MRMR MRMR stands for Multivariate Regression Model for Reserves. When pronounced "Mister Mister" it also sounds like a thankfully forgotten American soft pop band from the [...]
You can’t spell loss reserving without R
Last year, I spent a morning trying to return to first principles when modeling loss reserves. (Brief aside to non-actuaries: a loss reserve is the financial provision set aside to pay for claims which have either not yet settled, or have not yet been reported. If that doesn't sound fascinating, this will likely be a [...]
NFL Code on Github
I've made some revisions and simplifications to the code to compile NFL data. It's now all out on Github for anyone to play with in advance of the Superbowl. In the meantime, here's a lovely picture comparing every team's offense- as measured by total offensive yards- against their defenders. Note the anemic Chicago offense. https://github.com/PirateGrunt/NFL
Turnovers are poison
This is probably a slightly useless post, but a bit of fun all the same. If nothing else, it allows me to take a stab at learning a bit more about logistic regression. I'm still trying to unravel the mystery of why the Bears lost to the Vikings two weeks ago. This mystery is compounded [...]