Using R for classification in small-N studies
The question was something like: how can we look at a small-to-medium set of cases (like a dozen or a hundred countries or development programs) and tease out which factors are associated with some outcome. In Rick’s example, he looked at some African countries to see which characteristics are associated with a higher percentage of women in parliament.
Over at rpubs.com, I wrote a little post to show an easy way for evaluators to do classification trees using the open-source statistic software R rather than the Rapid Miner and BigML tools which Rick used. The problem I address at the end is how we can be sure if parts of the resulting models are not spurious.