Debugging BlogPulse
I love to use BlogPulse. I always get a kick out of seeing trends like this:
Or this:
These graphs often have a straightforward story behind them, allowing for a reasonable comparison between mentions of different words. Note, of course, that the graphs show the percentage of blog posts that contain a term, giving a normalized view.
But, what could explain something like this:
Here we see a term 'movie' which appears to have some sort of seasonal trend, dipping in autumn and rising again in the winter. However, the term 'guitar' appears to have a very odd shape, with a dramatic and sharp increase in the winter. Looking at this term on its own, we see:
There is a reason for this. If we look at links to weblogs published on MySpace, we see a matching pattern.
The reason that there are changes in the number of blog posts which link to MySpace blogs is that BlogPulse (Nielsen Online) is adjusting its crawling strategy over time (the above suggests that there was an increase in July, a decrease in September and another increase in October).
So, while I continue to believe that BlogPulse, and the trending tool in particular, are very useful, one has to be careful (and informed) regarding the base data that these analytics are built on.
























