My Photo

 

  • Subscribe with Kindle

« I See Dead Blogs | Main | Ambiguity In Comparitive Search: A Text Mining Puzzle »

August 09, 2006

Blogosphere Statistics Proposal

A logical continuation of my comments on Sifry's State of the Blogosphere post is to make some proposals regarding what would be acceptable observations to make about the blogosphere. Firstly, we can consider the things that we would like to know:

  • The number of new blogs created per day.
  • The number of posts published per day.
  • The number of blogs which have updated at least N times within the last K time periods of duration D. For example, the number of blogs that have published posts 2 times per week for the last 10 weeks.

Secondly, we can consider the observations that can be made:

  • The number of new blogs discovered per day by some system (e.g. Technorati or BlogPulse).
  • The number of posts harvested per day by some system.
  • The number of blogs which meet the post rate criteria according to some up-to-date index.

There are two key points here. One is the distinction between the true numbers that we could report if we had perfect visibility into the blogosphere and the observations made according to looking inside some index - this is the difference between the two blocks of points above. The second point is the proposal of some metrics that are transparent and useful rather than accumulations of historical data (as I described in my previous post).

The interesting part - the science - is figuring out how to take observations and project these with some confidence to predict the desired measurements.

Note comments that Kevin Burton (TailRank) has made regarding Sifry's claims.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341c994053ef00d834a68eb253ef

Listed below are links to weblogs that reference Blogosphere Statistics Proposal:

Comments

Don't forget the tricky bit: filtering out auto-generated blogs that just steal random text from other blogs to generate google-juice...

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Twitter Updates

    follow me on Twitter

    July 2009

    Sun Mon Tue Wed Thu Fri Sat
          1 2 3 4
    5 6 7 8 9 10 11
    12 13 14 15 16 17 18
    19 20 21 22 23 24 25
    26 27 28 29 30 31  

    Categories

    Blog powered by TypePad