My Photo

 

  • Subscribe with Kindle

« Images are Data | Main | Technorati Brings Charts Back »

October 23, 2007

The Most Important Blogs for Efficient Readers

Cascade Current systems for ranking blogs are largely about inlinks. Technorati and BlogPulse both use this basic measure of citation to create their lists; TechMeme - whose new list created plenty of discussion on the topic - takes the algorithm it uses for placing stories on its home page (essentially, another citation based approach) and aggregates visibility information. Additional features to consider include the number of feed subscribers and the number of visitors to the blog site. However, there are plenty of alternative approaches to creating a list of important blogs.

The above approaches are motivated by some (vague) notion of influence - a term that is central to the analysis of social media and blogs in particular, but one which has not really been given a full, well grounded definition in the space. However, there is also the issue of reader efficiency - ensuring that the consumer of blog data maximises the value they get from reading blogs.

A group of researchers at CMU have been considering a notion of blog importance based on how likely a set of blogs is to ensure that you will be informed of topics bursting in the blogosphere. By analogy, they consider a graph of water pipelines. Their paper - Cost-Effective Outbreak Detection in Networks Leskovec, Krause, Guestrin, Faloutsos, VanBriesen, Glance - poses the problem:

Given a water distribution network, where should we place sensors to quickly detect contaminants? Or, which blogs should we read to avoid missing important stories? These seemingly different problems share common structure: Outbreak detection can be modeled as selecting nodes (sensor locations, blogs) in a network, in order to detect the spreading of a virus or information as quickly as possible.

As a result of this work, the authors have published some blog lists which answer a fundamentally important question in terms of weblog reading habits: Which weblogs should I read to be most up to date? The lists answering this question - generated by the approach described in their paper - come in a number of varieties to be found on the project's page.

Highlights from the work include the top 10 and bottom 10 from the list of blogs to read to be the most up to date on stories if you only have time to read 100 blogs. It must be noted that this work is a theoretical exploration - the dataset mined to create the list is not a live corpus of blogs; thus some of the blogs may be stale or even abandoned.

1 http://instapundit.com
2 http://donsurber.blogspot.com
3 http://sciencepolitics.blogspot.com
4 http://www.watcherofweasels.com
5 http://michellemalkin.com
6 http://blogometer.nationaljournal.com
7 http://themodulator.org
8 http://www.bloggersblog.com
9 http://www.boingboing.net
10 http://atrios.blogspot.com
... ...
91 http://www.saysuncle.com
92 http://www.privacydigest.com
93 http://www.londonist.com
94 http://www.shanghaiist.com
95 http://markshea.blogspot.com
96 http://www.singleservecoffee.com
97 http://jeremy.zawodny.com/blog
98 http://www.scienceblogs.com
99 http://www.basicthinking.de/blog
100 http://scobleizer.wordpress.com

Note that another view of the data - which blogs to read if you can only read 500 posts - generates quite a different list of blogs.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341c994053ef00e54efd13cb8833

Listed below are links to weblogs that reference The Most Important Blogs for Efficient Readers:

» The science of blog reading from Rough Type: Nicholas Carr's Blog
The problem of detecting contaminants in a public water system is analogous to the problem of figuring out what's going on in the blogosphere, write a team of Carnegie-Mellon researchers in an award-winning paper called Cost-effective Outbreak Detectio... [Read More]

Comments

Interestingly, the #3 spot is a blog that hasn't been updated in a month.

Jason - note that the data used for this work is not a live corpus of weblog posts, but a (recent) historical set. I've updated the post to underline this point (the publication I link to makes this clear). Thanks!

Wow...what a great list. I will have to take the time to read them all. Thanks for putting together them all for me!

Calculating a blog's influence is important for ROI for potential advertisements from companies looking to get their products out there as well as companies working in brand management. The problem is right now there is not a set of standard metrics to form an algorithm to get a raw influence number for each site.

I like this approach because it gets away from old metrics like in link and out link counts, which do not speak to the topics being posted. Thanks

To me as a user, I am much more interested in the interaction between blogs. IN other words, which blogs like to read mine and interact with it, and which do I like and interat with. Instead of a "who has the biggest..." contest we would be able to see how blogs interact with each other and (re-) discover information in new ways. So, it is time for the mast of attraction to take the podium. You know him, there is only one, Sir Isaac Newton's Universal Law of BLOG attraction:
http://vanelsas.wordpress.com/2007/10/09/newtons-universal-law-of-blog-attraction-better-than-a-techmeme-leaderboard/

Something's wrong with either the conception or execution of that research because there isn't a Ron Paul Revolution blog on that list.

How can anyone be more influential than Ron Paul.

Something's wrong with either the conception or execution of that research because there isn't a Ron Paul Revolution blog on that list.

How can anyone be more influential than Ron Paul.

Something's wrong with either the conception or execution of that research because there isn't a Ron Paul Revolution blog on that list.

How can anyone be more influential than Ron Paul.

Something's wrong with either the conception or execution of that research because there isn't a Ron Paul Revolution blog on that list.

How can anyone be more influential than Ron Paul.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Twitter Updates

    follow me on Twitter

    July 2009

    Sun Mon Tue Wed Thu Fri Sat
          1 2 3 4
    5 6 7 8 9 10 11
    12 13 14 15 16 17 18
    19 20 21 22 23 24 25
    26 27 28 29 30 31  

    Categories

    Blog powered by TypePad