My Photo

 

  • Subscribe with Kindle

« Social Media Visualization: Twitter Blocks | Main | ICWSM 2008 Tutorials »

September 06, 2007

Search Trends: Data Mining, Text Mining

Below is the graph generated by Google Trends for the terms 'data mining', 'text mining', 'social media' and 'visualization'. How I wish this graph were labeled. The 'about' page suggests that the graph shows 'search volume'. It talks about normalization in the context of the auxiliary data (regions, cities and languages). Should we assume, then, that the graph shows absolute counts? If so, how can we interpret the below? The total number of searches for 'data mining' and 'visualization' is decreasing over time - i.e. fewer people are searching for these terms; while 'social media' and 'text mining' are increasing or staying constant. So why would searches for 'data mining', for example, be decreasing?

Googltrends

[Thanks to Ron Kass for inspiring this post.]

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341c994053ef00e54eed303f8834

Listed below are links to weblogs that reference Search Trends: Data Mining, Text Mining:

Comments

Is the y-axis absolute or relative? If the y-axis is relative it would explain the trend (more non-tech users are searching), if it is absolute the explanation could be that tech searchers bypass search.

Without knowing what the y-axis depicts the information is worthless.

Anjo.

Anjo - wasn't that the point I was making?

it's the students: whether the x-axis absolute or not, the spikes are at the beginning and the end of the autumn semenster and in the middle of the spring semester ... my uneducated guess is data mining lost it's hype value and moved to mainstream, which means better curricula and more books in the libraries, while "social media" is still hyped ... "text mining" is for the humanities and social sciences what "data mining" is for business or computer science ... so there the curricula might be less well developed.

My other uneducated guess is that the number of google searches is less relevant for the interest for a subject, and more relevant to the difficulty of finding relevant info about a subject: seven years ago I went to google to find info on Perl, now I go to perlmonks, use perl and cpan.

I don't have an answer to this issue, but I agree that interpreting Google Trends results is very difficult. Many factors can influence results. As written on Google Trends page, we should "Keep in mind that instead of measuring overall interest in a topic, Google Trends shows users' propensity to search for that topic on Google on a relative basis". I think that interpreting the basic results (i.e. the search volume) of Google Trends is already a challenge. I have a related post on my blog about interpreting results of Google Trends when used with data mining terms.

By the way, your blog is excellent! It's a pleasure to read it.


Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Twitter Updates

    follow me on Twitter

    July 2009

    Sun Mon Tue Wed Thu Fri Sat
          1 2 3 4
    5 6 7 8 9 10 11
    12 13 14 15 16 17 18
    19 20 21 22 23 24 25
    26 27 28 29 30 31  

    Categories

    Blog powered by TypePad