Monday, May 28th, 2007

Blogosphere and Time Series

Filed under: — Daniel Lemire @ 8:01

Though blogpulse seems to be going nowhere, as far as I can see, it is still one of the most fascinating tool out there. What it does is plot word occurrences versus time on the blogosphere. The recall is rather poor compared to Technorati but the time series plot are very nice.

Here’s one comparative plot that a student in my Information Retrieval course (Mahmoud El-Bachir) has submitted:

You can see clearly when Christmas is (Noël in French) and when the new year is… I think you also have the Chinese New Year too! (Seek the smaller bump).

My only beef is that I do not have access to the raw data: it would be really cool to build applications on top of blogpulse, but I guess it goes against their business model.

No Comments »

No comments yet.

RSS feed for comments on this post.

Leave a comment

Warning: When entering a long comment, please ensure that you make copy of your text prior to submitting it. If the server should fail or if you hit a bug, you might lose your work. I am not responsible for your lost effort.

To spammers: I carefully review every single post and make sure that spam gets deleted. You are wasting your time if you are manually entering spam using this form. Read my terms of use to see what I consider to be abusive.

Example: I + II + IX= XII. Yes, you have to enter a roman numeral. (Answer must be in upper case.)

« Blog's main page

34 queries. 0.911 seconds. Valid XHTML

Powered by WordPress

Subscribe to this blog in a reader or by Email.