OK, let’s do this: some statistics and visualization of the tweets for ISMB 2012.
In previous years, when FriendFeed was used as the micro-blogging platform for the annual ISMB meeting, I’ve written a post describing some statistical analysis of the conference coverage. Here’s my post from last year.
This year, it appears that the majority of the conference coverage happened at Twitter, using the #ISMB hashtag. Here’s what happened on July 18th when I used the R package twitteR to retrieve ISMB-related tweets for July 13/14:
library(twitteR) ismb1 <- searchTwitter("#ISMB", since = "2012-07-13", until = "2012-07-14") length(ismb1) #  383
383 tweets. Here’s what happened when I ran the same query today:
library(twitteR) ismb1 <- searchTwitter("#ISMB", since = "2012-07-13", until = "2012-07-14") length(ismb1) #  0
Zero tweets. Indeed, run the same query via the Twitter web interface and you’ll see only a very few tweets with the message “Older Tweet results for #ismb are unavailable.”
So far as Twitter is concerned, ISMB 2012 never happened. Or if it did, the data are buried away in a data centre, inaccessible to the likes of you and I. Did you ever hear anything more about that plan to archive every Tweet at the Library of Congress? Neither did I. I very much doubt that it’s going to happen.
I think Twitter is great – for broadcasting short pieces of information, such as useful URLs, in near real-time. For conference coverage which benefits from threaded conversation, longer comments and archiving, I think it’s rubbish.
On July 18 I did manage to retrieve 3162 Tweets for ISMB 2012, created between July 13 and July 17. I’ll write about them in a forthcoming post. All I’ll say for now is – lucky I was able to grab them when I did.
I wonder if part of the drop off is live bloggers moving to platforms like Twitter? I can tell you it seemed like there were almost as many tweets for one SIG (#bosc2011) as for the whole of #ISMB / #ECCB2011, and I personally didn’t post anything to FriendFeed but posted lots on Twitter.
Well, there’s a problem with using Twitter for analysis of conference coverage. Let’s try searching for ISMB-related tweets using the twitteR package:
library(twitteR) ismb <- searchTwitter("ismb", 1000) length(ismb) #  3030? Are we using twitteR properly? Running the same search at the Twitter website gives roughly the same results, plus this unhelpful message.
I like Twitter – as a real-time communication tool. As a data archive? Forget it.
Of course, having just analysed online coverage of the ISMB conference, I had to run all 6 906 comments from the 2008-2011 meetings through some code. If you followed along via the Sweave code, I went as far as generating the data frame of comments, ismb.comments, then pulled the comment text into a new data frame using:
It was then simply a case of following along with the excellent example code from the post Word Cloud in R, over at One R Tip A Day, limiting myself to the 1000 most-used words. Watch out, the TermDocumentMatrix() function from the tm package uses quite a lot of memory.
Result shown at right: click image for full-size version. I think that word in the centre says it all.
Last year, I wrote a post about how to use R to analyse the coverage. I was planning something similar for 2011 when I thought: we have 4 years of ISMB at FriendFeed now – why not look at all of them?
So I did. Read on for the details.
Read the rest…
Great to see more reports describing the use of online tools to cover scientific meetings. Here are the publications, from PLoS Computational Biology:
Live Coverage of Scientific Conferences Using Web Technologies.
Live Coverage of Intelligent Systems for Molecular Biology/European Conference on Computational Biology (ISMB/ECCB) 2009.
And here’s Ally a.k.a the robo-blogger on Social Networking and Guidelines for Life Science Conferences.
Looks like we’ve started a trend, long may it continue at future meetings.