November 30, 2010
As so often happens these days, a brief post at FriendFeed got me thinking about data analysis. Entitled “So how many retractions are there every year, anyway?”, the post links to this article at Retraction Watch. It discusses ways to estimate the number of retractions and in particular, a recent article in the Journal of Medical Ethics (subscription only, sorry) which addresses the issue.
As Christina pointed out in a comment at Retraction Watch, there are thousands of scientific journals of which PubMed indexes only a fraction. However, PubMed is relatively easy to analyse using a little Ruby and R. So, here we go…
Read the rest…
Posted in publications, bibliography, statistics, ruby, R |
11 Comments »
November 16, 2010
A couple of posts ago, I outlined a small bash script to generate an index.html file, containing links to other files in a directory. This was for generating links to files in a Dropbox public directory.
I had completely forgotten about the very useful UNIX/Linux command named tree. If not installed, it should be in your distribution repository (e.g. sudo apt-get install tree for Ubuntu/Debian). Then simply:
cd Dropbox/Public/mydirectory
tree -H . > index.html
|
Next, navigate to index.html at the Dropbox website and you should see something like the tree on the right. It’s a little ugly and obviously, not as convenient as something like Github, but can be a good quick and dirty fix if you need to share a hierarchy of directories and files.
|
|
Posted in computing, linux, web resources |
Comments Off
November 9, 2010
This is the story of how an open source project and a science communication tool combined to save the day.
Read the rest…
Posted in bioinformatics, computing, web resources |
8 Comments »
November 9, 2010
You know that Dropbox is terrific, of course. No? Go and check it out now.
One issue: files in your Public folder have a public URL, that you can send to other people. Unfortunately, directories do not. So how do you share a public directory full of files?
Answer: create an index.html file and share that. Let’s say that your files end in “.txt” and reside in ~/Dropbox/Public/entrez. Do this:
cd ~/Dropbox/Public/entrez
echo "<ol>" > index.html
for i in `ls *.txt`; do echo "<li><a href='$i'>$i</a></li>" >> index.html; done
echo "</ol>" >> index.html
Now you can share the link to the index.html, which when clicked will display a list of links to all the other files in the directory.
Posted in computing, web resources |
3 Comments »
November 8, 2010
Thanks to my former colleagues at UQ for involving me with:
Marfori, M., Mynott, A., Ellis, J.J., Mehdid, A.M., Saunders, N.F.W., Curmi, P.M., Forwood, J.K., Bodén, M. and Kobe, B. (2010)
Molecular basis for specificity of nuclear import and prediction of nuclear localization.
Biochimica et Biophysica Acta (BBA) – Molecular Cell Research: in press – uncorrected proof.
A real group effort this one; I made a small contribution to Section 3 – Nuclear localization databases and computational tools to predict nuclear localization.
Note that this version is an uncorrected proof and may change slightly before full publication. If nuclear localization is your thing, I think you’ll enjoy it; it’s a pretty comprehensive review.
Posted in bioinformatics, publications, research diary |
1 Comment »
November 5, 2010
You know the problem. You want to qualify your NCBI/Entrez database search term using a field. For example: “autism[TIAB]“, to search PubMed for the word autism in either Title or Abstract. Problem – you can’t find a list of fields specific to that database.
Now you can. Follow the links in this public Dropbox file, to see a CSV file containing name, full name and description of the fields for each Entrez database.
Code to generate the files is listed below. This may or may not be the first in an occasional, irregular “what the world needs” series.
#!/usr/bin/ruby
require 'rubygems'
require 'bio'
require 'hpricot'
require 'open-uri'
Bio::NCBI.default_email = "me@me.com"
ncbi = Bio::NCBI::REST.new
ncbi.einfo.each do |db|
puts "Processing #{db}..."
File.open("#{db}.txt", "w") do |f|
doc = Hpricot(open("http://eutils.ncbi.nlm.nih.gov/entrez/eutils/einfo.fcgi?db=#{db}"))
(doc/'//fieldlist/field').each do |field|
name = (field/'/name').inner_html
fullname = (field/'/fullname').inner_html
description = (field/'description').inner_html
f.write("#{name},#{fullname},#{description}\n")
end
end
end
Posted in bioinformatics, programming, ruby |
7 Comments »