Archive for category: Software

Bollocks to waiting 10 years for progress

March 23rd, 2011 by
Open Data warrior Mark Hahnel (), the creator of FigShare, explains in this guest post the motivation behind the project and asks researchers why they aren’t publishing their research data.

I read a good quote the other day:

“Bollocks to waiting 10 years for progress. I want people to know about it now, and then do something about it” – Dr Paul Fisher

So why do we wait? Why isn’t there immediate publication, analysis and dissemination of data? Publication of Scientific data as it stands is a broken business model…for the most part. The advent of journals like PLoS and their subsequent success shows that the scientific community is taking note of what steps need to be taken. In my short life as a scientist, there has always been one thing that really annoys me. The inefficiency of scientific publishing and subsequent global sharing of knowledge. In terms of making significant advances available to wide audiences as peer reviewed publications, PLoS has it covered. But what about the rest of your research?

What percentage of the figures that went into your undergrad, masters or doctorate thesis were ever published? The ones that you didnt publish were probably good basic science, or figures that didnt tell a complete story. As a PhD student, I became very aware of the fact that a large amount of my data, although good, would never be published as it did not show significant differences. I then began wondering how many times experiments had been repeated globally unnecessarily. And so FigShare started life as an idea for researchers to publish all of their data that would otherwise never leave their lab books. By categorising and tagging the research, it becomes very searchable and other scientists should not reproduce experiments and waste money when they have been conducted several times by other labs. Following the alpha release, FigShare received a lot of attention and a lot of feedback. This caused the site to develop and it now allows the upload of Figures, Datasets and most recently media (eg. videos).
(more…)

Knuth announces iTeX

July 3rd, 2010 by

Donald Knuth announced he would make an earthshaking announcement at TUG 2010. The breaking news is the plan to release the next-generation TeX engine. Read the comments and a brief summary of the announcement in perfect Stevenote style from Slashdot.

Paperpile: A new kid on the block

March 17th, 2010 by

The first public beta of Paperpile–the latest entry in the crowded arena of free reference management software–has been recently announced. As I write, a test version is available for Linux, but Mac and Windows versions should be released soon. From the screenshots gallery, it looks like Paperpile will feature a streamlined (although quite typical) 3-column interface, support for tags/labels as well as the standard Web import functionality from online bibliographic databases.

paperpile screenshot

It will be interesting to see how this software compares with cross-platform biggies such as Mendeley or established tools for specific environments such as BibDesk or JabRef.

Introducing citeproc-js

January 29th, 2010 by

Citation copy-editing is one of those deceptively small burdens that have a way of taking over the working day. If left untended, the task of tidying up casually scribbled references can snowball to crisis proportions as a submission deadline approaches. Similarly, when a submission to one publisher is unsuccessful, significant effort may be required to recast its citations in the format required by another. Collaboration outside of one’s own field can bring with it an unwelcome tangle of fresh style-guide quandaries to ponder and fight through. These are things that the machines, if they want to make themselves useful, should be doing for us.

There is plenty of collective experience in this line, and as fate would have it, there are also plenty of collective solutions. In the TeX/LaTeX world, authors and their editors can today choose between BibTeX and BibLaTeX — both of them excellent utilities — with the several variants of the former supported by no fewer than four separate versions of the BibTeX program. [1] Users of WYSIWYG word processors can look to the bibliographic support built into Word or Open Office, or they can turn to an external solution such as EndNote ™, ProCite ™, Reference Manager ™, or more recently Zotero or Mendeley. Migrating data between these environments is a process fraught with uncertainty, but it is sometimes unavoidable when you need this kind of output, and it can only be produced on that kind of system …

(more…)

Blog-sized lit reviews

November 6th, 2009 by

When I started my DPhil, I set myself assignments in order to cover the lit review in easy bite-sized chunks. This worked pretty well but the collated material was scattered across different Word documents, which meant that I couldn’t look at everything at one glance or search the content.

However at about the same time, I also started free-writing to generate ideas. If you’re not familiar with the technique, it’s simply writing for a fixed amount of time without stopping. You don’t delete anything on the fly and just go, writing down anything that comes to mind until the timer sounds or your wrists cramp up, whichever comes first.

Instead of putting this writing into Word documents though, I decided to set up a local installation of the blogging engine Movable Type. In retrospect, I think this was probably just an excuse to play with blogging software but it turned out to be a good decision. I could add content from any computer with access to the server, the basic input form meant that I focused on content not style, and of course, I had all the blogging bells-and-whistles attached. Comments could be added at a later date, the information sorted and searched, shared with my supervisor and so on.

I’m not sure why I didn’t think of it at the time but I would now recommend combining the two ideas: use a blog to write your lit review. (I say “write”, but I really mean “draft”. The structure of blog writing is quite different from academic writing and besides, you’ll want to tidy up references, tables, etc. for the final version. But a blog post is still big enough to cover the bulk of the material and help you organize your thoughts.)

The final stumbling block of course is how to get the content out of the blogging engine and into a presentable format. Fortunately, there’s a script called WPTEX that will convert your WordPress blog into a LaTeX document. I found this software about six months ago and it does what it says on the tin: give it some basic details and it will parse all of your posts, tidying up the code and creating LaTeX source files which you can then compile as a standalone PDF book or for inclusion as part of your thesis.

The script’s not perfect and I can think of several improvements, particularly in the way that it converts URLs for paper presentation. But if you’re starting a PhD, I’d recommend giving it a go. A blog-sized lit review is a great way to manage this difficult task and of course, if you make the blog public, you can engage with the wider community in your field, getting feedback and maybe even making a bit of a name for yourself. Happy blogging!