RT @neilfws: Bioinformaticians have been writing for years about how data preparation is >= 80% of the job; good to see "big data science" catching up :)