Article is here.
Nailing Down Next-Gen Data
April 2010
By Matthew DublinWith all of the nail-biting that supposedly goes hand-in-hand with the next-generation sequencing “data deluge,” the non-informaticist may be surprised to learn that the real worry of the folks tasked with making sense of this data lies not in the quantity, but rather, in the ambiguity of the data these machines are spitting out. Issues such as error rates in data and how to improve base calls to account for those errors result in researchers developing a sort of informatics hoarding disorder in which they sometimes feel the need to store images, base calls, second-best base calls, third-best base calls, and process intensity information — all because of a lack of knowledge about the data.












