Vladimir Brusic, John S. Wilkins, Clement A. Stanyon, and John Zeleznikow
The four most important data-related considerations for the bioinformatic analysis of biological systems are understanding of: the complexity and hierarchical nature of processes that generate biological data, fuzziness of biological data, biases and potential misconceptions in data, and the effects of noise and errors. We discuss these issues and summarize our findings by defining a Data Learning Process (DLP). DLP comprises a series of steps for comprehension of biological data within the bioinformatics framework. DLP is a formalization aimed at facilitating knowledge discovery in biological databases.