It is estimated that 1-2 exabytes of data is now being generated each year, almost all of it in purely digital form (Lyman et. ai. 2000). Properly structured, this information could form a global knowledge base. Currently however, this information exists in many different forms, many of which are only suitable for human consumption, and which are largely opaque to computer based understanding.