Figure 5.

Data integration. Individual elements d from source databases are translated to their representation in Biozon as per the transformation function TD. The graph ∑ resulting from integration of these elements has non-redundant objects, serving to merge the data from disparate sources into a cohesive whole. As shown, six records from GenPept, SwissProt BIND and DIP are translated into Biozon graph form. Each record is transformed into a set of objects (e.g. Math) and descriptors (e.g. Math). Identical proteins from SwissProt and GenPept records, Math and Math respectively, are instantiated as a single non-redundant protein object P1 on the graph. Similarly, Math and Math are mapped to a single P2. As a result, the two interaction objects Math (BIND) and Math (DIP) are mapped to the same object I1.

Birkland and Yona BMC Bioinformatics 2006 7:70   doi:10.1186/1471-2105-7-70
Download authors' original image