Table 1

Document Types in Biozon. Each type is represented differently in Biozon's implementation. Each representation may be decomposed into a number of atomic units for the purpose of comparison.

Document type
Representation
Atomic units

protein sequence
string
amino acids
nucleic acid sequence
string
nucleic acids
protein family
set
proteins
pathway
set
protein families
domain
ordered pair
sequence coordinates
domain family
set
domains
interaction
set
proteins, nucleic acids
descriptor
text
characters
structure
list
3D coordinates
unigene cluster
set
nucleic acids (ESTs)

Birkland and Yona BMC Bioinformatics 2006 7:70   doi:10.1186/1471-2105-7-70