Additional file 2.
Hierarchical clustering of secreted salivary gland proteins from six Anophelesspecies. A three step clustering was performed at ≥ 90%, ≥ 70% and ≥ 40% identity threshold with the H-CD-HIT server on secreted salivary proteins from An. gambiae, An. arabiensis, An. stephensi, An. funestus, An. albimanus and An. darlingi. Clusters are sorted into protein families. The NCBI accession number is indicated for each protein. * indicate the representative (i.e., longest) protein sequence of each cluster. The percentage identity between the representative protein sequence (*) and other protein sequences is given for each cluster. Protein in bold are new clusterised proteins at each identity threshold. Results from this table are graphically represented on Figure 2.
Format: XLS Size: 5.6MB Download file
This file can be viewed with: Microsoft Excel Viewer
Fontaine et al. BMC Genomics 2012 13:614 doi:10.1186/1471-2164-13-614