Additional file 3.ORF determination. Blastx run of a metagenomic contig, indicated by the red bar on the top. Eleven hits (homologue proteins) have been found, shown in black. Grey segments in the hits indicate the part of the homologue protein that has not been found. In the merging step, protein hits 1–4 are considered homologues since they are in the same position and frame containing the full length of the protein hit. Therefore they are merged in a single hit. Hits 7–10 are also in the same position and frame, but the alignment covers less than 50% of the protein hit because it is truncated by the end of the contig. In this case, as the other extreme of the protein has been found, the hit is considered valid, and homologues merge as before. Protein hit 5 is in the same frame as hits 1–4, but is much shorter. Therefore, it does not merge with hits 1–4, and is removed in the filtering step where all short hits overlapping with others in the position and frame are removed. Note that protein hit 6 overlaps slightly with hits 1–5, but it is considered a different ORF since they overlap by less than 50% of the length of both proteins. Protein 11 is removed in the filtering step since it covers less than 50% of the protein hit and is not truncated by the extremes of the contig. Format: PPT Size: 27KB Download file This file can be viewed with: Microsoft PowerPoint Viewer Tamames et al. BMC Genomics 2008 9:136 doi:10.1186/1471-2164-9-136 |