Analysis workflow for identifying composite proteins. (A) Work flowchart to determine gene fusions and functional linkages between Trypanosoma brucei and seven other protists. Amino acid sequences collected from NCBI and UniProt databases were used to identify fusion linked sequences by an automatic in-house built software module. The top scoring hits were verified by various criteria, such as the top scoring hit on the reverse BLAST process, the E-value threshold, the similarity and functionality of domain architecture between the component and composite proteins. (B) Outline of the software, with a focus on the parameters which can be altered by the user at various steps of the process.
Dimitriadis et al. BMC Evolutionary Biology 2011 11:193 doi:10.1186/1471-2148-11-193