Assessing complexity and depth of the P. hawaiensis transcriptome. Randomly sampled subsets of increasing percentages of the total number of reads, in increments of 10%, were used to generate progressively larger sub-assemblies using Newbler v2.5. (A) The number of unique BLAST hits (performed against nr) continues to increase as more sequences from this sample are assembled. (B) Full range of isotig length distribution for each sub-assembly. The isotig length distribution remains similar across all sub-assemblies for the shorter (< 5000 bp) reads in the assembly. (C) High read length (> 4000 bp) range of assembled read length distribution for each sub-assembly Assembly of reads with length ≥ 5000 bp requires assembly of at least ≈60% of our reads, or ≈1.9 million reads. Legend shown in B applies to both B and C.
Zeng et al. BMC Genomics 2011 12:581 doi:10.1186/1471-2164-12-581