Read lengths of raw, trimmed, and assembled reads. (A) The raw reads (grey) ranged in length from < 40 bp to 1196 bp, with a mean read length of 400 bp. The distribution of read lengths of those reads chosen for assembly (red) was comparable to that of raw reads for the Newbler assembly (grey). (B) Read length distributions from all products of assembly of trimmed reads. The longest isotig per isogroup is shown. Removing singletons (unassembled reads) from these data shows that most assembly products under ~600 bp are singletons, i.e. that the vast majority of assembly products are transcript models over 600 bp (pink).
Zeng et al. BMC Genomics 2011 12:581 doi:10.1186/1471-2164-12-581