This article is part of the supplement: Eleventh International Conference on Bioinformatics (InCoB2012): Computational Biology
Feasibility of using 454 pyrosequencing for studying quasispecies of the whole dengue viral genome
1 Bioinformatics and Data Management for Research Unit, Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
2 Department of Biochemistry, Faculty of Science, Mahidol University, Bangkok 10400, Thailand
3 Medical Molecular Biology Unit, Office for Research and Development, Faculty of Medicine, Siriraj Hospital, Mahidol University, Thailand
4 National Center for Genetic Engineering and Biotechnology, 113 Phaholyothin Rd., Klong 1, Klong Luang, Pathumthani, 12120, Thailand
5 Medical Biotechnology Research Unit, National Center for Genetic Engineering and Biotechnology, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
6 Dengue Hemorrhagic Fever Research Unit, Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
BMC Genomics 2012, 13(Suppl 7):S7 doi:10.1186/1471-2164-13-S7-S7Published: 13 December 2012
Dengue is the world's most common mosquito-borne viral disease. Poor proofreading by RNA polymerase during its replication results in the accumulation of mutations in its genome. This leads to a diversity of genotypes in the viral population termed quasispecies. Quasispecies play an important role in disease severity. The study of quasispecies in dengue has been hindered because of the requirement for large amounts of cloning and sequencing, which could be overcome by 454 pyrosequencing. In this study, we attempted to demonstrate the feasibility of using 454 pyrosequencing to study genome diversity of dengue virus quasispecies by sequencing a pool of known dengue viral strains.
Two sets of dengue DNA templates were sequenced by 454/Roche GS FLX. The total number of reads for data 1 and data 2 were 54,440 and 134,441, with average lengths of 221 and 232 bp, respectively. Reads containing ambiguous base Ns were excluded (6.00% in data 1, 7.05% in data 2). More than 99% of reads could be aligned back to the correct serotypes by BLAST. The reads covered the whole genome without any gaps, and the minimum coverage depth was 50×. Frequencies of known strains detected from each data set were highly correlated with the input ratios. We also explored criteria for filtering error reads and artifacts from true variations.
This study showed that 454 pyrosequencing, coupled with our analysis procedure, could sequence the whole genome of dengue virus with good coverage. The ratio of detected variants in the sequencing data reflected the starting ratio, proving that the proposed technique could be used to study the frequencies of variants in quasispecies.