Open Access Research article

Synonymous site conservation in the HIV-1 genome

Itay Mayrose1*, Adi Stern234, Ela O Burdelova26, Yosef Sabo27, Nihay Laham-Karam25, Rachel Zamostiano2, Eran Bacharach2* and Tal Pupko2*

Author Affiliations

1 Department of Molecular Biology and Ecology of Plants, Tel Aviv University, Tel-Aviv 69978, Israel

2 Department of Cell Research and Immunology, Tel Aviv University, Tel-Aviv 69978, Israel

3 Current address: Department of Microbiology and Immunology, University of California, San Francisco, USA

4 Department of Integrative Biology, University of California, Berkeley, USA

5 Current address: Department of Biotechnology and Molecular Medicine, University of Eastern Finland, Kuopio, Finland

6 Current address: Center for Digestive Tract and Liver Diseases, Tel Aviv Sourasky Medical Center, Sackler Faculty of Medicine, Tel-Aviv University, Tel-Aviv, Israel

7 Current address: Department of Biochemistry and Molecular Biophysics, College of Physicians and Surgeons, Columbia University, New York, NY 10032, USA

For all author emails, please log on.

BMC Evolutionary Biology 2013, 13:164 doi:10.1186/1471-2148-13-164

Published: 4 August 2013

Abstract

Background

Synonymous or silent mutations are usually thought to evolve neutrally. However, accumulating recent evidence has demonstrated that silent mutations may destabilize RNA structures or disrupt cis regulatory motifs superimposed on coding sequences. Such observations suggest the existence of stretches of codon sites that are evolutionary conserved at both DNA-RNA and protein levels. Such stretches may point to functionally important regions within protein coding sequences not necessarily reflecting functional constraints on the amino-acid sequence. The HIV-1 genome is highly compact, and often harbors overlapping functional elements at the protein, RNA, and DNA levels. This superimposition of functions leads to complex selective forces acting on all levels of the genome and proteome. Considering the constraints on HIV-1 to maintain such a highly compact genome, we hypothesized that stretches of synonymous conservation would be common within its genome.

Results

We used a combined computational-experimental approach to detect and characterize regions exhibiting strong purifying selection against synonymous substitutions along the HIV-1 genome. Our methodology is based on advanced probabilistic evolutionary models that explicitly account for synonymous rate variation among sites and rate dependencies among adjacent sites. These models are combined with a randomization procedure to automatically identify the most statistically significant regions of conserved synonymous sites along the genome. Using this procedure we identified 21 conserved regions. Twelve of these are mapped to regions within overlapping genes, seven correlate with known functional elements, while the functions of the remaining four are yet unknown. Among these four regions, we chose the one that deviates most from synonymous rate homogeneity for in-depth computational and experimental characterization. In our assays aiming to quantify viral fitness in both early and late stages of the replication cycle, no differences were observed between the mutated and the wild type virus following the introduction of synonymous mutations.

Conclusions

The contradiction between the inferred purifying selective forces and the lack of effect of these mutations on viral replication may be explained by the fact that the phenotype was measured in single-cycle infection assays in cell culture. Such a system does not account for the complexity of HIV-1 infections in vivo, which involves multiple infection cycles and interaction with the host immune system.

Keywords:
Codon models; HIV-1; Synonymous substitutions; Silent substitutions; Ka/Ks