The rapid generation of chimerical genes expanding protein diversity in zebrafish
1 The Key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
2 Graduate University of the Chinese Academy of Sciences, Beijing 100039, PR China
3 Department of Ecology and Evolution, University of Chicago, Chicago 60637, Illinois
BMC Genomics 2010, 11:657 doi:10.1186/1471-2164-11-657Published: 24 November 2010
Variation of gene number among species indicates that there is a general process of new gene origination. One of the major mechanism providing raw materials for the origin of new genes is gene duplication. Retroposition, as a special type of gene duplication- the RNA-based duplication, has been found to play an important role in new gene evolution in mammals and plants, but little is known about the process in the teleostei genome.
Here we screened the zebrafish genome for identification of retrocopies and new chimerical retrogenes and investigated their origination and evolution. We identified 652 retrocopies, of which 440 are intact retrogenes and 212 are pseudogenes. Retrocopies have long been considered evolutionary dead ends without functional significance due to the presumption that retrocopies lack the regulatory element needed for expression. However, 437 transcribed retrocopies were identified from all of the retrocopies. This discovery combined with the substitution analysis suggested that the majority of all retrocopies are subject to negative selection, indicating that most of the retrocopies may be functional retrogenes. Moreover, we found that 95 chimerical retrogenes had recruited new sequences from neighboring genomic regions that formed de novo splice sites, thus generating new intron-containing chimeric genes. Based on our analysis of 38 pairs of orthologs between Cyprinus carpio and Danio rerio, we found that the synonymous substitution rate of zebrafish genes is 4.13×10-9 substitution per silent site per year. We also found 10 chimerical retrogenes that were created in the last 10 million years, which is 7.14 times the rate of 0.14 chimerical retrogenes per million years in the primate lineage toward human and 6.25 times the rate of 0.16 chimerical genes per million years in Drosophila. This is among the most rapid rates of generation of chimerical genes, just next to the rice.
There is compelling evidence that much of the extensive transcriptional activity of retrogenes does not represent transcriptional "noise" but indicates the functionality of these retrogenes. Our results indicate that retroposition created a large amount of new genes in the zebrafish genome, which has contributed significantly to the evolution of the fish genome.