Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Highly Accessed Research article

Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq

Kyung-Do Park1, Jongsun Park2, Junsu Ko3, Byung Chul Kim23, Heui-Soo Kim4, Kung Ahn4, Kyoung-Tag Do5, Hansol Choi2, Hak-Min Kim2, Sanghoon Song3, Sunghoon Lee2, Sungwoong Jho2, Hong-Sik Kong1, Young Mok Yang6, Byung-Hak Jhun7, Chulhong Kim3, Tae-Hyung Kim3, Seungwoo Hwang8, Jong Bhak23*, Hak-Kyo Lee1* and Byung-Wook Cho5*

Author Affiliations

1 Department of Biotechnology, Hankyong National University, Anseong, 456-749, Republic of Korea

2 Personal Genomics Institute, Genome Research Foundation, 443-270, Suwon, Republic of Korea

3 Theragen BiO Institute, TheragenEtex, 443-270, Suwon, Republic of Korea

4 Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, 609-735, Republic of Korea

5 Department of Animal Science, College of Life Sciences, Pusan National University, Miryang, 627-702, Republic of Korea

6 Department of Pathology, School of Medicine, and Institute of Biomedical Science and Technology, Konkuk University, Seoul, 143-701, Republic of Korea

7 Department of Nanomedical Engineering, College of Nanoscience and Nanotechnology, Pusan National University, Miryang, 627-702, Republic of Korea

8 Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, 305-806, Daejeon, Republic of Korea

For all author emails, please log on.

BMC Genomics 2012, 13:473  doi:10.1186/1471-2164-13-473


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2164/13/473


Received:9 November 2011
Accepted:6 September 2012
Published:12 September 2012

© 2012 Park et al.; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Thoroughbred horses are the most expensive domestic animals, and their running ability and knowledge about their muscle-related diseases are important in animal genetics. While the horse reference genome is available, there has been no large-scale functional annotation of the genome using expressed genes derived from transcriptomes.

Results

We present a large-scale analysis of whole transcriptome data. We sequenced the whole mRNA from the blood and muscle tissues of six thoroughbred horses before and after exercise. By comparing current genome annotations, we identified 32,361 unigene clusters spanning 51.83 Mb that contained 11,933 (36.87%) annotated genes. More than 60% (20,428) of the unigene clusters did not match any current equine gene model. We also identified 189,973 single nucleotide variations (SNVs) from the sequences aligned against the horse reference genome. Most SNVs (171,558 SNVs; 90.31%) were novel when compared with over 1.1 million equine SNPs from two SNP databases. Using differential expression analysis, we further identified a number of exercise-regulated genes: 62 up-regulated and 80 down-regulated genes in the blood, and 878 up-regulated and 285 down-regulated genes in the muscle. Six of 28 previously-known exercise-related genes were over-expressed in the muscle after exercise. Among the differentially expressed genes, there were 91 transcription factor-encoding genes, which included 56 functionally unknown transcription factor candidates that are probably associated with an early regulatory exercise mechanism. In addition, we found interesting RNA expression patterns where different alternative splicing forms of the same gene showed reversed expressions before and after exercising.

Conclusion

The first sequencing-based horse transcriptome data, extensive analyses results, deferentially expressed genes before and after exercise, and candidate genes that are related to the exercise are provided in this study.

Keywords:
Transcriptome; Equus caballus; Gene expression; Racing performance

Background

The thoroughbred, a “hot-blooded” horse breed, is the favorite breed for use in horse racing [1]. The speed and agility of thoroughbred horses has resulted in the emergence of an industry involved in the breeding, training, and racing of elite racehorses worth many billions of dollars [2]. Until now, relatively few genes related to their athletic phenotypes have been identified, even though physical and physiological adaptations underlying their elite athleticism are well characterized [3]. Muscle is the most critical tissue for athletic performance. The skeletal muscle of the thoroughbred horse comprises over 55% of its total body mass [4,5] and has remarkable functional and structural plasticity [3]. Furthermore, over 90 hereditary conditions in horses have corresponding human disorders [3,6], and many muscle disorders in humans and horses share common clinical and histopathological characteristics, as well as molecular features [7-9]. Therefore, the horse can be an invaluable animal model for muscle diseases.

An international team of researchers has decoded the genome of the domestic horse, Equus caballus, and has reported that its genome structure is remarkably similar to the human genome [8]. An additional nine domesticated horse breeds have also been sequenced, identifying around one million single nucleotide polymorphisms (SNPs) [8]. However, there has been little progress in refining the functional annotation of horse genome using expressed genes. Although a small number (around 30,000) of expressed sequence tags (ESTs) has been deposited in the dbEST [10], this is insufficient to identify all the key genes related to specific functions, such as racing performance.

RNA-Seq is one of the most useful next generation sequencing (NGS) methods used to fully understand the landscape of a transcriptome, because it produces several tens of millions of short reads (17 bp to 101 bp) from the expressed genes in vivo. RNA-Seq has been used successfully to investigate the transcriptome profiles of human, mouse, Arabidopsis, and yeast [11-14]. The RNA-Seq data generally exhibit a high degree of concordance with established gene annotations [15,16]. Using RNA-Seq, researchers have identified numerous novel genes and additional alternative splicing forms [14,17,18], as well as unraveling expression profiles underlying phenotypic changes, such as development stages [19,20]. In addition, RNA-Seq permits the identification of single nucleotide variations (SNVs) in coding regions from various organisms because of the large number of reads [14,21,22]. Moreover, RNA-Seq has identified novel unannotated transcriptionally active regions in rice [23], indicating that there are novel genes that cannot be detected by conventional gene prediction methods.

In horses, two transcriptome studies using RNA-Seq have been reported: one study refined the structural annotation of protein-coding genes based on the RNA-Seq sequences from several equine tissues [24], while the other analyzed RNA-Seq sequences acquired from skeletal muscle to find long-term training-related genes [25]. Both studies used very short RNA-Seq sequences, 17 bp and 35 bp, respectively, because of the limitation of NGS technologies available at that time. These very short-read RNA sequences have one critical limitation. When aligned against the reference genome, the typical success rate is as low as 66% in the case of 17 bp RNA fragments [24]. This was caused not only by the very short-read sequences, but also by intron junctions, which were not included in the short-reads. This disadvantage must be overcome to advance horse transcriptome research.

Here, we present a large scale analysis of whole transcriptome data. The samples were taken from blood and muscle tissues of six thoroughbred horses before and after 30 minutes of exercises, resulting in 24 samples.

Results and discussion

Gene cluster analysis and identification of novel transcripts from the horse RNA-Seq sequences

To construct high quality horse transcriptome data, we generated over 1.3 billion 90-bp pair-end reads using an Illumina HiSeq2000 (Additional file 1: Figure S1, Additional file 1: Table S1, and Table S2). Using TopHat [26] and Cufflinks [27], 84.60% of all the reads were successfully mapped against the current horse reference genome (Additional file 1: Table S3). A novel bioinformatics protocol for processing large amounts of transcriptome sequences was built (Additional file 1: Figure S2). RNA sequences were obtained from 24 different samples; therefore, we defined a new concept, unigene cluster (UC), which contains overlapped unigene sequences originating from multiple samples. Utilizing the current annotation (Ensembl 62), 32,361 unigene clusters (UCs), with a total length of 51.83 Mb, were identified. 11,933 UCs matched current gene models, which comprised 36.87% of the 32,361 UCs (Figure  1A and Additional file 1: Supplementary Methods) [8]. The remaining 20,428 UCs (63.13%), which contained more than 60% of the transcripts, were novel (Additional file 1: Supplementary Methods and Additional file 1: Figure S3). The expressions of eight randomly selected novel UCs were confirmed by reverse transcription PCR (Additional file 1: Figure S4 and Additional file 1: Table S4). In addition, the unmapped raw sequences were processed by SOAPdenovo [28], resulting in assemblies of 42,476 to 72,011 scaffolds for each sample. These assembled sequences increased the extent of the current horse genome (Additional file 1: Supplementary Methods, Additional file 1: Figure S2, and Additional file 1: Tables S5, S6, S7, S8). When we pooled the scaffolds together, we identified around 670,000 non-redundant unigenes. 27% to 46% of these unigenes from each sample were matched to human genes using tBLASTx (Additional file 1: Table S9).

Additional file 1. Figure S1. Six thoroughbred horses. Table S1. 24 sample names from six thoroughbred horses used in this study. Table S2. The statistics of RNA-Seq raw data from 24 different samples. Table S3. The mapping results against the horse reference genome (Ensembl 62). Figure S2. Procedure for identifying horse unigenes. Figure S3. Distribution plot of the exons identified without gene models which contain ORFs. Figure S4. Reverse transcript PCR (RT-PCR) confirmation of the 8 novel unigene clusters (UCs) Table S4. Primer information for the RT-PCR experiment. Table S5. Statistics of the filtered de novo transcripts identified from cufflink. Table S6.de novo assembly results from one sample with various k-mer values. Table S7.de novo assembly of unmapped sequences originated from 24 samples. Table S8. Filtered and clustered unigenes from the scaffolds assembled from unmatched sequences. Figure S5. The whole process of identifying SNVs. Table S9. The proportion of the scaffolds from the unmapped sequences which were matched against the human genome. Table S10. The statistics of total SNVs identified from 24 samples. Table S11. The number of total SNVs identified in thoroughbred horses. Table S12. The number of individual-specific SNVs. Table S13. Conformation of the SNPs identified from the mouse sample [2]. Table S14. Distribution of SNP locations in three datasets. Table S15. The list of transcripts which have ten or more non-synonymous SNPs. Figure S6. GO classification of all expressed genes in human, mouse, and horse muscle tissue. Figure S7. GO classification of all expressed horse genes in blood and muscle tissue. Figure S8. Correlation matrix of the 24 samples. Figure S9. Correlation matrix of three human samples from kidney and liver tissues. Figure S10. Histogram of average expression level of the unigene clusters in the 24 samples. Table S16. List of DEGs in muscle and blood tissues. Table S17. Expression profiles of known exercise-related horse genes. Table S18. Comparison between DEGs in muscle tissue and the DEGs which are responsible to exercise training [25]. Table S19. List of transcription factors differentially expressed in muscle and blood tissues. Table S20. RT-PCR primers for seven transcription factors. Table S21. RT-PCR results of differentially expressed transcription factors in muscle tissue. Table S22. The list of four genes of which alternative splicing forms showed reversed expression patterns before and after exercising. Figure S11. Expression profiles of the genes of which alternative splicing forms showed reversed expression patterns before and after exercising. Table S23. Number of filtered de novo transcripts identified by Cufflink. Supplementary methods.

Format: DOC Size: 8.6MB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

thumbnailFigure 1. Enhanced genome annotation, single nucleotide variation analyses, and differentially expressed genes before and after exercise in horse. (A) Red and green circles indicate expressed genes in the blood and muscle tissues, respectively, and the blue circle shows the current Ensembl annotation (Release 62). The grey rectangle indicates the coverage of the current horse genome. (B) Green circle: SNPs provided by the Broad Institute, red circle: SNPs provided by Ensembl (release 62), blue circle: SNPs identified from this study. (C) SNV profiles of six horses for the titin (TTN) gene. The top of the blue arrow is the 5' end and the bottom is 3' end of TTN gene. The X-axis shows the names of the horses. Dark green horizontal bars are non-synonymous SNVs. Light green horizontal bars are synonymous SNVs. (D) Blue bars: >2-fold upregulated genes, red bars: >2-fold downregulated genes, white bars: not differentially expressed. The four pie charts display the composition of the DEGs supported by four horses (white), five horses (light grey), and six horses (grey).

Identification and dissection of novel SNVs from a large amount of horse RNA-Seq sequences

We identified 182,722 non-redundant SNPs and 7,251 non-redundant INDELs from the 24 samples using several filters, including an exon-intron boundary misalignment filter (Additional file 1: Figure S5, Additional file 1: Tables S10, S11, S12, and Additional file 1: Supplementary Methods). The filters had been validated on mouse RNA-Seq data [29] by showing that 80.28% of the identified SNPs were confirmed in two inbred mouse genomes (Additional file 1: Supplementary Methods and Additional file 1: Table S13). Each horse showed a similar number of SNVs, ranging from 72,000 to 77,000 (Additional file 1: Table S11). 82,476 individual-specific SNVs were identified (Additional file 1: Table S12). Only 7,316 out of 182,722 SNPs (4.00%) overlapped with both of the two existing databases (Figure  1B). This is because only 1% (10,229 from Broad Institute and 4,287 from Ensembl) of the SNPs from the two databases were located in the exonic region of the current genome annotation (Additional file 1: Tables S11 and S14). These results demonstrate the usefulness of identifying novel SNPs from transcriptome data. Moreover, 116,650 (61.40%) of 189,973 SNVs were located in exons of unigene clusters, and 67,788 (58.11%) of the 116,650 SNVs caused amino acid changes. Some of the transcripts are possibly related to the horses’ running ability. For example, titin (TTN) is related to the passive stiffness of muscle by limiting the range of motion of the sarcomere in tension [30], such that TTN affects the ability of muscle directly [31]. Other examples include obscurin (OBSCN), which is associated with TTN and ANK1 genes [32,33], and the skeletal muscle calcium release channel gene (RYR1) located in the sarcoplasmic reticulum [34,35], whose mutations caused several muscle-related diseases, including central core disease [36]. The SNV distribution profile of the TTN was specific to individual horses (Figure  1C and Additional file 1: Table S15).

Comparison of the expressed genes in blood and muscle tissues with other organisms

In muscle and blood tissues, 17,484 and 25,220 UCs were identified as expressed genes, respectively, showing that blood expressed about 40% more genes than muscle tissue (Figure  1A). By comparison with two previous RNA-Seq studies conducted in human (Illumina BodyMap2 transcriptome) and in mouse skeletal muscle tissue [12], we observed that the GO classifications of all the expressed genes in the three organisms were similar to each other (Additional file 1: Figure S6). The differences in GO assignments between tissues (muscle and blood) were larger than those of the same tissue among the three species (Additional file 1: Figure S7).

Identification and characterization of differentially expressed genes in blood and muscle tissues regulated by exercise

We calculated the expression level of all unigenes from the 24 samples. The distribution of 32,361 UCs’ average expression levels in the 24 samples showed that half of the UCs were expressed at less than 0.19 FPKM (fragments per kilobase of exon model per million mapped reads) and three-quarters of the UCs were expressed at less than 4.81 FPKM (Additional file 1: Figure S10). The correlation coefficients of the FPKM values among the samples were comparable to previous RNA-Seq studies [15,37] (Additional file 1: Figures S8 and S9). By comparing before and after 30 minutes of exercise, we identified 1,285 differentially expressed genes (DEGs), consisting of 62 up- and 80 downregulated UCs in blood and 878 up- and 285 downregulated UCs in muscle (Additional file 1: Supplementary Methods and Figure  1D). While the overall number of all the differentially expressed UCs was much larger in muscle than in blood, the number of novel differentially expressed UCs was larger in blood (42 UCs) than in muscle (8 UCs) (Additional file 1: Table S16).

We examined 28 genes that are known to be associated with racing performance in horses [38]. Twelve of the 25 genes successfully mapped on the genome annotation were expressed in muscle and blood, and six were differentially expressed in muscle (Additional file 1: Table S17). The rest were detected in neither of the two tissues. The six genes upregulated by exercise were: HIF1A, which encodes a transcription factor that responds to hypoxia; ADRB2, which is involved in the regulation of energy expenditure and lipid mobilization from adipose; PPARD, which regulates expression of genes involved in lipid and carbohydrate metabolisms; VEGF, which is an important angiogenic factor recovering the oxygen supply to tissues when blood vessels are blocked; TNC, which is located in positive-selected regions for racing performance [4]; and BDNF, which is a candidate gene that may be associated with exercise behavior [39].

We also compared differentially expressed genes (DEGs) in muscle tissue with the 15 upregulated and 53 downregulated DEGs that are associated with exercise training [25] (Additional file 1: Table S18). Among these 68 DEGs, only five genes, ACTR3B, FBXO32, PER3, C1orf51, and GATM, were identified as DEGs in this study, among which C1orf51 and PER3 showed a different expression profile.

Sampling the transcriptomes immediately after exercise enabled the identification of differentially expressed early response genes that are rapidly induced by exercise. Many early response proteins include transcriptional regulators, such as Mitogen-activated protein kinases (MAPKs) and NF-κB, which promote fuel homeostasis and prevent skeletal muscle atrophy [40]. In addition, important oxidative stress-sensitive enzymes that can be activated by NF-κB and MAPKs after exercise, such as inducible nitric oxide synthase (iNOS; ENSECAT00000026843) [41], were upregulated in horse muscle after exercise. Among the 1,285 DEGs from the two tissues, we identified 91 transcription factors, which might regulate downstream components of exercise-triggered signaling pathways (Additional file 1: Table S19). GATA2, which can interact with AP1 transcription factors to regulate MAPK and NF-κB signaling [42], was underexpressed in blood, while CREB5, whose zinc-finger and bZIP domain can specifically bind to the CRE with c-Jun or CRE-BP1 [43], was overexpressed in muscle. Upregulation of CREB5 might be explained by the fact that CREB5 and c-Jun genes are involved in calcium-dependent transcriptional pathways in skeletal muscle [44].

At least 56 uncharacterized transcription factors could be candidates for novel primary transcriptional regulators accompanying exercise. We validated the expression levels of seven randomly chosen transcription factors using quantitative RT-PCR from the remaining sample materials (Additional file 1: Tables S20 and S21).

Switching the expression pattern of alternative splicing forms of the gene before and after exercising

Four genes (three from muscle, one from blood) showed interesting RNA expression patterns, in which two different alternative splicing forms of the same gene showed reversed expression patterns before and after exercising, similar to that of the SXL gene in Drosophila [45]. This observation suggested a cost-effective method of regulation: the cells do not have to produce completely new exons and proteins, but merely change the composition of the existing exons [46]. The genes with reversed expression are: AXL, DYNC1, PLEKHG1, and COBLL1 (Additional file 1: Table S22 and Additional file 1: Figure S11). Figure  2 shows cytoplasmic dynein intermediate chain (ENSECAG00000020218) protein (DYNC1) [47] as an example of the reversed expression pattern in muscle before and after exercising.

thumbnailFigure 2. Switching expressions of alternatively spliced forms before and after exercise. (A) Red bars are the exons of the two transcripts of the DYNC1 gene: ENSECAT00000021919 and ENSECAT00000021863. (B) Each plot shows the gene expression level (FPKM value; fragments per kilobase of exon per million fragments mapped) of the two transcripts (Blue lines represent the ENSECAT00000021919 transcript and red lines represent the ENSECAT00000021863 transcript) in each individual horse, whose name is shown as the plot title. Percentages inside the plots are the coverages of the transcripts.

Conclusion

We generated a large amount of horse transcriptome data. Their analyses provided candidate genes that are related to horse racing performance: six previously identified exercise-associated genes and 91 early regulated transcription factors that are differentially expressed by exercise, three genes that display high SNV density, and four alternatively expressed splicing variants. In addition, all 1,258 differentially expressed genes could be important candidate genes for further research.

Methods

The muscle and blood samples from six retired thoroughbred horses were taken before and after trotting (30 minutes). 90-bp pair-end sequences were obtained with an Illumina HiSeq2000, San Diego, US, from the samples. These sequences were mapped against the horse reference genome (Ensembl release 62) using TopHat 1.2.0 with two options (−−mate-inner-dist = 200 and --allow-indels) for paired-end sequences and identified unigenes using the Cufflinks program without genome annotation data. From the results of 24 samples, novel genes were clustered based on the genomic coordination to define unigene clusters (UCs). The generated UCs were subjected to the filter, which extracted UCs overlapping with the genome annotation. The expressed genes annotated by the pipeline and the filtered UCs were merged as the final set of UCs. In addition, unmapped sequences were assembled by the SOAPdenovo [28] program and were subjected to an ORF length filtering process. In-house bioinformatics pipelines were used to identify SNVs under several stringent conditions, especially for the exon-intron boundary misalignment filter (See Additional file 1: Supplementary Methods). Differentially expressed genes were selected by comparing the expression profiles of the six horses, with a selection criterion of more than two fold up- or downregulation in four or more horses.

Data access

All raw sequences of the horse transcriptomes are openfreely available at http://www.horsegenome.net/ webcite.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

KDP, THK, SL, HKL, BWC, and JB designed and supervised the experiments and analyses. KDP, JP, BCK, HKL, BWC, and JB supervised the progress of the project. KDP, BWC, KTD, YMY, HSK, CK, and THK prepared blood and muscle samples from the six horses. CK, BCK, and THK generated sequences from the samples. JP, HC, HMK, SS, and SJ conducted the bioinformatics analyses. HSK, JP, and JB designed the validation experiments, and KA and KTD conducted the experiments. JP, BCK, SL, SH, JK, and JB wrote the manuscript and BCK, HC, SL, SH, and YMY participated in improving the manuscript. All authors read and approved the final manuscript.

Acknowledgements

This work was supported by a grant from the Next-Generation BioGreen 21 Program (No.PJ0081062011), Rural Development Administration, Republic of Korea. The Theragen team was supported by the Industrial Strategic Technology Development Program, 10040231, "Bioinformatics platform development for next generation bioinformation analysis", funded by the Ministry of Knowledge Economy (MKE, Korea). SH was supported by a grant from KRIBB Research Initiative Program. We thank Maryana Bhak for editing.

References

  1. Sons W: An Introduction to a General Stud Book. Weatherby and Sons, London; 1791. OpenURL

  2. Gordon J: The Horse Industry Contributing to the Australian Economy.

    Canberra: Rural Industries Research and Development Corporation 2001, 1:1-58. OpenURL

  3. Booth FW, Tseng BS, Fluck M, Carson JA: Molecular and cellular adaptation of muscle in response to physical training.

    Acta Physiol Scand 1998, 162(3):343-350. PubMed Abstract | Publisher Full Text OpenURL

  4. Gu J, Orr N, Park SD, Katz LM, Sulimova G, MacHugh DE, Hill EW: A genome scan for positive selection in thoroughbred horses.

    PLoS One 2009, 4(6):e5767. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  5. Gunn HM: Muscle, bone and fat proportions and muscle distribution of Thoroughbreds and other horses. ICEEP Publications, Davis, Calif (USA); 1987. OpenURL

  6. Das PJ, Paria N, Gustafson-Seabury A, Vishnoi M, Chaki SP, Love CC, Varner DD, Chowdhary BP, Raudsepp T: Total RNA isolation from stallion sperm and testis biopsies.

    Theriogenology 2010, 74(6):1099-1106.

    1106e1091-1092

    PubMed Abstract | Publisher Full Text OpenURL

  7. Eisen A: Recent considerations in the etiopathogenesis of ALS.

    Suppl Clin Neurophysiol 2004, 57:187-190. PubMed Abstract OpenURL

  8. Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR, et al.: Genome sequence, comparative analysis, and population genetics of the domestic horse.

    Science 2009, 326(5954):865-867. PubMed Abstract | Publisher Full Text OpenURL

  9. Poole D: Current concepts of oxygen transport during exercise.

    Equine Comp Exerc Physiol 2004, 1:5-22. Publisher Full Text OpenURL

  10. Pascual I, Dhar AK, Fan Y, Paradis MR, Arruga MV, Alcivar-Warren A: Isolation of expressed sequence tags from a Thoroughbred horse (Equus caballus) 5'-RACE cDNA library.

    Anim Genet 2002, 33(3):231-232. PubMed Abstract | Publisher Full Text OpenURL

  11. Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ, Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G, et al.: Stem cell transcriptome profiling via massive-scale mRNA sequencing.

    Nat Methods 2008, 5(7):613-619. PubMed Abstract | Publisher Full Text OpenURL

  12. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq.

    Nat Methods 2008, 5(7):621-628. PubMed Abstract | Publisher Full Text OpenURL

  13. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M: The transcriptional landscape of the yeast genome defined by RNA sequencing.

    Science 2008, 320(5881):1344-1349. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  14. Gan X, Stegle O, Behr J, Steffen JG, Drewe P, Hildebrand KL, Lyngsoe R, Schultheiss SJ, Osborne EJ, Sreedharan VT, et al.: Multiple reference genomes and transcriptomes for Arabidopsis thaliana.

    Nature 2011, 477(7365):419-423. PubMed Abstract | Publisher Full Text OpenURL

  15. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

    Genome Res 2008, 18(9):1509-1517. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Fu X, Fu N, Guo S, Yan Z, Xu Y, Hu H, Menzel C, Chen W, Li Y, Zeng R, et al.: Estimating accuracy of RNA-Seq and microarrays with proteomics.

    BMC Genomics 2009, 10:161. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  17. Li M, Wang IX, Li Y, Bruzel A, Richards AL, Toung JM, Cheung VG: Widespread RNA and DNA sequence differences in the human transcriptome.

    Science 2011, 333(6038):53-58. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. Ju YS, Kim JI, Kim S, Hong D, Park H, Shin JY, Lee S, Lee WC, Yu SB, Park SS, et al.: Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals.

    Nat Genet 2011, 43(8):745-752. PubMed Abstract | Publisher Full Text OpenURL

  19. Severin AJ, Woody JL, Bolon YT, Joseph B, Diers BW, Farmer AD, Muehlbauer GJ, Nelson RT, Grant D, Specht JE, et al.: RNA-Seq Atlas of Glycine max: a guide to the soybean transcriptome.

    BMC Plant Biol 2010, 10:160. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  20. Zenoni S, Ferrarini A, Giacomelli E, Xumerle L, Fasoli M, Malerba G, Bellin D, Pezzotti M, Delledonne M: Characterization of transcriptional complexity during berry development in Vitis vinifera using RNA-Seq.

    Plant Physiol 2010, 152(4):1787-1795. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  21. Canovas A, Rincon G, Islas-Trejo A, Wickramasinghe S, Medrano JF: SNP discovery in the bovine milk transcriptome using RNA-Seq technology.

    Mamm Genome 2010, 21(11–12):592-598. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  22. Chepelev I, Wei G, Tang Q, Zhao K: Detection of single nucleotide variations in expressed exons of the human genome using RNA-Seq.

    Nucleic Acids Res 2009, 37(16):e106. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  23. Lu T, Lu G, Fan D, Zhu C, Li W, Zhao Q, Feng Q, Zhao Y, Guo Y, Huang X, et al.: Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq.

    Genome Res 2010, 20(9):1238-1249. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  24. Coleman SJ, Zeng Z, Wang K, Luo S, Khrebtukova I, Mienaltowski MJ, Schroth GP, Liu J, MacLeod JN: Structural annotation of equine protein-coding genes determined by mRNA sequencing.

    Anim Genet 2010, 41(Suppl 2):121-130. PubMed Abstract | Publisher Full Text OpenURL

  25. McGivney BA, McGettigan PA, Browne JA, Evans AC, Fonseca RG, Loftus BJ, Lohan A, MacHugh DE, Murphy BA, Katz LM, et al.: Characterization of the equine skeletal muscle transcriptome identifies novel functional responses to exercise training.

    BMC Genomics 2010, 11:398. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  26. Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq.

    Bioinformatics 2009, 25(9):1105-1111. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

    Nat Biotechnol 2010, 28(5):511-515. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J: SOAP2: an improved ultrafast tool for short read alignment.

    Bioinformatics 2009, 25(15):1966-1967. PubMed Abstract | Publisher Full Text OpenURL

  29. Gregg C, Zhang J, Weissbourd B, Luo S, Schroth GP, Haig D, Dulac C: High-resolution analysis of parent-of-origin allelic expression in the mouse brain.

    Science 2010, 329(5992):643-648. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  30. Granzier HL, Labeit S: The giant protein titin: a major player in myocardial mechanics, signaling, and disease.

    Circ Res 2004, 94(3):284-295. PubMed Abstract | Publisher Full Text OpenURL

  31. Gajdosik RL: Passive extensibility of skeletal muscle: review of the literature with clinical implications.

    Clin Biomech (Bristol, Avon) 2001, 16(2):87-101. Publisher Full Text OpenURL

  32. Young P, Ehler E, Gautel M: Obscurin, a giant sarcomeric Rho guanine nucleotide exchange factor protein involved in sarcomere assembly.

    J Cell Biol 2001, 154(1):123-136. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  33. Kontrogianni-Konstantopoulos A, Jones EM, Van Rossum DB, Bloch RJ: Obscurin is a ligand for small ankyrin 1 in skeletal muscle.

    Mol Biol Cell 2003, 14(3):1138–-1148. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  34. Jayaraman T, Brillantes AM, Timerman AP, Fleischer S, Erdjument-Bromage H, Tempst P, Marks AR: FK506 binding protein associated with the calcium release channel (ryanodine receptor).

    J Biol Chem 1992, 267(14):9474-9477. PubMed Abstract | Publisher Full Text OpenURL

  35. Zorzato F, Fujii J, Otsu K, Phillips M, Green NM, Lai FA, Meissner G, MacLennan DH: Molecular cloning of cDNA encoding human and rabbit forms of the Ca2+ release channel (ryanodine receptor) of skeletal muscle sarcoplasmic reticulum.

    J Biol Chem 1990, 265(4):2244-2256. PubMed Abstract | Publisher Full Text OpenURL

  36. Robinson RL, Brooks C, Brown SL, Ellis FR, Halsall PJ, Quinnell RJ, Shaw MA, Hopkins PM: RYR1 mutations causing central core disease are associated with more severe malignant hyperthermia in vitro contracture test phenotypes.

    Hum Mutat 2002, 20(2):88-97. PubMed Abstract | Publisher Full Text OpenURL

  37. Ramskold D, Wang ET, Burge CB, Sandberg R: An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.

    PLoS Comput Biol 2009, 5(12):e1000598. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  38. Schroder W, Klostermann A, Distl O: Candidate genes for physical performance in the horse.

    Vet J 2011, 190(1):39-48. PubMed Abstract | Publisher Full Text OpenURL

  39. Bryan A, Hutchison KE, Seals DR, Allen DL: A transdisciplinary model integrating genetic, physiological, and psychological correlates of voluntary exercise.

    Health Psychol 2007, 26(1):30-39. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  40. Kramer HF, Goodyear LJ: Exercise, MAPK, and NF-kappaB signaling in skeletal muscle.

    J Appl Physiol 2007, 103(1):388-395. PubMed Abstract | Publisher Full Text OpenURL

  41. Ji LL, Gomez-Cabrera MC, Vina J: Exercise and hormesis: activation of cellular antioxidant signaling pathway.

    Ann N Y Acad Sci 2006, 1067:425-435. PubMed Abstract | Publisher Full Text OpenURL

  42. Kawana M, Lee ME, Quertermous EE, Quertermous T: Cooperative interaction of GATA-2 and AP1 regulates transcription of the endothelin-1 gene.

    Mol Cell Biol 1995, 15(8):4225-4231. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  43. Nomura N, Zu YL, Maekawa T, Tabata S, Akiyama T, Ishii S: Isolation and characterization of a novel member of the gene family encoding the cAMP response element-binding protein CRE-BP1.

    J Biol Chem 1993, 268(6):4259-4266. PubMed Abstract | Publisher Full Text OpenURL

  44. Chin ER: The role of calcium and calcium/calmodulin-dependent kinases in skeletal muscle plasticity and mitochondrial biogenesis.

    Proc Nutr Soc 2004, 63(2):279-286. PubMed Abstract | Publisher Full Text OpenURL

  45. Bell LR, Maine EM, Schedl P, Cline TW: Sex-lethal, a Drosophila sex determination switch gene, exhibits sex-specific RNA splicing and sequence similarity to RNA binding proteins.

    Cell 1988, 55(6):1037-1046. PubMed Abstract | Publisher Full Text OpenURL

  46. Blencowe BJ: Alternative splicing: new insights from global analyses.

    Cell 2006, 126(1):37-47. PubMed Abstract | Publisher Full Text OpenURL

  47. Kuta A, Deng W, Morsi El-Kadi A, Banks GT, Hafezparast M, Pfister KK, Fisher EM: Mouse cytoplasmic dynein intermediate chains: identification of new isoforms, alternative splicing and tissue distribution of transcripts.

    PLoS One 2010, 5(7):e11682. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL