|Correspondence to mammalian genes and estimated efficiencies of cloning of start codons of EST assemblies|
|Unique Gene ID (without HomoloGene ID)||Unique HomoloGene ID||Assemblies matched to protein sequences||Assemblies estimated to include start codons|
Numbers of genes that had unique NCBI Gene IDs and corresponded to contigs and singlets generated by assembly of expressed sequence tags (ESTs) are indicated. Also shown are the numbers that had unique Gene IDs in the NCBI HomoloGene database (a database of orthologs among species) and corresponded to the contigs and singlets generated. Numbers in parentheses indicate numbers of gene IDs that had no corresponding HomoloGene IDs. HomoloGene IDs in pigs are not indicated, because there is no HomoloGene ID database for pig genes.
EST assemblies were estimated to contain start codons if the length upstream of the matches (BLAST score >50) in the assemblies was greater than that between the start base of the coding sequence and the matched region of the corresponding gene. Numbers of assemblies (contigs and singlets) corresponding to protein sequences in humans, mice, cattle, dogs, and pigs are also shown.
Uenishi et al.
Uenishi et al. BMC Genomics 2012 13:581 doi:10.1186/1471-2164-13-581