Large gene overlaps in prokaryotic genomes: result of functional constraints or mispredictions?1 Biochemistry and Biotechnology Department, Rovira i Virgili University, C/Marcel·lí Domingo s/n, 43007 Tarragona, Catalunya, Spain 2 European Molecular Biological Laboratory, Meyerhofstrasse, 1, 69012 Heidelberg, Germany 3 Max Delbrück Centre for Molecular Medicine, Berlin-Buch, Robert-Rössle-Strasse 10, D-13092 Berlin, Germany
BMC Genomics 2008, 9:335doi:10.1186/1471-2164-9-335
Additional filesAdditional file 1: The 968 overlaps manually analysed. The co-directional, convergent and divergent overlaps analysed. They are separated depending on the orientation of the pair. The genes identification is made joining the Taxonomy ID of the species which contains the gene and the gene name separated by a dot. The columns are the upstream and the downstream gene ids, the functions of the protein encoded in the genes, the orientation, the overlapping length and the type of misannotation. Notice that the types of misannotations are described at the end of each of the lists. Format: XLS Size: 274KB Download file This file can be viewed with: Microsoft Excel Viewer Additional file 2: Number of misannotations per genome in each category. Summary of the mispredicted overlaps found within the genome of each species sorted by categories. Format: XLS Size: 77KB Download file This file can be viewed with: Microsoft Excel Viewer Additional file 3: Misannotations related to some genome features. Table summarizing the genomes with more misannotations and some features of the genome such as genome length, gene content, GC content, sequencing method, annotating method and sequence date. Format: XLS Size: 34KB Download file This file can be viewed with: Microsoft Excel Viewer Additional file 4: Start codons analysis. Study of the start codons usage found among the three normal gene sets (random set I, II and II), which contains well-characterized non-overlapping genes randomly selected, and within the mispredicted start codon gene set. The usage and percentage of usage of each alternative start codon considered (AUG, GUG, UUG, other) is shown in the rows. Format: DOC Size: 100KB Download file This file can be viewed with: Microsoft Word Viewer |




on Google Scholar







author email
corresponding author email