References

  1. Miller W: Comparison of genomic DNA sequences: solved and unsolved problems.

    Bioinformatics 2001, 17:391-397. PubMed Abstract | Publisher Full Text OpenURL

  2. Frazer KA, Elnitski L, Church DM, Dubchak I, Hardison RC: Cross-species sequence comparisons: a review of methods and available resources.

    Genome Res 2003, 13:1-12. PubMed Abstract | Publisher Full Text OpenURL

  3. McClure MA, Vasi TK, Fitch WM: Comparative analysis of multiple protein-sequence alignment methods.

    Mol Biol Evol 1994, 11:571-592. PubMed Abstract | Publisher Full Text OpenURL

  4. Thompson JD, Plewniak F, Poch O: A comprehensive comparison of multiple sequence alignment programs.

    Nucleic Acids Res 1999, 27:2682-2690. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  5. Sauder JM, Arthur JW, Dunbrack Jr RL: Large-scale comparison of protein sequence alignment algorithms with structure alignments.

    Proteins 2000, 40:6-22. PubMed Abstract | Publisher Full Text OpenURL

  6. Brenner SE, Chothia C, Hubbard TJ: Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.

    Proc Natl Acad Sci U S A 1998, 95:6073-6078. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  7. Bray N, Dubchak I, Pachter L: AVID: A Global Alignment Program.

    Genome Res 2003, 13:97-102. PubMed Abstract | Publisher Full Text OpenURL

  8. Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, Program NC, Green ED, Sidow A, Batzoglou S: LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNA.

    Genome Res 2003. OpenURL

  9. Stoye J, Evers D, Meyer F: Rose: generating sequence families.

    Bioinformatics 1998, 14:157-163. PubMed Abstract | Publisher Full Text OpenURL

  10. Hillis DM, Huelsenbeck JP, Cunningham CW: Application and accuracy of molecular phylogenies.

    Science 1994, 264:671-677. PubMed Abstract | Publisher Full Text OpenURL

  11. Thorne JL, Kishino H, Felsenstein J: An evolutionary model for maximum likelihood alignment of DNA sequences.

    J Mol Evol 1991, 33:114-124. PubMed Abstract | Publisher Full Text OpenURL

  12. Thorne JL, Kishino H, Felsenstein J: Inching toward reality: an improved likelihood model of sequence evolution.

    J Mol Evol 1992, 34:3-16. PubMed Abstract | Publisher Full Text OpenURL

  13. Holmes I, Durbin R: Dynamic programming alignment accuracy.

    J Comput Biol 1998, 5:493-504. PubMed Abstract | Publisher Full Text OpenURL

  14. Stoye J: Multiple sequence alignment with the Divide-and-Conquer method.

    Gene 1998, 211:GC45-56. PubMed Abstract | Publisher Full Text OpenURL

  15. Hein J, Wiuf C, Knudsen B, Moller MB, Wibling G: Statistical alignment: computational properties, homology testing and goodness-of-fit.

    J Mol Biol 2000, 302:265-279. PubMed Abstract | Publisher Full Text OpenURL

  16. Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

    Nucleic Acids Res 2002, 30:3059-3066. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Lassmann T, Sonnhammer EL: Quality assessment of multiple alignment programs.

    FEBS Lett 2002, 529:126-130. PubMed Abstract | Publisher Full Text OpenURL

  18. Metzler D: Statistical alignment based on fragment insertion and deletion models.

    Bioinformatics 2003, 19:490-499. PubMed Abstract | Publisher Full Text OpenURL

  19. Celniker SE, Wheeler DA, Kronmiller B, Carlson JW, Halpern A, Patel S, Adams M, Champe M, Dugan SP, Frise E, Hodgson A, George RA, Hoskins RA, Laverty T, Muzny DM, Nelson CR, Pacleb JM, Park S, Pfeiffer BD, Richards S, Svirskas R, Tabor PE, Wan K, Scherer SE, Stapleton M, Sutton GG, Venter C, Weinstock G, Myers EW, Gibbs RA, Rubin GM: Finishing a whole genome shotgun sequence assembly: Release 3 of the Drosophila euchromatic genome sequence.

    Genome Biology 2002, 3:research0079.1-research0079.14. BioMed Central Full Text OpenURL

  20. Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell K, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, Smith CD, Tupy JL, Whitfield EJ, Bayraktaroglu L, Berman BP, Celniker SE, A.D.N.J. de Grey., Drysdale RA, Harris NL, Richter J, Russo S, Shu S, Stapleton M, Yamada C, Ashburner M, Gelbart WM, Rubin GM, Lewis SE: Annotation of the Drosophila euchromatic genome: a systematic review.

    Genome Biology 2002, 3:research0083.1-research0083.22. BioMed Central Full Text OpenURL

  21. Baylor College of Medicine Drosophila Genome Project [http://www.hgsc.bcm.tmc.edu/projects/drosophila/]
  22. Petrov DA, Lozovskaya ER, Hartl DL: High intrinsic rate of DNA loss in Drosophila.

    Nature 1996, 384:346-349. PubMed Abstract | Publisher Full Text OpenURL

  23. Petrov DA, Hartl DL: High rate of DNA loss in the Drosophila melanogaster and Drosophila virilis species groups.

    Mol Biol Evol 1998, 15:293-302. PubMed Abstract | Publisher Full Text OpenURL

  24. Kaminker JS, Bergman CM, Kronmiller B, Carlson J, Svirskas R, Patel S, Frise E, Wheeler DA, Lewis S, Rubin GM, Ashburner A, Celniker SE: The transposable elements of the Drosophila melanogaster euchromatin – a genomics perspective.

    Genome Biology 2002, 3:research0084. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  25. Bergman CM, Kreitman M: Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

    Genome Res 2001, 11:1335-1345. PubMed Abstract | Publisher Full Text OpenURL

  26. Nekrutenko A, Makova KD, Li WH: The K(A)/K(S) ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study.

    Genome Res 2002, 12:198-202. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Carninci P, Cawley S, Chiaromonte F, Chinwalla AT, Church DM, Clamp M, Clee C, Collins FS, Cook LL, Copley RR, Coulson A, Couronne O, Cuff J, Curwen V, Cutts T, Daly M, David R, Davies J, Delehaunty KD, Deri J, Dermitzakis ET, Dewey C, Dickens NJ, Diekhans M, Dodge S, Dubchak I, Dunn DM, Eddy SR, Elnitski L, Emes RD, Eswara P, Eyras E, Felsenfeld A, Fewell GA, Flicek P, Foley K, Frankel WN, Fulton LA, Fulton RS, Furey TS, Gage D, Gibbs RA, Glusman G, Gnerre S, Goldman N, Goodstadt L, Grafham D, Graves TA, Green ED, Gregory S, Guigo R, Guyer M, Hardison RC, Haussler D, Hayashizaki Y, Hillier LW, Hinrichs A, Hlavina W, Holzer T, Hsu F, Hua A, Hubbard T, Hunt A, Jackson I, Jaffe DB, Johnson LS, Jones M, Jones TA, Joy A, Kamal M, Karlsson EK, Karolchik D, Kasprzyk A, Kawai J, Keibler E, Kells C, Kent WJ, Kirby A, Kolbe DL, Korf I, Kucherlapati RS, Kulbokas EJ, Kulp D, Landers T, Leger JP, Leonard S, Letunic I, Levine R, Li J, Li M, Lloyd C, Lucas S, Ma B, Maglott DR, Mardis ER, Matthews L, Mauceli E, Mayer JH, McCarthy M, McCombie WR, McLaren S, McLay K, McPherson JD, Meldrim J, Meredith B, Mesirov JP, Miller W, Miner TL, Mongin E, Montgomery KT, Morgan M, Mott R, Mullikin JC, Muzny DM, Nash WE, Nelson JO, Nhan MN, Nicol R, Ning Z, Nusbaum C, O'Connor MJ, Okazaki Y, Oliver K, Overton-Larty E, Pachter L, Parra G, Pepin KH, Peterson J, Pevzner P, Plumb R, Pohl CS, Poliakov A, Ponce TC, Ponting CP, Potter S, Quail M, Reymond A, Roe BA, Roskin KM, Rubin EM, Rust AG, Santos R, Sapojnikov V, Schultz B, Schultz J, Schwartz MS, Schwartz S, Scott C, Seaman S, Searle S, Sharpe T, Sheridan A, Shownkeen R, Sims S, Singer JB, Slater G, Smit A, Smith DR, Spencer B, Stabenau A, Stange-Thomann N, Sugnet C, Suyama M, Tesler G, Thompson J, Torrents D, Trevaskis E, Tromp J, Ucla C, Ureta-Vidal A, Vinson JP, Von Niederhausern AC, Wade CM, Wall M, Weber RJ, Weiss RB, Wendl MC, West AP, Wetterstrand K, Wheeler R, Whelan S, Wierzbowski J, Willey D, Williams S, Wilson RK, Winter E, Worley KC, Wyman D, Yang S, Yang SP, Zdobnov EM, Zody MC, Lander ES: Initial sequencing and comparative analysis of the mouse genome.

    Nature 2002, 420:520-562. PubMed Abstract | Publisher Full Text OpenURL

  28. Castillo-Davis CI, Hartl DL: Genome evolution and developmental constraint in Caenorhabditis elegans.

    Mol Biol Evol 2002, 19:728-735. PubMed Abstract | Publisher Full Text OpenURL

  29. Stein LD, Bao Z, Blasiar D, Blumenthal T, Brent MR, Chen N, Chinwalla A, Clarke L, Clee C, Coghlan A, Coulson A, D'Eustachio P, Fitch DH, Fulton LA, Fulton RE, Griffiths-Jones S, Harris TW, Hillier LW, Kamath R, Kuwabara PE, Mardis ER, Marra MA, Miner TL, Minx P, Mullikin JC, Plumb RW, Rogers J, Schein JE, Sohrmann M, Spieth J, Stajich JE, Wei C, Willey D, Wilson RK, Durbin R, Waterston RH: The Genome Sequence of Caenorhabditis briggsae: A Platform for Comparative Genomics.

    PLoS Biol 2003, 1:E45. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  30. Zeng LW, Comeron JM, Chen B, Kreitman M: The molecular clock revisited: the rate of synonymous vs. replacement change in Drosophila.

    Genetica 1998, 102-103:369-382. PubMed Abstract | Publisher Full Text OpenURL

  31. Bergman CM, Pfeiffer BD, Rincón-Limas DE, Hoskins RA, Gnirke A, Mungall CJ, Wang AM, Kronmiller B, Pacleb J, Park S, Stapleton M, Wan K, George R, de Jong PJ, Botas J, Rubin GM, Celniker SE: Assessing the impact of comparative genomic sequences data on the functional annotation of the Drosophila genome.

    Genome Biology 2002, 3:research0086.1-research0086.20. BioMed Central Full Text OpenURL

  32. Stoye J, Evers D, Meyer F: Generating benchmarks for multiple sequence alignments and phylogenetic reconstructions.

    Proc Int Conf Intell Syst Mol Biol 1997, 5:303-306. PubMed Abstract | Publisher Full Text OpenURL

  33. Ptak SE, Petrov DA: How intron splicing affects the deletion and insertion profile in Drosophila melanogaster.

    Genetics 2002, 162:1233-1244. PubMed Abstract | Publisher Full Text OpenURL

  34. Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, Maskeri B, Hansen NF, Schwartz MS, Weber RJ, Kent WJ, Karolchik D, Bruen TC, Bevan R, Cutler DJ, Schwartz S, Elnitski L, Idol JR, Prasad AB, Lee-Lin SQ, Maduro VV, Summers TJ, Portnoy ME, Dietrich NL, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley CP, Brooks SY, Granite S, Guan X, Gupta J, Haghighi P, Ho SL, Huang MC, Karlins E, Laric PL, Legaspi R, Lim MJ, Maduro QL, Masiello CA, Mastrian SD, McCloskey JC, Pearson R, Stantripop S, Tiongson EE, Tran JT, Tsurgeon C, Vogt JL, Walker MA, Wetherby KD, Wiggins LS, Young AC, Zhang LH, Osoegawa K, Zhu B, Zhao B, Shu CL, De Jong PJ, Lawrence CE, Smit AF, Chakravarti A, Haussler D, Green P, Miller W, Green ED: Comparative analyses of multi-species sequences from targeted genomic regions.

    Nature 2003, 424:788-793. PubMed Abstract | Publisher Full Text OpenURL

  35. Morgenstern B, Frech K, Dress A, Werner T: DIALIGN: finding local similarities by multiple sequence alignment.

    Bioinformatics 1998, 14:290-294. PubMed Abstract | Publisher Full Text OpenURL

  36. Averof M, Rokas A, Wolfe KH, Sharp PM: Evidence for a high frequency of simultaneous double-nucleotide substitutions.

    Science 2000, 287:1283-1286. PubMed Abstract | Publisher Full Text OpenURL

  37. Arndt PF, Burge CB, Hwa T: DNA sequence evolution with neighbor-dependent mutation.

    J Comput Biol 2003, 10:313-322. PubMed Abstract | Publisher Full Text OpenURL

  38. Siepel A, Haussler D: Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood.

    Mol Biol Evol 2003. OpenURL

  39. AlignmentBenchmarking [http://rana.lbl.gov/AlignmentBenchmarking]
  40. Boffelli D, McAuliffe J, Ovcharenko D, Lewis KD, Ovcharenko I, Pachter L, Rubin EM: Phylogenetic shadowing of primate sequences to find functional regions of the human genome.

    Science 2003, 299:1391-1394. PubMed Abstract | Publisher Full Text OpenURL

  41. Elnitski L, Hardison RC, Li J, Yang S, Kolbe D, Eswara P, O'Connor MJ, Schwartz S, Miller W, Chiaromonte F: Distinguishing regulatory DNA from neutral sites.

    Genome Res 2003, 13:64-72. PubMed Abstract | Publisher Full Text OpenURL

  42. Cooper GM, Brudno M, Green ED, Batzoglou S, Sidow A: Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes.

    Genome Res 2003, 13:813-820. PubMed Abstract | Publisher Full Text OpenURL

  43. Ludwig MZ, Bergman C, Patel N, Kreitman M: Evidence for stabilizing selection in a eukaryotic cis-regulatory element.

    Nature 2000, 403:564-567. PubMed Abstract | Publisher Full Text OpenURL

  44. Cuadrado M, Sacristan M, Antequera F: Species-specific organization of CpG island promoters at mammalian homologous genes.

    EMBO Rep 2001, 2:586-592. PubMed Abstract | Publisher Full Text OpenURL

  45. Costas J, Casares F, Vieira J: Turnover of binding sites for transcription factors involved in early Drosophila development.

    Gene 2003, 310:215-220. PubMed Abstract | Publisher Full Text OpenURL

  46. Emberly E, Rajewsky N, Siggia ED: Conservation of regulatory elements between two species of Drosophila.

    BMC Bioinformatics 2003, 4:57. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  47. Mungall CJ, Misra S, Berman BP, Carlson J, Frise E, Harris N, Marshall B, Shu S, Kaminker JS, Prochnik SE, Smith CD, Smith E, Tupy JL, Wiel C, Rubin G, Lewis SE: An integrated computational pipeline and database to support whole genome sequence annotation.

    Genome Biology 2002, 3:research0081.1-research0081.11. BioMed Central Full Text OpenURL

  48. Comprehensive R Archive Network [http://cran.r-project.org/]
  49. Weir BS: Genetic Data Analysis II.

    Sunderland, MA, Sinauer Associates, Inc. 1996, 445. OpenURL

  50. Burge C, Campbell AM, Karlin S: Over- and under-representation of short oligonucleotides in DNA sequences.

    Proc Natl Acad Sci U S A 1992, 89:1358-1362. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  51. Katz RW: On some criteria for estimating the order of a Markov chain.

    Technometrics 1981, 23:243-249. OpenURL

  52. Yang Z, Nielsen R: Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models.

    Mol Biol Evol 2000, 17:32-43. PubMed Abstract | Publisher Full Text OpenURL

  53. PAML (version 3.13) [http://abacus.gene.ucl.ac.uk/software/paml.html]
  54. Hasegawa M, Kishino H, Yano T: Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

    J Mol Evol 1985, 22:160-174. PubMed Abstract | Publisher Full Text OpenURL

  55. Moriyama EN, Hartl DL: Codon usage bias and base composition of nuclear genes in Drosophila.

    Genetics 1993, 134:847-858. PubMed Abstract | Publisher Full Text OpenURL

  56. Moriyama EN, Powell JR: Intraspecific nuclear DNA variation in Drosophila.

    Mol Biol Evol 1996, 13:261-277. PubMed Abstract | Publisher Full Text OpenURL

  57. Comeron JM, Kreitman M: The correlation between intron length and recombination in Drosophila. Dynamic equilibrium between mutational and selective forces.

    Genetics 2000, 156:1175-1190. PubMed Abstract | Publisher Full Text OpenURL

  58. ROSE (version 1.3) [http://bibiserv.techfak.uni-bielefeld.de/rose/]
  59. PHYLIP (version 3.5c) [http://evolution.genetics.washington.edu/phylip.html]

    Seattle OpenURL

  60. Zhu J, Liu JS, Lawrence CE: Bayesian adaptive sequence alignment algorithms.

    Bioinformatics 1998, 14:25-39. PubMed Abstract | Publisher Full Text OpenURL

  61. Tatusova TA, Madden TL: BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences.

    FEMS Microbiol Lett 1999, 174:247-250. PubMed Abstract | Publisher Full Text OpenURL

  62. Jareborg N, Birney E, Durbin R: Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs.

    Genome Res 1999, 9:815-824. PubMed Abstract | Publisher Full Text OpenURL

  63. Delcher AL, Phillippy A, Carlton J, Salzberg SL: Fast algorithms for large-scale genome alignment and comparison.

    Nucleic Acids Res 2002, 30:2478-2483. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  64. Ogurtsov AY, Roytberg MA, Shabalina SA, Kondrashov AS: OWEN: aligning long collinear regions of genomes.

    Bioinformatics 2002, 18:1703-1704. PubMed Abstract | Publisher Full Text OpenURL

  65. Pearson WR, Lipman DJ: Improved tools for biological sequence comparison.

    Proc Natl Acad Sci U S A 1988, 85:2444-2448. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  66. Needleman SB, Wunsch CD: A general method applicable to the search for similarities in the amino acid sequence of two proteins.

    J Mol Biol 1970, 48:443-453. PubMed Abstract | Publisher Full Text OpenURL

  67. Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W: Human-mouse alignments with BLASTZ.

    Genome Res 2003, 13:103-107. PubMed Abstract | Publisher Full Text OpenURL

  68. Morgenstern B: DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.

    Bioinformatics 1999, 15:211-218. PubMed Abstract | Publisher Full Text OpenURL

  69. Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

    Nucleic Acids Res 1994, 22:4673-4680. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  70. Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite.

    Trends Genet 2000, 16:276-277. PubMed Abstract | Publisher Full Text OpenURL

  71. Kent WJ, Zahler AM: Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment.

    Genome Res 2000, 10:1115-1125. PubMed Abstract | Publisher Full Text OpenURL