Large-scale mutagenesis screens in the zebrafish employing the mutagen ENU have isolated several hundred mutant loci that represent putative developmental control genes. In order to realize the potential of such screens, systematic genetic mapping of the mutations is necessary. Here we report on a large-scale effort to map the mutations generated in mutagenesis screening at the Max Planck Institute for Developmental Biology by genome scanning with microsatellite markers.
We have selected a set of microsatellite markers and developed methods and scoring criteria suitable for efficient, high-throughput genome scanning. We have used these methods to successfully obtain a rough map position for 319 mutant loci from the Tübingen I mutagenesis screen and subsequent screening of the mutant collection. For 277 of these the corresponding gene is not yet identified. Mapping was successful for 80 % of the tested loci. By comparing 21 mutation and gene positions of cloned mutations we have validated the correctness of our linkage group assignments and estimated the standard error of our map positions to be approximately 6 cM.
By obtaining rough map positions for over 300 zebrafish loci with developmental phenotypes, we have generated a dataset that will be useful not only for cloning of the affected genes, but also to suggest allelism of mutations with similar phenotypes that will be identified in future screens. Furthermore this work validates the usefulness of our methodology for rapid, systematic and inexpensive microsatellite mapping of zebrafish mutations.
Large-scale mutagenesis screens in the zebrafish employing the mutagen ENU have isolated several hundred mutant loci that represent putative developmental control genes [1,2]. In order to realize the potential of such screens, systematic genetic mapping of the mutations is necessary. Genome scanning by bulked segregant analysis with microsatellite markers is the method of choice for such purposes, as a rough map position can be quickly obtained [3,4]. In the zebrafish it is easy to perform mapcrosses against a polymorphic reference line, followed by brother-sister matings among the F1 generation. Linkage to a microsatellite marker can then be found by comparing the band intensities of marker alleles in a pool of mutant F2 individuals with a pool of their wildtype siblings. Because full sibships are analyzed the genetic distance between the mutant locus and a microsatellite can be determined by a simple count of recombinations.
The established reference map for the zebrafish genome is the MGH map [5-7] which was generated by scoring 3,881 microsatellite markers (all of them CA repeats) on a panel of 48 diploid F2 fish of an India × AB reference cross. It covers 2,295 centimorgans (cM) at a resolution of 1.2 cM. Because the MGH markers do not necessarily show a usable polymorphism in reference crosses of Tü × WIK our first task was to identify markers that could be used in such a cross.
Results and discussion
Selection of markers for genome scanning
Two sets of microsatellite markers for scanning the genome were developed in parallel with the mutant mapping effort. The starting point was the testing of 314 markers for polymorphism in Tü × WIK crosses . 72 markers (3 per chromosome) were selected that showed a polymorphism between Tü and WIK, with bands easily distinguishable on agarose gels, in at least three out of five reference crosses (zebrafish genome scan set version 1, or G1). Additional markers from the MGH map that had shown a robust polymorphism in fine-mapping experiments were subsequently added, while markers that never gave any confirmed linkage in our experiments or that were omitted from the MGH map were removed from our set, eventually resulting in the G4 set of 192 markers . An alternate set of markers was generated by testing another 1,092 microsatellite markers from the MGH map in five reference crosses. 178 of these markers were polymorphic in all five reference crosses. Together with 14 additional markers these were selected for the H2 set of 192 markers (Table 1).
Table 1. Sets of microsatellite markers used for scanning the zebrafish genome
The average distance between markers of the G4 set is 11.6 cM, and all distances are smaller than 36 cM, except for a 71.1 cM interval on LG21 (between Z4425 and Z1497). Within this particular interval few MGH markers are available, and no suitably polymorphic marker could be identified in our reference crosses. For the H2 set the average distance is 11.5 cM, and all distances are smaller than 53.8 cM, except for a 83.3 cM interval on LG21. The more uneven chromosomal distribution of markers in the H2 set reflects the fact that frequently the best markers available were already used in the G4 set.
Our mapping methodology as described below can theoretically detect significant linkage over a distance of approximately 36 cM (assuming the genotyping of 48 mutant individuals). However, since the LOD score is proportional to the number of individuals scored, this range can be easily increased by adding more mutant individuals if a linkage is questionable. Our marker sets therefore cover the genome adequately to detect significant linkage with the great majority of mutant loci. All the mutant loci mapped in this work have confirmed linkage to at least one G4 or H2 marker (not shown if the closest flanking markers were selected from outside the sets).
Mapping of mutant loci
We report here on the mapping of 319 mutant loci identified in the ENU-based Tübingen I mutagenesis screen [1,2] and subsequent screening among the mutant collection (1). For 42 of the loci the corresponding genes have already been identified by other researchers, as listed by the ZFIN database ; they are included as controls for our mapping procedure (see below). Not included are 70 successfully mapped loci for which the corresponding genes were already published by ourselves or such a publication is in preparation, or the carriers of which were lost after mapping.
Additional file 1. Mapping results. Listed are 319 mutant loci identified in the ENU-based Tübingen I mutagenesis screen [1,2] and subsequent screening among the mutant collection. Descriptions of all listed mutations are available from the ZFIN database . The list includes 42 mutations for which the corresponding gene was identified by other researchers according to ZFIN. References to the original publications are not included due to space constraints, but are likewise available from ZFIN. We recognize the priority of mapping results already published for these 42 mutations, but include the mutations here as controls for our mapping data (see Table 2). Allele: allele name. Abbr: mutant abbreviation (or "unm" for unnamed mutant). LG: linkage group. Pos: map position deduced from the two markers listed further to the right, in cM from the top of the linkage group. Gene: symbol of the corresponding gene as listed by ZFIN (or "n.d." if not identified). Marker1: closest marker. Marker2: closest marker on the other side of the mutation (if available) or second closest marker on the same side. Pos1, Pos2: marker position on the MGH map, in cM from the top of the linkage group. Dis1, Dis2: distance between mutant locus and marker, in cM. LOD1, LOD2: LOD score for two-point linkage between the mutant locus and marker ("inf" for infinite if no recombinants were found). n1, n2: number of F2 individuals successfully scored. * Marker 2 was determined to be on the same side of the mutant locus as marker 1, since the majority of recombinants for marker 1 is also recombinant for marker 2. However, no usable marker was found on the other side. In these cases only the distance from marker 1 is used to calculate the map position. ** One of the F1 individuals is homozygous for marker 2, allowing only recombinations in the other individual to be scored. Such linkages are listed in order to support chromosomal assignments if no other usable marker was found. However, the distances are not comparable to the sex-averaged map because male and female recombination rates differ to an unknown extent. In these cases the mutation is placed at the position of marker 1. *** ali and bxe have ambiguous positions because marker 1 and marker 2 are on the same side of the mutant locus and the distance to both markers is the same. Thus, ali can be placed either at 107.5 or 134.0 cM from the top of LG21, and bxe either 34.7 or 78.7 cM from the top of LG13.
Format: CSV Size: 22KB Download file
For each mutation we crossed mutant carriers against the polymorphic reference line WIK which was established in our lab for this purpose . Brother-sister matings were performed in the F1 and the F2 progeny was sorted by phenotype. DNA was prepared on 96-well plates, and aliquots of 36 – 48 mutant F2 individuals and their wildtype siblings were pooled. Genome scanning was performed by PCR of the mutant and sibling pools with the markers of the G4 marker set, and the band intensities on agarose gels were quantified semi-automatically using NIH Image software as well as visually assessed to identify potential linkages. Mutant and sibling pools representing up to 24 different mutations were tested in parallel. Verification of the best potential linkages (up to six) for each mutation was then attempted by performing PCR of the respective marker with the individual mutants and siblings that had been used for pooling, and counting the recombinant genotypes (for the genotype data see 2). Siblings were always included on the same gel as a control to confirm that the marker is polymorphic and the two polymorphic bands appear at the proper frequency. If no potential linkage could be verified for a mutation and sufficient material was available, the procedure was repeated once with the G4 marker set, and another two times with the H2 marker set. If possible, DNA was prepared from a different F1 pair for each genome scan, since the Tübingen and WIK lines used are not isogenic and markers that show no usable polymorphism in progeny of one F2 pair are therefore sometimes usable in progeny of another one.
Additional file 2. Genotypes. This file lists the scoring results of individual F2 fish that were used to determine the map positions of the mutations. Row: template plate row, identifies the individuals scored; one row of the file corresponds to two rows of wells on a microtiter plate. Abbr: mutant abbreviation. Allele: mutant allele. Marker: marker name. Genotypes: 24 genotypes, encoded as follows: "1", homozygous for the upper band (which may be either Tü or WIK, dependent on the marker and F1 cross), "2", homozygous for the lower band, "3", heterozygous, "0" or "-", not determined (bad gel lane or microtiter well, respectively).
Format: CSV Size: 104KB Download file
A potential linkage was considered confirmed if it had a two-point LOD score equal or greater than 3. The individuals were then genotyped for all polymorphic markers from the same marker set and chromosomal region in order to identify, if possible, a pair of markers flanking the mutation, and if that was not possible, the two closest markers on one side of the mutation. Occasionally additional markers not in the chosen marker set were also included in the genotyping. Decisions on whether or not a mutation was flanked by two markers were based on whether recombinations with the markers occurred independently. For details of the mapping procedure and the calculation of map positions see the Methods section and .
In total, mapping was attempted for 486 mutations from the Tübingen I screen and subsequent screens of the mutant collection and successful for 389, giving a success rate of 80 %. 12 of these could be mapped only with the H2 set. Unsuccessful mapping experiments were due to difficulties in obtaining sufficient F2 individuals and to PCR problems as well as to a lack of polymorphic markers in our marker set. Among the mutations to be mapped, a group of 63 was prioritized based on interest in their phenotypes. For each of these several additional mapcrosses were set up (data not shown). 56 mutations of this group, or 89 % were successfully mapped, providing a lower limit for the percentage of mutations that our marker sets and methodology is capable of mapping if sufficient F2 individuals are available. The biggest distance to markers on either side at which we could confirm linkage was 31.9 cM (for the mutation spt), approaching the theoretical cutoff of 36 cM.
Chromosomal distribution of mutant loci
Between 1,400 and 2,400 zebrafish genes have been estimated to have visible mutant phenotypes in embryonic and early larval development [1,11]. Therefore the loci reported in this work represent at least one eighth and possibly as much as quarter of all the loci that can be mutated to give a visible phenotype.
The number of mapped loci assigned to each chromosome is between 6 and 32 (on average 12.8 ± 5.8) (Figure 1). These numbers are not significantly correlated with the number of mutant loci per chromosome identified by insertional mutagenesis in the laboratory of N. Hopkins ([11,12] and unpublished data, available from ZFIN ) (R2 = 0.02 assuming a linear regression relationship) or with the number of Ensembl genes per chromosome in the Ensembl Zv6 assembly  (R2 = 0.19); by comparison, the values of Amsterdam et al. have a slightly stronger correlation to the number of Ensembl transcripts (R2 = 0.28). Because mapping with our methodology was successful for 80% of all mutations for which it was attempted, possible deficiencies of the mapping method cannot fully account for this low correlation. Rather, it probably reflects an uneven distribution of genes with specific, visible phenotypes in embryos or early larvae as identified in ENU mutagenesis screening, and the absence of such selectivity in the insertional mutagenesis experiment, demonstrating that both types of mutagenesis experiments complement each other in their coverage of their genome. Moreover, we cannot rule out region-specific differences in ENU mutagenesis efficiency.
Figure 1. Distribution of mapped mutations among the zebrafish chromosomes. Light blue, ENU mutations mapped in the present work. Purple, insertional mutations from the laboratory of N. Hopkins ( and unpublished data, available from ZFIN ), shown for comparison. Numbers for insertional mutations were obtained by searching ZFIN for mutations with a "hi" designation assigned to each linkage group and eliminating multiple hits of the same gene as well as mutations with ambiguous chromosomal assignments. Yellow, Ensembl genes predictions for each chromosome (× 100) (Ensembl release Zv6, available from Ensembl ) The number of mapped mutations or genes is indicated on the vertical axis, the linkage group (LG) number on the horizontal axis.
Assessment of mapping quality
In order to assess the quality of our mapping data we looked at the 42 mutant loci that were cloned by other researchers. For 21 of these independently derived map positions of the affected gene are publicly available on ZMAP (an integrated map produced by intercalating data from several mapping panels into the MGH genetic map, available from ZFIN ; Allen Day, Tom Conlin and John H. Postlethwait, unpublished) (Table 2).
Table 2. Comparison of mutant and gene positions
A comparison of the linkage group assignments shows that two of the 21 genes (frs/slc25a and ovl/ift88) are assigned to a different linkage group by ZMAP, in both cases based on results from the Heat Shock (HS) panel [14-16]. However, several published linkages to genetic markers support our linkage group assignment of frs/slc25a  while our assignment of ovl/ift88 is supported by the T51 panel (as shown on the ZFIN website) and by the latest version of the HS map . In conclusion, none of our linkage group assignments is conclusively contradicted by gene mapping.
Next we compared the map positions of the mutations with those of the genes on ZMAP (using the median of the ZMAP positions if a gene was placed on more than one mapping panel). If we assume the gene positions to be correct, we obtain a standard error of our mutant map positions of 6.1 cM. Further assuming a normal distribution of errors, we can predict that approximately 95 % of the genes should be within 12.2 cM (two standard errors) of the rough mapping position of the mutation. Indeed, 17 out of the 19 genes mapped on the same chromosome (90 %) are within two standard errors of the mutation, and 16 out of 19 (84 %) within one standard error. Actually both mutation and gene mapping contribute to the observed errors to an unknown degree, so that 6.1 cM merely represents an upper limit for the standard error of our mapping procedure.
We have obtained rough map positions for over 300 zebrafish mutants with an accuracy of approximately 6 cM and thereby validated the usefulness of our methodology for rapid, systematic and inexpensive microsatellite mapping of zebrafish mutations. The dataset that we have produced is a first step towards identification of the genes affected by the 277 mutations that are not yet cloned.
In candidate gene approaches, our data can substantially narrow down the number of candidate genes, since on the order of 99 % of the genome are outside the two-standard-errors confidence limit of our map positions. Positional cloning approaches in the absence of obvious candidate genes will still require fine mapping by genotyping of additional individuals and identification of more closely linked markers, using the flanking markers identified by us as starting points. Particularly thorough fine-mapping is required in centromeric regions because the genetic recombination rate is often several-fold reduced in such regions , an effect that can be easily observed in the zebrafish by comparing the genetic map and the radiation hybrid map . Nevertheless, we expect our map positions to be useful even without knowledge of the affected genes, as they can suggest allelism of mutations with a similar phenotype identified in future screens.
We have found that a relatively small number of microsatellite markers is sufficient to scan almost the entire genome and that the experimental procedures are robust and easy to perform. Other methods that have been proposed for the mapping of mutant loci in the zebrafish include half-tetrad analysis with microsatellite markers, genome scanning with SNPs and microarray based SNP mapping. While half-tetrad analysis requires only 25 markers to obtain a linkage group assignment [20-22], it has the disadvantage that gynogenetic diploid fish must be generated first, which makes this approach less convenient for high-throughput analysis. In the course of the ongoing zebrafish genome project, more than 50,000 SNPs have been identified  offering an enticing alternative to microsatellite markers, but SNP genotyping is far more costly than the agarose based method employed by us. Genotyping of SNPs in a bulked segregant panel is also possible by microarray hybridization . However, the SNPs identified to date are specific to the strains they were developed from and may not be informative in mapcrosses performed with different strains (such as ours). Furthermore such a microarray experiment replaces only two steps in our mapping procedure, namely the pooled PCR and its associated gel run, which represent only a minor part of the total mapping effort, as compared to fish breeding, sorting of F2 embryos and confirmation of the bulked segregant results by genotyping of F2 individuals. Future microarray based approaches may make it possible to dispense with the genotyping of individuals entirely, provided that a very large number of SNPs can be multiplexed in a single microarray hybridization such that it immediately provides a reliable map position. Meanwhile, genome scanning with microsatellite markers remains the method of choice as it is equally suitable for the mapping of individual mutations by laboratories with limited genomics resources, and for high throughput projects such as ours.
Mapcrosses were set up between mutant carriers and the laboratory reference line WIK  and brother-sister matings were performed between F1 individuals following standard laboratory procedures .
F2 embryos were sorted by phenotype and stored in Eppendorf tubes with 100 % MeOH at -70°C until use. Single embryos were arrayed on a 96-well microtiter plate with a glass Pasteur pipette. The MeOH was evaporated on a PCR block at 70°C and 25 μl of 1.7 mg/ml Proteinase K in 1 × TE was added to each well. The plate was covered with sealing film and heated to either 55°C or 70°C for 240 min and to 94°C for 10 min in a thermocycler. 10 μl of each of the sibling and mutant lysates was pooled and 45 μl sterile ddH2O was added to the remainder. Plates were stored at -20°C.
PCR was initially performed on mutant and sibling pools for genome scanning, and subsequently on the individuals that had been used for the pooling in order to confirm potential linkages to specific markers. 20 μl PCR reactions were set up from 14.28 μl of reaction mix (2 μl of 10 × PCR buffer, 0.04 μl each of 100 mM dATP, dCTP, dGTP and dTTP, 12.12 μl water), 0.16 μl each of 20 mM forward and reverse primer, 0.4 μl of 5U/μl Taq polymerase, and 5 μl of template DNA. 10 × PCR buffer contained 100 mM Tris-HCl (pH 8.3), 500 mM KCl, 15 mM MgCl2 and 0.1 % (w/v) gelatin. All pipetting was done with a Biomek 2000 robot. Cycling was carried out by initially denaturating at 94°C for 2 min, 35 cycles of denaturation at 94°C for 30 sec, annealing at 60°C for 30 sec and extension at 73°C for 1 min, and a final extension at 73°C for 5 min. 5 μl of 6 × loading buffer were added to each sample, and electrophoresis was carried out at 200 V for 45 min in 1 × TBE buffer, on 2 % agarose gels. Gels were imaged and scored semi-automatically with NIH Image and a set of custom-designed macros.
Calculation of map positions
Distances between mutations and markers were calculated by determining the recombination fraction in the mutant F2 individuals and applying the Kosambi mapping function. Linkages with a two-point LOD score equal or greater 3 were regarded as significant.
In order to place a mutation in the genetic interval between the closest marker and another linked marker we determined whether recombinations for both of them were correlated. For this purpose we considered only single recombinants for the closest marker, i.e. heterozygotes. If the majority of these were heterozygous for the second marker we regarded the recombinations as uncorrelated and placed the mutation in the interval between the markers. Otherwise we placed the mutation outside the interval in the direction opposite from the second marker.
Assuming complete meiotic interference, i.e. only a single recombination event per chromosome, all recombinants for the first marker should be either non-recombinant for the second marker if the markers flank the mutation, or heterozygous if both markers are on the same side of the mutation. In our data approximately half of the mutations gave results in between these extremes. This may be due to occasional contaminations of the PCR assays but also to less than complete meiotic interference, which would allow a second recombination in the same individual. We therefore did not eliminate any contradictory individuals from the calculation of genetic distances as they may represent a genuine second recombination.
If a mutation could be placed in an interval between two markers, a map position was calculated by scaling the observed distances between the mutation and the markers so as to fit into the published distance between the markers. In the remaining cases only the distance to the closest marker was used to calculate the map position. A FileMaker Pro 5 database was used to store the scoring data and perform the calculations . The latest version of the MGH map, available through ZFIN , was used as a reference for calculating map positions.
RG implemented the mapping approach, supervised the project, analysed the results and drafted the manuscript. CNV initiated and supported the project. The remaining authors contributed equally to fish breeding and sorting by phenotype, PCR reactions and electrophoresis and scoring of gel images. GJR, BR and ER also evaluated the polymorphism of microsatellite markers for the selection of marker sets. All authors read and approved the final manuscript.
Note added in proof
For the following mutations, still listed as uncloned in 1, the corresponding genes have been reported by other researchers: beo, blu, hap, leo, obe, san, stu. For references see the ZFIN database .
This paper is dedicated to the memory of two deceased colleagues: Pascal Haffter, who conceived the large-scale mapping approach and initially supervised the project; and Ulrike Martyn, who contributed to the experimental work. For contributions to the experimental work we are also indebted to Claudia Bernardo de Oliveira, Katy Hingst, Tüzer Kalkan and Jeremy Keenan. We would like to thank Hans-Georg Frohnhöfer (Tübingen zebrafish stockcenter) for providing mutant carriers for mapcrosses and for information on allele nomenclature and availability. This work was supported by the German Human Genome Project (DHGP Grant 01 KW 9627 and 01 KW 9919).
Haffter P, Granato M, Brand M, Mullins MC, Hammerschmidt M, Kane DA, Odenthal J, van Eeden FJ, Jiang YJ, Heisenberg CP, Kelsh RN, Furutani-Seiki M, Vogelsang E, Beuchle D, Schach U, Fabian C, Nüsslein-Volhard C: The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio.
Haffter P, Odenthal J, Mullins MC, Lin S, Farrell MJ, Vogelsang E, Haas F, Brand M, van Eeden FJM, Furutani-Seiki M, Granato M, Hammerschmidt M, Heisenberg CP, Jiang YJ, Kane DA, Kelsh RN, Hopkins N, Nüsslein-Volhard C: Mutations affecting pigmentation and shape of the adult zebrafish.
Dev Genes Evol 1996, 206:260-276. Publisher Full Text
Michelmore RW, Paran I, Kesseli RV: Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations.
Proc Natl Acad Sci U S A 1991, 88(21):9828–32. Publisher Full Text
Poult Sci 1997, 76(2):386-391. PubMed Abstract
Knapik EW, Goodman A, Atkinson OS, Roberts CT, Shiozawa M, Sim CU, Weksler-Zangen S, Trolliet MR, Futrell C, Innes BA, Koike G, McLaughlin MG, Pierre L, Simon JS, Vilallonga E, Roy M, Chiang PW, Fishman MC, Driever W, Jacob HJ: A reference cross DNA panel for zebrafish (Danio rerio) anchored with simple sequence length polymorphisms.
Proc Natl Acad Sci U S A 2004, 101(35):12792–7. Publisher Full Text
Golling G, Amsterdam A, Sun Z, Antonelli M, Maldonado E, Chen W, Burgess S, Haldi M, Artzt K, Farrington S, Lin SY, Nissen RM, Hopkins N: Insertional mutagenesis in zebrafish rapidly identifies genes essential for early vertebrate development.
Kelly PD, Chu F, Woods IG, Ngo-Hazelett P, Cardozo T, Huang H, Kimm F, Liao L, Yan YL, Zhou Y, Johnson SL, Abagyan R, Schier AF, Postlethwait JH, Talbot WS: Genetic linkage mapping of zebrafish genes and ESTs.
Shaw GC, Cope JJ, Li L, Corson K, Hersey C, Ackermann GE, Gwynn B, Lambert AJ, Wingert RA, Traver D, Trede NS, Barut BA, Zhou Y, Minet E, Donovan A, Brownlie A, Balzan R, Weiss MJ, Peters LL, Kaplan J, Zon LI, Paw BH: Mitoferrin is essential for erythroid iron assimilation.
Kwok C, Korn RM, Davis ME, Burt DW, Critcher R, McCarthy L, Paw BH, Zon LI, Goodfellow PN, Schmitt K: Characterization of whole genome radiation hybrid mapping resources for non-mammalian vertebrates.
Methods Cell Biol 1999, 60:287-302. PubMed Abstract
Geisler R, Rauch GJ, Baier H, van Bebber F, Broß L, Dekens MP, Finger K, Fricke C, Gates MA, Geiger H, Geiger-Rudolph S, Gilmour D, Glaser S, Gnugge L, Habeck H, Hingst K, Holley S, Keenan J, Kirn A, Knaut H, Lashkari D, Maderspacher F, Martyn U, Neuhauss S, Haffter P, et al.: A radiation hybrid map of the zebrafish genome.
Hukriede NA, Joly L, Tsang M, Miles J, Tellis P, Epstein JA, Barbazuk WB, Li FN, Paw B, Postlethwait JH, Hudson TJ, Zon LI, McPherson JD, Chevrette M, Dawid IB, Johnson SL, Ekker M: Radiation hybrid mapping of the zebrafish genome.
Postlethwait JH, Yan YL, Gates MA, Horne S, Amores A, Brownlie A, Donovan A, Egan ES, Force A, Gong Z, Goutel C, Fritz A, Kelsh R, Knapik E, Liao E, Paw B, Ransom D, Singer A, Thomson M, Abduljabbar TS, Yelick P, Beier D, Joly JS, Larhammar D, Talbot WS, et al.: Vertebrate genome evolution and the zebrafish gene map.