Skip to main content

Identification of quantitative trait loci associated with leaf rust resistance in rye by precision mapping

Abstract

Background

Leaf rust (LR) is among the most destructive fungal diseases of rye (Secale cereale L.). Despite intensive research using various analytical and methodological approaches, such as quantitative trait locus (QTL) mapping, candidate gene expression analysis, and transcriptome sequencing, the genetic basis of the rye immune response to LR remains unclear.

Results

A genome-wide association study was employed to detect QTLs controlling the immune response to LR of rye. A mapping population, G38A, was constructed by crossing two inbred lines: 723 (susceptible to LR) and JKI-NIL-Pr3 (a donor of the LR resistance gene Pr3). For genotyping, SNP-DArT and silico-DArT markers were used. Resistance phenotyping was conducted by visual assessment of the infection severity in detached leaf segments inoculated with two isolates of Puccinia recondita f. sp. secalis, namely, 60/17/2.1 (isolate S) in the main experiment and 86/n/2.1_5x (isolate N) in the validation experiment, at 10 and 17 days post-infection (dpi), respectively.

In total, 42,773 SNP-DArT and 105,866 silico-DArT markers were included in the main analysis including isolate S, of which 129 and 140 SNP-DArTs and 767 and 776 silico-DArTs were significantly associated (p ≤ 0.001; − log10(p) ≥ 3.0) with the immune response to LR at 10 and 17 dpi, respectively. Most significant markers were mapped to chromosome 1R. The number of common markers from both systems and at both time points occupying common chromosomal positions was 37, of which 21 were positioned in genes, comprising 18 markers located in exons and three in introns. This gene pool included genes encoding proteins with a known function in response to LR (e.g., a NBS-LRR disease resistance protein-like protein and carboxyl-terminal peptidase).

Conclusion

This study has expanded and supplemented existing knowledge of the genetic basis of rye resistance to LR by (1) detecting two QTLs associated with the LR immune response of rye, of which one located on the long arm of chromosome 1R is newly detected, (2) assigning hundreds of markers significantly associated with the immune response to LR to genes in the ‘Lo7’ genome, and (3) predicting the potential translational effects of polymorphisms of SNP-DArT markers located within protein-coding genes.

Peer Review reports

Background

Although rye (Secale cereale L.) is considered to be among the cereal crops most tolerant to environmental stresses, more than 30 diseases, including leaf rust (LR), infect this species. LR, which is caused by Puccinia recondita Roberge ex Desm. f. sp. secalis Miedaner, Klocke, Flath, Geiger & Weber (Prs), is among the most damaging diseases of rye, causing up to 40% yield losses [1].

Different molecular methods have been applied to examine the genetic background of the rye immune response to LR. Using a linkage mapping approach, 16 dominant Pr genes have been identified, namely Pr1–5, Pr-d–f, Pr-i–l, Pr-n, Pr-p, Pr-r, and Pr-t [2,3,4]. The genes Pr1–Pr5, Pr-d–f, Pr-n, Pr-p, and Pr-r are associated with resistance at both the seedling and the adult stages, indicating that they are all-stage resistance genes [3, 5]. Of the mentioned genes, Pr1–5, Pr-d–f, Pr-n, and Pr-r confer resistance to a broad range of single-pustule isolates [3, 6].

The majority of rye Pr genes have been mapped to individual chromosomes: Pr3, Pr4, Pr5, Pr-i, Pr-k, and Pr-n to chromosome 1R, Pr-d and Pr-f to 2RS, Pr-j and Pr-l to 4R, Pr1 and Pr-e to 6R, and Pr2 to 7RL [2, 3, 6,7,8,9]. The chromosomal localization of the genes Pr-p, Pr-r, and Pr-t, found in populations from Argentina, USA, and Russia, could not be resolved [3]. The results presented by Milczarski et al. [10] are not entirely consistent with those described above. Based on a mapping population derived from the recombinant inbred lines 541 (one parental component of the mapping population used in the present study) and Ot1-3, with the use of diversity array technology (DArT) markers, the authors identified ten LR resistance-related QTLs, of which four were mapped on chromosome 1R (two distributed on the short arm [1RS] and two on the long arm [1RL]), and the remaining six QTLs were mapped on chromosomes 3R (three QTLs; one on 3RS, one in the centromeric region, and the third QTL on 3RL), 4R (one QTL, in the centromeric region), and 5R (two QTLs, both in the centromeric region). The QTLs mapped to chromosome 5R have the strongest impact on resistance to LR. The authors emphasize, however, that each of the LR-QTLs was detected only in one year and in one location, which indicates that their effectiveness is highly dependent on the environment.

The results of the aforementioned studies were generated on the basis of interval mapping. Using a different approach, namely, genome-wide association study (GWAS; a strategy increasingly used for detection of resistance genes [11]), Vendelbo et al. [12, 13] mapped five LR resistance-associated QTLs on chromosome arms 1RS, 1RL, 2RL, 5RL, and 7RS using Gülzow-based elite hybrid rye breeding germplasm. Two QTLs located on chromosome arms 1RS and 7RS were of particular importance. The most important resistance-associated marker on chromosome arm 1RS was physically co-localized with molecular markers delimiting the previously characterized Pr3 gene. The region on chromosome arm 7RS contained a large number of nucleotide-binding leucine-rich repeat (NLR) genes, one of which, provisionally denoted Pr6, was similar (at the protein level) to the wheat LR resistance gene Lr1 situated on wheat chromosome arm 5DL. Rakoczy-Trojanowska et al. [14] used GWAS to identify the single DArT sequencing (DArTseq) marker (3363612|F|0–17:G > C-17:G > C) on chromosome 2R that was stably associated with LR resistance under field conditions.

In addition to the genes described above, we recently identified hundreds of other LR-related, differentially expressed genes using RNA-sequencing analysis of three rye inbred lines infected with compatible and incompatible strains of Prs. Among these genes were four wheat Lr gene orthologs (identified previously in the ‘Lo7’ genome [15]), namely, ScLr1_3, ScLr1_4, ScLr1_8, and ScRga2_6; the former two genes were located on chromosome 7R, whereas the latter two genes were of unknown location [16]. For now, however, it is not known whether any ScLr1 variants are counterparts to the genes Pr2 or Pr6, previously also assigned to chromosome 7R.

In the present study, which aimed to locate QTLs associated with LR resistance in rye, we used the DArTseq marker system for genotyping of the mapping population and a linear mixed model was developed based on observations included in the work of Bedo et al. [17] describing statistical machine learning (SML) for validation of QTLs. In principle, SML is unlike the interval mapping approach most commonly used for this purpose. The SML approach, first published in 2008 by Bedo et al. [17], relies on estimating the generalization performance of a QTL model. This is achieved by dividing the data into independent training and testing subsets. The training set is employed for model development, while the testing set is used for model evaluation. The authors, using a mapping population consisting of 94 F1-derived doubled-haploid plants from a cross between the barley cultivars ‘Steptoe’ and ‘Morex’, showed that this algorithm produces superior estimates of QTLs (for α-amylase, diastatic power, grain protein content, malt extract, heading date, height, lodging, and yield) than interval mapping and identifies QTLs with greater precision. Moreover, SML allows for identification of markers linked to QTLs without the need to construct a genetic map and, just as importantly, for reduction of the false-discovery rate. The machine learning approach has been repeatedly used and positively verified to identify QTLs [18] and expression QTLs [19], and as the basis for the AutoQTL analytical procedure [20] in plants.

The DArTseq marker system used for genotyping in the present work evolved from the microarray-based DArT platform developed more than 20 years ago [21] by combining restriction enzyme digestion with next-generation sequencing [22]. The DArTseq procedure generates two types of data: scores for presence/absence, called silico-DArTs (dominant markers, analogous to microarray DArTs, but extracted in silico from sequences obtained from genomic representations), and single-nucleotide polymorphisms (SNPs) in fragments present in the genomic representations (SNP-DArTs, co-dominant markers; https://www.diversityarrays.com/services/dartseq/dartseq-data-types/). To date, silico-DArTs and SNP-DArTs have been used in multiple studies of rye for development of SNP markers associated with selected agronomically important traits [14], construction of genetic maps and QTL identification [10, 23,24,25], targeting gene space [26], and determination and verification of phylogenetic relationships in the genus Secale [23, 27].

The aim of the present work was to better understand the genetic basis of rye resistance to LR by using a novel approach combining the principles of GWAS and linkage mapping.

Methods

Biological materials

The plant material consisted of (1) 329 plants of the F2/F3 mapping population G38A, the parents of which were the S7 inbred line 723 (bred at the West Pomeranian University of Technology, Szczecin, Poland), which is susceptible to LR, and the inbred line JKI-NIL-Pr3 (bred at the Julius Kühn-Institute Federal Research Centre for Cultivated Plants, Germany; kindly provided by Dr. Peter Wehling and Dr. Steffen Roux), a donor of the LR resistance gene Pr3; (2) the two parental lines; and (3) the LR-susceptible Polish rye cultivar ‘Konto’.

To obtain single-spore isolates of Prs that did not break the Pr3 gene-based resistance (infected plants lacked this gene), 53 populations of LR were first collected from eight provinces in Poland in 2015–2017. Among them, nine populations (60/17/2.1, 67/17/2.2.1, 76/17/2_3x, 58/17/5.1, 60/17/2_3x, 34i/17/2/5.1, 76/17/2.1, 77/17/2.3, and 86/n/2.1_5x) were proved to be incompatible with donors of Pr3 gene-based resistance to LR. From these nine populations, single-spore isolates were derived. Two of these nine isolates, namely, 60/17/2.1 and 86/n/2.1_5x (hereinafter referred to as isolates S and N, respectively) were characterized by segregation most similar to the Mendelian 1:2:1 ratio. The S isolate was used in the main experiment (involving the entire mapping population), whose data were used in QTL mapping, and the N isolate was used to verify the immune response of selected individuals (the verification experiment).

Detached leaf test

The F3 seeds of the F2 G38A mapping population were sown on pallets in such a manner that one well contained the progeny of a single F2 plant. Ten-day-old seedlings were cut into approximately 1.5 cm fragments, which were then plated on four-well plates (Greiner Bio-One GmbH, Germany) filled with medium of the following composition: 3 g agar, 35 mg benzimidazole, and redistilled water up to 1 dm3. Each F2 plant was represented by 25–30 F3 seedlings. Leaf fragments of the LR-sensitive cultivar ‘Konto’ were placed around the lined leaf fragments of the G38A mapping population as a positive control for infection process (Fig. S1, S2).

Immediately after leaf fragments of the G38A mapping population and ‘Konto’ were placed on the plates, they were inoculated with isolates S and N. The isolate N was used only in the verification experiment. Spores were suspended in Novec™ 7100 engineered fluid (1 mg*cm−3). Each plate was sprayed four times. After inoculation, the plates were placed in a growth chamber under controlled conditions (18 °C, 16 h/8 h [light/dark] photoperiod, 50% relative humidity, and illumination intensity of 60 µmol m−2 s−1).

Phenotyping of rye–LR interaction

Evaluation of the specificity of the plant–pathogen interaction was performed twice, at 10 days post-inoculation (dpi) and 17 dpi, using the 0–5 Murphy scale [28], where 0 = immune (no visible reaction), 1 = very resistant (chlorotic and necrotic flecking), 2 = resistant (minute uredinia, surrounded by chlorosis or necrosis), 3 = resistant to moderately resistant (small to medium-size uredinia, surrounded by chlorosis or necrosis), 4 = moderately resistant to moderately susceptible (medium to large uredinia, surrounded by chlorosis), and 5 = susceptible (large uredinia without chlorosis). The results of the disease-symptom evaluation are presented as the percentage of leaf fragments showing a given reaction. The phenotypes were described by “values used in the association analysis”, VAA (Table S1).

The disease-symptoms associated with a given Prs isolate and a given time point were treated as separate traits and analyzed separately. Thus, the following four traits were included in the mapping analysis: reaction to isolate S, 10 dpi (S_10) and to isolate S, 17 dpi (S_17) in the main experiment; and reaction to isolate N, 10 dpi (N_10) and to isolate N, 17 dpi (N_17) in the verification experiment.

Genotype determination

Determination of the genotype of the plants included in the mapping population, i.e., whether a given plant was a dominant homozygote, recessive homozygote, or heterozygote, was based on the percentage of leaf fragments with a certain type of immune response in the 0–5 Murphy scale. Plants with a predominant proportion of leaf fragments in classes 0, 1, and/or 2 were classified as dominant homozygotes; plants with a predominant proportion of leaf fragments in classes 4 and/or 5 were categorized as recessive homozygotes; and plants in which leaf fragments were distributed in all or almost all immune response classes were denoted as heterozygotes.

Molecular marker system

Genomic DNA for SNP-DArT (DArTseq) and silico-DArT genotyping was isolated from 329 plants of the G38A mapping population and the two parental lines using the Mag-Bind Plant DNA 96 Kit (Omega Bio-Tek, USA) in accordance with the manufacturer’s instructions. One hundred milligrams of ground leaf tissue was used for each analyzed plant. The isolated DNA was dissolved in 100 μl elution buffer. The DNA yield and purity were estimated using a spectrophotometer (NanoDrop 2000, Thermo Fisher Scientific, USA) and electrophoresed in 1% agarose gel stained with SimplySafe (Eurx, Poland). The DNA concentration was adjusted to 100 ng and then samples were sent to the Diversity Arrays Technology Pty Ltd. (Bruce, Australia), where genotyping was performed employing the DArTseq 1.0 technology developed by Cruz et al. [29] (Table S2 and S3).

Marker annotation was performed by mapping the sequences and chromosome positions to the Lo7 rye genome [15]. All gene names mentioned in the text are taken from Rabanus-Wallace et al. [15].

GWAS

The values used in the association analysis were calculated using the following formula:

6 × value in column B/100 + 5 × value in column C/100 + 4 × value in column D/100 + 3 × value in column E/100 + 2 × value in column F/100 + 1 × value in column G/100. The values calculated are included in Table S1.

The linear mixed model (LMM) was formulated as follows:

$${{\varvec{y}}}_{{\varvec{i}}}={\varvec{\mu}}+{{\varvec{g}}}_{{\varvec{i}}}+{{\varvec{a}}}_{{\varvec{i}}}+{{\varvec{\varepsilon}}}_{{\varvec{i}}}$$

where y i is the phenotype value of the ith line, µ is the overall mean, g j is the fixed effect of the jth marker, a i is the random effect of the ith line, and ε i is the random error.

In the matrix notation, this model is as follows:

$$\mathbf{y}=\mathbf{X}{\varvec{\upbeta}}+\mathbf{Z}\mathbf{a}+{\varvec{\upvarepsilon}}$$

where y is the vector of the phenotypes, β = (β0, βg)’ is the vector of fixed effects, where β0 is the mean and βg is the marker effect, X is the incidence matrix of fixed effects, which relates records to the marker genotypes (g) and mean, a is the vector of random line effects, Z is the incidence matrix of random effects, which relates records to the line effects, and ε is the vector of random errors.

This is a classic linear mixed model. The variable a models the genetic background of each line as a random effect with variance var(a) =\(\boldsymbol A\sigma_a^2\)  where A is the relationship matrix that can be estimated from the markers.

The ‘GWAS’ function in the R package ‘rrBLUP’ (https://cran.r-project.org/web/packages/rrBLUP/rrBLUP.pdf) implements the mixed model and was used to perform a genome-wide association analysis. The ‘GWAS’ function is equivalent to EMMAX [30] and calculates the score − log10(p-value) for each marker for the trait. If a marker had − log10(p-value) = 3, it was considered to be likely significantly associated with the trait.

A Manhattan plot for each trait was generated after applying the mixed model. The Manhattan plot represents the significance of the association between a marker and the trait being measured. The Y-axis is − log10(p-value), which represents the strength of association; the higher the point on the scale, the more strongly significant the association with the trait.

The mixed model approach results in the generation of a large number of correlated p-values (tests). Given the many single-point association analyses in a GWAS, adjustment of the p-values to correct for multiple testing is always a challenge, and was the focus of a further analysis step.

The GWAS analysis was implemented on the DArT KDCompute platform.

Prediction of translational effects

The translational effects of SNPs (DArTseq markers) were predicted with the Ensembl Variant Effect Predictor [31].

Results

Distribution of phenotypic classes in the mapping population

In the main experiment involving the entire mapping population and the S isolate of Prs, the distribution of disease-symptom phenotypes deviated from the normal distribution, which resulted from under-representation of F2/F3 plants in the Values used in the Association Analysis (VAA) class II, both at 10 and 17 dpi. At the former time point, plants were most numerous in the class IV (VAA-4.01 ÷ 5), whereas at 17 dpi plants were most frequent in the class III (VAA—3.05 ÷ 4); (Fig. 1, Table S1).

Fig. 1
figure 1

Distribution of phenotypic classes (described by VAA values) in G38A mapping population. Classes for S_10: I - 1.0 ÷ 2.0; II – 2.03 ÷ 2.97; III - 3.02 ÷ 4.0; IV – 4.01 ÷ 5.0; V – 5.06 ÷ 6.0. Classes for S_17: I - 1.0 ÷ 2.0; II – 2.14 ÷ 3.0; III - 3.05 ÷ 4.0; IV – 4.01 ÷ 5.0; V – 5.01 ÷  5.85 (Table S1)

The verification experiment performed using isolate N (which was of similar pathogenicity to that of the S isolate) confirmed the phenotypic assessments for all evaluated individuals of the mapping population. All results of the validation experiment are presented in Tables S1, S8, S9, S10, S11 and Figures S3, S4.

Identification of SNP-DArT and silico-DArT markers associated with response to LR

A majority of markers included in the main analysis involving isolate S were assigned to Lo7 chromosomes (Table 1, S4, S5, S6, S7).

Table 1 Results of the genome-wide association analysis for the S isolate of Puccinia recondita f. sp. secalis

With regard to SNP-DArT markers, 365 and 373 markers were significantly (0.001 > p ≤ 0.05; 3 <  − log10(p) ≥ 1.301029996) associated with the response to LR at 10 and 17 dpi, respectively. In the case of silico-DArT markers, these values were 1941 and 1876, respectively. Applying a more stringent selection criterion, i.e., p ≤ 0.001, − log10(p) ≥ 3.0, the numbers of markers significantly associated with each trait were 129 (SNP-DArTs at 10 dpi), 140 (SNP-DArTs at 17 dpi), 767 (silico-DArTs at 10 dpi) and 776 (silico-DArTs at 17 dpi).

By far, the greatest number of markers significantly associated with all traits, both in the case of SNP-DArT and silico-DArT markers, were assigned to chromosome 1R; the proportion ranged from 48.56% for S_17 at 0.001 > p ≤ 0.05 to 82.95% for S_10 at p ≤ 0.001 (Table 2).

Table 2 Percentage of markers assigned to rye chromosome 1R

For further detailed analysis, only markers assigned to chromosome 1R and significant at p ≤ 0.001 (hereinafter referred to as s_SNP-DArT and s_silico-DArT markers) were selected (Table S12). The results for markers with a p-value between 0.05 and 0.001 are shown in Table S13.

Detailed characteristics of markers mapped to chromosome 1R—s_SNP-DArT markers

In the Manhattan plot generated based on the SNP-DArT markers, at 10 dpi two clearly separated peaks/QTLs spanning from 53,241,837 to 164,974,087 (which corresponded to 1RS) and from 291,247,877 to 411,193,503 (which corresponded to 1RL), respectively, were observed (Fig. 2a, Tables S4, S12).

Fig. 2
figure 2

Manhattan plot visualizing SNP-DArT markers significantly associated with the immune response to Prs isolate S. a. 10 dpi. b. 17 dpi

In general, the composition of QTLs at both time points was extremely similar. Of the 107 (2.56%) s_SNP-DArT markers associated with the S_10 trait, 64 were mapped to rye Lo7 genes [15]; most of these markers (53) were located in exons (Table 1, S4, S12). In the first peak/QTL, three s_SNP-DArT markers characterized by the highest − log10(p) values (≥ 20) were located in Lo7 genes encoding a NBS-LRR disease resistance protein-like protein (marker 5034809|F|0–32:G > C-32:G > C located in an exon), a desiccation-related protein PCC13-62 (marker 3365190|F|0–64:C > G-64:C > G located in an exon), and a phenazine biosynthesis PhzC/PhzF family protein (marker 3363069|F|0–46:G > C-46:G > C located in an exon). In the second peak/QTL, two s_SNP-DArT markers, namely, 3907484|F|0–52:C > T-52:C > T and 3360254|F|0–23:C > T-23:C > T, with the highest − log10(p) values (> 5) were located in rye Lo7 genes encoding a long-chain-alcohol oxidase (marker located in an exon) and folylpolyglutamate synthase (marker located in an intron), respectively. In addition to the 65 markers located in genes, six markers (four within the first QTL and two within the second QTL) were mapped to transposable elements (Table S12).

A total of 115 (2.75%) s_SNP-DArT markers were associated with the S_17 trait, including 68 mapped to the Lo7 genome; most of these markers (56) were located in exons (Table 1, S5, S12). The GWAS for S_17 showed the presence of two peaks/QTLs on chromosome 1R, covering the regions from 53,241,837 to 164,974,087 (identical as for S_10) and from 247,907,747 to 426,929,972 (partially overlapping with the second QTL for S_10, but with a range larger by more than 59 Mb) (Fig. 2b, Table S12).

In the first peak/QTL two markers characterized by the highest − log10(p) values (≥ 20) were located within exons of genes encoding a desiccation-related pcC13-62 protein (marker 3365190|F|0–64:C > G-64:C > G) and a pathogenesis-related thaumatin family protein (marker 3594357|F|0–29:A > C-29:A > C). A third marker with − log10(p) ≥ 20, associated with a NBS-LRR disease resistance protein-like protein at 10 dpi, in the case of S_17 was characterized by a slightly lower, but still extremely high, − log10(p) value of 19.59605099 (Table S12). In the second peak/QTL, two s_SNP-DArT markers, namely, 5788995|F|0–62:T > G-62:T > G and 3363086|F|0–40:T > C-40:T > C, were associated most strongly with genes encoding folylpolyglutamate synthase (the marker was located in an intron) and carboxypeptidase (the marker was located in an exon), respectively (Table S12).

As in the case of S_10, the same six markers were located within transposable elements (Table S12).

In addition to the aforementioned markers associated with the immune response at extremely high levels of significance, at both time points and in both QTLs, the presence of markers associated with the analyzed traits was detected at slightly lower (but still high) levels of significance in genes typical for the immune response, such as those encoding a receptor-like kinase, endoglucanase, cellulose synthase-like protein, and a MYB protein or myosin (Table S12).

Ninety-seven s_SNP-DArT markers were common to both time points, whereas 7 and 14 markers were unique for 10 dpi and for 17 dpi, respectively (Fig. 3a, Tables S12, S14). Within the first peak/QTL, 69 s_SNP-DArT markers were common to both time points of which four were unique for 10 dpi and no markers were unique for 17 dpi (Fig. 3b, Tables S12, S14). In the case of the second QTL, 28 markers were common to both time points of which three and 14 markers were unique for 10 dpi and 17 dpi, respectively (Fig. 3c, Tables S12, S14).

Fig. 3
figure 3

Common and unique significant SNP-DArT markers. Venn diagrams represents the number of common and unique s_SNP-DArT markers for 10 and 17 days post-inoculation (dpi). a Both QTLs on chromosome 1R; b the first QTL on chromosome 1R; c the second QTL on chromosome 1R. The diagrams were generated using an online tool (https://bioinformatics.psb.ugent.be/webtools/Venn/). S_10 all referred to s_SNP-DArTseq markers in both QTLs, 10 dpi; S_17 all referred to s_SNP-DArTseq markers in both QTLs, 17 dpi; S_10/I referred to s_SNP-DArTseq markers in the first QTL, 10 dpi; S_17/I referred to s_SNP-DArTseq markers in the first QTL, 17 dpi; S_10/II referred to s_SNP-DArTseq markers in the second QTL, 10 dpi; S_17/II referred to s_SNP-DArTseq markers in the second QTL, 17 dpi

Detailed characteristics of markers mapped to chromosome 1R—s_silico-DArT markers

As for the s_SNP-DArT markers, two QTLs on chromosome 1R (in the ranges from 30,347,446 to 185,495,011 and 204,544,065 to 720,560,882) were associated with silico-DArT markers at 10 dpi, but the distance between the markers was less pronounced than for the s_SNP-DArT markers (Fig. 4a, Tables S6, S12).

Fig. 4
figure 4

Manhattan plot visualizing silico-DArT markers significantly associated with the immune response to Prs isolate S. a. 10 dpi. b. 17 dpi

In total, 552 (7.87%) of the s_silico-DArT markers were associated with the S_10 trait of which 220 markers were located in Lo7 genes and 172 of these in exons (Tables S6, S12). The gene-assigned s_silico-DArT markers with the highest − log10(p) values (> 20) for the first QTL were as follows: 3,342,055, 3,342,250, 3,342,695, 3,342,810, 3,342,996, 3,343,110, and 3,343,673 were located in exons of genes encoding a NBS-LRR disease resistance protein-like protein, pathogenesis-related thaumatin family protein, cytochrome P450, serine/threonine-protein kinase ATM, replication protein A 70 kDa DNA-binding subunit, invertase/pectin methylesterase inhibitor family protein, and CHUP1 protein, respectively. Most of these genes were associated with the immune response to fungal pathogens. Of the markers for the second QTL, several were located within genes associated with the response to pathogens; these included, e.g., genes encoding a myosin heavy chain-related protein (marker 5,044,425 located in an intron), MYB family transcription factor-like protein (marker 7,062,938 located in an exon; and marker 5,043,320 at 377,042,108 located in an intron), and carboxypeptidase (marker 5,039,638 located in an intron) (Table S12).

At the second time point, two QTLs (in the same ranges as those for 10 dpi) were generated by the GWAS analysis (Fig. 4b). In total, 519 (7.40%) silico_SNP-DArT markers were associated with the S_17 trait, comprising 207 markers located in genes of which 162 markers were located in exons (Tables S7, S12).

In total, 464 s_silico-DArT markers were common for both time points, and 57 and 22 markers were unique for 10 dpi and for 17 dpi, respectively (Fig. 5a, Tables S12, S14). In the first QTL, 251 s_silico-DArT markers were common for both time points, and 17 and 16 markers were unique for 10 dpi and for 17 dpi, respectively (Fig. 5b, Tables S12, S14). For the second QTL, 213 markers were common for both time points, and 40 and 6 markers were unique for 10 dpi and for 17 dpi, respectively (Fig. 5c, Tables S12, S14).

Fig. 5
figure 5

Common and unique significant silico-DArT markers. Venn diagrams representing numbers of common and unique s_silico-DArT markers for 10 and 17 dpi. a. Both QTLs on chromosome 1R; b. the first QTL on chromosome 1R; c. the second QTL on chromosome 1R. The diagrams were gener,ed with an online tool (https://bioinform.ics.psb.ugent.be/webtools/Venn/). S_10 all referred to s_silico-DArT markers in both QTLs, 10 dpi; S_17 all referred to s_silico-DArT markers in both QTLs, 17 dpi; S_10/I referred to s_silico-DArT markers in the first QTL, 10 dpi; S_17/I referred to s_silico-DArT markers in the first QTL, 17 dpi; S_10/II referred to s_silico-DArT markers in the second QTL, 10 dpi; S_17/II referred to s_silico-DArT markers in the second QTL, 17 dpi

Common positions of significant SNP-DArT and silico-DArT markers

Thirty-nine markers from both marker systems and at both time points occupied common chromosomal positions on chromosome 1R. Of these markers, 21 were located in genes (hereinafter referred to as the “common 21 pool”), comprising 18 markers located in exons and three markers located in introns. In addition, two common markers were assigned to transposable elements. The remainder of the markers were located in intergenic regions (Fig. 6, Table 3, S12, S14).

Fig. 6
figure 6

Common significant SNP-DArTs for 10 dpi and 17 dpi for both QTLs. Venn diagram represents the numbers of common and unique s_SNP-DArT and s_silico-DArT markers for 10 and 17 dpi. The diagram was generated using an online tool (https://bioinformatics.psb.ugent.be/webtools/Venn/). SNP_10 referred to s_SNP-DArTseq markers, 10 dpi; SNP_17 referred to s_SNP-DArTseq markers, 17 dpi; silico_10 referred to s_silico-DArT markers, 10 dpi; silico_17 referred to s_silico-DArT markers, 17 dpi

Table 3 Markers occupying identical chromosomal positions at both time points: 10 and 17 dpi. Table presents “common 21 pool” of significant SNP-DArT and silico-DArT markers that shared common chromosomal positions for two time points.*) First, reference allele; second, SNP allele. **) Marker genotype in LR-resistant lines (VAA ≥ 5; Table S1) and LR-susceptible lines (VAA ≤ 2.65; Table S1)

The highest − log10(p) values (> 20) were observed for the following markers: s_silico-DArT at 10 dpi within genes encoding an invertase/pectin methylesterase inhibitor family protein, CHUP1 protein, NBS-LRR disease resistance protein-like protein, and pathogenesis-related thaumatin family protein; s_SNP-DArT at 10 dpi within the gene encoding a NBS-LRR disease resistance protein-like protein; and s_SNP-DArT at 17 dpi within the gene encoding a pathogenesis-related thaumatin family protein. Except for two genes in which markers for the second QTL were localized (SECCE1Rv1G0027270.1 and SECCE1Rv1G0028640.1 coding for folylpolyglutamate synthase and a kinase family protein, respectively), in all remaining genes markers for the first QTL were localized. Among the 18 markers localized in gene exons, two markers (3349900|F|0–20:T > C-20:T > C and 3344187|F|0–12:G > C-12:G > C; SECCE1Rv1G0015900.1) were located within the same gene, namely, SECCE1Rv1G0015900.1 coding for a receptor-like kinase. Two intron-localized markers (3730018|F|0–57:A > G-57:A > G and 3899895|F|0–56:G > A-56:G > A) were located in the first intron of genes coding for a protein phosphatase 2C-like protein and a MYB-related transcription factor, respectively, whereas the marker 3360254|F|0–23:C > T-23:C > T was located in the second exon of the gene encoding folylpolyglutamate synthase.

In the case of nine gene-localized markers, the homozygote for the reference allele determined the LR-resistant phenotype, whereas the remaining 11 markers were associated with the homozygote for the alternative allele (Table 3). Data for the complete set of genes are provided in Table S12.

Translational effect of SNPs located in genes

The ten SNPs located in exons of 18 genes from the “common 21 pool” had a low translational impact (i.e., synonymous protein variant formation without an amino acid change), seven SNPs had a moderate impact (mis-sense protein variant formation that resulted from different codons and, therefore, translation of a different amino acid), and only one SNP had a high impact with loss of the stop codon. This specific polymorphism (A/C) was located in the exon of the gene SECCE1Rv1G0014450.1 encoding a pathogenesis-related thaumatin family protein; the effect of stop-codon loss was determined by the mutant genotype, CC, present in the LR-resistant rye lines. All three intron variants had modifier effects, i.e., the splicing process was not affected (Tables 3 and 4).

Table 4 Translational effect of single-nucleotide polymorphisms (SNPs) located in the genes from “common 21 pool”

Among the remaining genes containing s_SNP-DArTseq markers, only the SNP T/C (in the marker 3363086|F|0–40:T > C-40:T > C), located in the SECCE1Rv1G0016320.1 gene encoding a haloacid dehalogenase-like hydrolase superfamily protein, may have a strong translational effect. Similar to the SECCE1Rv1G0014450.1 gene, the effect of stop-codon loss was determined by the recessive allele present in LR-resistant lines (Tables S13, S15).

The search for rye genes conferring resistance to LR has been ongoing for more than 20 years, but our current knowledge, and especially the number of identified and characterized resistance genes, remains limited. Much of the research in this field published to date has aimed to identify regions/genes on genetic maps [6,7,8]. Of the 16 genes identified in this manner, of profound importance is the Pr3 locus on chromosome 1R, which carries many genes controlling agronomically important traits but mainly involved in tolerance to abiotic and biotic stresses, including LR [32, 33].

Identification of LR associated QTLs

To verify previous results and, above all, expand on existing knowledge, we identified QTLs for the immune response to LR of rye, but using a novel approach. The present study employed a combination of two highly effective tools: the DArTseq platform and LMM procedure based on SML described by Bedo et al. [17]. While DArTseq markers have been used in previous studies on rye (as outlined in the Introduction), including identification of QTLs and SNP markers associated with LR resistance [10, 14], the effectiveness of SML-like method has not been verified previously in this species. Nevertheless, machine learning has been shown to be a powerful and extremely precise tool in many fields. Bedo et al. [17] compared multiple methods for detection of QTLs in barley (for the following traits: malting quality traits, heading date, plant height, lodging, pubescent leaves, grain protein content, and yield), namely, composite interval mapping, Bayesian interval mapping, and single marker regression. The authors reported that SML generates narrower peaks than the other methods, thereby allowing QTL identification with greater precision. We also performed a comparative analysis for 105 DArTseq markers from chromosome 1R using the CIM method, which allowed the identification of only one peak spanning a very wide range – 161,174 cM, corresponding to 717,370,020 bp and not two QTLs covering much narrower areas, than when using LMM (from 53,241,837 bp to 164,974,087 bp and from 291,247,877 bp to 411,193,503 bp, respectively); Table S16.

Characteristics of QTLs from chromosome 1R

In the present study, we focused on the rye genomic region that includes the Pr3 gene. We used a F2/F3 mapping population that was developed by crossing two inbred lines: JKI-NIL-Pr3, a donor of the Pr3 gene; and 723, an inbred line that lacks the Pr3 gene. Phenotypic segregation deviated from the expected normal distribution owing to under-representation of lines in the VAA class 2.01–3 but slight overrepresentation of lines in the class 1–2. This might be caused by two factors: (1) imprecise phenotyping in some cases caused by superinfections with other pathogens, and (2) the parents of the mapping population were not characterized by extreme values for the S_10 and S_17 traits.

Based on previous research findings [12, 13] and taking into account that the inbred line JKI-NIL-Pr3 carried the Pr3 gene, we expected to identify one region on chromosome 1R with markers located around the NBS gene cluster. We detected two distinct QTLs in all four GWAS analyses performed; the first QTL was detected on 1RS, and the second QTL on 1RL, of chromosome 1R. More than half (in the case of s_SNP-DArTs) and almost half (in the case of s_silico-DArTs) of the markers were located in genes, and mostly in exons. Although the gene-located s_silico-DArT markers were present in both peaks, they were substantially more abundant (by more than five times) in the first peak. More details on the selected genes containing markers significantly associated with the S_10 and S_17 traits for both QTLs are presented later in the discussion.

Could SECCE1Rv1G0014220.1 gene be considered a good candidate for Pr3 candidate?

Interestingly, in the Lo7 genomic block spanning from 101 to 117 Mb that showed the strongest association with LR resistance in the GWAS analysis and in which Vendelbo et al. [13] detected as many as five NBS-NLR genes, we located only one NBS-LRR-encoding gene (SECCE1Rv1G0014220.1) containing s_SNP-DArTseq and s_silico-DArT markers located in position 111,147,729 Mb (Table S12). This position was identical to or very close to the position of one of two NBS-NLR paralogs in the Lo7 genome that showed 83.5%–85.1% sequence similarity to RenRS5_3, a candidate LR resistance gene detected by Vendelbo et al. [13]. In the present study, however, SNP markers located in the gene SECCE1Rv1G0014220.1 were associated with LR resistance with extremely high probability (− log10(p) > 19), thus much higher than in the study of Vendelbo et al. [13] in which none of the markers exhibited a p-value above the Bonferroni significance threshold of − log10(p) = 6.72 (nevertheless, it should be taken into account that they used a natural population rather than a biparental population, which may explain the difference between their results and ours). Therefore, in our opinion, SECCE1Rv1G0014220.1 should be considered to be among the best Pr3 gene candidates conferring resistance to LR. However, the role of this gene may be related to down-regulation rather than up-regulation of its expression, as suggest results of preliminary RT-qPCR analyzes employing four rye inbred lines – two susceptible (Lo7 and L318) and two resistant (SE8 and SE212) to LR, infected with 1.1/6 isolate of Prs (Fig. S5a,b; Table S17). Also in our previous work, similar relationship was observed for several variants of the ScLr1 gene belonging to the NBS-LRR family, which were down-regulated after infection with Prs [16].

The results of the present analysis differ almost entirely from those reported by Milczarski et al. [10] who detected four QTLs on chromosome 1R, two on 1RS (at positions 39.4 Mbp and 54.8 Mbp) and two on 1RL (at positions 154.7 Mbp and 240.1 Mbp). The marker 5225086|F|0–7:T > G-7:T > G (associated with LR resistance at a relatively low but still statistically significant − log10(p) value of 1.96457369) was the only SNP-DArTseq marker that was located near the second QTL on chromosome 1RS detected by Milczarski et al. [10] (Table S13). Unfortunately, this marker has not been mapped to a Lo7 gene. The differences between the present results and those of Milczarski et al. mainly reflect the disparate experimental conditions: first, Milczarski et al. assessed LR resistance under field conditions, and second, the causative agent of LR used by Milczarski et al. was a mixture of spores of many undefined Prs strains. It is also worth noting that the QTLs for LR resistance identified by Milczarski et al. [10] in 2013 did not overlap with those detected in the following year (2014).

As noted above, apart from the gene SECCE1Rv1G0014220.1, none of the other four NBS-LRR genes identified by Vendelbo et al. [13] were detected in the present analysis, neither in the block spanning from 101 to 117 Mb showing the strongest association with LR resistance in the GWAS analysis, nor in in the region between 96.7 and 137.6 Mb flanked by markers co-segregating with the Pr3 gene. Instead, we identified in this area many other markers associated with LR resistance with extremely high probabilities, located in genes important from the perspective of the immune response to diseases caused by rust fungi; some of these genes contained s_SNP-DArT and s_silico-DArT markers located in positions common to all four GWAS analyses (highlighted in bold in Table 3).

Markers located in Lo7 genes

The pool of both types of markers located in common positions significantly associated with the S_10 and S_17 traits amounted to 39 markers, 21 of which were located in genes, including 18 markers within exons. Notably, two markers forming a second QTL, SECCE1Rv1G0027270.1 and SECCE1Rv1G0028640.1, were located in genes coding for folylpolyglutamate synthase and a kinase family protein, respectively; the former marker was located in an intron and the latter marker in an exon. Many of these 21 genes, such as those coding for a NBS-LRR disease resistance protein-like protein, leucine-rich repeat receptor-like protein kinase family protein, pathogenesis-related thaumatin family protein, phenylalanine ammonia-lyase, endoglucanase, MYB-related transcription factor, kinase (serine/threonine) family protein, and GDSL esterase/lipase ([34,35,36,37,38,39,40,41], respectively), have been proven to play a role in the immune response to rust fungi. One such gene, SECCE1Rv1G0013110.1, encoding an endoglucanase (to which marker 3596652|F|0–30:A > G-30:A > G has been mapped), has previously been observed to be differentially expressed (highly up-regulated; log2 estimated fold change = 4.7) in the rye inbred line L318 infected with compatible Prs strains [41]. Given the high scores for the marker (p-value) and the gene (log2 estimated fold change), we suggest that SECCE1Rv1G0013110 is a valuable candidate gene conferring resistance to LR and the marker 3596652|F|0–30:A > G-30:A > G is a valuable tool for marker-assisted selection. Apart from this gene, none of the other aforementioned genes were identified as differentially expressed genes in our parallel transcriptome analyses. Nonetheless, many other genes encoding all of the above-mentioned antifungal proteins have been detected [41]. Other genes in this pool, for example, genes encoding an invertase/pectin methylesterase inhibitor family protein, GDSL esterase/lipase, and CHUP1 protein ([42,43,44], respectively), are associated with fungal pathogens but their role in the response to LR remains unclear. The gene SECCE1Rv1G0012070, encoding a dehydrogenase (including glucose and ribitol dehydrogenase) that is categorized among stress-defensive proteins, has been identified inter alia in wheat [45]. As many as five of the above-mentioned genes – those coding for an endoglucanase, NBS-LRR disease resistance protein-like protein, pathogenesis-related thaumatin family protein, phenylalanine ammonia-lyase, and MYB-related transcription factor – are located in the region between 96.7 and 137.6 Mb flanked by markers co-segregating with the Pr3 gene.

However, some genes in this pool (e.g., genes coding for a carboxyl-terminal peptidase, peptide chain release factor 1 or RNA polymerase II subunit B1 CTD phosphatase RPAP2, folylpolyglutamate synthase, and DNA repair helicase), although involved in diverse biological processes, have not previously been assigned a clear function in defense against stresses, including LR. In several genes in this pool that have as yet unconfirmed roles in the immune response to LR, namely, those encoding an invertase/pectin methylesterase inhibitor family protein, CHUP1 protein, RNA polymerase II subunit B1 CTD phosphatase RPAP2, and peptide chain release factor 1, s_DArTseq markers characterized by an extremely high p-value are located. Therefore, these genes are potential candidates conferring resistance to LR in rye, although experimental verification is required.

Apart from three markers located in introns of genes encoding a protein phosphatase 2C-like protein, MYB-related transcription factor, and folylpolyglutamate synthase, the remaining markers were located in exons, which in the case of SNPs might cause the synthesis of a distinct protein variant or an inactive protein.

Predicted translational effects of gene-located SNPs

To predict the potential impact of polymorphisms on the structure (and properties) of proteins, we performed an Ensembl Variant Effect Predictor [31] analysis, which showed that most polymorphisms within exons have low impact on protein structure and properties, and thus cause harmless or insignificant changes. A portion of the SNPs, however, were assessed as causing moderate effects that may result in non-disruptive variant formation. Such variants might change the protein effectiveness. The only polymorphism that was predicted to have a strong translational effect, through loss of the stop codon, was detected in the marker 3594357|F|0–29:A > C-29:A > C located in the gene SECCE1Rv1G0014450.1, which codes for a pathogenesis-related thaumatin family protein. The polymorphism A/C results in the change of the TAG stop codon to the GAG codon for glutamic acid and, therefore, elongation instead of termination of the polypeptide chain. This, in turn, must result in loss of protein function. Given that the DArTseq marker genotype in resistant lines is CC, the loss-of-function of an important protein can be assumed from the perspective of the immune response, resulting in increased resistance to the disease. This is a completely unexpected and never previously reported effect. However, although with strong support, verification experiments are required to confirm this hypothesis.

Among markers from the “common 21 pool”, all SNPs in non-coding regions were predicted to have modifier effects without affecting the splicing process.

Excluding the “common 21 pool”, we subjected the remaining s_SNP-DArT markers located in Lo7 genes to an Ensembl Variant Effect Predictor analysis. Similar to the “common 21 pool”, most SNPs had a low translational impact and only one SNP was evaluated to have a high translational impact, namely, that in the SECCE1Rv1G0016320.1 gene encoding a haloacid dehalogenase-like hydrolase superfamily protein. Similar to the SECCE1Rv1G0014450.1 gene encoding a pathogenesis-related thaumatin family protein, the effect of stop-codon loss was determined by the recessive allele present in LR-resistant lines. The role of this enzyme in the immune response of plants to fungal pathogens is not yet known. However, such proteins are known to be involved in plant growth and abiotic stress response [46].

Unlike SNPs in non-coding regions in the previous pool, we observed three categories of influence in non-coding sequences, namely, intron variants, downstream gene variants, and upstream gene variants. Thus, there is a broader potential spectrum of possibilities for regulating the expression of genes within which s_SNP-DArT markers are located.

In this study, a GWAS approach, using DArT marker genotyping and detached leaf assay phenotyping of the mapping population constructed by crossing rye lines resistant and susceptible to LR, allowed the detection of two QTLs for the immune response to LR on chromosome 1R. Each QTL was composed of markers strongly associated with the trait of interest; more than half of the markers were located in Lo7 genes, including genes with known functions (among them, SECCE1Rv1G0013110.1 and SECCE1Rv1G0014220.1 encode an endoglucanase and NBS-LRR disease resistance protein-like protein, respectively) and genes of currently unknown function (e.g., SECCE1Rv1G0013840.1 and SECCE1Rv1G0013960.1 encode an invertase/pectin methylesterase inhibitor family protein and CHUP1 protein, respectively). Most of the predicted translational effects of markers located in gene exons were of low or moderate impact, and those located in introns had modifier effects. Only two SNPs, located in genes that encode a pathogenesis-related thaumatin family protein and a haloacid dehalogenase-like hydrolase superfamily protein, caused loss of the stop codon and were categorized as having a high translational impact, resulting in translation of non-functional proteins. Given the present results and those of other authors, we propose that the gene SECCE1Rv1G0014220.1, which encodes a NBS-LRR protein, is the most likely candidate for the Pr3 gene. In conclusion, by using an approach combining the principles of GWAS and linkage mapping, the present results expand on existing knowledge of the genetic basis of the immune response to LR in rye.

Availability of data and materials

The raw RNA-seq (fastq) data are deposited in the NCBI database (BioProject: PRJNA888031). The remainder of the data used in this work is included in the supplementary files.

References

  1. Wehling P, Linz A, Hackauf B, Roux SR, Ruge B, Klocke B. Leaf rust resistance in rye (Secale cereale L.). 1. Genetic analysis and mapping of resistance genes Pr1 and Pr2. Theor Appl Genet. 2003;107:432–8. https://doi.org/10.1007/s00122-003-1263-7.

    Article  CAS  PubMed  Google Scholar 

  2. Solodukhina OV, Kobylyanski VD. Problems of winter rye breeding for resistance to leaf and stem rusts. Plant Breed Seed Sci. 2003;48:87–97.

    Google Scholar 

  3. Roux SR, Hackauf B, Ruge-Wehling B, et al. Exploitation and comprehensive characterization of leaf-rust resistance in rye. Proc Int Symp on Rye Breed and Genet, Rostock (Germany), June 28–30, 2006 Vortr Pflanzenzüchtg. 2007;71:144–50.

    Google Scholar 

  4. Klocke B. Virulenzstruktur und dynamik des Roggenbraunrostes (Puccinia recondita f. sp. secalis) in der Bundesrepublik Deutschland. Halle-Wittenberg: Martin Luther University; 2004. p. 1–141.

  5. Wu H, Kang Z, Li X, Li Y, Li Y, Wang S, Liu D. Identification of wheat leaf rust resistance genes in Chinese wheat cultivars and the improved germplasms. Plant Dis. 2020;104:2669–80. https://doi.org/10.1094/PDIS-12-19-2619-RE.

    Article  CAS  PubMed  Google Scholar 

  6. Roux SR, Hackauf B, Linz A, Ruge B, Klocke B, Wehling P. Leaf rust resistance in rye ( Secale cereale L.). 2. Genetic analysis of resistance genes Pr3, Pr4 , and Pr5. Theor Appl Genet. 2004;110:192–201. https://doi.org/10.1007/s00122-004-1807-5.

    Article  CAS  PubMed  Google Scholar 

  7. Ruge B, Roux SR, Linz A, et al. Erschließung und molekulare Charakterisierung von Resistenzen gegen Braunrost bei Roggen (Secale cereale L.). Vortr Pflanzenzüchtg. 1999;46:169–76.

    Google Scholar 

  8. Roux SR, Ruge B, Linz A, Wehling P. Leaf rust resistance in rye – evaluation, genetic analysis and molecular mapping. Acta Phytopathol Entomol Hung. 2000;35:65–73.

    CAS  Google Scholar 

  9. Solodukhina OV. Genetic characterization of rye accessions with regard to leaf rust resistance. Russian J Genet. 2002;38:399–407. https://doi.org/10.1023/A:1015202303392.

    Article  CAS  Google Scholar 

  10. Milczarski P, Masojć P, Krajewski P, Stochmal A, Kowalczyk M, Angelov M, Ivanova V, Schollenberger M, Wakuliński W, Banaszak Z, Banaszak K, Rakoczy-Trojanowska M. QTL mapping for benzoxazinoid content, preharvest sprouting, α-amylase activity, and leaf rust resistance in rye (Secale cereale L.). PLoS One. 2017;12:e0189912.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Liu HJ, Yan J. Crop genome-wide association study: a harvest of biological relevance. Plant J. 2019;97:8–18. https://doi.org/10.1111/tpj.14139.

    Article  CAS  PubMed  Google Scholar 

  12. Vendelbo NM, Mahmood K, Sarup P, Hovmøller MS, Justesen AF, Kristensen PS, Orabi J, Jahoor A. Discovery of a novel leaf rust (Puccinia recondita) resistance gene in rye (Secale cereale L.) using association genomics. Cells. 2022;11:64. https://doi.org/10.3390/cells11010064.

    Article  CAS  Google Scholar 

  13. Vendelbo NM, Mahmood K, Steuernagel B, Wulff BBH, Sarup P, Hovmøller MS, Justesen AF, Kristensen PS, Orabi J, Jahoor A. Discovery of resistance genes in rye by targeted long-read sequencing and association genetics. Cells. 2022;11:1273.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Rakoczy-Trojanowska M, Krajewski P, Bocianowski J, Schollenberger M, Wakuliński W, Milczarski P, Masojć P, Targońska-Karasek M, Banaszak Z, Banaszak K, Brukwiński W, Orczyk W, Kilian A. Identification of single nucleotide polymorphisms associated with brown rust resistance, α-amylase activity and pre-harvest sprouting in rye (Secale cereale L.). Plant Mol Biol Rep. 2017;35(3):366–78. https://doi.org/10.1007/s11105-017-1030-6.

    Article  CAS  Google Scholar 

  15. Rabanus-Wallace MT, Hackauf B, Mascher M, et al. Chromosome-scale genome assembly provides insights into rye biology, evolution and agronomic potential. Nat Genetics. 2021;53:564–73. https://doi.org/10.1038/s41588-021-00807-0.

    Article  CAS  PubMed  Google Scholar 

  16. Krępski T, Olechowski M, Samborska-Skutnik I, Święcicka M, Grądzielewska A, Rakoczy-Trojanowska M. Identification and characteristics of wheat Lr orthologs in three rye inbred lines. PLoS One. 2023;18(7):e0288520.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Bedo J, Wenzl P, Kowalczyk A, Kilian A. Precision-mapping and statistical validation of quantitative trait loci by machine learning. BMC Genet. 2008;9:35.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Lin F, Fan J, Rhee SY. QTG-Finder: a machine-learning based algorithm to prioritize causal genes of quantitative trait loci in Arabidopsis and rice. G3 (Bethesda). 2019;9(10):3129–38. https://doi.org/10.1534/g3.119.400319.

    Article  CAS  PubMed  Google Scholar 

  19. Chen J, Nodzak C. Statistical and machine learning methods for eQTL analysis. In: Shi X, editor. eQTL analysis. Methods in molecular biology, vol. 2082. New York: Humana; 2020. https://doi.org/10.1007/978-1-0716-0026-9_7.

    Chapter  Google Scholar 

  20. Freda PJ, Ghosh A, Zhang E, et al. Automated quantitative trait locus analysis (AutoQTL). BioData Mining. 2023;16:14. https://doi.org/10.1186/s13040-023-00331-3.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Jaccoud D, Peng KM, Feinstein D, Kilian A. Diversity arrays: a solid state technology for sequence information independent genotyping. Nucleic Acids Res. 2001;29(4):e25. https://doi.org/10.1093/nar/29.4.e25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol Biol. 2012;888:67–89. https://doi.org/10.1007/978-1-61779-870-2_5.

    Article  PubMed  Google Scholar 

  23. Bolibok-Brągoszewska H, Heller-Uszyńska K, Wenzl P, Uszyński G, Kilian A, Rakoczy-Trojanowska M. DArT markers for the rye genome - genetic diversity and mapping. BMC Genomics. 2009;10:578.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Milczarski P, Bolibok-Brągoszewska H, Myśków B, Stojałowski S, Heller-Uszyńska K, Góralska M, Brągoszewski P, Uszyński G, Kilian A, Rakoczy-Trojanowska M. A high density consensus map of rye (Secale cereale L.) based on DArT markers. PLoS One. 2011;6(12):e28495.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Stojałowski SA, Milczarski P, Hanek M, Bolibok-Brągoszewska H, Myśków B, Kilian A, Rakoczy-Trojanowska M. DArT markers tightly linked with the Rfc1 gene controlling restoration of male fertility in the CMS-C system in cultivated rye (Secale cereale L.). J Appl Genetics. 2011;52:313–8. https://doi.org/10.1007/s13353-011-0049-x.

    Article  CAS  Google Scholar 

  26. Gawroński P, Pawełkowicz M, Tofil K, Uszyński G, Sharifova S, Ahluwalia S, Tyrka M, Wędzony M, Kilian A, Bolibok-Brągoszewska H. DArT markers effectively target gene space in the rye genome. Front Plant Sci. 2016;7:1600. https://doi.org/10.3389/fpls.2016.01600.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Targońska-Karasek M, Bolibok-Brągoszewska H, Oleniecki T, Sharifova S, Kopania M, Rakoczy-Trojanowska M. Verification of taxonomic relationships within the genus Secale (Poaceae: Pooideae: Triticeae) based on multiple molecular methods. Phytotaxa. 2018;383(2):14. https://doi.org/10.11646/phytotaxa.383.2.1.

    Article  Google Scholar 

  28. Murphy HC. Physiologic specialization in Puccinia coronata avenae. US Dep Agric Tech Bull. 1935;433:48.

    Google Scholar 

  29. Cruz VMV, Kilian A, Dierig DA. Development of DArT marker platforms and genetic diversity assessment of the US collection of the new oilseed crop lesquerella and related species. PLoS One. 2013;8:e64062.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Kang HM, Sul JH, Service SK, Zaitlen NA, Kong S-Y, Freimer NB, Sabatti CH, Eskin E. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010;42:348–54. https://doi.org/10.1038/ng.548.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, Flicek P, Cunningham F. The ensembl variant effect predictor. Genome Biol. 2016;17:122. https://doi.org/10.1186/s13059-016-0974-4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Crespo-Herrera LA, Smith CM, Singh RP, Åhman I. Resistance to multiple cereal aphids in wheat–alien substitution and translocation lines. Arthropod Plant Interact. 2013;7(5):535–45. https://doi.org/10.1007/s11829-013-9267-y.

    Article  Google Scholar 

  33. Rakoczy-Trojanowska M, Bolibok-Brągoszewska H, Myśków B, et al. Genetics and genomics of stress tolerance. In: Rabanus-Wallace MT, Stein N, editors., et al., The Rye genome. Compendium of plant genomes. Cham: Springer; 2021. https://doi.org/10.1007/978-3-030-83383-1_11.

    Chapter  Google Scholar 

  34. DeYoung BJ, Innes RW. Plant NBS-LRR proteins in pathogen sensing and host defense. Nat Immunol. 2006;7:1243–9. https://doi.org/10.1038/ni1410.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Zou S, Tang Y, Xu Y, Ji J, Lu Y, Wang H, Li Q, Tang D. TuRLK1, a leucine-rich repeat receptor-like kinase, is indispensable for stripe rust resistance of YrU1 and confers broad resistance to multiple pathogens. BMC Plant Biol. 2022;22:280. https://doi.org/10.1186/s12870-022-03679-6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Zhang J, Wang F, Liang F, Zhang Y, Ma L, Wang H, Liu D. Functional analysis of a pathogenesis-related thaumatin-like protein gene TaLr35PR5 from wheat induced by leaf rust fungus. BMC Plant Biol. 2018;18:76. https://doi.org/10.1186/s12870-018-1297-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Liu R, Lv X, Wang X, Yang L, Cao J, Dai Y, Wu W, Wu Y. Integrative analysis of the multi-omics reveals the stripe rust fungus resistance mechanism of the TaPAL in wheat. Front Plant Sci. 2023;14:1174450.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Neugebauer KA, Bruce M, Todd T, Trick HN, Fellers JP. Wheat differential gene expression induced by different races of Puccinia triticina. PLoS One. 2018;13(6):e0198350.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Hawku MD, He F, Bai X, Islam MA, Huang X, Kang Z, Guo J. A R2R3 MYB transcription factor, TaMYB391, is positively involved in wheat resistance to Puccinia striiformis f. sp. tritici. Int J Mol Sci. 2022;23(22):14070. https://doi.org/10.3390/ijms232214070.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Dmochowska-Boguta M, Kloc Y, Zielezinski A, Werecki P, Nadolska-Orczyk A, Karlowski WM, Orczyk W. TaWAK6 encoding wall-associated kinase is involved in wheat resistance to leaf rust similar to adult plant resistance. PLoS One. 2020;15(1):e0227713.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Krępski T, Piasecka A, Święcicka M, Kańczurzewska M, Sawikowska A, Dmochowska-Boguta M, Rakoczy-Trojanowska M. Leaf rust ( Puccinia recondita f. sp. secalis ) triggers substantial changes in rye ( Secale cereale L.) at the transcriptome and metabolome levels. BMC Plant Biol. 2024;24:107.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Coculo D, Lionetti V. The plant invertase/pectin methylesterase inhibitor superfamily. Front Plant Sci. 2022;13:863892.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Gao M, Yin X, Yang W, Lam SM, Tong X, Liu J, Wang X, Li Q, Shui G, He Z. GDSL lipases modulate immunity through lipid homeostasis in rice. PLoS Pathog. 2017;13(11):e1006724.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Irieda H. Emerging roles of motile epidermal chloroplasts in plant immunity. Int J Mol Sci. 2022;23(7):4043.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Lu Y, Wu J, Wang R, Yan Y. Identification of stress defensive proteins in common wheat-Thinopyron intermedium translocation line YW642 developing grains via comparative proteome analysis. Breed Sci. 2020;70(5):517–29. https://doi.org/10.1270/jsbbs.19133.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Zan X, Zhou Z, Wan J, et al. Overexpression of OsHAD3, a member of HAD superfamily, decreases drought tolerance of rice. Rice. 2023;16:31. https://doi.org/10.1186/s12284-023-00647-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank Robert McKenzie, PhD, from Edanz (www.edanz.com/ac) for editing a draft of this manuscript. We thank Tomasz Krępski, Marek D. Koter, Izabela Samborska-Skutnik, Ewa Siedlecka, Mateusz Olechowski, and Bartek Biernacik for their technical support in detached leaf tests. We also thank Meric Deniz Inan for technical support in annotation of DArTseq and DArT-silico markers.

Funding

This study was financially supported by National Science Centre, grant no. 2018/31/B/NZ9/00439.

Author information

Authors and Affiliations

Authors

Contributions

AG – designing and performing DLT; MM – mapping and annotation of DArTseq and DArT-silico markers, co-contribution in writing the manuscript; MM and ADI – performing RT-qPCR analysis; MŚ and AO – responsible for plant materials, isolating DNA and preparing samples for DArT analyses; MM and MoM – performing analysis of SNP translational effects; JS – performing Linear Mixed Model (LMM) analysis; AK – performing DArT analyses, supervision of LMM analysis; MRT – conceptualization and supervision of the experiments, co-contribution in writing the manuscript

Corresponding author

Correspondence to Rakoczy-Trojanowska Monika.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mateusz, M., Agnieszka, G., Magdalena, Ś. et al. Identification of quantitative trait loci associated with leaf rust resistance in rye by precision mapping. BMC Plant Biol 24, 291 (2024). https://doi.org/10.1186/s12870-024-04960-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-024-04960-6

Keywords