One of the challenges in insect chemical ecology is to understand how insect pheromones are synthesised, detected and degraded. Genome wide survey by comparative sequencing and gene specific expression profiling provide rich resources for this challenge. A. ipsilon is a destructive pest of many crops and further characterization of the genes involved in pheromone biosynthesis and transport could offer potential targets for disruption of their chemical communication and for crop protection.
Here we report 454 next-generation sequencing of the A. ipsilon pheromone gland transcriptome, identification and expression profiling of genes putatively involved in pheromone production, transport and degradation. A total of 23473 unigenes were obtained from the transcriptome analysis, 86% of which were A. ipsilon specific. 42 transcripts encoded enzymes putatively involved in pheromone biosynthesis, of which 15 were specifically, or mainly, expressed in the pheromone glands at 5 to 120-fold higher levels than in the body. Two transcripts encoding for a fatty acid synthase and a desaturase were highly abundant in the transcriptome and expressed more than 40-fold higher in the glands than in the body. The transcripts encoding for 2 acetyl-CoA carboxylases, 1 fatty acid synthase, 2 desaturases, 3 acyl-CoA reductases, 2 alcohol oxidases, 2 aldehyde reductases and 3 acetyltransferases were expressed at a significantly higher level in the pheromone glands than in the body. 17 esterase transcripts were not gland-specific and 7 of these were expressed highly in the antennae. Seven transcripts encoding odorant binding proteins (OBPs) and 8 encoding chemosensory proteins (CSPs) were identified. Two CSP transcripts (AipsCSP2, AipsCSP8) were highly abundant in the pheromone gland transcriptome and this was confirmed by qRT-PCR. One OBP (AipsOBP6) were pheromone gland-enriched and three OBPs (AipsOBP1, AipsOBP2 and AipsOBP4) were antennal-enriched. Based on these studies we proposed possible A. ipsilon biosynthesis pathways for major and minor sex pheromone components.
Our study identified genes potentially involved in sex pheromone biosynthesis and transport in A. ipsilon. The identified genes are likely to play essential roles in sex pheromone production, transport and degradation and could serve as targets to interfere with pheromone release. The identification of highly expressed CSPs and OBPs in the pheromone gland suggests that they may play a role in the binding, transport and release of sex pheromones during sex pheromone production in A. ipsilon and other Lepidoptera insects.
Lepidoptera sex pheromones are primarily C10-C18 long straight chain unsaturated alcohols, aldehydes or acetate esters , biosynthesised and released mainly from pheromone glands located between the 8th and 9th abdominal segments of the female moths. Usually the females use a mixture of compounds in a unique ratio to attract conspecific males . The extremely high specificity and sensitivity of species-specific pheromones make them potential biological control agents for population monitoring, mass trapping and reducing pesticide use in integrated pest management (IPM) programs [3-5]. Further use of pheromones in such strategies would be aided by an understanding of the pathways involved in pheromone biosynthesis and transport.
Most sex pheromone blends of Lepidoptera insects are synthesised de novo via modified fatty acid biosynthesis pathways [2,6,7] and gland-specific enzymes are involved in desaturation, chain shortening, reduction and acetylation [1,2]. Different species use different combinations of these reactions to produce unique species-specific pheromone blends. The first step is the synthesis of saturated fatty acid precursors malonyl-CoA from acetyl-CoA by acetyl-CoA carboxylase (ACC) and fatty acid synthetase (FAS) [8,9]. Labeling studies conducted with acetate indicated that malonyl-CoA and NADPH are used by FAS to produce mainly saturated stearic acid (18:0) and palmitic acid (16:0) with 18 and 16 carbon atoms and no double bonds, respectively, as precursors [10-12]. Modification of the fatty acid chain includes the introduction of a double bond by desaturases specific to pheromone biosynthesis followed by chain shortening using specific β–oxidation enzymes [13,14]. So far, several types of desaturases have been extensively studied through gene characterization and expression analysis, including Δ5 , Δ9 [16,17], Δ10 , Δ11 [19,20], and Δ14  desaturases. Once unsaturated pheromone precursor with a specific chain-length is produced, the carboxyl carbon is modified to form one of functional groups (aldehyde, alcohol or acetate ester). These modifications require the enzymes fatty acid reductase to produce the alcohols from the fatty acyl precursor , which in some species may be oxidized to aldehydes serving as pheromone components , and to acetate esters (OAc) by acetyltransferase . Recently, a few members of the reductase gene family have been discovered and functionally characterized in several Lepidoptera species, including Ostrinia scapulalis, Heliothis virescens, Heliothis subflexa, Helicoverpa armigera, Helicoverpa assulta, Ostrinia nubilalis, Yponomeuta evonymellus (L.), Yponomeuta padellus (L.) and Yponomeuta rorellus (Hübner) . A number of pheromone gland-specific enzymes have been identified and their essential functions in pheromone production demonstrated in vitro as well as in vivo. For example, using RNA interference, Matsumoto and colleagues showed that two pheromone gland-specific enzymes (acyl-CoA desaturase and a fatty-acyl reductase) are responsible for pheromone production in the silk moth Bombyx mori[29-31].
After production and release of the sex pheromone components by female moths the males detect the pheromone and respond for mating. It is commonly accepted that pheromone molecules are captured and transported to the pheromone receptors on the dendrites of pheromone-sensitive neurons by olfactory binding proteins, including odorant binding proteins (OBPs) and chemosensory proteins (CSPs) [32-34]. Pheromone binding proteins (PBPs) bind to sex pheromone components and classified into a subclass of OBPs . After activation of the pheromone receptors the olfactory signals must be degraded rapidly to prevent from prolonged neuronal excitation . This may involve pheromone degrading enzymes (PDEs) capable of degrading the pheromone molecules .
The black cutworm Agrotis ipsilon is a destructive polyphagous insect pest of many crops and for a strain from China the female sex pheromone blend comprises five main acetate components: (Z)-11-hexadecenyl acetate (Z11-16:OAc), (Z)-9-tetradecenyl acetate (Z9-14:OAc), (Z)-7-dodecenyl acetate (Z7-12:OAc), (Z)-8-dodecenyl acetate (Z8-12:OAc) and (Z)-5-decenyl acetate (Z5-10:OAc) . These components indicate the involvement of different desaturases and ß-oxidases during the sex pheromone biosynthesis. However, the genes/proteins and their specific function in mediating A. ipsilon pheromone production, transport and degradation have not been characterized. Over the last few years, the next generation sequencing such as 454 pyrosequencing technique provides an easy and effective method for the discovery of novel genes. In present study, using the Roche GS FLX Titanium sequencing platform, we report a genetic database of the genes expressed in the pheromone glands of A. ipsilon and the identification of genes with putative roles in pheromone biosynthesis, degradation and transport as well as their tissue expression profiles.
Results and discussion
454 sequencing and unigene assembly
Sequencing of a cDNA library prepared from mRNAs of the pheromone glands of A. ipsilon gave a total of 631,425 raw reads with an average length of 517 base pairs (bp). After trimming adaptor sequences and removing low quality sequences, 629,273 clean reads remained with an average length of 496 bp. The size distribution of the clean reads is shown in Additional file 1. The sequences of all reads have been deposited in the NCBI SRA database with the accession number SRX189143.
Format: DOCX Size: 275KB Download file
The 629,273 clean reads were assembled into 23,473 unigenes, including 20,541 contigs (87.5%) and 2,932 singletons (12.5%), the largest transcriptome dataset so far from moth sex pheromone glands. An overview of the sequencing and assembly results is presented in Table 1. The length of the assembled unigenes ranged from 100 bp to 21842 bp with an average length of 770 bp. Among the unigenes, 22,035 (93.9%) are between 200 bp and 2000 bp long with an average length of 649 bp. These unigenes are in fact transcripts in the A. ipsilon pheromone gland cDNA library. Therefore we refer them as transcripts. All sequences of the unigenes used in the current study are provided in Additional file 2.
Table 1. Summary of A. ipsilon pheromone gland unigene sequences and assembly
Analysis of the transcripts from the A. ipsilon pheromone gland
BLASTx and BLASTn were used to compare each A. ipsilon transcript with a cut-off E-value of 1.0E-5 against GeneBank entries. 12,989 transcripts (55%) had BLASTx hits in the non-redundant protein (nr) databases and 9,392 (40%) had BLASTn hits in the non-redundant nucleotide sequence (nt) databases. This is consistent with a previous report of H. virescens pheromone gland ESTs . Some of the A. ipsilon transcripts were homologous to those from more than one species but in general most were homologous to other Lepidoptera species taking up 2,379 in the 9,392 BLASTn hits, including 1,124 (12%) to B. mori entries. The second highest hits were to Dipteran species with 343 hits to D. melanogaster and 279 and 221 hits to the mosquitoes Anopheles gambiae and Aedes aegypti, respectively. The lowest hits were to the wasp Nasonia vitripennis (190 hits), the beetle Tribolium castaneum (147 hits) and the pea aphid Acyrthosiphon pisum (136 hits). The top 15 insect species that have significant BLASTn hits are shown in Figure 1.
Figure 1. Top 15 insect species that have significant BLASTn hits. All A. ipsilon pheromone gland unigenes were used in BLASTn searches against the GenBank entries. The significant hits with an E-value >=1.0E-5 for each query were grouped according to species and the number of the unigenes that had significant homology is indicated after the specie name.
Gene Ontology of the genes expressed in the A. ipsilon pheromone gland
The 23,473 assembled transcripts were annotated into different functional groups according to Gene Ontology (GO) analysis. Some transcripts were annotated into more than one GO category. Of the 22,473 transcripts, 7,546 (32%) could be assigned to a GO category (Additional file 3). The “cellular process” and “metabolic process” GO categories were most abundantly represented with 4,056 (17.3%) and 3,361 (14.3%) transcripts, respectively, within the biological process GO ontology. In the “cellular components” GO ontology the transcripts were mainly distributed in cell (18.8%) (4,415 transcripts) and cell part (17.6%) (4,133 transcripts). The GO analysis also showed that in the molecular function ontology 3,271 transcripts (13.9%) were annotated as having binding functions and 3,484 (14.8%) to have catalytic activity.
Additional file 3. Gene Ontology (GO) classifications of the 23473 A. ipsilon pheromone gland unigenes according to their involvement in biological processes, cellular component and molecular function.
Format: DOCX Size: 146KB Download file
Comparative analysis of transcripts in Lepidoptera pheromone glands
In order to compare the A. ipsilon pheromone gland transcriptome with those from other Lepidoptera and to identify A. ipsilon transcripts with potential involvement in sex pheromone production and transport we downloaded the pheromone gland ESTs of three other Lepidoptera A. segetum, B. mori and H. virescens from the dbEST database of NCBI and previously published pheromone gland transcriptome of H. virescens. After assembling these ESTs we obtained 925 unigenes from A. segetum, 3943 from B. mori and 8202 from H. virescens with an average length of 384 bp, 692 bp and 474 bp, respectively. These are much lower numbers than that obtained by the current study through the 454 sequencing of the A. ipisilon pheromone gland, demonstrating that our pheromone gland transcriptome is currently the largest transcriptome resource for an insect pheromone gland.
When comparing the pheromone gland transcripts pairwise using best bidirectional hits, we found that there were 461 homologous transcripts between A. ipsilon and A. segetum, 1110 homologous transcripts between A. ipsilon and B. mori, and 2106 homologous transcripts between A. ipsilon and H. virescens (Figure 2). A large portion of A. ipsilon transcripts (86.4%) (20,274 out of 23,473) had no homologous ESTs in the available pheromone gland EST libraries of the other 3 species. This may be due to the larger dataset (23,473 unigenes) for A. ipsilon and lower coverage in the other studies. Nevertheless, it was shown that 309 transcripts, 5,755 transcripts and 2,556 transcripts are only found in A. segetum, H. virescen and B. mori, respectively, in our comparison (Figure 2).
Figure 2. Comparative analysis of A. ipsilon pheromone gland unigenes with other insects. This shows the overlap of blast homology in genes expressed in pheremone glands in four species of Lepidoptera. The comparative analyses of A. ipsilon, H. virescens, B. mori and A. segetum pheromone gland unigenes were performed based on the Best Bidirectional Hits results (reciprocal BLASTn, E-value less than 1.0E-6).
Transcript abundance in the A. ipsilon pheromone gland
The pheromone gland mRNA samples used for constructing the cDNA library were non-normalized and non-amplified by PCR, so the reads in the sequencing dataset most likely represent the relative abundance of each assembled transcript in the pheromone gland as summarized in Table 2. The most abundant transcripts include vitellogenin, a major reproductive protein in insects (2,925 reads per kilobase per million mapped reads (RPKM); 2.2% reads), the precursor of egg yolk proteins for insect egg production  and genes involved in PBAN stimulated pheromone production such as lipase 3  (4,731 RPKM; 0.8% reads) and in sex pheromone biosynthesis such as acyl-CoA desaturase (1,206 RPKM; 0.3% reads) and in lipid transport such as apolipophorin III (2894 RPKM 0.4% reads). Another highly abundant transcript (Unigene_721) with 1,365 RPKM encodes a CSP with a 76% protein identity to the H. virescens CSP (Protein ID: ACX53806) and 41% to the ejaculatory bulb-specific protein 3 of D. melanogaster (Protein ID: Q9W1C9).
Table 2. The most prevalent mRNAs in A. ipsilon sex pheromone gland
Candidate genes in the A. ipsilon pheromone gland with putative functions in pheromone production, transport and degradation
The overall enzymatic steps during pheromone biosynthesis in A. ipsilon are likely to be similar to those in other moth species, which include fatty acid synthesis, desaturation, chain shortening, reduction and acetylation [1,2,6]. By homologous searches we identified members of gene subfamilies in the A. ipsilon pheromone gland transcriptome putatively involved in these biosynthetic processes and pheromone production, including transcripts putatively encoding 3 synthases (2 actyl-CoA carboxylase and 1 fatty acid synthase), 5 desaturases, 13 acyl-CoA reductases, 5 alcohol oxidases and 5 acetyltransferases as well as 11 aldehyde reductases (Table 3); 17 transcripts encoding putative pheromone degradation enzymes (Table 4); 8 transcripts encoding putative CSPs and 7 transcripts encoding putative OBPs (Table 5). Their abundances in the pheromone gland transcriptome are shown in Figures 3 and 4. We further validated and characterized the expression level and the tissue distribution of these genes by RT-PCR and qRT-PCR and summarised below. There is a clear agreement between the transcript abundance estimated by the transcriptome sequencing and transcript expression level in the pheromone gland as measured by RT-PCR and qRT-PCR.
Table 3. Putative pheromone biosynthesis related genes in the A. ipsilon pheromone gland
Table 4. Candidate esterase genes likely involved in A. ipsilon pheromone degradation
Table 5. Candidate olfactory genes involved in A. ipsilon pheromone reception
Figure 3. The abundance of the unigenes encoding the sex pheromone synthase in the A. ipsilon transcriptome dataset presented as normalized read count in reads per kilobase per million mapped reads (RPKM). The putative enzyme names are indicated as gene abbreviations followed by Genbank accession numbers. ACC Acetyl-CoA carboxylase, AOX Alcohol oxidase, AR Aldehyde reductase, ATF Acetyltransferase, DES Desaturase, FAR Fatty acyl reductase, FAS Fatty acid synthase.
Figure 4. The abundance of unigenes encoding chemosensory proteins (CSPs), odorant-binding proteins (OBPs) and esterase (EST) in the A. ipsilon transcriptome dataset presented as normalized reads in reads per kilobase per million mapped reads (RPKM).
Receptor for the pheromone biosynthesis activating neuropeptide (PBAN)
PBAN is released from the suboesophagal ganglion in the brain and goes to the hemolymph, where it binds to the PBAN receptor in the membrane of the pheromone gland and triggers the pheromone production [42,43]. Although there was no PBAN receptor found in the pheromone gland transcriptome of H. virescens we found one transcript (Unigene_3821) encoding a protein highly homologous to PBAN receptor isoform B. It has very low abundance in the A. ipsilon transcriptome (31 RPKM) but high amino acid identity of 97% to H. virescens PBAN receptor in GenBank (Protein IDs: ABU93813) .
Acetyl-CoA carboxylase (ACC)
Saturated long chain fatty acids are the precursors of sex pheromones in most moth species. Their biosynthesis is started by ACC catalysing the production of malonyl-CoA from acetyl-CoA in the first committed biosynthesis step [8,9]. In the A. ipsilon pheromone gland we found two transcripts (ACC-JX989149 and ACC-JX989150) encoding ACCs. ACC-JX989149 with an open reading frame (ORF) of 5841 bp encodes for a ACC with 67% amino acid identity with the ACC of T. castaneum (Protein ID: XP_969851) and ACC-JX989150 encodes a protein with 56% amino acid identity with the ACC of H. virescens (Protein ID: ACX53705) (Table 3). The RT-PCR and qRT-PCR revealed that both ACC-JX989149 and ACC-JX989150 are highly expressed in the pheromone gland as compared to the body (Figure 5 and Figure 6). However, they have very low abundance (81 and 21 RPKM) in the transcriptome (Figure 3).
Figure 5. RT-PCR results showing the relative expression of the A. ipsilon pheromone biosynthesis-related genes in pheromone gland (PG) and the body (BO). The genes that are more highly expressed in the pheromone gland are labeled with red pentagram. β-actin was used as internal reference gene to test the integrity of each cDNA templates; the similar intensity of β-actin bands between the pheromone gland and the body part indicate the use of equal template concentrations.
Figure 6. qRT-PCR results showing the relative expression levels of the A. ipsilon pheromone biosynthesis related genes between the pheromone gland (PG) and the body (BO). The putative enzyme names are indicated as gene abbreviations followed by Genbank accession numbers. ACC Acetyl-CoA carboxylase, FAS Fatty acid synthase, DES Desaturase, FAR Fatty acyl reductase, AOX alcohol oxidase, AR Aldehyde reductase, ATF Acetyltransferase. The internal control β-actin and ribosomal protein S3 were used to normalize transcript levels in each sample. This figure was presented using β-actin as reference gene to normalize the target gene expression and correct sample-to-sample variation; similar results were also obtained with ribosomal protein S3 as reference gene. The standard error is represented by the error bar, and the different letters (a, b) above each bar denote significant differences (p >0.05).
Fatty acid synthase (FAS)
FAS has been shown to catalyse the conversion of malonyl-CoA and NADPH to produce saturated fatty acids . We identified one putative FAS transcript (FAS-JX989151) in the A. ipsilon pheromone gland (Table 3), containing an ORF of 7176 bp and encoding a FAS with 57% amino acid identity to the FAS of T. castaneum (Protein ID: XP_970417). The RT-PCR and qRT-PCR revealed that FAS-JX989151 is highly expressed in the pheromone gland (40-fold higher than in the body, Figure 5 and Figure 6) and also has a high abundance (343 RPKM) in the transcriptome (Figure 3).
Pheromone-specific desaturases introduce double bond(s) into the fatty acids at specific positions along the chain. Five putative sex pheromone components extracted from A. ipsilon sex pheromone gland are unsaturated fatty acids with acetate as the functional group and 16 or less carbons . At least three active pheromone components (Z7-12:OAc, Z9-14:OAc and Z11-16:OAc) have been identified in A. ipsilon strains from China , North America , France  and Japan . It is reasonable to propose that the saturated fatty acid precursor of A. ipsilon sex pheromones would be palmitic acid (16:0) which is desaturated by ∆11-desaturase to form the precursor Z11-16:acyl-CoA for the production of two major (Z7-12:OAc and Z9-14:OAc) and two minor (Z11-16:OAc and Z5-10:Ac) pheromone components (Figure 7). It is not clear how the minor pheromone component (Z8-12:OAc) is synthesized in A. ipsilon, which should involve a ∆12-desaturase. Other studies in Lepidoptera species support a ∆11-desaturase acting on palmitic acid and leading to the production of the sex pheromone components [19,20,48]. In the A. ipsilon pheromone gland transctiptome 5 transcripts have high homology to genes encoding desaturases (Table 3). DES-JX989152 is homologous to a gene encoding an acyl-CoA ∆9-desaturase in M. brassicae (Protein ID: ABX90048) with an amino acid identity of 96%. ∆9-desaturase makes oleic acid from stearic acid (18:0) and possibly palmitoleic acid from palmitic acid [16,17,49]. It would not participate in the biosynthesis of A. ipsilon sex pheromones. DES-JX989153 encodes a protein with 87% amino acid identity with the acyl-CoA ∆11 desaturase of M. brassicae (Protein ID: ABX90049). DES-JX989154, DES-JX989155 and DES-JX989156 encode proteins, respectively, with 94% amino acid identity to the acyl-CoA desaturase from H. assulta (Protein ID: AF482909), 64% amino acid identity to a S. littoralis desaturase (Protein ID: AAQ74260) and 93% amino acid identity to an acyl-CoA desaturase of S. exigua (Protein ID: AAM28510). These transcripts could possibly encode ∆12-desaturases in A. ipsilon in formation of the minor pheromone component Z8-12:OAc from the precursor Z12-16:acyl-CoA. However, they could also function as ∆9-desaturase. Further study on their enzyme activity could confirm their role in the sex pheromone biosynthesis. The RT-PCR and qRT-PCR results indicated that DES-JX989153 and DES-JX989154 are highly expressed in the A. ipsilon pheromone gland compared with the body (85 and 63 fold higher, respectively) (Figure 5 and Figure 6). One of the transcripts (DES-JX989154) is also highly abundant (1206 RPKM) in the pheromone gland transcriptome (Figure 3), suggesting a possible role in A. ipsilon sex pheromone biosynthesis.
Figure 7. Putative biosynthesis pathways of the sex pheromones in Agrotis ipsilon. The saturated fatty acid precursor palmitic acid (16:0) is desaturated by ∆11-desaturase to form the precursor Z11-16:acyl-CoA for the production of three major and one minor pheromone components (adapted from [2,6,12,13,50]).
Fatty acyl-CoA reductase (FAR)
Once a specific Δ11 and possibly Δ12 double bond is introduced into fatty acid precursors to form a fatty acyl-CoA precursor, the chain of the precursors is then shortened sequentially by ß–oxidation to form different shorter chain fatty acyl-CoA precursors . These precursors are further reduced individually by fatty acyl reductase (FAR) to form corresponding fatty alcohols [26,28,51]. In the A. ipsilon pheromone gland transcriptome there are 13 transcripts homologous to putative FAR genes (Table 3). Among them, 5 transcripts encode proteins with 59%-80% amino acid identity to the fatty-acyl CoA reductases of Ostrinia nubilalis (Protein IDs: ADI82776, ADI82777, ADI82778 and ADI82779). Other FAR transcripts are homologous to the fatty acyl-CoA reductase from a wide range of insect species including H. virescens, N. vitripennis, Danaus plexippus, Bombus terrestris and Apis mellifera with amino acid identities of about 60% (Table 3). The RT-PCR and qRT-PCR results indicated that three transcripts (FAR-JX989157, FAR-JX989162 and FAR-JX989164) are highly expressed in the pheromone gland (Figure 5 and Figure 6). The other ten transcripts seem equally expressed in the pheromone gland and the body or highly expressed in the body. All FAR transcripts except two (FAR-JX989157 and FAR-JX989159) have low abundance (from 81 and 16 RPKM) in the pheromone gland transcriptome (Figure 3).
Alcohol oxidase/dehydrogenase (AOX)
Fatty alcohols can be used as pheromone components in many moth species, and they are also pheromone intermediates to produce aldehyde pheromones by the alcohol oxidases [52,53]. In the A. ipsilon PG 5 homologous genes of alcohol oxidase/dehydrogenase were identified, the BLASTx results revealed three unigenes (AOX-KC007341, AOX-KC007342 and AOX-KC007344) are with the amino acid identity of 43%, 55% and 64%, respectively, to a putative alcohol dehydrogenase of D. plexippus (Protein ID: EHJ70611), and one unigene (AOX-KC007345) are homologous to another putative alcohol dehydrogenase of D. plexippus (Protein ID: EHJ73729 ) with the amino acid identity of 68%. AOX-KC007343 showed 78% amino acid identity with the alcohol dehydrogenase of H. virescens (Protein ID: ACX53694). The RT-PCR and qRT-PCR results indicated that AOX-KC007341 and AOX-KC007343 showed a higher expressed level in the PG than in the body (Figure 5 and Figure 6).
Aldehyde reductase (AR)
Aldehyde reductases are members of the aldo-ketoreductase superfamily and could be used to reduce long-chain acyl-CoA to form alcohol intermediates . In the A. ipsilon pheromone gland we identified 11 transcripts with homology to the aldo-ketoreductases of Papilio dardanus, B. mori, H. armigera, D. plexippus, Culex quinquefasciatus, H. virescens and Papilio xuthus (Table 3). The derived protein sequences of these 11 transcripts show 53%-88% amino acid identity with their homologs in other insects. The RT-PCR and qRT-PCR results indicated that AR-KC007350 and AR-KC007351 are mainly expressed in the pheromone gland, while the other 9 putative aldehyde reductase transcripts have equal expression levels between the pheromone gland and the body or a higher expression level in the body (Figure 5 and Figure 6). All aldehyde reductase transcripts are present at low abundance (from 67 to 10 RPKM) in the pheromone gland transcriptome (Figure 3). The involvement of aldehyde reductase in sex pheromone biosynthesis has not been demonstrated in moth species.
The fatty acid alcohols are used as pheromone components in many moth species. In A. ipsilon whose sex pheromone blends comprise only acetates, they are intermediates and acetylated to pheromone components as acetate esters by actyltransferases . In the A. ipsilon pheromone gland transcriptome 5 acetyltransferase homologous transcripts were identified (Table 3), 3 of them (ATF-KC007357, ATF-KC007360 and ATF-KC007361) encode proteins that are homologous to the acetyltransferase of D. plexippus (Protein IDs: EHJ65205, EHJ65977 and EHJ68573) with relatively high amino acid identities (<70%), one (ATF-KC007358) encodes a protein with 90% amino acid identity to H. virescens acetyltransferase (Protein ID: ACX53812) and one (ATF-KC007359) encodes a protein with 86% amino acid identity with the acetyltransferase of B. mori (Protein ID: NP_001182381). The RT-PCR and qRT-PCR revealed that three transcripts (ATF-KC007358, ATF-KC007360 and ATF-KC007357) are mainly expressed in the pheromone gland (Figure 5 and Figure 6) and have a relative high abundance of 195, 155 and 71 RPKM, respectively in the pheromone gland transcriptome (Figure 3).
Genes encoding candidate pheromone degrading enzymes in the A. ipsilon pheromone gland
It would be potentially harmful to insects if pheromone molecules and other odorants remained on the olfactory receptors after they had stimulated the olfactory receptor neurons (ORNs). It is therefore thought that there are mechanisms to protect the ORNs by odorant degrading enzymes (ODEs)  including esterases [54,55], aldehyde oxidases [56-58], cytochromes P450 [59-61], carboxyl esterase , and glutathione S-transferase (GST) . In this study, we identified 17 transcripts predicted to encode esterases in the A. ipsilon pheromone gland, and the BLASTx results showed that all have very high amino acid identities with the antennal esterases of S. littoralis (Table 4), we named them as AipsCXE1-AipsCXE16 and AipsCXE20 following the nomenclature in S. littoralis. Our qRT-PCR results revealed that 7 of the transcripts (AipsCXE3, AipsCXE7, AipsCXE8, AipsCXE9, AipsCXE11, AipsCXE14 and AipsCXE20) are antennal-enriched, 3 (AipsCXE5, AipsCXE10 and AipsCXE15) are both antennal- and pheromone gland-enriched and the remaining 7 (AipsCXE1, AipsCXE2, AipsCXE4, AipsCXE6, AipsCXE12, AipsCXE13 and AipsCXE16) have similar expression levels in antennae, body and pheromone gland, suggesting they are not pheromone specific (Figure 8).
Figure 8. qRT-PCR results showing the expression of A. ipsilon unigenes encoding the putative esterase (CXE) identified in the pheromone gland in the male antennae (MA), the female antennae (FA), the body (BO) and the pheromone gland (PG). The standard error is represented by the error bar, and the different letters (a, b, c) above each bar denote significant differences (p > 0.05).
Genes encoding candidate pheromone carrier proteins in the A. ipsilon pheromone gland
Moth sex pheromones are synthesised and protected from degradation until being released from the female pheromone gland and it has been proposed that OBPs and CSPs could participate in this process. In this study we have identified transcripts of 7 OBPs and 8 CSPs from the A. ipsilon pheromone gland (Table 5), all of these have the typical insect OBP sequence motif C1-X15-39-C2-X3-C3-X21-44-C4-X7-12-C5-X8-C6 [35,64] or CSP sequence motif C1-X6-8-C2-X16-21-C3-X2-C4. One CSP transcript, AipsCSP2 seems to be gland-specific and has an extremely high expression level (<100 folds) in the pheromone glands compared with the antennae and body and a relative high abundance in the pheromone gland transcriptome. AipsCSP8 shows a higher expression level in the pheromone gland (10-fold higher than in body) (Figure 9) and is extremely abundant with 1,364 RPKM in the pheromone gland transcriptome (Figure 4).
There is one OBP transcript (AipsOBP6) which is highly expressed in the pheromone gland (more than 3-fold higher than in the antennae), and 3 OBPs (AipsOBP1, AipsOBP2 and AipsOBP4) are highly expressed in the antennae (Figure 10). This high expression of OBPs and CSPs in the pheromone gland is interesting because it suggests a possible involvement in carrying and releasing sex pheromones as demonstrated for the antennal OBPs and CSPs. However, the molecular mechanisms that connect these proteins with the involvement of pheromone production needs further investigation. No ORs, IRs and SNMPs are identified in the A. ipsilon pheromone gland.
Figure 9. qRT-PCR results showing the relative expression of the A. ipsilon unigenes encoding putative chemosensory proteins (CSP) identified in the pheromone gland in the male antennae (MA), the female antennae (FA), the body (BO) and the pheromone gland (PG). The standard error is represented by the error bar, and the different letters (a, b) above each bar denote significant differences (p >0.05).
Figure 10. qRT-PCR results showing the relative expression of the A. ipsilon unigenes encoding putative odorant binding proteins (OBP) identified in the pheromone gland in the male antennae (MA), the female antennae (FA), the body (BO) and the pheromone gland (PG). The standard error is represented by the error bar, and the different letters (a, b, c) above each bar denote significant differences (p > 0.05).