- Research article
- Open access
- Published:
Non-consensus GLI binding sites in Hedgehog target gene regulation
BMC Molecular Biology volume 11, Article number: 2 (2010)
Abstract
Background
The GLI transcription factors, mediators of the hedgehog signal bind with high affinity to the consensus sequence GACCACCCA. The affinity of variant single substitutions in GLI binding sites has been measured systematically, but the affinities of the variant binding sites appears low compared to the frequency of occurrence of variant sites in known GLI target gene promoters.
Results
We quantified transcriptional activation by GLI using PTCH1 promoter based luciferase reporters containing all single substitutions of the GLI consensus binding site. As expected variants with very low affinity did not activate the reporter. Many lower affinity binding sequences are, however, functional in the presence of moderate GLI concentration. Using two natural non-consensus GLI site promoters we showed that substitution of the variant sequences by consensus leads to comparable activity.
Conclusions
Variant GLI binding sites with relatively low affinity can within natural promoters lead to strong transcriptional activation. This may facilitate the identification of additional direct GLI target genes.
Background
Sequence specific binding of transcription factors in response to diverse cellular input signals is a major determinant in the regulation of transcription. Binding sequences for many factors have been identified by experiment and/or by a wealth of prediction methods (reviewed in [1]). Consensus binding sites were classically determined by SELEX experiments and verified by EMSA while more recently affinity measurements by methods better suited to moderate to large scale experimentation like microarray binding experiments have been used [2]. Experimentally determined affinities or frequencies for each base at every position of a binding site can be represented as position weight matrices or sequence logos, which can be used for prediction of new binding sites [3, 4]. It is well known that not all sequences, which a transcription factor strongly binds to in vitro will also be bound in an in vivo context [5]. Global chromatin immunoprecipitation can identify the sequences bound by a transcription factor within the cellular context but does not indicate whether the binding site is functional, i.e. whether the presence of a given TF at this site affects expression of the target gene. For this, additional information usually derived from microarray data, sequencing or promoter studies is required [6, 7].
Relative binding affinity is a good indicator of transcriptional activation or repression in an artificial system as shown for example by Kang et al for the Zif268 DNA binding domain joined to repressor or activator domains [8]. A detailed description of the quantitative relationship between affinity and activation potential in the cell is difficult since in vivo activation depends on the presence of co-factors, additional transcription factors and the epigenetic state of the chromatin. On the other hand, a single high affinity binding site in combination with a minimal promoter frequently does not produce strong target gene activation and reporter constructs therefore usually contain several repeats of consensus binding sites to enhance reporter activity. In the analysis of specific promoters attention is usually first focussed on consensus sites though the functionality of variant sites for many transcription factors has been shown in vivo and in reporter gene assays. The effect of variation in a single site on activation and specificity has extensively been investigated in E coli[9]. Within specific mammalian promoters the influence of variant sites on transcriptional activation has not been explored systematically.
The three GLI transcription factors, mediators of the hedgehog signal, comprise a DNA binding domain of five zinc fingers, which are very highly homologous in the three GLIs. Two of the five fingers are responsible for all but one of the protein-DNA base contacts [10]. The GLIs can function as activators and/or repressors and regulate target genes in a highly context specific way. The consensus binding sequence GACCACCCA was first determined by Kinzler et al [11] and many direct GLI target genes have been identified. Hallikas et al [12] determined the affinities of all single base substitutions in the GLI consensus binding sequence using a fusion of luciferase with the GLI-DNA binding domain in an in vitro assay. These data together with information on species conservation were used in the novel EEL prediction program to identify GLI regulated genes within the mouse and human genome. These predictions were successful in identifying new target genes though some known target genes were not represented in the original version. This emphasizes the need to characterise in more detail the relationship between affinity and functionality of GLI binding sites in functional assays.
We therefore set out to investigate the activity of all single site variants of the consensus GLI binding site in a luciferase assay. Frequently GLI transcriptional activity is measured in an artificial construct containing multiple copies of the consensus site. Here we use a construct based on the PTCH1 promoter, which is functional in many different cell types and should approximate a "normal" control of gene expression. Using relatively low GLI concentration to enhance specificity we found that a rather large number of variant GLI binding sites was able to activate transcription within the PTCH1 promoter. We then proceeded to turn variant binding sites into consensus within two unrelated natural promoters containing essential non-consensus GLI binding sites and found that activity was not significantly enhanced.
Results and Discussion
A PTCH1 reporter system to measure the functionality of variant GLI binding sites
The hedgehog receptor PTCH1 is a well characterised direct GLI target gene and its elevated expression is indicative of Hh pathway activation. PTCH1 expression is driven from several alternative transcription start sites [13]. The PTCH1 promoter region upstream of exon 1B (Figure 1A) has been shown to contain a GLI consensus site (BS2, -704) [14] essential for activation by GLI. We localised a second GLI binding site (BS1, GACCTCCCA) with a single substitution compared to consensus upstream of BS2 at -1033. The presence of BS1 only is not sufficient for promoter activation by GLI in a luciferase assay, but it enhances transcriptional activation in the presence of BS2 (Figure 1C). We chose to use the essential BS2 site in the PTCH1 promoter to investigate the influence of all 27 possible single base substitutions in the consensus sequence on transcriptional activation. To facilitate the exchange of consensus by the variant binding site we replaced the consensus site with a linker sequence permitting the test sequence to be quickly inserted into the PTCH1 luciferase reporter construct (PTCH1_VAR) (Figure 1A). Together with the variant sequence, a HindIII site was inserted to allow fast identification of plasmids containing the variant sequence. The base C in position 14 relative to the start of the consensus sequence has previously been shown to positively affect GLI binding affinity [12, 15] and is included in the construct as part of the HindIII site.
We then tested the functionality of the luciferase reporter system by comparing the ability of GLI2act to activate the reporter constructs containing the linker with the consensus sequence (PTCH1_VAR_(cons)) to the unmodified PTCH1 promoter luciferase reporter construct (PTCH1_WT) (Figure 1B). All results presented here were obtained with GLI2act, which is a strong activator. When GLI1 was used comparable results were obtained though activity was lower (data not shown, CS unpublished). As shown in Figure 1B both wild type (PTCH1_WT) and modified PTCH1 promoter construct (PTCH1_VAR(Cons)) were strongly induced in response to GLI2act with only slightly lower activation for the modified PTCH1 promoter (Figure 1B). As expected, the inactive variant 6G7G (GACCAGG CA) (Figure 1B) in PTCH1_WT as well as in PTCH1_VAR resulted in strongly reduced reporter activity. No activation was observed with PTCH1_VAR, with no inserted sequence. Thus, the modified PTCH1 reporter system is functional and can be used to systematically measure the effect of variation in GLI binding sequence on GLI target gene activation.
The effect of GLI binding site variants on PTCH1 promoter activation
To determine GLI activity for all single site variants of the 9 bp consensus binding sequence we co-transfected each PTCH1_VAR luciferase reporter together with GLI2act into HaCaT cells (Figure 2A). As a negative control we used PTCH1_VAR(6G7G) (Figure 1B). To exploit the dynamic range of the reporter system, all assays were performed under optimal transcriptional activation conditions using moderate GLI2act levels. The boxplot (Figure 2A) shows the range of activities measured at each position, statistical significance compared to negative control and to consensus is shown in Figure 2B. At first view it is striking that many sequence variants result in reporter activation similar to the consensus GLI binding site. Especially in position 5 there is no significant difference in the transcriptional activities between consensus and any non-consensus bases (Figure 2A, B). In contrast, any substitution in position 4 or 6 leads to loss of activity, consistent with affinity measurements showing complete loss of GLI binding if these critical positions are altered (CS unpublished). There are several positions where the identity of the substituted base shows a pronounced effect on transcriptional activation: in position 7 (C in consensus), G and T do not lead to reporter gene transcription while A reproducibly equals or even appears to exceed the level of activation by consensus. A number of variants results in activities intermediate between consensus and background. Taking into account the variability inherent in biological replicates it is not possible to attach significance to relatively small differences in activity. To exclude the possibility that the linker sequence, which surrounds the binding site differentially affects the activation of the various reporter constructs, we also tested a small number of binding site variants directly within the unmodified PTCH1 promoter construct by introducing site-specific mutations (Figure 3). No major discrepancies were observed, suggesting in summary that many variant GLI binding sites are functional and can substitute for the consensus. We then compared the transcriptional activation (Figure 2A) to the affinity profile described by Hallikas et al [12] and found that a large number of substitutions, which have quite low affinity significantly activate the luciferase reporter. This may be due to the fact that the nonlinear normalization applied to the raw data very strongly emphasizes the consensus site [16]. Conventional competitive EMSA measurements on selected binding sequences with linear normalization showed several single substitutions with Kd values within a factor of 10 of the consensus (CS, unpublished), which are compatible with the results of the luciferase reporter activity found. This is also consistent with the existence of many single and several double substitutions in the GLI consensus sequence of promoters with known GLI dependent function in vivo (Table 1), which failed to be retrieved in genome-wide in silico searches for GLI target genes [12] e.g. BCL2, IL1R2, FST, TGM3 (Table 1).
Though not perfectly representing the context of chromatin, luciferase reporter assays can be used to distinguish between potentially functional GLI binding sites and apparent binding sites, which do not activate reporter gene activity within their sequence context. This can be demonstrated clearly for the TGM3 promoter, which contains three potential GLI binding sites: one consensus sequence and two variants with a C to A substitution in position 7 (7A) (Figure 4). Mutation of the consensus sequence to nonbinding 6G7G does not affect reporter activity nor is the consensus site bound by GLI in a ChIP experiment (Figure 4B, C). In contrast, the variant sites are bound by GLI and mutation of either variant site abolishes transcriptional activation (Figure 4B, C).
Non-consensus GLI binding sites in GLI target gene promoters
To further explore the influence of binding affinity on transcriptional activation in a natural promoter context other than PTCH1 we chose the JUN and GLI1 promoters, both containing functional non-consensus GLI binding sites, for further analysis
The human JUN (JUNpromWT2G5C) [17] and human proximal GLI1 promoter (GLI1prom WT9G) [18] both contain only one functional GLI binding site thus eliminating possible interactions between nearby GLI binding sites (Figure 5A, B). Either binding site variant has been shown to be essential for activation by GLI and both have significantly lower affinity than the consensus site (9G coefficient according to Hallikas et al binding profile 0.004 for GLI1 (0.982 for consensus) and GLI3 (0.937 for consensus), 0.000 for GLI2 (0.982 for consensus) (see Table S1 in [12]), double substitutions as found in the JUN promoter were not tested under identical conditions, [12, 17]. To compare the activity of the variants to the consensus, we applied site directed mutagenesis to change the wild type variant sites to the consensus sequence (GLI1promCons, JUNpromCons) (Figure 5). The luciferase reporter constructs (GLI1promCons, JUNpromCons) were then tested for the response to GLI2act in HaCaT cells and luciferase activity was compared to the respective wild type promoter constructs (GLI1promWT9G, JUNpromWT2G5C). We detected no significant difference between GLI consensus and non consensus wild type sequences in the context of either promoter. These results indicate that relatively low binding site affinity does not prevent activation by GLI in a luciferase assay at optimal GLI concentration.
Recent observations show that lower affinity binding sites for transcription factors can be identified by global ChIP [7] and occur quite frequently. Large scale affinity measurements as described in [2] showed that a large selection of transcription factors recognises many variations of the primary motifs and that even secondary motifs exist, which may possibly affect changes in transcriptional specificity. A visible influence of low affinity sites on gene expression in yeast has been described pointing to their potential relevance for modulating gene expression [19]. Vokes et al [15] identified a number of GLI promoters/enhancers, which behave in a tissue specific way and are influenced by nuclear GLI concentration. In a more global study, groups of sites with high and lower affinities to REST repressor were shown to cluster into groups responsible for activation of target genes expressed commonly, specifically or uniquely in different cell lines [20]. These data imply an important role for lower affinity sites in the context dependent control of transcription and point to the need for more detailed investigation of their function.
Conclusion
The results presented here specifically focus on the activation potential of binding sites of the GLI transcription factors, the mediators of the hedgehog signal. We measured activation in a standardised luciferase assay in the context of the PTCH1 promoter testing all single site mutations of the GLI consensus binding sequence. A rather large number of substitutions was shown to be active, which is consistent with the existence of many known GLI target gene promoters containing variant sites with lower binding affinity. Taking into account the contribution of a larger subset of binding sites with significant affinity the results presented in this study are likely to be helpful in the prediction and experimental validation of more direct GLI target genes.
Methods
Cloning
Numbering of base positions was according to [14] for the PTCH1 promoter, to [17] for the JUN promoter and to [18] for the GLI1 promoter. The GLI consensus site orientation used is 5'GACCACCCA3' [11]. The wild type PTCH1 promoter (-1022 to +211) was amplified from BAC #RP11/43505 (obtained from Children's Hospital Oakland Research Institute (CHORI)) and cloned into the NheI and BglII sites of pGL3 basic vector (Promega, Madison, USA). For the PTCH1_VAR construct GLI binding site BS2 (-704) was replaced with a 29 bp linker sequence containing the restriction sites NsiI and XhoI. Oligonucleotides representing all variant GLI binding sites and including a HindIII restriction site for quick screening of positive clones were inserted into the pGL3_PTCH1_linker construct. (Figure 1A). To mutate GLI binding sites in wild type promoters we used QuickChange site-directed mutagenesis kit (Stratagene, La Jolla, USA) according to the manufacturer's protocol and verified changes by sequencing. For primers and oligos see Table 2.
Cell culture, transfection and luciferase reporter assays
HaCaT cells and GLI2actHaCaT [21] were cultured in Dulbecco's modified Eagle medium (high glucose, PAA, Pasching, Austria) with 10% fetal calf serum (PAA, Pasching, Austria) supplemented with streptomycin/penicillin (Pen/Strep100x stock solution, PAA, Pasching, Austria) at 37°C, 5% CO2. Cells were grown to 80% confluence in 24-well plates and transfected in triplicate with the pGL3 luciferase reporter plasmids, the GLI2act expression (80 ng/transaction sample) construct [21] or pcDNA4/TO as negative control using Superfect Transfection reagent (Quiagen Inc., Valencia, CA). LacZ expression plasmid (400 ng/transfection sample) was used for normalization of transfection efficiency. Cells were harvested 48 h after transfection, and luciferase activity measured with a LucyII luminometer (Anthos Labtec, Cambridge, UK) using Luciferase Assay Substrate (Promega, Madison, USA).
Chromatin immunoprecipitation
ChIP from GLI2actHaCaT was done as described in [22]. Antibodies used were: polyclonal goat-anti-GLI2 (GLI2-N20) (Santa Cruz Biotechnology) for specific precipitation and species matched normal IgGs (Santa Cruz Biotechnology) for unspecific control. PCR primer sequences are listed in Table 2.
Abbreviations
- EMSA:
-
Electrophoretic Mobility Shift Assay
- TSS:
-
Transcription Start Site
- SELEX:
-
Systematic Evolution of Ligands by Exponential Enrichment
- TF:
-
Transcription Factor
- ChIP:
-
Chromatin immunoprecipitation
- Hh:
-
Hedgehog
- RLU:
-
Relative Light Unit.
References
Bulyk ML: Computational prediction of transcription-factor binding site locations. Genome Biol. 2003, 5: 201- 10.1186/gb-2003-5-1-201
Badis G, Berger MF, Philippakis AA, Talukder S, Gehrke AR, Jaeger SA, Chan ET, Metzler G, Vedenko A, Chen X, et al: Diversity and complexity in DNA recognition by transcription factors. Science. 2009, 324: 1720-1723. 10.1126/science.1162327
Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18: 6097-6100. 10.1093/nar/18.20.6097
Stormo GD: DNA binding sites: representation and discovery. Bioinformatics. 2000, 16: 16-23. 10.1093/bioinformatics/16.1.16
Liu X, Lee CK, Granek JA, Clarke ND, Lieb JD: Whole-genome comparison of Leu3 binding in vitro and in vivo reveals the importance of nucleosome occupancy in target site selection. Genome Res. 2006, 16: 1517-1528. 10.1101/gr.5655606
Barrera LO, Ren B: The transcriptional regulatory code of eukaryotic cells--insights from genome-wide analysis of chromatin organization and transcription factor binding. Curr Opin Cell Biol. 2006, 18: 291-298. 10.1016/j.ceb.2006.04.002
Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, et al: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007, 447: 799-816. 10.1038/nature05874
Kang JS: Correlation between functional and binding activities of designer zinc-finger proteins. Biochem J. 2007, 403: 177-182. 10.1042/BJ20061644
Redfield RJ, Cameron AD, Qian Q, Hinds J, Ali TR, Kroll JS, Langford PR: A novel CRP-dependent regulon controls expression of competence genes in Haemophilus influenzae. J Mol Biol. 2005, 347: 735-747. 10.1016/j.jmb.2005.01.012
Pavletich NP, Pabo CO: Crystal structure of a five-finger GLI-DNA complex: new perspectives on zinc fingers. Science. 1993, 261: 1701-1707. 10.1126/science.8378770
Kinzler KW, Vogelstein B: The GLI gene encodes a nuclear protein which binds specific sequences in the human genome. Mol Cell Biol. 1990, 10: 634-642.
Hallikas O, Palin K, Sinjushina N, Rautiainen R, Partanen J, Ukkonen E, Taipale J: Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity. Cell. 2006, 124: 47-59. 10.1016/j.cell.2005.10.042
Kogerman P, Krause D, Rahnama F, Kogerman L, Unden AB, Zaphiropoulos PG, Toftgard R: Alternative first exons of PTCH1 are differentially regulated in vivo and may confer different functions to the PTCH1 protein. Oncogene. 2002, 21: 6007-6016. 10.1038/sj.onc.1205865
Agren M, Kogerman P, Kleman MI, Wessling M, Toftgard R: Expression of the PTCH1 tumor suppressor gene is regulated by alternative promoters and a single functional Gli-binding site. Gene. 2004, 330: 101-114. 10.1016/j.gene.2004.01.010
Vokes SA, Ji H, McCuine S, Tenzen T, Giles S, Zhong S, Longabaugh WJ, Davidson EH, Wong WH, McMahon AP: Genomic characterization of Gli-activator targets in sonic hedgehog-mediated neural patterning. Development. 2007, 134: 1977-1989. 10.1242/dev.001966
Hallikas O, Taipale J: High-throughput assay for determining specificity and affinity of protein-DNA binding interactions. Nat Protoc. 2006, 1: 215-222. 10.1038/nprot.2006.33
Laner-Plamberger S, Kaser A, Paulischta M, Hauser-Kronberger C, Eichberger T, Frischauf AM: Cooperation between GLI and JUN enhances transcription of JUN and selected GLI target genes. Oncogene. 2009, 28: 1639-1651. 10.1038/onc.2009.10
Ikram MS, Neill GW, Regl G, Eichberger T, Frischauf AM, Aberger F, Quinn A, Philpott M: GLI2 is expressed in normal human epidermis and BCC and induces GLI1 expression by binding to its promoter. J Invest Dermatol. 2004, 122: 1503-1509. 10.1111/j.0022-202X.2004.22612.x
Tanay A: Extensive low-affinity transcriptional interactions in the yeast genome. Genome Res. 2006, 16: 962-972. 10.1101/gr.5113606
Bruce AW, Lopez-Contreras AJ, Flicek P, Down TA, Dhami P, Dillon SC, Koch CM, Langford CF, Dunham I, Andrews RM, et al: Functional diversity for REST (NRSF) is defined by in vivo binding affinity hierarchies at the DNA sequence level. Genome Res. 2009, 19: 994-1005. 10.1101/gr.089086.108
Regl G, Kasper M, Schnidar H, Eichberger T, Neill GW, Philpott MP, Esterbauer H, Hauser-Kronberger C, Frischauf AM, Aberger F: Activation of the BCL2 promoter in response to Hedgehog/GLI signal transduction is predominantly mediated by GLI2. Cancer Res. 2004, 64: 7724-7731. 10.1158/0008-5472.CAN-04-1085
Eichberger T, Kaser A, Pixner C, Schmid C, Klingler S, Winklmayr M, Hauser-Kronberger C, Aberger F, Frischauf AM: GLI2-specific transcriptional activation of the bone morphogenetic protein/activin antagonist follistatin in human epidermal cells. J Biol Chem. 2008, 283: 12426-12437. 10.1074/jbc.M707117200
Kasper M, Schnidar H, Neill GW, Hanneder M, Klingler S, Blaas L, Schmid C, Hauser-Kronberger C, Regl G, Philpott MP, et al: Selective modulation of Hedgehog/GLI target gene expression by epidermal growth factor signaling in human keratinocytes. Mol Cell Biol. 2006, 26: 6283-6298. 10.1128/MCB.02317-05
Zhao M, Qiao M, Harris SE, Chen D, Oyajobi BO, Mundy GR: The zinc finger transcription factor Gli2 mediates bone morphogenetic protein 2 expression in osteoblasts in response to hedgehog signaling. Mol Cell Biol. 2006, 26: 6197-6208. 10.1128/MCB.02214-05
Sasaki H, Hui C, Nakafuku M, Kondoh H: A binding site for Gli proteins is essential for HNF-3beta floor plate enhancer activity in transgenics and can respond to Shh in vitro. Development. 1997, 124: 1313-1322.
Saitsu H, Komada M, Suzuki M, Nakayama R, Motoyama J, Shiota K, Ishibashi M: Expression of the mouse Fgf15 gene is directly initiated by Sonic hedgehog signaling in the diencephalon and midbrain. Dev Dyn. 2005, 232: 282-292. 10.1002/dvdy.20236
Komada M, Saitsu H, Shiota K, Ishibashi M: Expression of Fgf15 is regulated by both activator and repressor forms of Gli2 in vitro. Biochem Biophys Res Commun. 2008, 369: 350-356. 10.1016/j.bbrc.2008.02.015
Solecki DJ, Gromeier M, Mueller S, Bernhardt G, Wimmer E: Expression of the human poliovirus receptor/CD155 gene is activated by sonic hedgehog. J Biol Chem. 2002, 277: 25697-25702. 10.1074/jbc.M201378200
Acknowledgements
We thank Stefan Wegenkittl (University of Applied Sciences, Salzburg) for advice with the statistical evaluation of the results. This work was supported by the Austrian Genome Project GENAU "Ultra-sensitive Proteomics and Genomics II" to AMF and FA, FWF Project 16518-B14 to FA, and the University of Salzburg priority program "Biosciences and Health".
Author information
Authors and Affiliations
Corresponding author
Additional information
Authors' contributions
MW and CS participated in design of the experiments and reporter constructs, carried out and evaluated luciferase assays except experiments related to TGM3, which were done by SLP, AK carried out ChIP, FA contributed to the design of the project and the evaluation of the results, TE participated in the writing of the manuscript and the evaluation of the results, AMF contributed to the design of the project, the evaluation of the results and the writing of the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Winklmayr, M., Schmid, C., Laner-Plamberger, S. et al. Non-consensus GLI binding sites in Hedgehog target gene regulation. BMC Molecular Biol 11, 2 (2010). https://doi.org/10.1186/1471-2199-11-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1471-2199-11-2