BMC Evolutionary Biology

official impact factor 3.70

Open Access Research article

Characterization of the tandem CWCH2 sequence motif: a hallmark of inter-zinc finger interactions

Minoru Hatayama1 and Jun Aruga1,2*

Author Affiliations

1 Laboratory for Behavioral and Developmental Disorders, RIKEN Brain Science Institute, Wako-shi, Saitama 351-0198, Japan

2 Saitama University Brain Science Institute, Saitama-shi, Saitama 338-8570, Japan

For all author emails, please log on.

BMC Evolutionary Biology 2010, 10:53 doi:10.1186/1471-2148-10-53

Published: 19 February 2010

Additional files

Additional file 1:

Neighboring residues of the tryptophans in tCWCH2 3D structure. (A) Distances of the tCWCH2-forming residues from the tCWCH2 ZF1 tryptophan (left) and the tCWCH2 ZF2 tryptophan (right). Black and red letters indicate residues in the ZF1 and the ZF2 respectively. Gothic letters, zinc chelating residue; *, ϕ1 and ϕ2 in Figure 5 and 6. Note that both tryptophan residues were close to the residues in the other C2H2 ZF unit. (B) Stereo view of the ZIC3, GLI1, and Zap1 tryptophan side chain adjacent residues within 5 Å. Tryptophan residues, zinc chelating histidine, ϕ1, and ϕ2 residues are labeled. For ZIC3 and Zap1, 20 NMR models in wire model are superimposed. For GLI1, cylinder model are shown. White, carbon; red, oxygen; blue, nitrogen; yellow; sulfur.

Format: PDF Size: 1.1MB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 2:

List of tandem CWCH2-containing genes. Gene ID of NCBI or Ensembl http://www.ensembl.org/ webcite or JGI Lottia gigantea genome http://genome.jgi-psf.org/Lotgi1/Lotgi1.home.html webcite database and organism names that we collected are listed and categorized according to gene group. The EDV29271 Trichoplax adhaerens Zic amino acid sequence was manually added from the genome database, and indicates as "EDV29271+edit". The inset in right bottom indicates the numbers of non-redundant sequences in each class.

Format: PDF Size: 63KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 3:

PHI-BLAST pattern validity. Percentage occurrence of the PHI-BLAST pattern in the tCWCH2 library (A). Most frequent amino acid sequence for tCWCH2 in the PHI-BLAST result. A loose search pattern increases the number of BLAST hits, and increases the percentages of recovery of library (B). The tCWCH2 sequence motif collection contains 551 sequences from the NCBI protein database and 36 sequences from the NCBI nucleotide or other databases.

Format: PDF Size: 51KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 4:

The sequence of the conserved domains of Arid2 and Rsc9. The ARID domain (A) and the ZF domain (B) are conserved in Arid2 and Rsc9. Core amino acid residues of the ARID domain are indicated by asterisks (*) in (A). The structure of the tCWCH2 motifs and flanking sequence are shown in (B). Conserved sequence [IxL (S/T) AxL (I/V) L (K/R) N (I/L) x(K/R)] are indicated by asterisks (*). Hs, Homo sapiens; Dr, Danio rerio; Bm, Brugia malayi; Dm, Drosophila melanogaster; Nc Neurospora crassa; Spo, Schizosaccharomyces pombe.

Format: PDF Size: 23KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 5:

Zinc finger domains of Mizf. We found novel ZFs in Mizf. A pseudo ZF structure was found between ZF4 and ZF5 (indicated as ZF?). Hs, Homo sapiens; Dr, Danio rerio; Ta, Trichoplax adhaerens; Ci, Ciona intestinalis; Dm, Drosophila melanogaster; Ce, Caenorhabditis elegans.

Format: PDF Size: 70KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 6:

Sequence alignment of Zap1 and ZafA. Zinc homeostasis genes Zap1 and ZafA have structural similarity. Conservation of ZF8 was weak. Um, Ustilago maydis; Cn, Cryptococcus neoformans; Afu, Aspergillus fumigatus; Mg, Magnaporthe grisea; Yl, Yarrowia lipolytica; Ca, Candida albicans; Sc, Saccharomyces cerevisiae.

Format: PDF Size: 66KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 7:

Sequence alignment of zinc finger domain of Fungl, human GLI3, and Fungl-4ZF genes. We propose the common gene name, Fungl: Fungus Gli like. Afu, Aspergillus fumigatus; Nc, Neurospora crassa; Yl, Yarrowia lipolytica; Mgl, Malasszia globosa; Um, Ustilago maydis; Ro, Rhizopus oryzae; Ecu, Encephalitozoon cuniculi; Ca, Candida albicans; Lbi, Laccaria bicolor; Eb, Enterocytozoon bieneusi; Alo, Antonospora locustae; Cn, Cryptococcus neoformans; Hs, Homo sapiens; Bde, Batrachochytrium dendrobatidis.

Format: PDF Size: 60KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 8:

Sequence alignment of Twincl zinc finger domain. Based on its structural features, we named this gene family Twincl (Tandem CWCH2 protein C-terminal). Afu, Aspergillus fumigatus; Nf, Neosartorya fischeri; Cim, Coccidioides immitis; Pn, Phaeosphaeria nodorum; Gz, Gibberella zeae; Mgr, Magnaporthe grisea; Nc, Neurospora crassa.

Format: PDF Size: 41KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 9:

Sequence alignment of Clr1 zinc finger domain. Spo, Schizosaccharomyces pombe; Sj, Schizosaccharomyces japonicus; Cn, Cryptococcus neoformans.

Format: PDF Size: 40KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 10:

Sequence alignment of Monosiga, Dictyostelium, and plant gene. (A) Zinc finger domain alignment of Dictyostelium discoideum hypothetical protein (DdHp) and Monosiga brevicollis tCWCH2. DdHp2 contains two tCWCH2 domains {ZF12 (ZF1-ZF2) and ZF34 (ZF3-ZF4)}. (B) Plant REF6 Physcomitrella patens (XP_001771543) gene has tCWCH2 sequence. It has strong homology to Arabidopsis REF6 (NP_680116), Oryza sativa gene (NP_001045137) and Vitis vinifera (CAO64178) gene, but the tryptophan residue is not conserved in Arabidopsis and other plants. An alignment of zinc finger domain of these genes is shown. Amino acid identity between sequences is shown by a dot (.).

Format: PDF Size: 58KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 11:

Sequence alignment for phylogenic tree analysis. Conserved cysteine and histidine residues are indicated with an asterisk (*). Afr, Artemia franciscana; Afu, Aspergillus fumigatus; Aga, Anopheles gambiae; Alo, Antonospora locustae; Ape, Asterina pectinifera; Bde, Batrachochytrium dendrobatidis; Bfl, Branchiostoma floridae; Cin, Ciona intestinalis; Cn, Cryptococcus neoformans; Co, Corbicula sp.; Da, Dicyema acuticephalum; Dd, Dictyostelium discoideum; Dj, Dugesia japonica; Dm, Drosophila melanogaster; Hr, Halocynthia roretzi; Hs, Homo sapiens; Hv, Hydra vulgaris; Lbl, Loligo bleekeri; Mms, Mus musculus; Nf, Neosartorya fischeri; Nve, Nematostella vectensis; Oo, Octopus ocellatus; Pi, Pandinus imperator; Ro, Rhizopus oryzae; Sc, Saccharomyces cerevisiae; Sm, Schistosoma mansoni; Sp, Strongylocentrotus purpuratus; Sso, Spisula solidissima; Ssu, Scolionema suvaense; Ta, Trichoplax adhaerens; Tt, Tubifex tubifex; Um, Ustilago maydis; Xl, Xenopus laevis; Yl, Yarrowia lipolytica.

Format: PDF Size: 38KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 12:

BI tree of tCWCH2+1ZF. BI tree analysis was performed on 8 × 106 generations using WAG model. Average standard deviation of split frequencies was lower than 0.001 at the end of the analysis. Abbreviations are the same as those in Additional file 11.

Format: PDF Size: 20KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 13:

ML tree of tCWCH2+1ZF. ML tree analysis was performed using WAG model with "empirical base frequencies", "maximum likelihood search", and "estimate proportion of invariable sites" options of RAxML [75]. 100 replicates were set for the bootstrap analysis. Abbreviations are the same as those in Additional file 11.

Format: PDF Size: 20KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 14:

NJ tree of tCWCH2+1ZF. NJ tree analysis was performed using JTT model and "pair wise deletion", "Rate among site = different (gamma distribution = 0.62)" options of MEGA4 (http://www.megasoftware.net/ webcite, [73,74]). The gamma distributions alpha parameter was calculated by "Tree-Puzzle" (http://www.tree-puzzle.de/ webcite, [72]). 1000 replicates were set for the bootstrap analysis. Abbreviations are the same as those in Additional file 11.

Format: PDF Size: 20KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data