A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions in order to stabilize the fold. Originally coined to describe the finger-like appearance of a hypothesized structure from Xenopus laevis transcription factor IIIA, the zinc finger name has now come to encompass a wide variety of differing protein structures. Xenopus laevis TFIIIA was originally demonstrated to contain zinc and require the metal for function in 1983, the first such reported zinc requirement for a gene regulatory protein.
Proteins that contain zinc fingers (zinc finger proteins) are classified into several different structural families. Unlike many other clearly defined supersecondary structures such as Greek keys or β hairpins, there are a number of unique types of zinc fingers, each with a unique three-dimensional architecture. A particular zinc finger protein's class is determined by this three-dimensional structure, but it can also be recognized based on the primary structure of the protein or the identity of the ligands coordinating the zinc ion. In spite of the large variety of these proteins, however, the vast majority typically function as interaction modules that bind DNA, RNA, proteins, or other small, useful molecules, and variations in structure serve primarily to alter the binding specificity of a particular protein.
Since their original discovery and the elucidation of their structure, these interaction modules have proven ubiquitous in the biological world. In addition, zinc fingers have become extremely useful in various therapeutic and research capacities. Engineering zinc fingers to have an affinity for a specific sequence is an area of active research, and zinc finger nucleases and zinc finger transcription factors are two of the most important applications of this to be realized to date.
Zinc fingers were first identified in a study of transcription in the African clawed frog, Xenopus laevis. A study of the transcription of a particular RNA sequence revealed that the binding strength of a small transcription factor (transcription factor IIIA; TFIIIA) was due to the presence of zinc-coordinating finger-like structures. Amino acid sequencing of TFIIIA revealed nine tandem sequences of 30 amino acids, including two invariant pairs of cysteine and histidine residues. Extended x-ray absorption fine structure confirmed the identity of the zinc ligands: two cysteines and two histidines. The DNA-binding loop formed by the coordination of these ligands by zinc were thought to resemble fingers, hence the name. More recent work in the characterization of proteins in various organisms has revealed the importance of zinc ions in polypeptide stabilization.
Zinc finger (Znf) domains are relatively small protein motifs that contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not, instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein, and/or lipid substrates. Their binding properties depend on the amino acid sequence of the finger domains and on the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g., some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organization, epithelial development, cell adhesion, protein folding, chromatin remodeling, and zinc sensing, to name but a few. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.
Initially, the term zinc finger was used solely to describe DNA-binding motif found in Xenopus laevis, however it is now used to refer to any number of structures related by their coordination of a zinc ion. In general, zinc fingers coordinate zinc ions with a combination of cysteine and histidine residues. Originally, the number and order of these residues was used to classify different types of zinc fingers ( e.g., Cys2His2, Cys4, and Cys6). More recently, a more systematic method has been used to classify zinc finger proteins instead. This method classifies zinc finger proteins into "fold groups" based on the overall shape of the protein backbone in the folded domain. The most common "fold groups" of zinc fingers are the Cys2His2-like (the "classic zinc finger"), treble clef, and zinc ribbon.
The following table shows the different structures and their key features:
|Fold Group||Representative structure||Ligand placement|
|Cys2His2||150px||Two ligands form a knuckle and two more form the c terminus of a helix.|
|Gag knuckle||150px||Two ligands form a knuckle and two more form a short helix or loop.|
|Treble clef||Two ligands form a knuckle and two more form the N terminus of a helix.|
|Zinc ribbon||150px||Two ligands each form two knuckles.|
|Zn2/Cys6||150px||Two ligands form the N terminus of a helix and two more form a loop.|
|TAZ2 domain like||Two ligands form the termini of two helices.|
|Zinc finger, C2H2 type|
The Cys2His2-like fold group is by far the best-characterized class of zinc fingers and are extremely common in mammalian transcription factors. These domains adopt a simple ββα fold and have the amino acid Sequence motif:
This class of zinc fingers can have a variety of functions such as binding RNA and mediating protein-protein interactions, but is best known for its role in sequence-specific DNA-binding proteins such as Zif268 (Egr1). In such proteins, individual zinc finger domains typically occur as tandem repeats with two, three, or more fingers comprising the DNA-binding domain of the protein. These tandem arrays can bind in the major groove of DNA and are typically spaced at 3-bp intervals. The α-helix of each domain (often called the "recognition helix") can make sequence-specific contacts to DNA bases; residues from a single recognition helix can contact 4 or more bases to yield an overlapping pattern of contacts with adjacent zinc fingers.
This fold group is defined by two short β-strands connected by a turn (zinc knuckle) followed by a short helix or loop and resembles the classical Cys2His2 motif with a large portion of the helix and β-hairpin truncated.
The retroviral nucleocapsid (NC) protein from HIV and other related retroviruses are examples of proteins possessing these motifs. The gag-knuckle zinc finger in the HIV NC protein is the target of a class of drugs known as zinc finger inhibitors.
The treble-clef motif consists of a β-hairpin at the N-terminus and an α-helix at the C-terminus that each contribute two ligands for zinc binding, although a loop and a second β-hairpin of varying length and conformation can be present between the N-terminal β-hairpin and the C-terminal α-helix. These fingers are present in a diverse group of proteins that frequently do not share sequence or functional similarity with each other. The best-characterized proteins containing treble-clef zinc fingers are the nuclear hormone receptors.
The zinc ribbon fold is characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites.
|Fungal Zn(2)-Cys(6) binuclear cluster domain|
The canonical members of this class contain a binuclear zinc cluster in which two zinc ions are bound by six cysteine residues. These zinc fingers can be found in several transcription factors including the yeast Gal4 protein.
File:PDB 1pxe EBI.jpg|
solution structure of a cchhc domain of neural zinc finger factor-1
File:PDB 1pxe EBI.jpg|
solution structure of a cchhc domain of neural zinc finger factor-1
Various protein engineering techniques can be used to alter the DNA-binding specificity of zinc fingers and tandem repeats of such engineered zinc fingers can be used to target desired genomic DNA sequences. Fusing a second protein domain such as a transcriptional activator or repressor to an array of engineered zinc fingers that bind near the promoter of a given gene can be used to alter the transcription of that gene. Fusions between engineered zinc finger arrays and protein domains that cleave or otherwise modify DNA can also be used to target those activities to desired genomic loci. The most common applications for engineered zinc finger arrays include zinc finger transcription factors and zinc finger nucleases, but other applications have also been described. Typical engineered zinc finger arrays have between 3 and 6 individual zinc finger motifs and bind target sites ranging from 9 basepairs to 18 basepairs in length. Arrays with 6 zinc finger motifs are particularly attractive because they bind a target site that is long enough to have a good chance of being unique in a mammalian genome.
Zinc finger nucleases
Engineered zinc finger arrays are often fused to a DNA cleavage domain (usually the cleavage domain of FokI) to generate zinc finger nucleases. Such zinc finger-FokI fusions have become useful reagents for manipulating genomes of many higher organisms including Drosophila melanogaster, Caenorhabditis elegans, tobacco, corn, zebrafish, various types of mammalian cells, and rats. Targeting a double-strand break to a desired genomic locus can be used to introduce frame-shift mutations into the coding sequence of a gene due to the error-prone nature of the non-homologous DNA repair pathway. If a homologous DNA "donor sequence" is also used then the genomic locus can be converted to a defined sequence via the homology directed repair pathway. An ongoing clinical trial is evaluating Zinc finger nucleases that disrupt the CCR5 gene in CD4+ human T-cells as a potential treatment for HIV/AIDS.
Methods of engineering zinc finger arrays
The majority of engineered zinc finger arrays are based on the zinc finger domain of the murine transcription factor Zif268, although some groups have used zinc finger arrays based on the human transcription factor SP1. Zif268 has three individual zinc finger motifs that collectively bind a 9 bp sequence with high affinity. The structure of this protein bound to DNA was solved in 1991 and stimulated a great deal of research into engineered zinc finger arrays. In 1994 and 1995, a number of groups used phage display to alter the specificity of a single zinc finger of Zif268. There are two main methods currently used to generate engineered zinc finger arrays, modular assembly, and a bacterial selection system, and there is some debate about which method is best suited for most applications.
The most straightforward method to generate new zinc finger arrays is to combine smaller zinc finger "modules" of known specificity. The structure of the zinc finger protein Zif268 bound to DNA described by Pavletich and Pabo in their 1991 publication has been key to much of this work and describes the concept of obtaining fingers for each of the 64 possible base pair triplets and then mixing and matching these fingers to design proteins with any desired sequence specificity. The most common modular assembly process involves combining separate zinc fingers that can each recognize a 3-basepair DNA sequence to generate 3-finger, 4-, 5-, or 6-finger arrays that recognize target sites ranging from 9 basepairs to 18 basepairs in length. Another method uses 2-finger modules to generate zinc finger arrays with up to six individual zinc fingers. The Barbas Laboratory of The Scripps Research Institute used phage display to develop and characterize zinc finger domains that recognize most DNA triplet sequences while another group isolated and characterized individual fingers from the human genome. A potential drawback with modular assembly in general is that specificities of individual zinc finger can overlap and can depend on the context of the surrounding zinc fingers and DNA. A recent study demonstrated that a high proportion of 3-finger zinc finger arrays generated by modular assembly fail to bind their intended target with sufficient affinity in a bacterial two-hybrid assay and fail to function as zinc finger nucleases, but the success rate was somewhat higher when sites of the form GNNGNNGNN were targeted.
A subsequent study used modular assembly to generate zinc finger nucleases with both 3-finger arrays and 4-finger arrays and observed a much higher success rate with 4-finger arrays. A variant of modular assembly that takes the context of neighboring fingers into account has also been reported and this method tends to yield proteins with improved performance relative to standard modular assembly.
Numerous selection methods have been used to generate zinc finger arrays capable of targeting desired sequences. Initial selection efforts utilized phage display to select proteins that bound a given DNA target from a large pool of partially randomized zinc finger arrays. This technique is difficult to use on more than a single zinc finger at a time, so a multi-step process that generated a completely optimized 3-finger array by adding and optimizing a single zinc finger at a time was developed. More recent efforts have utilized yeast one-hybrid systems, bacterial one-hybrid and two-hybrid systems, and mammalian cells. A promising new method to select novel 3-finger zinc finger arrays utilizes a bacterial two-hybrid system and has been dubbed "OPEN" by its creators. This system combines pre-selected pools of individual zinc fingers that were each selected to bind a given triplet and then utilizes a second round of selection to obtain 3-finger arrays capable of binding a desired 9-bp sequence. This system was developed by the Zinc Finger Consortium as an alternative to commercial sources of engineered zinc finger arrays. It is somewhat difficult to directly compare the binding properties of proteins generated with this method to proteins generated by modular assembly as the specificity profiles of proteins generated by the OPEN method have never been reported.
- Suppressor of tumourigenicity protein 18 (ST18)
- Klug A, Rhodes D (1987). "Zinc fingers: a novel protein fold for nucleic acid recognition". Cold Spring Harbor Symposia on Quantitative Biology 52: 473–82. PMID 3135979. doi:10.1101/sqb.1987.052.01.054.
- Hanas, JS; Hazuda, DJ; Bogenhagen, DF; Wu, FY; Wu, CW (1983). "Xenopus transcription factor A requires zinc for binding to the 5 S RNA gene.". J Biol Chem 258 (23): 14120–14125. PMID 6196359.
- Berg, Jeremy (1990). "Zinc fingers and other metal-binding domains. Elements for interactions between macromolecules.". J Biol Chem 265 (12): 6513–6. PMID 2108957.
- Miller J, McLachlan AD, Klug A (Jun 1985). "Repetitive zinc-binding domains in the protein transcription factor IIIA from Xenopus oocytes". The EMBO Journal 4 (6): 1609–14. PMC 554390. PMID 4040853.
- Klug A (2010). "The discovery of zinc fingers and their applications in gene regulation and genome manipulation". Annual Review of Biochemistry 79: 213–31. PMID 20192761. doi:10.1146/annurev-biochem-010909-095056.
- Miller Y, Ma B, Nussinov R (May 2010). "Zinc ions promote Alzheimer Abeta aggregation via population shift of polymorphic states". Proceedings of the National Academy of Sciences of the United States of America 107 (21): 9490–5. Bibcode:2010PNAS..107.9490M. PMC 2906839. PMID 20448202. doi:10.1073/pnas.0913114107.
- Low LY, Hernández H, Robinson CV, O'Brien R, Grossmann JG, Ladbury JE et al. (May 2002). "Metal-dependent folding and stability of nuclear hormone receptor DNA-binding domains". Journal of Molecular Biology 319 (1): 87–106. PMID 12051939. doi:10.1016/S0022-2836(02)00236-X.
- Klug A (Oct 1999). "Zinc finger peptides for the regulation of gene expression". Journal of Molecular Biology 293 (2): 215–8. PMID 10529348. doi:10.1006/jmbi.1999.3007.
- Hall TM (Jun 2005). "Multiple modes of RNA recognition by zinc finger proteins". Current Opinion in Structural Biology 15 (3): 367–73. PMID 15963892. doi:10.1016/j.sbi.2005.04.004.
- Brown RS (Feb 2005). "Zinc finger proteins: getting a grip on RNA". Current Opinion in Structural Biology 15 (1): 94–8. PMID 15718139. doi:10.1016/j.sbi.2005.01.006.
- Gamsjaeger R, Liew CK, Loughlin FE, Crossley M, Mackay JP (Feb 2007). "Sticky fingers: zinc-fingers as protein-recognition motifs". Trends in Biochemical Sciences 32 (2): 63–70. PMID 17210253. doi:10.1016/j.tibs.2006.12.007.
- Matthews JM, Sunde M (Dec 2002). "Zinc fingers--folds for many occasions". IUBMB Life 54 (6): 351–5. PMID 12665246. doi:10.1080/15216540216035.
- Laity JH, Lee BM, Wright PE (Feb 2001). "Zinc finger proteins: new insights into structural and functional diversity". Current Opinion in Structural Biology 11 (1): 39–46. PMID 11179890. doi:10.1016/S0959-440X(00)00167-6.
- Krishna SS, Majumdar I, Grishin NV (Jan 2003). "Structural classification of zinc fingers: survey and summary". Nucleic Acids Research 31 (2): 532–50. PMC 140525. PMID 12527760. doi:10.1093/nar/gkg161.
- Pabo CO, Peisach E, Grant RA (2001). "Design and selection of novel Cys2His2 zinc finger proteins". Annual Review of Biochemistry 70: 313–340. PMID 11395410. doi:10.1146/annurev.biochem.70.1.313.
- Jamieson AC, Miller JC, Pabo CO (May 2003). "Drug discovery with engineered zinc-finger proteins". Nature Reviews. Drug Discovery 2 (5): 361–368. PMID 12750739. doi:10.1038/nrd1087.
- Liu Q, Segal DJ, Ghiara JB, Barbas CF (May 1997). "Design of polydactyl zinc-finger proteins for unique addressing within complex genomes". Proceedings of the National Academy of Sciences of the United States of America 94 (11): 5525–30. Bibcode:1997PNAS...94.5525L. PMC 20811. PMID 9159105. doi:10.1073/pnas.94.11.5525.
- Shukla VK, Doyon Y, Miller JC, DeKelver RC, Moehle EA, Worden SE et al. (May 2009). "Precise genome modification in the crop species Zea mays using zinc-finger nucleases". Nature 459 (7245): 437–41. Bibcode:2009Natur.459..437S. PMID 19404259. doi:10.1038/nature07992.
- Reynolds IJ, Miller RJ (Dec 1988). "[3H]MK801 binding to the N-methyl-D-aspartate receptor reveals drug interactions with the zinc and magnesium binding sites". The Journal of Pharmacology and Experimental Therapeutics 247 (3): 1025–31. PMID 2849655.
- Carroll D (Nov 2008). "Progress and prospects: zinc-finger nucleases as gene therapy agents". Gene Therapy 15 (22): 1463–8. PMC 2747807. PMID 18784746. doi:10.1038/gt.2008.145.
- Geurts AM, Cost GJ, Freyvert Y, Zeitler B, Miller JC, Choi VM et al. (Jul 2009). "Knockout rats via embryo microinjection of zinc-finger nucleases". Science 325 (5939): 433. Bibcode:2009Sci...325..433G. PMC 2831805. PMID 19628861. doi:10.1126/science.1172447.
- Tebas P, Stein D (2009). "Autologous T-Cells Genetically Modified at the CCR5 Gene by Zinc Finger Nucleases SB-728 for HIV". ClinicalTrials.gov.
- Christy B, Nathans D (Nov 1989). "DNA binding site of the growth factor-inducible protein Zif268". Proceedings of the National Academy of Sciences of the United States of America 86 (22): 8737–41. Bibcode:1989PNAS...86.8737C. PMC 298363. PMID 2510170. doi:10.1073/pnas.86.22.8737.
- Pavletich NP, Pabo CO (May 1991). "Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A". Science 252 (5007): 809–17. Bibcode:1991Sci...252..809P. PMID 2028256. doi:10.1126/science.2028256.
- Rebar EJ, Pabo CO (Feb 1994). "Zinc finger phage: affinity selection of fingers with new DNA-binding specificities". Science 263 (5147): 671–3. Bibcode:1994Sci...263..671R. PMID 8303274. doi:10.1126/science.8303274.
- Jamieson AC, Kim SH, Wells JA (May 1994). "In vitro selection of zinc fingers with altered DNA-binding specificity". Biochemistry 33 (19): 5689–95. PMID 8180194. doi:10.1021/bi00185a004.
- Choo Y, Klug A (Nov 1994). "Toward a code for the interactions of zinc fingers with DNA: selection of randomized fingers displayed on phage". Proceedings of the National Academy of Sciences of the United States of America 91 (23): 11163–7. Bibcode:1994PNAS...9111163C. PMC 45187. PMID 7972027. doi:10.1073/pnas.91.23.11163.
- Wu H, Yang WP, Barbas CF (Jan 1995). "Building zinc fingers by selection: toward a therapeutic application". Proceedings of the National Academy of Sciences of the United States of America 92 (2): 344–8. Bibcode:1995PNAS...92..344W. PMC 42736. PMID 7831288. doi:10.1073/pnas.92.2.344.
- Kim JS, Lee HJ, Carroll D (Feb 2010). "Genome editing with modularly assembled zinc-finger nucleases". Nature Methods 7 (2): 91; author reply 91–2. PMC 2987589. PMID 20111032. doi:10.1038/nmeth0210-91a.
- Joung JK, Voytas DF, Cathomen T (February 2010). "Reply to "Genome editing with modularly assembled zinc-finger nucleases"". Nat. Methods 7 (2): 91–2. PMID 20111031. doi:10.1038/nmeth0210-91b.
- Segal DJ, Dreier B, Beerli RR, Barbas CF (Mar 1999). "Toward controlling gene expression at will: selection and design of zinc finger domains recognizing each of the 5'-GNN-3' DNA target sequences". Proceedings of the National Academy of Sciences of the United States of America 96 (6): 2758–63. Bibcode:1999PNAS...96.2758S. PMC 15842. PMID 10077584. doi:10.1073/pnas.96.6.2758.
- Dreier B, Fuller RP, Segal DJ, Lund CV, Blancafort P, Huber A et al. (Oct 2005). "Development of zinc finger domains for recognition of the 5'-CNN-3' family DNA sequences and their use in the construction of artificial transcription factors". The Journal of Biological Chemistry 280 (42): 35588–97. PMID 16107335. doi:10.1074/jbc.M506654200.
- Dreier B, Beerli RR, Segal DJ, Flippin JD, Barbas CF (Aug 2001). "Development of zinc finger domains for recognition of the 5'-ANN-3' family of DNA sequences and their use in the construction of artificial transcription factors". The Journal of Biological Chemistry 276 (31): 29466–78. PMID 11340073. doi:10.1074/jbc.M102604200.
- Bae KH, Kwon YD, Shin HC, Hwang MS, Ryu EH, Park KS et al. (Mar 2003). "Human zinc fingers as building blocks in the construction of artificial transcription factors". Nature Biotechnology 21 (3): 275–80. PMID 12592413. doi:10.1038/nbt796.
- Ramirez CL, Foley JE, Wright DA, Müller-Lerch F, Rahman SH, Cornu TI et al. (May 2008). "Unexpected failure rates for modular assembly of engineered zinc fingers". Nature Methods 5 (5): 374–5. PMID 18446154. doi:10.1038/nmeth0508-374.
- Kim HJ, Lee HJ, Kim H, Cho SW, Kim JS (Jul 2009). "Targeted genome editing in human cells with zinc finger nucleases constructed via modular assembly". Genome Research 19 (7): 1279–88. PMC 2704428. PMID 19470664. doi:10.1101/gr.089417.108.
- Sander JD, Dahlborg EJ, Goodwin MJ, Cade L, Zhang F, Cifuentes D et al. (Jan 2011). "Selection-free zinc-finger-nuclease engineering by context-dependent assembly (CoDA)". Nature Methods 8 (1): 67–69. PMC 3018472. PMID 21151135. doi:10.1038/nmeth.1542.
- Greisman HA, Pabo CO (Jan 1997). "A general strategy for selecting high-affinity zinc finger proteins for diverse DNA target sites". Science 275 (5300): 657–61. PMID 9005850. doi:10.1126/science.275.5300.657.
- Maeder ML, Thibodeau-Beganny S, Osiak A, Wright DA, Anthony RM, Eichtinger M et al. (Jul 2008). "Rapid "open-source" engineering of customized zinc-finger nucleases for highly efficient gene modification". Molecular Cell 31 (2): 294–301. PMC 2535758. PMID 18657511. doi:10.1016/j.molcel.2008.06.016.
- Smith AT, Tucker-Samaras SD, Fairlamb AH, Sullivan WJ (Dec 2005). "MYST family histone acetyltransferases in the protozoan parasite Toxoplasma gondii". Eukaryotic Cell 4 (12): 2057–65. PMC 1317489. PMID 16339723. doi:10.1128/EC.4.12.2057-2065.2005.
- Akhtar A, Becker PB (Feb 2001). "The histone H4 acetyltransferase MOF uses a C2HC zinc finger for substrate recognition". EMBO Reports 2 (2): 113–8. PMC 1083818. PMID 11258702. doi:10.1093/embo-reports/kve022.
- Kim JG, Armstrong RC, v Agoston D, Robinsky A, Wiese C, Nagle J et al. (Oct 1997). "Myelin transcription factor 1 (Myt1) of the oligodendrocyte lineage, along with a closely related CCHC zinc finger, is expressed in developing neurons in the mammalian central nervous system". Journal of Neuroscience Research 50 (2): 272–90. PMID 9373037. doi:10.1002/(SICI)1097-4547(19971015)50:2<272::AID-JNR16>3.0.CO;2-A.
- Jandrig B, Seitz S, Hinzmann B, Arnold W, Micheel B, Koelble K et al. (Dec 2004). "ST18 is a breast cancer tumor suppressor gene at human chromosome 8q11.2". Oncogene 23 (57): 9295–302. PMID 15489893. doi:10.1038/sj.onc.1208131.
- C2H2 family at PlantTFDB: Plant Transcription Factor Database
- McDowall J. "Protein of the Month: Zinc Fingers". European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI). Retrieved 2008-01-13.
- Goodsell DS. "Molecule of the Month: Zinc Fingers". Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB). Retrieved 2008-01-13.
- The double helix between the zinc finger
- Zinc Finger Tools design and information site
- Human KZNF Gene Catalog
- Zinc finger C2H2-type domain in PROSITE
- Entry for zinc finger class C2H2 in the SMART database
- The Zinc Finger Consortium
- ZiFiT- Zinc Finger Design Tool
- Zinc Finger Consortium Materials from Addgene
- Predicting DNA-binding Specificities for C2H2 Zinc Finger Proteins