当前位置: 首页 > 期刊 > 《核酸研究》 > 2004年第3期 > 正文
编号:11371498
Sequence saturation mutagenesis (SeSaM): a novel method for directed e
http://www.100md.com 《核酸研究医学期刊》
     International University Bremen (IUB), Campus Ring 1, 28759 Bremen, Germany 1 Fine Chemicals and Biocatalysis Research, GVF/D-B009, 67056 Ludwigshafen, Germany

    *To whom correspondence should be addressed. Tel: +49 421 200 3543; Fax: +49 421 200 3229; Email: u.schwaneberg@iu-bremen.de

    ABSTRACT

    Sequence saturation mutagenesis (SeSaM) is a conceptually novel and practically simple method that truly randomizes a target sequence at every single nucleotide position. A SeSaM experiment can be accomplished within 2–3 days and comprises four steps: generating a pool of DNA fragments with random length, ‘tailing’ the DNA fragments with universal base using terminal transferase at 3'-termini, elongating DNA fragments in a PCR to the full-length genes using a single-stranded template and replacing the universal bases by standard nucleotides. Random mutations are created at universal sites due to the promiscuous base-pairing property of universal bases. Using enhanced green fluorescence protein as the model system and deoxyinosine as the universal base, we proved by sequencing 100 genes the concept of the SeSaM method and achieved a random distribution of mutations with the mutational bias expected for deoxyinosine.

    INTRODUCTION

    Inspired by Darwinian evolution in nature, directed evolution methods have been developed for tailoring proteins to our needs and elucidating structure–function relationships (1). An important step in a directed evolution experiment is to efficiently explore sequence space through random mutagenesis. The sequence space of a truly randomized library is, by its nature, not limited by a preselection for function under physiological conditions and allows us to adapt proteins to non-natural environments such as biotechnological processes.

    Among random mutagenesis methods, error-prone PCR (epPCR) methods based on inaccurate amplification of genes have been very successful and are generally employed in directed enzyme evolution experiments due to their simplicity and versatility (1). EpPCR methods can be divided into (a) methods that reduce the fidelity of the polymerase by unbalancing nucleotides concentrations and/or adding of chemical compounds such as manganese chloride (2–4), (b) methods that employ nucleotide analogs (5,6), (c) methods that utilize ‘mutagenic’ polymerases (7) and (d) combined methods (8).

    Though versatile and simple, all known epPCR methods are significantly limited in their ability to create diversity on the gene level (9). Instead of changing one amino acid to all other 19 possible amino acids, an average of less than five other amino acids can be obtained with current epPCR methods. The reasons lie mainly in the redundant genetic code and the biased mutational spectra of DNA polymerases (7). Additionally, most directed evolution experiments are performed at low to moderate mutation frequencies in order to avoid a large portion of inactive clones. Thus only 1–3 nt have been mutated per 1000 bp (7). Statistically these mutations rarely occur next to each other. Furthermore, mutational bias of the employed DNA polymerases significantly reduces the diversity of the nucleotide substitutions and their random incorporation in the target gene. Taq DNA polymerase and Mutazyme (Stratagene), the commonly used polymerases in epPCRs, are biased toward transitions. For instance 40.9% of all observed substitutions employing Taq polymerase under epPCR conditions are AT or TA substitutions (7). The incorporation of nucleotide analogs occurs in a PCR at preferential sites and results in reduced diversity . An additional drawback of these methods is the inefficient incorporation of nucleotide analogs by polymerases (10), resulting in limited mutation frequencies (11) and often low product yields.

    As a truly random mutagenesis method, sequence saturation mutagenesis (SeSaM) overcomes limitations caused by biased polymerases. Moreover, SeSaM can target a nucleotide species of the selected sequence and each nucleotide species can be exchanged in a controlled manner since SeSaM regulates the mutational spectra through a universal base.

    MATERIALS AND METHODS

    All chemicals used were of analytical reagent grade or higher quality and purchased from Sigma-Aldrich Chemie (Taufkirchen, Germany), Applichem (Darmstadt, Germany) and Carl Roth (Karlsruhe, Germany). pEGFP plasmid was purchased from BD Biosciences (Heidelberg, Germany).

    A thermal cycler (Mastercycler gradient; Eppendorf, Hamburg, Germany) and thin-wall PCR tubes (Multi-Ultra tubes; 0.2 ml; Carl Roth, Germany) were used in all PCRs. The PCR volume was always 50 μl; larger volumes were prepared in multiple 50 μl PCRs. The amount of DNA was quantified using a NanoDrop photometer (NanoDrop Technologies, Delaware, USA).

    Preparatory step 1: single-stranded EGFP template preparation

    For each PCR (94°C for 3 min, 1 cycle; 94°C for 1 min/59.5°C for 1 min/72°C for 75 s, 31 cycles; 72°C for 10 min, 1 cycle), 2.5 U Taq DNA polymerase (Qiagen, Hilden, Germany), 0.2 mM dNTP mix (New England Biolabs, Frankfurt, Germany), 20 pmol forward primer (5'-fluorescein-CCAATACGCAAACCGCCTCTCC-3'), 20 pmol 5'-biotinylated reverse primer (5'-CGTATCACGAGGCCCTTAAGG-3') and 250 ng pEGFP were used. After PCR, biotinylated reverse strands were isolated using Dynabeads MyOne Streptavidin (Dynal Biotech, Oslo, Norway) at room temperature. Fifty microliters of Dynabeads (10 mg/ml) were washed twice using 100 μl B&W buffer 1 (10 mM Tris–HCl pH 7.5, 1.0 mM EDTA, 2.0 M NaCl) each time. Washed Dynabeads were resuspended in 100 μl B&W buffer 1 and 100 μl of PCR product was added. After incubating for 20 min, the biotinylated DNA was immobilized on the Dynabeads, and the Dynabeads were washed with 100 μl B&W buffer 1. Non-biotinylated DNA fragments were released after incubating in 100 μl DNA melting solution (0.1 M NaOH) at 37°C for 10 min and bead washing . The single-stranded template was released from Dynabeads according to the manual by boiling in 0.1% SDS. Template DNA was desalted using a Microcon YM-10 centrifugal filter device (Millipore, Eschborn, Germany). The 50 μl sample was applied to a Microcon YM-10 sample reservoir and spun at 14 000 g for 10 min. Then, 100 μl EB buffer (1:10 dilution with water; Qiagen) was added and spun at 14 000 g for an additional 15 min. The sample reservoir was inverted and spun at 14 000 g for 3 min to recover the desalted DNA.

    Preparatory step 2: cloning vector preparation

    Twenty-five micrograms of plasmid pEGFP (QIAprep, Qiagen) was digested with 30 U EcoRI (New England Biolabs) in a reaction mixture of 100 μl. The reaction mixture was incubated at 37°C for 3 h. The linearized plasmid was purified with the NucleoSpin Extract kit (Machery-Nagel, Düren, Germany) (elution volume of 50 μl for a 100 μl reaction mixture). Fifteen micrograms of linearized plasmid pEGFP was subjected to a second digestion with 20 U AgeI (New England Biolabs) in a total reaction volume of 50 μl. After incubation at 37°C for 3 h, the double-digested plasmid was purified using the NucleoSpin Extract kit (Machery-Nagel).

    Step 1a: PCR with dATPS

    For each PCR with dATPS (94°C for 3 min, 1 cycle; 94°C for 1 min/59.5°C for 1 min/72°C for 75 s, 31 cycles; 72°C for 10 min, 1 cycle) the reaction mixture included 2.5 U Taq DNA polymerase (Qiagen), 0.2 mM dNTP mix (New England Biolabs), 0.2 mM Sp-dATPS (Biolog Life Science Institute, Bremen, Germany), 12.6 pmol 5'-biotinylated forward primer (5'-GACCATGATTACGCCAAGCTTGC-3'), 12.6 pmol reverse primer (5'-GACCGGCGCTCAGTTGGAATTCTAG-3') and 250 ng plasmid pEGFP.

    Step 1b: iodine cleavage of phosphorothioate backbone

    The phosphorothioate bond was cleaved with iodine (dissolved in ethanol; final concentration in the PCR tube, 2 μM). The mixture was incubated at room temperature for 1 h.

    Step 1c: preparation of single-stranded DNA fragments with a length distribution

    Biotinylated DNA fragments from the forward primer were isolated using Dynabeads MyOne Streptavidin (DYNAL Biotech) by a method identical to preparatory step 1, but without the desalting step.

    Step 1d: SDS salt removal from eluted single-stranded DNA fragments

    Desalination was done using the NucleoTrap kit (Macherey-Nagel). Four hundred microliters of buffer NT2 was added to 100 μl of eluted DNA followed by 15 μl of NucleoTrap suspension. The mixture was incubated at room temperature for 10 min with gentle shaking every 2–3 min. The sample was then centrifuged at 10 000 g for 30 s and, after discarding the supernatant, the pelleted silica matrix was washed twice with 500 μl of buffer NT3 followed by air-drying for 15 min at 37°C to remove residual ethanol. The silica matrix was resuspended in 55 μl Tris–HCl buffer (5 mM, pH 8.5), and incubated at 50°C for 5 min to elute the DNA from the silica. The suspension was centrifuged with NucleoSpin Microfilter (Machery-Nagel) at 10 000 g for 30 s to separate silica matrix from the DNA-containing solution.

    Step 2: enzymic elongation of DNA fragments with the universal base deoxyinosine

    Elongation of DNA fragments was performed at 37°C for 30 min in a total reaction volume of 50 μl. Each reaction mixture consisted of: 5 U terminal transferase (New England Biolabs), 0.25 mM CoCl2, 0.4 μM dITP (Amersham Biosciences Europe GmbH, Freiburg, Germany) and 650 ng of desalted DNA fragments from Step 1d. Terminal transferase was subsequently heat inactivated at 70°C for 10 min and the product was purified using QIAquick Nucleotide Removal Kit (Qiagen).

    Step 3: full-length gene synthesis

    The full-length gene was synthesized by PCR (94°C for 3 min, 1 cycle; 94°C for 1 min/52.7°C for 1 min/72°C for 75 s, 30 cycles; 72°C for 10 min, 1 cycle) using the following reaction mixture: 2.5 U Taq DNA polymerase (Qiagen), 0.2 mM dNTP mix (New England Biolabs), 500 ng elongated DNA fragments, 20 pmol reverse primer (5'-GACCGGCGCTCAGTTGGAATTCTAG-3') and 60 ng single-stranded reverse template (preparatory step 1). Full-length genes were purified using the NucleoSpin Extract kit (Machery-Nagel).

    Step 4: universal base replacement

    Universal base replacement was achieved by PCR (94°C for 3 min, 1 cycle; 94°C for 1 min/52.7°C for 1 min/72°C for 75 s, 30 cycles; 72°C for 10 min, 1 cycle) employing 2.5 U Taq DNA polymerase (Qiagen), 0.2 mM dNTP mix (New England Biolabs), 20 pmol forward primer (5'-GACCATGATTACGCCAAGCTTGC-3'), 20 pmol reverse primer (5'-GACCGGCGCTCAGTTGGAATTCTAG-3') and 30 ng full-length gene (Step 3). The PCR products were purified using the NucleoSpin Extract kit (Machery-Nagel).

    Cloning of the mutated EGFP library

    Two micrograms of purified PCR product (Step 4) was digested with 30 U EcoRI (New England Biolabs) for 3 h at 37°C. Next, 20 U of AgeI (New England Biolabs) was added to the digest and incubated for an additional 3 h at 37°C. The digested product was purified using the NucleoSpin Extract kit (Machery-Nagel). Ligation of the digested PCR product into the linearized pEGFP cloning vector (Preparatory step 2) was performed at room temperature for 1 h using T4 DNA ligase (Roche, Mannheim, Germany). The transformation into Escherichia coli XL2 Blue (Stratagene, Amsterdam, Netherlands) cells was performed using the TSS method (12). The cells were plated and grown on LBamp plates.

    RESULTS

    Principle of SeSaM

    SeSaM is a four-step method that can saturate every single nucleotide position of the target sequence with all four standard nucleotides (Fig. 1). In the first step a pool of DNA fragments with a random length distribution is generated. In the second step, DNA fragments are ‘tailed’ at 3'-termini with universal bases using terminal transferase. In the third step the elongated DNA fragments are extended in PCR to the full-length genes by using a single-stranded template and a reverse primer. In the fourth step a concluding PCR is used to replace the universal bases with standard nucleotides.

    Figure 1. Scheme of the SeSaM method that comprises four steps. Step 1: creating a pool of DNA fragments with a random size distribution. Step 2: enzymic elongation of the DNA fragments with a universal base. Step 3: full-length gene synthesis. Step 4: universal base replacement by standard nucleotides.

    Step 1: creating a DNA fragment pool with length distribution

    In the first step (Fig. 2), a PCR is performed using a biotinylated forward primer and a non-biotinylated reverse primer in the presence of both standard nucleotides and -phosphothioate nucleotides. -Phosphothioate nucleotides are similar to standard nucleotides, except that a sulfur atom replaces an oxygen atom at the -phosphate. The commercially available S-isomer of the -phosphothioate is randomly incorporated by DNA polymerases while the R-isomer is not accepted as substrate (13,14). The phosphothioate bond is susceptible to iodine cleavage in alkaline solution (15), and cleaving of the PCR product containing -phosphothioate nucleotides generates a library of fragments that stop at every single nucleotide. Separation of this library from cleaved products is achieved by isolating biotinylated forward primer using magnetic particles with a streptavidin-coated surface in a DNA melting solution. For the success of this method, it is important to fully remove nicked complementary strands in this purification step. Subsequent release of the biotinylated DNA fragments from the solid support is achieved by boiling in 0.1% SDS solution. In order to generate a homogeneous library, the concentration of dATPS was carefully adjusted to generate a homogeneous smear (DNA fragment pool) that spans through the whole gene length (Fig. 2B).

    Figure 2. Step 1 of SeSaM: creating a DNA fragment pool with length distribution. (A) scheme of Step 1. (B) Gel pictures: PCR products before (left lane) and after (right lane) iodine cleavage. The PCRs were performed in the presence of the stated dATPS concentration. Bottom gel picture: DNA fragment size distribution after DNA melting and purification of the cleaved PCR products.

    Step 2: enzymic elongation of DNA fragments with the universal base deoxyinosine

    In the second step (Fig. 3), DNA fragments are ‘tailed’ by terminal transferase with the universal base deoxyinosine (11,16). Deoxyinosine is a well-studied universal base with a strong preference for base pairing with cytosine (11,16). It is used as a replacement for guanine in DNA sequencing (17,18). Terminal transferase is also known to incorporate universal bases with unbiased mutational spectra such as 3-nitropyrrole and 5-nitroindole (19,20). The biased deoxyinosine was selected to ensure that the incorporation of universal bases occurs and that the mutations found can be attributed to bias of deoxyinosine.

    Figure 3. Step 2 of SeSaM: enzymic elongation of DNA fragments with the universal base deoxyinosine.

    Step 3: full-length gene synthesis

    The next step is extending the elongated fragment library to full length without amplifying the wild-type gene (Fig. 4) and this has been achieved by using a long single-stranded template. After this elongation step the reverse primer will anneal to the newly synthesized full-length strand and initiate the synthesis of the complementary strand. The double-stranded DNA synthesized thus contains one strand harboring a nucleotide analog and a complementary strand containing at this position a randomly replaced nucleotide. Using a single-stranded template is crucial for the success of this step. In the case of a double-stranded template, the reverse primer will bind to its complementary template strand, resulting in PCR products that do not contain a nucleotide analog. A large fraction of non-mutated wild-type genes would consequently be generated in Step 4.

    Figure 4. Step 3 of SeSaM: full-length gene synthesis by a PCR employing a single-stranded template and a reverse primer that amplifies the newly synthesized strand.

    Step 4: universal base replacement

    To replace nucleotide analogs with standard nucleotides (Fig. 5), a concluding PCR is required where full-length genes containing nucleotide analogs act as a template. In this PCR, nucleotide analogs are replaced by standard nucleotides due to promiscuous base-pairing of the universal bases.

    Figure 5. Step 4 of SeSaM: the universal base deoxyinosine is replaced by one of the four standard nucleotides in a PCR.

    Mutational spectra of SeSaM

    Several libraries of 50–60 clones were sequenced for developing the SeSaM and sequencing of 100 clones validated the current protocol. Mutations are randomly distributed along the target gene EGFP (Fig. 6). Sequencing results and mutational spectra of SeSaM are summarized in Figure 7 and Table 1. As expected, most of the mutations (49%) occur at adenine positions because only dATPS was used to create the DNA fragment pool; 95.8% of the AX substitutions are AG substitutions, reflecting well the base-pair properties of deoxyinosine (11,16).

    Figure 6. Mutational spectra of the SeSaM method employing dATPS and deoxyinosine: In, insertion; De, deletion.

    Figure 7. Distribution of the mutations generated by the SeSaM method.

    Table 1. Mutational spectra of SeSaM

    DISCUSSION

    In order to validate the SeSaM method we have to prove that nucleotide substitutions occur at positions of Sp-dNTPS incorporation, and that universal bases are incorporated and responsible for nucleotide substitutions. For showing the site-specific cleavage we decided to use Sp-dATPS. Mutations generated by the SeSaM method therefore have to occur preferably at adenine positions in the gene sequence. Equal targeting of all nucleotides in a gene sequence can be achieved by four PCRs (Step 1) using in each a different Sp-dNTPS and balancing the DNA amount of each fragment library in Step 2.

    Figure 7 shows that 49.0% of the mutations occur at adenine positions of the sequence, 16.3% occur at ‘adenine’ positions in the complementary strand and 24.5% occur at 1 or 2 nt downstream of an adenine position. Substitutions at adenine positions in the complementary strand occur mainly due to nicked DNA that has not been completely removed in the washing step (Fig. 2A). Improvements in the washing procedure reduced the total percentage of these mutations significantly to 16.3%. In order to minimize wild-type formation, conditions were chosen in which statistically more than one degenerated base is incorporated (Fig. 3; Manual of New England Biolabs). We assume that many mutations next to an adenine position are caused by the incorporation of more than one nucleotide analog and that the nucleotide analogs at the adenine positions were replaced by the ‘original’ adenine nucleotide in the subsequent PCR (Step 4; Fig. 5). Elongation of DNA fragments containing more than one universal base incorporated at the 3'-terminus (Step 3; Fig. 4) is, however, a rare event owing to the weak two hydrogen bonds between each deoxyinosine and the template nucleotide. As a result the 3'-end is bending and difficult to be amplified by polymerases. In total up to 88% of the mutations are likely attributed to Sp-dATPS incorporation.

    We selected deoxyinosine as the universal base to prove that incorporation and nucleotide replacement can be attributed to the incorporated universal base. Deoxyinosine has a strong bias to base-pair with cytosine in the complementary strand (11,16). We therefore expect mainly AG substitutions and the mutational bias found confirms that more than 95% of the AX substitutions are AG substitutions. Indeed, these findings are in very good agreement with the reported bias (11,16).

    DNA fragments smaller than 67 nt were not mutagenized (Fig. 7). The absence of mutations at this starting region is caused by the DNA extraction and purification procedures of the commercial purification kits employed (Machery Nagel NucleoTrap, Qiagen QIAquick) in which small fragments are recovered inefficiently. For library constructions with SeSaM we therefore used a forward primer that binds 70 bp upstream of the EGFP gene.

    Compared with currently used epPCR methods, SeSaM has the advantage of being completely independent of the mutational bias of DNA polymerases. In addition, the fragment distribution of a DNA library is controllable by using different concentrations of the individual Sp-dNTPS or a combination of them. Universal or degenerate bases with different base-pairing preferences can in future be used to tune the mutational spectra. Consequently, we believe that SeSaM is a very beneficial method for the field of directed evolution as it represents a superior tool to truly explore sequence space. Currently, we are extending the scope of SeSaM to randomize every nucleotide position in a target sequence by employing unbiased universal bases and to generate multiple subsequent mutations.

    ACKNOWLEDGEMENTS

    We thank Dr H.-G. Genieser (Biolog Life Science Institute, Bremen, Germany) for dNTPS. We thank BASF AG and Dr M. Breuer for financial support and DNA sequencing.

    REFERENCES

    Brakmann,S. (2001) Discovery of superior enzymes by directed molecular evolution. Chem. Biochem., 2, 865–871.

    Cadwell,R.C. and Joyce,G.F. (1992) Randomization of genes by PCR mutagenesis. PCR Methods Appl., 2, 28–33.

    Cadwell,R.C. and Joyce,G.F. (1994) Mutagenic PCR. PCR Methods Appl., 3, 136–140.

    Lin-Goerke,J.L., Robbins,D.J. and Burczak,J.D. (1997) PCR-based random mutagenesis using manganese and reduced dNTP concentration. Biotechniques, 23, 409–412.

    Zaccolo,M., Williams,D.M., Brown,D.M. and Gherardi,E. (1996) An approach to random mutagenesis of DNA using mixtures of triphosphate derivatives of nucleoside analogues. J. Mol. Biol., 255, 589–603.

    Kuipers,O.P. (1996) Random mutagenesis by using mixtures of dNTP and dITP in PCR. Methods Mol. Biol., 57, 351–356.

    Cline,J. and Hogrefe,H.H. (2000) Randomize gene sequences with new PCR mutagenesis kit. Strategies, 13, 157–161.

    Xu,H., Petersen,E.I., Petersen,S.B. and el-Gewely,M.R. (1999) Random mutagenesis libraries: optimization and simplification by PCR. Biotechniques, 27, 1102–1108.

    Patrick,W.M., Firth,A.E. and Blackburn,J.M. (2003) User-friendly algorithms for estimating completeness and diversity in randomized protein-encoding libraries. Protein Eng., 16, 451–457.

    Innis,M.A., Myambo,K.B., Gelfand,D.H. and Brow,M.A. (1988) DNA sequencing with Thermus aquaticus DNA polymerase and direct sequencing of polymerase chain reaction-amplified DNA. Proc. Natl Acad. Sci. USA, 85, 9436–9440.

    Spee,J.H., de Vos,W.M. and Kuipers,O.P. (1993) Efficient random mutagenesis method with adjustable mutation frequency by use of PCR and dITP. Nucleic Acids Res., 21, 777–778.

    Chung,C.T., Niemela,S.L. and Miller,R.H. (1989) One-step preparation of competent Escherichia coli: transformation and storage of bacterial cells in the same solution. Proc. Natl Acad. Sci. USA, 86, 2172–2175.

    Romaniuk,P.J. and Eckstein,F. (1982) A study of the mechanism of T4 DNA polymerase with diastereomeric phosphorothioate analogues of deoxyadenosine triphosphate. J. Biol. Chem., 257, 7684–7688.

    Eckstein,F. and Thomson,J.B. (1995) Phosphate analogs for study of DNA polymerases. Methods Enzymol., 262, 189–202.

    Gish,G. and Eckstein,F. (1988) DNA and RNA sequence determination based on phosphorothioate chemistry. Science, 240, 1520–1522.

    Kawase,Y., Iwai,S., Inoue,H., Miura,K. and Ohtsuka,E. (1986) Studies on nucleic acid interactions. I. Stabilities of mini-duplexes (dG2A4XA4G2-dC2T4YT4C2) and self-complementary d(GGGAAXYTTCCC) containing deoxyinosine and other mismatched bases. Nucleic Acids Res., 14, 7727–7736.

    Motz,M., Paabo,S. and Kilger,C. (2000) Improved cycle sequencing of CC-rich templates by a combination of nucleotide analogs. Biotechniques, 29, 268–270.

    Dierick,H., Stul,M., Dekelver,W., Marynen,P. and Cassiman,J.J. (1993) Incorporation of dITP or 7-deaza dGTP during PCR improves sequencing of the product. Nucleic Acids Res., 21, 4427–4428.

    Smith,C.L., Simmonds,A.C., Hamilton,A.L., Martin,D.L., Lashford,A.G., Loakes,D., Hill,F. and Brown,D.M. (1998) Use of 5-nitroindole-2'-deoxyribose-5'-triphosphate for labelling and detection of oligonucleotides. Nucl. Nucl., 17, 555–564.

    Loakes,D. (2001) Survey and summary: the applications of universal DNA base analogues. Nucleic Acids Res., 29, 2437–2447.(Tuck Seng Wong, Kang Lan Tee, Berhard Ha)