当前位置: 首页 > 期刊 > 《核酸研究》 > 2005年第1期 > 正文
编号:11368804
Destabilization of tetranucleotide repeats in Haemophilus influenzae m
http://www.100md.com 《核酸研究医学期刊》
     Molecular Infectious Diseases Group, Department of Paediatrics, Weatherall Institute for Molecular Medicine, University of Oxford, John Radcliffe Hospital Headington, Oxford OX3 9DU, UK

    *To whom correspondence should be addressed. Tel: +44 1865 222344; Fax: +44 1865 222626; Email: cbayliss@hammer.imm.ox.ac.uk

    ABSTRACT

    A feature of Haemophilus influenzae genomes is the presence of several loci containing tracts of six or more identical tetranucleotide repeat units. These repeat tracts are unstable and mediate high frequency, reversible alterations in the expression of surface antigens. This process, termed phase variation (PV), enables H.influenzae to rapidly adapt to fluctuations in the host environment. Perturbation of lagging strand DNA synthesis is known to destabilize simple sequence repeats in yeast and Escherichia coli. By using a chromosomally located reporter construct, we demonstrated that the mutation of an H.influenzae rnhA (encoding RnaseHI) homologue increases the mutation rates of tetranucleotide repeats 3-fold. Additionally, deletion of the Klenow domain of DNA polymerase I (PolI) resulted in a 35-fold increase in tetranucleotide repeat-mediated PV rates. Deletion of the PolI 5'>3' exonuclease domain appears to be lethal. The phenotypes of these mutants suggest that delayed or mutagenic Okazaki fragment processing destabilizes H.influenzae tetranucleotide repeat tracts.

    INTRODUCTION

    Replication of double-stranded DNA genomes is semi-discontinuous with DNA replication being continuous on the leading strand and discontinuous on the lagging strand. Lagging strand DNA synthesis initiates from RNA primers of 8–12 nt in length, proceeds for 100–2000 nt and terminates in response to signals that include the 5' end of an RNA primer (prokaryotes) or DNA flaps (eukaryotes) (1,2). Processing of these DNA fragments, termed Okazaki fragments, requires removal of the RNA primer/DNA flap, filling of the gap between the two DNA fragments by DNA synthesis and DNA ligation. In eukaryotes, perturbation of Okazaki fragment processing can destabilize simple sequence repeats (3–5).

    Okazaki fragment processing in Escherichia coli is initiated by RnaseHI, which removes or shortens the RNA primer (6). Polymerase I (PolI) then extends the 3' end of the upstream Okazaki fragment, displaces a single nucleotide of the 5' end of the downstream fragment, cleaves the displaced single-strand DNA and thus creates a substrate for DNA ligase (7). PolI has three structural domains with different activities: 5'>3' exonuclease, 3'>5' exonuclease and 5'>3' DNA polymerase (8). The latter two domains form the Klenow fragment. In E.coli, polA (i.e. polI) mutants lacking either polymerizing (polA1) or 5'>3' exonuclease (polA, F' Klenow) activity exhibit elevated mutation frequencies for dinucleotide repeat tracts (9). While activation of an SOS response was partially responsible for the increase, these results indicate that accurate Okazaki fragment processing is required to maintain the stability of dinucleotide repeat tracts in E.coli.

    Haemophilus influenzae is a common commensal of the upper respiratory tract of humans. Characteristically, the genomes of this bacterial species contain multiple long tetranucleotide repeat tracts (10,11). Long repeat tracts with alternate unit sizes (e.g. mono or dinucleotide repeats) are infrequent. These tetranucleotide repeat tracts have high-mutation rates and are responsible for PV, i.e. a high frequency of reversible switches in expression, of surface antigens such as lipopolysaccharide epitopes and haemoglobin–haptoglobin binding proteins (12,13). The mutation rates of these repeat tracts are influenced by the number of repeats in the tract but are not subjected to control by mismatch repair, recA-mediated recombination or transcription-coupled repair (14,15). Interestingly, an insertion/deletion mutation in the H.influenzae polI gene destabilized tetranucleotide, but not dinucleotide, repeat tracts independently of activation of an SOS response suggesting a critical role for Okazaki fragment processing in determining the stability of repeats in this bacterial species (14). In this study, we have extended these observations by constructing a mutation in an rnhA (encoding RnaseHI) homologue and a deletion of the PolI Klenow domain of H.influenzae, and show that both these mutations destabilize tetranucleotide repeat tracts.

    MATERIALS AND METHODS

    Strains

    H.influenzae strain RM118, termed Rd herein, is a derivative of strain KW-20 (16). Strains RdGZ38R and RdGZ17R were described previously (15). H.influenzae strains were grown in brain heart infusion (BHI) supplemented with either haemin (10 μg/ml) and NAD (2 μg/ml) for liquid media or Levinthal supplement for solid media. E.coli strains DH5 or GM48 were used to propagate plasmids and were grown in Luria–Bertani broth supplemented with ampicillin (100 μg/ml), kanamycin (50 μg/ml), tetracycline (12 μg/ml) or chloramphenicol (30 μg/ml).

    Construction of mutations in polI and rnhA of H.influenzae

    Here, we describe the construction of a number of plasmids that were used in attempts to inactivate genes or to delete specific regions of genes on the H.influenzae chromosome (see Figure 1 for a depiction of the inserts in these plasmids and Table 1 for the DNA sequences of the oligonucleotides utilized for the cloning of these inserts). The designations of these constructs are as follows: a gene preceded by indicates that the gene was inactivated by the insertion of an antibiotic cassette (deletion of a portion of a gene may also occur in these constructs); a gene followed by a and then a domain(s) name indicates the deletion of a particular domain(s) of a gene and that the antibiotic cassette is in an adjacent gene (in these constructs it is assumed that the other domains of the protein will be expressed). The designations for the domains of polI (see above and Figure 1) are as follows: 5' to 3' exonuclease, 5'>3' exo; 3' to 5' exonuclease, 3'>5' exo; 3' to 5' exonuclease and DNA polymerase, Klen.

    Figure 1 Construction of mutations in polI and rnhA of H.influenzae. (a) Representations of the native H.influenzae polI locus (top diagram) and of deletions/insertions in this locus that were constructed in plasmids are shown. The ‘hi’ numbers are taken from the annotation of the Rd genome sequence in the TIGR Microbial database (www.tigr.org). Genes are represented by open boxes and non-coding DNA by a thick dark line. The three domains of PolI are shown as filled boxes: dots, 5'>3' exonuclease domain; filled box, 3'>5' proof-reading exonuclease domain; and vertical stripes, polymerase domain (this analysis is based on data available in the TIGR Microbial database). The direction of translation is indicated by arrows. Plasmids pUCpolI and pUChi0855 were described previously (14). Antibiotic cassettes were inserted into these plasmids in either the HindIII (pUCpolI and pUCpolI-large) or EcoRV sites. The parentheses indicate the polI domains that could be expressed in H.influenzae constructs derived using plasmids pUCpolIKlen, pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo. (b) The construction of an insertional mutation in rnhA is shown. The top diagram represents the native locus and indicates the positions of two primers used to clone a 1700 bp fragment containing the rnhA gene. A tetracycline cassette (tetR) was inserted into a BclI site located in the centre of rnhA.

    Table 1 Oligonucleotides utilized in construction of plasmids

    In construct pUCpolI (14), 666 bp of the H.influenzae polI gene were deleted and replaced by a tetracycline cassette. Plasmid pUCpolI-large was constructed in order to extend this deletion into the PolI 5'>3' exo domain. This plasmid was made by amplifying from H.influenzae strain Rd chromosomal DNA a fragment that included sequences upstream of polI and part of the 5'>3' exo domain, using primers POL9E and POL8H, which, respectively, have EcoRI and HindIII sites at the 5' end. This fragment was digested with EcoRI and HindIII and used to replace the EcoRI–HindIII fragment of pUCpolI (Figure 1a). Recombination of the insert of this plasmid into the H.influenzae chromosome would result in deletion of the entire 3'>5' exo domain and part of the 5'>3' exo domain.

    Plasmid, pUCpolIKlen, was constructed to enable deletion of the entire Klenow domain and efficient expression of the 5'>3' exo domain. The entire 5'>3' exo domain and some upstream sequences were amplified with primers POL10RV (binds downstream of the PolI 5'>3' exo domain) and POL9E. The former primer has an EcoRV site at the 5' end and inserts two 5'TAA stop codons in frame with the polI reading frame, enabling efficient termination of translation. This fragment was digested with EcoRV and EcoRI and used to replace the EcoRV/EcoRI fragment of plasmid pUChi0855 (14). Recombination of the insert of this plasmid into the H.influenzae chromosome would result in deletion of the entire Klenow domain and the 5' end of the downstream gene, hi0855.

    A full-length clone of the H.influenzae polI gene was constructed by amplifying chromosomal DNA with primers POL9E and POL4B (binds within polI downstream of the HindIII site), digesting the product with EcoRI and HindIII (a native site within polI), and using this fragment to replace the EcoRI/HindIII fragment of pUCpolI. The resultant plasmid was digested with HindIII and BamHI and the larger fragment was ligated to two fragments generated from pUChi0855 by digestion with either HindIII and EcoRV or EcoRV and BamHI. The resultant plasmid, pUCpolI-full, contains the entire polI gene and promoter region plus an interrupted version of gene hi0855.

    Plasmids, pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo, were constructed to enable deletion of either the 5'>3' exo domain alone or both the 5'>3' and 3'>5' exo domains. DNA fragments lacking these domains were generated by PCR amplification using plasmid DNA of pUCpolI-full and either POL12KLEN (binds at the beginning of the PolI 3'>5' exo domain) and POL7B (binds downstream of hi0855) or POL13POL (binds at the beginning of the PolI polymerase domain) and POL7B primers. These fragments were digested with NcoI (present at the 5' ends of POL12KLEN and POL13POL) and EcoRV. PCR amplification of chromosomal DNA was then performed with primers POL14E (binds upstream of POL9E and has an EcoRI site at the 5' end) and POL11NC (binds adjacent to but upstream of the native polI start codon and has an NcoI site at the 5' end). This fragment was digested with EcoRI and NcoI and together with one of the NcoI–EcoRV fragments was used to replace the EcoRI/EcoRV fragment of pUCpolI-full. Recombination of the inserts of the resultant plasmids, pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo, into the H.influenzae chromosome would result in the expression of either the Klenow or polymerase domains of PolI, respectively.

    To enable selection for these mutations in H.influenzae, antibiotic cassettes were then inserted into each of these plasmids. A tetracycline cassette was inserted in the HindIII site of pUCpolI-large and into the EcoRV site of pUCpolIKlen, pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo (note that in the latter three constructs hi0855 will be inactivated). In addition, a chloramphenicol cassette was inserted into the EcoRV site of pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo.

    Plasmid, pUCrnhA-tet, was constructed to permit inactivation of the H.influenzae rnhA gene. The rnhA gene and flanking regions were amplified by PCR from chromosomal DNA of H.influenzae strain Rd using primers RNASE1B and OMP2EXT. This product was digested with BamHI and EcoRI and ligated to pUC19 cut with the same enzymes. This plasmid was transformed into E.coli strain GM48 to allow restriction with BclI, which is sensitive to methylation. A tetracycline cassette was ligated into the unique BclI site, in the centre of rnhA, of this plasmid.

    These plasmids were linearized with either SalI or BamHI and transformed into competent H.influenzae. Transformants were selected on BHI plates containing 4 μg/ml tetracycline or 2 μg/ml chloramphenicol. Transformants were checked by PCR amplification using primers spanning the deletion/antibiotic cassette insertion sites and by Southern-blot analysis using probes, specific for polI or rnhA, which were generated by PCR amplification and labelled using the dioxigenin system (Boehringer Mannheim).

    Growth and cell division assays

    Growth rates and numbers of colony forming units (c.f.u.) were measured for strains grown in liquid culture. Cultures (5 ml) were inoculated with 0.3 ml of an overnight culture of each strain and grown to an optical density (OD) at 490 nm of 0.4. Cultures were then standardized by diluting back to an OD of 0.1. Optical densities were measured at 490 nm every 2 h. Serial dilutions of samples, taken at different intervals throughout the growth curve, were plated and used to estimate the total number of c.f.u. Some samples were also examined by phase contrast microscopy. Doubling times were estimated by plotting Log N (where N is the number of c.f.u.; OD490 units were converted into cfu using the average value for strain Rd of 1.6 x 109 c.f.u./OD490 unit) against time and fitting a trendline to the linear portion of the growth curve.

    Phase variation rate assays

    PV rates and mutational patterns were determined as described previously (15). Briefly, H.influenzae mutants were transformed with linearized DNA of plasmids pGZ38R, pGZ17R and pGZAT20, and transformants were isolated on BHI plates containing 10 μg/ml of kanamycin. These plasmids recombine into the H.influenzae chromosome and express a phase variable Mod–LacZ fusion protein whose PV is driven by, respectively, 38 5'AGTC, 17 5'AGTC or 20 5'AT repeats. PV rates were determined for two transformants of each combination of mutant strain/reporter construct. Serial dilutions of multiple colonies of a strain were plated on BHI plates containing 40 μg/ml of Xgal (5-bromo-4-chloro-3-indoyl-?-D-galactopyranoside) and these plates were used to estimate the total number of variants and cells in a colony. PV frequencies were calculated from these values and then PV rates were derived using the median frequency either according to Drake (19) or to Saunders et al. (20). Alterations in repeat tracts were determined by PCR amplification from parental and variant progeny colonies using fluorescently labelled primers which span the repeat tract. The sizes of PCR products were determined by electrophoresis using an ABIprism 377 autosequencer and analysis of gels with the ABI GeneScan 3.1 program (PerkinElmer).

    The increase in ON-to-OFF switching of strain RdpolIKlen was separated into increases in shifts of differing unit sizes using the observed proportions of each shift size (i.e. 1, 2 or >2 units). The increase in deletions and insertions for each unit size was then calculated using the observed ratio of deletions:insertions in ON-to-OFF switching (i.e. a ratio of 30:1 implies that 1/31 of the 1 unit shifts are insertions). These values were then used to estimate the increases in OFF-to-ON switching (i.e. the increase in –1 switching is the combined values for the increase in –1, +2, >–2 and >+2 shifts).

    Statistics

    Statistical analyses were performed with the program Instat 2.0. For PV frequencies, comparisons of the PV frequencies for a pair of strains were performed using a Mann–Whitney non-parametric rank sum test. For proportions of insertions and deletions, pairs of strains were compared using a 2 x 2 contingency table and a Fisher's exact test where the two-sided P-value tests the null hypothesis that the proportion of deletions and insertions is identical in each strain.

    RESULTS

    Construction of a deletion of the polI Klenow domain and inactivation of rnhA of H.influenzae

    Mutant RdpolI was constructed by interrupting the H.influenzae polI gene with an antibiotic cassette inserted into the region encoding the Klenow domain of PolI (see pUCpolI in Figure 1). While this mutant exhibited an elevation of tetranucleotide repeat-mediated PV rates (14), it was unclear whether this increase was due to complete loss of PolI activity (i.e. both 5'>3' exonuclease and polymerase activity) or just the absence of PolI polymerase activity. The deletion in pUCpolI was therefore extended to include part of the 5'>3' exo domain. The resultant plasmid, pUCpolI-large, was used to transform H.influenzae strain Rd. All of the transformants obtained had undergone a single cross-over event and retained an intact polI gene. This result suggested that inactivation of the entire polI gene was lethal as observed for E.colipolA mutants grown on rich media (21).

    We then constructed a series of plasmids that would permit deletion of the entire Klenow domain (pUC polIKlen), the 5'>3' exo domain (pUC polI5'>3'exo) or the 5'>3' and 3'>5' exo (pUC polI5'>3'exo/3'>5'exo) domains (see Figure 1). A tetracycline cassette was inserted into an artificial EcoRV site in hi0855, the gene downstream of polI whose inactivation was known not to destabilize tetranucleotide repeat tracts (14). Transformants were obtained with each construct but only those transformed with pUC polIKlen had undergone double cross-over events and retained the deletion. These transformants were designated RdpolIKlen. This result demonstrated that deletion of the Klenow domain of H.influenzae PolI was possible but suggested that deletion of the 5'>3' exo domain was lethal.

    A major function of PolI is processing of Okazaki fragments. To test whether inactivation of other proteins involved in this process also destabilizes tetranucleotide repeat tracts, a mutation was constructed in the H.influenzae rnhA homolog. The rnhA gene and flanking region was cloned from H.influenzae strain Rd and a tetracycline cassette was inserted into a native BclI site in the centre of the gene. This construct was used to transform H.influenzae strain Rd and transformants were generated containing an interrupted rnhA gene. These transformants were designated RdrnhA.

    Growth characteristics of mutants

    Growth of the PolI-Klenow and RnaseHI mutants was examined for RdpolIKlen, RdrnhA and Rd in liquid media (Figure 2a). The mutants exhibited doubling times of 51 (±0.8) and 76 (±3.3) min, respectively, whereas the doubling time of strain Rd was 50 (±4.6). These data indicate that the loss of RnaseHI activity has a much greater effect on growth than loss of Klenow activity. Filament formation was investigated by plating samples from different points along the growth curves and calculating the numbers of c.f.u./ml present in the culture. To enable comparison between the different time points these values were divided by the number of OD units (Figure 2b). Strain Rd exhibited a small increase in the number of c.f.u. per OD unit as the cultures progressed into stationary phase, possibly indicating that during early stages of the growth curve DNA replication/cell growth occurred faster than cell division. In three cases, these curves also exhibited a significant reduction in the number of c.f.u. after overnight growth likely due to cells dying. Both the RdpolIKlen and RdrnhA mutants exhibited large reductions in the number of c.f.u. per OD unit with the average figures being 8- or 14-fold lower, respectively, than the average figure for strain Rd (1.6 x 109 c.f.u./OD490 unit, ±8 x 108). Examination of the mutants by phase contrast microscopy revealed the presence of filaments of varying sizes in the liquid cultures (data not shown), which may partly or wholly explain the observed decreases in c.f.u. per OD unit. Similarly, reductions in the number of c.f.u. per colony were also noted when the mutants were grown on solid media, once again indicating the presence of filaments.

    Figure 2 Growth rates and plating efficiencies of polI and rnhA mutants. Growth curves were performed with strains Rd, RdpolIKlen and RdrnhA using BHI-broth supplemented with NAD and haemin at 37°C. Four independent cultures were performed for each strain. (a) Optical density at 490 nm plotted against time. Average values and standard deviations are shown for each strain. (b) Samples were taken at different time points, serially diluted and plated on BHI-agar supplemented with Levinthal's supplement. The number of colony forming units (c.f.u.) per ml was then divided by the OD490 of the sample and these values were plotted on a log scale against OD490. The four independent experiments are shown separately as samples were taken at different optical density values in each experiment. (a and b) Filled shapes, Rd; open shapes, RdpolIKlen; and alternative symbols, RdrnhA.

    Phase variation rates of polI mutants

    PV rates were investigated by inserting a lacZ reporter construct containing 5'AGTC tetranucleotide repeats into the chromosome of the mutant strains. Median PV frequencies were determined for each mutant using reporter constructs containing 38 or 17 repeats (Table 2). Rates were determined for both directions of switching i.e. ON-to-OFF (blue to white colonies) and OFF-to-ON (white to blue colonies). Note that in these constructs there is only one ON reading frame for every two OFF reading frames and that the majority of mutations in strain Rd involve loss or gain of a single repeat unit. Thus, an ON construct with 38 repeats will usually switch to OFF variants containing either 39 or 37 repeats. Meanwhile OFF-to-ON switching takes two forms. In –1 switching of strain Rd, the majority of ON variants are generated by loss of a repeat unit (e.g. 39 to 38 repeats) while in +1 switching gain of a repeat unit is the commonest event (e.g. 37 to 38 repeats). Obviously ON variants can also be generated by larger events but these occur infrequently (e.g. 37 to 35 or 32 repeats). Mutation rates were derived for these constructs either according to Drake (19), to facilitate comparisons to our previous data, or to Saunders et al. (20) (Table 2). This latter method includes a calculation of the back-mutation rate and may, therefore, provide a more accurate estimation of PV rate. Table 2 indicates that the Drake equation can underestimate high-mutation rates by up to 1.3-fold and can overestimate low-mutation rates by up to 1.5-fold but in general provides a reasonably accurate estimation of PV rate.

    Table 2 Influence of pol-klen and rnhA mutations on H.influenzae PV rates

    PV rates for RdpolIKlen were increased relative to strain Rd by 31- to 40-fold for ON-to-OFF switching and 30- to 36-fold for –1 OFF-to-ON switching but only 5- to 17-fold for +1 OFF-to-ON switching. These data indicated that tetranucleotide repeat tracts are destabilized in RdpolIKlen and the greater effect on OFF-to-ON switching of –1 versus +1 constructs suggested that there was a higher proportional increase in deletions relative to insertions.

    Analyses of the types of mutations occurring in variant colonies of RdpolIKlen reporter constructs were performed and compared to analyses performed on strain Rd and Rdhi0855 (Figure 3). For ON-to-OFF switching in RdpolIKlen, no insertions were seen in a construct with 17 5'AGTC repeats while the ratio of deletions to insertions for a 38 repeat construct was 30:1. This figure was significantly higher than the ratios of 2.4:1 and 2.3:1 observed with reporter strains of Rd and Rdhi0855, respectively, containing similar numbers of repeats (by Fisher's exact test P-values of 0.016 and 0.006 were obtained). For –1 OFF-to-ON switching in which a –1 deletion produces an ON phenotype, the ratio was slightly higher for RdpolIKlen, 21:1 (combined data for 36 and 15 5'AGTC repeats), than for strain Rd, 7:1 (combined data for 24 and 18 5'AGTC repeats), but the difference between these ratios was not significant (P = 0.56). For +1 OFF-to-ON switching in which a +1 insertion produces an ON phenotype, the ratio for RdpolIKlen was 5:1 (for both 37 and 16 5'AGTC repeats), which was significantly higher than the ratio for strains Rd, 1:4 (for 37 5'AGTC, P = 0.017), and Rdhi0855, 1:33 (for 37 5'AGTC, P 0.0001).

    Figure 3 Influence of pol-klen and rnhA mutations on the types of alterations occurring in 5'AGTC tetranucleotide repeat tracts. Repeat tracts from phase variants were amplified by PCR and either sequenced or sized by a gene scan protocol. Alterations were classified as insertions or deletions of 1, 2 or >2 repeat units and the number of each mutational type was then represented as a percentage of the total number of tracts analysed. The genotype of each strain is indicated below the columns (wt is strain Rd) and the direction of switching above the columns. The number of repeats in the parental strain is indicated in parentheses while the total number of tracts analysed is in curly brackets. Data for wt 17, wt 37 and wt 24/18 were published previously (15).

    The relative proportions of 1:2:>2 repeat units shifts (i.e. 1 repeat unit shifts are the combined values for –1 and +1 shifts, etc.) was similar between RdpolIKlen and strain Rd, 79.5:15.5:5 and 78.5:17:4.5, respectively (derived from the combined data for 35/17 and 38/17 repeat reporter constructs), indicating that the sizes of alterations in repeat number had not changed. Using these values and a 30:1 ratio of deletions to insertions for ON-to-OFF switching in RdpolIKlen, it can be calculated (see Materials and Methods) that the 35-fold increase in ON-to-OFF switching was due to increases of 33.8-fold for deletions and 1.2-fold for insertions. This analysis would predict that –1 OFF-to-ON switching would increase 30-fold while +1 OFF-to-ON switching would increase 8-fold. These values are similar to the observed values, supporting the notion that the elevated PV rates of RdpolIKlen were due to a major increase in the number of deletions.

    Phase variation rates of rnhA mutants

    PV rates of reporter constructs containing either tetranucleotide (5'AGTC) or dinucleotide (5'AT) repeats were examined in rnhA mutants of H.influenzae (Table 2). PV rates mediated by tetranucleotide repeats were elevated for both directions of switching by 2.4-fold for long tracts (37 and 38 repeats) and 4.4-fold for short tracts (17 and 18 repeats). These differences were significant as shown by the non-overlapping 95% confidence intervals and P-values of <0.001 obtained in Mann–Whitney (non-parametric) rank sum tests comparing PV frequencies of parental and mutant strains. The increase (1.9-fold) for an RdrnhA mutant containing 16 5'AGTC repeats was not significant (P-value of 0.17) but this may due to the comparison being made with an 18 5'AGTC reporter construct, an equivalent strain Rd reporter construct not being available. Contrastingly, PV rates for the dinucleotide repeats were elevated only 1.3- and 1.4-fold for ON-to-OFF and OFF-to-ON switching, respectively. The majority of these increases were not significant as shown by overlapping 95% confidence intervals and P values of 0.1 and 0.61 for the 20 and 19 5'AT repeat constructs, respectively. The construct with 5'AT 18 repeats had a significantly (P-value of 0.009) elevated PV rate but the importance of this result is unclear as this rate was higher than that of an RdrnhA mutant reporter construct containing 19 5'AT repeats.

    The types of alterations occurring in repeat tracts of RdrnhA mutants were also examined. The pattern of mutations occurring in both the 5'AGTC (Figure 2) and 5'AT (data not shown) repeat tracts were similar to that observed with strain Rd.

    Lethality of H.influenzae 5'>3' exonuclease mutation

    Transformation with the pUCpolI5'>3'exo-tet and pUCpolI5'>3'exo/3'>5'exo-tet plasmids (see above) frequently generated transformants that had undergone recombination between the tetracycline cassette and the deletion (i.e. between the NcoI and EcoRV sites, see pUCpolI5'>3'exo and pUCpolI5'>3'exo/3'>5'exo in Figure 1). In order to force recombination to occur at the ends of the constructs (i.e. upstream of the NcoI site and downstream of the EcoRV site), we inserted a chloramphenicol cassette into the EcoRV site and used these constructs to transform the tetracycline-resistant RdpolIKlen mutants described above. This transformant lacks the DNA sequences between the EcoRV site and the deletion (compare pUCpolIKlen and pUCpolI5'>3'exo, see Figure 1), such that recombination can occur only at the ends of the plasmids. Only a small number of transformants were obtained and all these transformants had undergone only single cross-over events. The ability of RdpolIKlen to incorporate foreign DNA into the chromosome was tested by transformation with a plasmid carrying an unrelated H.influenzae gene (i.e. lgtC). The mutant exhibited a transformation frequency similar to that of strain Rd (data not shown). These results provide a further indication that deletion of the H.influenzae PolI 5'>3' exo domain is lethal.

    DISCUSSION

    Multiple cis- and trans-acting factors govern the stability of microsatellites in both prokaryotes and eukaryotes (22). Microsatellites or simple sequence repeats mediate high frequencies of reversible changes in the expression of virulence factors in a number of bacterial species (13,23–25). To fully understand the contribution of these repeats to bacterial adaptation, an in-depth analysis of the factors controlling the occurrence of mutations in these repeats is required. In this study, we have demonstrated that deletion of the entire PolI Klenow domain destabilizes tetranucleotide repeats 35-fold while inactivation of RnaseHI, another enzyme involved in processing of Okazaki fragments, also destabilizes tetranucleotide repeats although to a lesser extent. In addition, we provide evidence that the 5'>3' exonuclease domain of PolI is required for viability.

    PolI polymerase activity is not required for H.influenzae viability

    In E.coli, viable polA mutants lacking either the 5'>3' exonuclease or Klenow domains have been generated (21). Survival of these mutants demonstrates that the exonuclease and polymerase activities of PolI are not essential for replication of the E.coli genome and that other enzymes/proteins can complement these activities. The failure to construct a 5'>3' exonuclease deletion mutant of H.influenzae may suggest that this activity of PolI is essential for viability of this bacterial species. It should be noted, however, that the E.coli exonuclease mutant was constructed by first expressing the Klenow domain on an F plasmid and then deleting the entire polA gene. Interestingly, it was possible to construct a viable H.influenzae mutant lacking the entire Klenow domain of PolI. H.influenzae differs from E.coli in that it lacks homologs of polB, dinB and umuC/umuD (26,27) whose products are specialized DNA polymerases (i.e. PolII, PolIV and PolV) that could be responsible for survival of the E.coli polA1 mutant, which is known to be deficient in the Klenow domain. Thus, unless H.influenzae encodes an additional DNA polymerase not detectable by homology to currently known families of DNA polymerases, polymerization of the 1.8 Mb genome of this bacterial species is performed by PolIII alone. Notably, this includes the DNA polymerization required for Okazaki fragment joining.

    Growth characteristics of polI-klen and rnhA mutants

    While the growth rates of H.influenzae polIKlen mutants were similar to the parental strain, the mutant cells were forming filaments (Figure 2). This suggests that growth and cell division were uncoupled which in part may result from perturbation of DNA replication. Filament formation may also be a result of induction of an SOS response. H.influenzae has a LexA-inducible operon similar to that of E.coli although lacking a sulA homolog (W.A. Sweetman, E.R. Moxon and C.D. Bayliss, manuscript in preparation), which is responsible for SOS-induced filament formation in E.coli (28). H.influenzae rnhA mutants also exhibited filamentous growth but with a slightly more severe phenotype (14 cells/filament) and a reduction in growth rate (Figure 2). This result is surprising and suggests either that an inability to shorten the RNA primers at the ends of Okazaki fragments engenders a severe block to replication of the genome or that RnaseHI is required for other functions important for growth .

    PV rates of polI-klen and rnhA mutants

    Previously, we constructed a mutation in the H.influenzae polI gene and demonstrated that this mutation elevated tetranucleotide, but not dinucleotide, repeat-mediated PV rates by 40-fold (14). This mutation involved partial deletion of the Klenow domain and insertion of an antibiotic cassette into the H.influenzae polI gene. Thus, it was unclear whether expression of the 5' portion of the PolI protein, encompassing the 5'>3' exonuclease domain, was perturbed and whether any residual expression of the PolI polymerase domain was retained. The polI-klen mutant described in this study resolves these issues by inserting artificial stop codons at the end of the exonuclease domain of polI, allowing efficient translation of this fragment, and deleting the entire Klenow domain so no residual PolI polymerase activity can be produced.

    Tetranucleotide PV rates were elevated 35-fold in RdpolIKlen indicating that loss of PolI DNA polymerization activity alone is responsible for destabilization of these repeats. The data presented herein indicates that this instability is due to a major increase in the numbers of deletions. It has been proposed that deletions could result from the following pathway: spontaneous or UvrD-mediated displacement of the 5' end of the downstream Okazaki fragments located within the repeat tract, removal of the RNA primer, re-annealing of the DNA flap adjacent to the 3' end of the upstream Okazaki fragment and ligation of the DNA ends without DNA polymerization (see Figure 4). Note that this process produces a deletion whereas insertions would result from loops forming in the displaced DNA flap and would require DNA synthesis prior to ligation (14). One unclear step was the identity of the proteins responsible for the removal of ribonucleotides at the 5' end of the downstream Okazaki fragment. We now propose that these nucleotides are removed by cleavage of the DNA flaps by the 5'>3' exonuclease activity contained in the N-terminal PolI fragment that is retained in the RdpolIKlen mutant (Figure 4). A similar elevation of deletions and frameshift mutations has been noted for E.coli polA mutants deficient in the Klenow domain (29). Nagata et al. (29) proposed that mismatch bulges in the template strand are recognized by the Klenow domain of PolI and processed by the 3'>5' exonuclease activity of this domain. However, the mechanistic basis for recognition of these mismatches was not elucidated.

    Figure 4 Models of slippage in H.influenzae repeat tracts in the absence of RnaseHI and PolI DNA polymerase activity. These diagrams depict the processing of two Okazaki fragments, terminated within a tetranucleotide repeat tract (repeats are indicated by filled oblongs), in the absence of either RnaseHI (a) or the PolI Klenow fragment (c) or in non-mutant cells (b). In non-mutant cells (b), the RNA primer (open oblongs) is rapidly removed by the action of RnaseHI (RnhI) with the gap between the fragments being polymerized by DNA Polymerase III (PolIII). The last ribonucleotide (open ovals) is removed by PolI 5'>3' exonuclease activity (PolIexo), the gap filled by PolI polymerase activity (PolIKlen) and the two fragments joined by DNA ligase (Lig). These processes occur rapidly and only rarely (indicated by thin dashed lines) are flaps generated that could lead to the formation of slippage mutations (i.e. insertion or deletions of repeats). In the absence of RnaseHI (a) removal of RNA primers occurs slowly resulting in more frequent generation of DNA flaps (and indeed generation of a flap may facilitate removal of the primer). Realignment of the DNA strands can occur during annealing of the flaps leading to loops in either the newly synthesized or template strands (resulting in insertions or deletions of repeats, respectively). For simplicity, only an insertion has been depicted in (a) but it should be noted that the numbers of both insertions and deletions are elevated in rnhA mutants. In mutants lacking the PolI Klenow fragment (c) the last ribonucleotide can be efficiently removed from the 5' end of the Okazaki fragment by the action of the 5'>3' exonuclease activity contained in the N-terminal fragment of PolI. However, the gap between the two fragments cannot be efficiently filled in the absence of PolI polymerase activity allowing for frequent juxtapositioning of the ends of the two Okazaki fragments by the formation of loops in the template strand. This permits joining of the two fragments by DNA ligase without the need for further DNA synthesis but results in frequent production of deletions.

    The destabilization of tetranucleotide repeats by mutations in polI implicated Okazaki fragment processing as having a major influence on the stability of such repeats. In order to further investigate the role of this process, a mutation was constructed in rnhA, which encodes the enzyme responsible for shortening of RNA primers during completion of lagging DNA synthesis. RdrnhA mutants exhibited a significant increase in instability of tetra- but not dinucleotide repeats as observed for RdpolI. However, the increase was only 3-fold and there was no increase in the number of deletions relative to insertions, indicating that the mutational process was mechanistically different in RdrnhA mutants. The increased PV rate of these mutants may be due to persistence of Okazaki fragments resulting in more time for the formation of DNA flaps which would then re-anneal resulting in a higher frequency of displacements of repeats in both the template and nascent DNA strands and thus a higher rate of PV (Figure 4). Alternatively, flap formation may be required for the removal of RNA primers leading to a higher frequency of displacement of repeats but with no bias towards deletions as PolI can efficiently polymerize the gaps between fragments. The absence of an effect on dinucleotide repeat tracts may be due to correction of additional mutations in these tracts by MMR.

    In summary, the Klenow fragment of PolI is a major factor controlling the stability of tetranucleotide repeats in H.influenzae while RnaseHI is a minor factor. Thus, accurate Okazaki fragment processing may be an important factor for preventing slippage in a subset of prokaryotic simple sequence repeats.

    ACKNOWLEDGEMENTS

    This work was supported by a Wellcome Trust programme grant. We would like to thank Andrew Wright and Peter Power for their careful reading of this manuscript. Funding to pay the Open Access publication charges for this article was provided by the Wellcome Trust.

    REFERENCES

    Li, X. and Marians, K.J. (2000) Two distinct triggers for cycling of the lagging strand polymerase at the replication fork J. Biol. Chem., 275, 34757–34765 .

    MacNeill, S.A. (2001) DNA replication: partners in the Okazaki two-step Curr. Biol., 11, R842–R844 .

    Kokoska, R.J., Stefanovic, L., Tran, H.T., Resnick, M.A., Gordenin, D.A., Petes, T.D. (1998) Destabilization of yeast micro- and minisatellite DNA sequences by mutations affecting a nuclease involved in Okazaki fragment processing (rad27) and DNA polymerase delta (pol3-t) Mol. Cell. Biol., 18, 2779–2788 .

    Callahan, J.L., Andrews, K.J., Zakian, V.A., Freudenreich, C.H. (2003) Mutations in yeast replication proteins that increase CAG/CTG expansions also increase repeat fragility Mol. Cell. Biol., 23, 7849–7860 .

    Schweitzer, J.K. and Livingston, D.M. (1998) Expansions of CAG repeat tracts are frequent in a yeast mutant defective in Okazaki fragment maturation Hum. Mol. Genet., 7, 69–74 .

    Ogawa, T. and Okazaki, T. (1984) Function of RNase H in DNA replication revealed by RNase H defective mutants of Escherichia coli Mol. Gen. Genet., 193, 231–237 .

    Xu, Y., Grindley, N.D., Joyce, C.M. (2000) Coordination between the polymerase and 5'-nuclease components of DNA polymerase I of Escherichia coli J. Biol. Chem., 275, 20949–20955 .

    Lehman, I.R. and Uyemura, D.G. (1976) DNA polymerase I: essential replication enzyme Science, 193, 963–969 .

    Morel, P., Reverdy, C., Michel, B., Ehrlich, S.D., Cassuto, E. (1998) The role of SOS and flap processing in microsatellite instability in Escherichia coli Proc. Natl Acad. Sci. USA, 95, 10003–10008 .

    Hood, D.W., Deadman, M.E., Jennings, M.P., Bisercic, M., Fleischmann, R.D., Venter, J.C., Moxon, E.R. (1996) DNA repeats identify novel virulence genes in Haemophilus influenzae Proc. Natl Acad. Sci. USA, 93, 11121–11125 .

    van Belkum, A., Scherer, S., van Leeuwen, W., Willemse, D., van Alphen, L., Verbrugh, H.A. (1997) Variable number of tandem repeats in clinical strains of Haemophilus influenzae Infect. Immun., 65, 5017–5027 .

    Bayliss, C.D., Field, D., Moxon, E.R. (2001) The simple sequence contingency loci of Haemophilus influenzae and Neisseria meningitidis J. Clin. Invest., 107, 657–662 .

    Hood, D.W. and Moxon, E.R. (1999) Lipopolysaccharide phase variation in Haemophilus and Neisseria In Brude, H., Opal, S.M., Vogel, S.N., Morrison, D.C. (Eds.). Endotoxin in Health and Disease, NY Marcel Dekker Inc. pp. 39–54 .

    Bayliss, C.D., van de Ven, T., Moxon, E.R. (2002) Mutations in polI but not mutSLH destabilize Haemophilus influenzae tetranucleotide repeats EMBO J., 21, 1465–1476 .

    De Bolle, X., Bayliss, C.D., Field, D., van de Ven, T., Saunders, N.J., Hood, D.W., Moxon, E.R. (2000) The length of a tetranucleotide repeat tract in Haemophilus influenzae determines the phase variation rate of a gene with homology to type III DNA methyltransferases Mol. Microbiol., 35, 211–222 .

    Fleischmann, R.D., Adams, M.D., White, O., Clayton, R.A., Kirkness, E., Kerlavage, A., Bult, C., Tomb, J., Dougherty, B., Merrick, J. (1995) Whole-genome random sequencing and assembly of Haemophilus influenzae Rd Science, 269, 496–512 .

    Danner, D.B. and Pifer, M.L. (1982) Plasmid cloning vectors resistant to ampicillin and tetracycline which can replicate in both E.coli and Haemophilus cells Gene, 18, 101–105 .

    Kraib, A., Schlor, S., Reidl, J. (1998) In vivo transposon mutagenesis in Haemophilus influenzae Appl. Environ. Microbiol., 64, 4697–4702 .

    Drake, J.W. (1991) A constant rate of spontaneous mutation in DNA-based microbes Proc. Natl Acad. Sci. USA, 88, 7160–7164 .

    Saunders, N.J., Moxon, E.R., Gravenor, M.B. (2003) Mutation rates: estimating phase variation rates when fitness differences are present and their impact on population structure Microbiology, 149, 485–495 .

    Joyce, C.M. and Grindley, N.D.F. (1984) Method for determining whether a gene of Escherichia coli is essential: application to the polA gene J. Bacteriol., 158, 636–643 .

    Bayliss, C.D., Dixon, K.M., Moxon, E.R. (2004) Simple sequence repeats (microsatellites): mutational mechanisms and contributions to bacterial pathogenesis. A meeting review FEMS Immunol. Med. Microbiol., 40, 11–19 .

    Yogev, D., Browning, G., Wise, K.S. In Razin, S. and Herrmann, R. (Eds.). Molecular Biology and Pathogenicity of Mycoplasma, (2002) NY Kluwer Academic/Plenum Publishers pp. 417–443 .

    Saunders, N.J., Peden, J., Hood, D., Moxon, E.R. (1998) Simple sequence repeats in the Helicobacter pylori genome Mol. Microbiol., 27, 1091–1098 .

    Parkhill, J., Wren, B.W., Mungall, K., Ketley, J.M., Churcher, C., Basham, D., Chillingworth, T., Davies, R.M., Feltwell, T., Holroyd, S., et al. (2000) The genome sequence of the food-borne pathogen Campylobacter jejuni reveals hypervariable sequences Nature, 403, 665–668 .

    Eisen, J.A. and Hanawalt, P.C. (1999) A phylogenomic study of DNA repair genes, proteins, and processes Mutat. Res., 435, 171–213 .

    McKenzie, G.J. and Rosenberg, S.M. (2001) Adaptive mutations, mutator DNA polymerases and genetic change strategies of pathogens Curr. Opin. Microbiol., 4, 586–594 .

    Huisman, O. and D'Ari, R. (1981) An inducible DNA replication-cell division coupling mechanism in E.coli Nature, 290, 797–799 .

    Nagata, Y., Mashimo, K., Kawata, M., Yamamoto, K. (2002) The roles of Klenow processing and flap processing activities of DNA polymerase I in chromosome instability in Escherichia coli K12 strains Genetics, 160, 13–23 .

    Bayliss, C.D., van de Ven, T., Moxon, E.R. (2002) Corrigenda. Mutations in polI but not mutSLH destabilize Haemophilus influenzae tetranucleotide repeats EMBO J, 21, 4391 .(Christopher D. Bayliss*, Wendy A. Sweetm)