Previous Article | Next Article ![]()
Infection and Immunity, April 2005, p. 1927-1938, Vol. 73, No. 4
0019-9567/05/$08.00+0 doi:10.1128/IAI.73.4.1927-1938.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Andrew P. Tomaras,
Eric R. Rhodes, and
Luis A. Actis*
Department of Microbiology, Miami University, Oxford, Ohio
Received 1 July 2004/ Returned for modification 21 September 2004/ Accepted 22 November 2004
|
|
|---|
|
|
|---|
About 20 years after this initial outbreak, the molecular mechanisms responsible for the massive vascular damage seen in infected patients have not been elucidated. Progress in this field has been slow, mainly due to the lack of and difficulties in developing appropriate genetic systems that could allow the identification and characterization of potential bacterial virulence factors involved in the pathogenesis of this disease. With this in mind, the first approach used in our laboratory to identify and study genes unique to the BPF strain F3031 has been a genome subtraction hybridization procedure utilizing total DNA from the invasive F3031 strain and the noninvasive biogroup aegyptius strain F1947 (40). These are two related strains isolated in Brazil that belong to the same genus and species but differ in their rRNA gene restriction and sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) protein patterns, as well as their restriction pattern of the 24-MDa plasmid that is found in both of them (7). The genome subtraction hybridization approach yielded small DNA fragments that are specific to the BPF prototype strain F3031, several of which appear to have been acquired from unrelated microbial sources and encode unknown functions. Similar results were obtained by Li et al. (23) when they compared the BPF strain F3028 with the conjunctivitis strain F3043.
Complete and partial phages have been described for different strains of H. influenzae (2, 10, 19, 38, 52). It is known that two Mu-like phages are found in the sequenced strain of H. influenzae Rd, which are referred to as the prophage FluMu (29) and the cryptic prophage
flu (19). The prophage FluMu contains most of the genes present in Mu and is found integrated within a gene encoding molybdenum transport functions. It also contains genes encoding products involved in transposition, has only one proposed tail fiber structural protein, and contains no invertible gene segments (29). In addition to FluMu, H. influenzae has two other temperate phages called HP1 (10) and HP2 (52), of which HP1 was the first phage identified for H. influenzae Rd (10, 17). The HP1 genome has 41 genes that encode typical structural proteins such as a tail sheath, tail fibers, a tail collar, and it also encodes proteins involved in integration and excision of the viral genome from that of the bacterial host (10). The recently described HP2 genome is related to HP1 with the important caveat that it infects nontypeable Haemophilus strains. Importantly, comparisons between Haemophilus phages suggest that extensive recombination events have occurred within this group of phages and also between the phages and resident host chromosomes (52). Some of these events within genes encoding tail fiber proteins have been proposed to explain differences in host specificities (39, 52).
Recently, Chang et al. (8) have described a genetic island that is specific to the H. influenzae type b serotype and contains a mosaic collection of genes, some of which appear to be of phage origin, whereas others have homologs to genes in other bacteria. Although the contribution of this genetic island to virulence is not known, the propensity of finding this 16-kb region in type b strains, coupled with the finding that numerous characteristics for pathogenicity islands are contained in this region, suggests that it may play a role in virulence (8). In addition to this island, other type b-specific genetic islands, in relationship to H. influenzae Rd KW20, have been described (3). Some of these islands contain genes whose products play a known role in virulence, capsular and fimbrial gene clusters, whereas the role in virulence of other gene products contained in these islands remains cryptic, such as tail fiber proteins and outer membrane proteins of undefined function (3). We describe here a genomic region in the BPF clone F3031 of biogroup aegyptius that is not found in its noninvasive counterpart F1947. This 34-kb region, which was found in other BPF isolates but not in non-BPF biogroup aegyptius strains and nontypeable or typeable H. influenzae strains, with the exception of a region present in the type b Eagan strain, is similar to the genetic island 1 of the type b strains of H. influenzae in that it contains numerous genes encoding either putative phage or bacteriocin functions and hypothetical proteins with unknown functions.
|
|
|---|
(Gibco-BRL, Gaithersburg, Md.) and EPI300 (Epicentre, Madison, Wis.), which were used as hosts in routine molecular biology experiments, were cultured under standard conditions in Luria-Bertani medium (37) containing the appropriate antibiotics. |
View this table: [in a new window] |
TABLE 1. Bacterial strains and plasmids used in this study
|
-32P]dCTP (Perkin-Elmer, Boston, Mass.) (11). The radioactive bands were detected either with X-ray film or a Storm 860 scanner (Molecular Dynamics, Piscataway, N.J.).
![]() View larger version (7K): [in a new window] |
FIG. 1. Diagram of DNA fragments used in cloning and analysis of the genomic island. Fragments of F3031 DNA that were cloned in vectors are indicated by pMU numbers. PCR#1 (9,519 bp) and PCR#2 (8,885 bp) were not cloned but were used as probes in Southern blot hybridization analyses. BPF33 is a BPF locus that has been described (40). The length of the thick black lines show the sizes of the F3031 fragments used in experiments as indicated by the size scale, in kilobases, located on top. The stippled line at the bottom shows the location of the genomic island.
|
45-kb that encompasses the genomic island found in the F3031 BPF strain. The construct pMU363 was derived from pMU263 by digesting this plasmid with BamHI, gel isolating an
7-kb fragment, and subsequently ligating the purified fragment into BamHI-digested pCC1FOS. Transcriptional analysis of gene expression. Expression of some genes located within the F3031 genomic island was evaluated by reverse transcription-PCR (RT-PCR) analysis with total RNA isolated as described previously (53) from bacteria grown in 35 ml of sBHI to an optical density at 600 nm of 0.7. The RNA samples were treated with RNase-free DNase I (Roche, Indianapolis, Ind.), and the presence of selected transcripts was detected with a RT-PCR commercial kit (Qiagen) under the conditions suggested by the supplier. The primers used in the amplification reactions are listed in Table 2. The cDNA products were analyzed by agarose gel electrophoresis. PCR of total RNA with Pfu DNA polymerase without reverse transcription was used to test for DNA contamination of RNA samples.
|
View this table: [in a new window] |
TABLE 2. Oligonucleotide primers used in RT-PCR experiments
|
Nucleotide sequence accession number. The nucleotide sequence of the biogroup aegyptius F3031 genomic island described in the present study was deposited in GenBank under accession number AY647244.
|
|
|---|
5-kb gap existed between the fragments cloned in these two constructs. In addition, the DNA sequence of the insert in pMU69 could be assembled with BPF33 (Fig. 1), a BPF-specific locus we described before (40).
Attempts to isolate high-copy-number genomic clones harboring the
5-kb gap between the inserts of pMU69 and pMU71 (Fig. 1) or to clone it by PCR amplification in vectors such as pCR-BluntII-TOPO (Invitrogen) failed to produce the appropriate derivatives. However, the screening of a fosmid library made by cloning large DNA fragments in pCC1FOS yielded the derivative pMU263, which includes the fragments cloned in both pUC18-based genomic clones and the previously described BPF33 locus (Fig. 1). Sequencing and nucleotide analysis of the 42-kb insert showed that carotovoricin Er-related genes are part of a 34,252-bp fragment that disrupts a homolog of the H. influenzae Rd HI1706 gene (Fig. 2), which is predicted to encode a high-affinity choline transport protein (13). The last 1,720 bp and the first 240 bp of the F3031 HI1706 homolog flank the 5' and 3' ends of the disrupting fragment, respectively. Based on similarity, the entire HI1706 homolog is not represented in pMU263. The sequence 5'-TTTAAAATGAGAA-3' (IRAL and IRAR) is found repeated twice in inverted orientation, with one copy located 17 nucleotides downstream of the interruption site of the 5' fragment of the HI1706 homolog and the other copy found 321 nucleotides upstream of the interruption site of the 3' fragment of this homolog (Fig. 2).
![]() View larger version (14K): [in a new window] |
FIG. 2. Graphical representation of the F3031 genomic island and surrounding regions. Horizontal arrows represent ORFs with the direction of transcription shown by the arrows orientation. The black rectangle and arrow represent the 5' and 3' fragments of HI1706, respectively, that were produced by the insertion of a genomic island. ORFs contained within the island are numbered, and those outside the island are indicated with HI locus numbers according to their homologs in H. influenzae Rd KW20. The stippled bars show the subtracted fragments that were previously isolated (40), and the black boxes show subtracted fragments isolated by Li et al. (23). Vertical arrows designate uptake signal sequence sites. The solid and open double-headed arrows highlight the inverted repeats IRA and IRB, respectively. Half-site copies of the inverted repeats are designated as left (L) or right (R). The numbers in the size scales are kilobases.
|
Based on the analysis of the putative products encoded by these genes (Table 3), this seems to be a genomic island that can be functionally divided into three regions in which the first initiates immediately after the 3' end of the 5' fragment of HI1706 and continues through ORF 15. The second region encompasses ORF 16 through ORF 28, and the third region extends from ORF 29 to the 5' end of the 3' fragment of the interrupted HI1706 homolog.
|
View this table: [in a new window] |
TABLE 3. Description of ORFs contained in the BPF genomic island
|
The product of ORF 8 is related to a hypothetical protein of the Y. enterocolitica temperate bacteriophage PY54 (20), which is significantly related to the putative products of genes found in the lambdoid phage 434 (1) and the enterobacteria phage HK022 (21). The E16 Mu-like protein encoded by the HD0110 gene of H. ducreyi showed the highest similarity to the predicted product of ORF 10, which is also related to the FluMu protein gp16 (HI1488) of H. influenzae Rd (13). Between ORF 8 and ORF 10 (Fig. 2), there is a 118-amino-acid coding sequence that is related to the H. ducreyi proteins HD0495 and HD0960. ORF 11 codes for a product that is related to several putative bacteriophage transcriptional regulators, with the highest match to the ß-proteobacterium Chromobacterium violaceum CV2165 protein (4). Significant homology was also detected with the DNA-binding protein RdgB of the plant pathogen Erwinia carotovora subsp. carotovora Er, which together with RdgA regulates the production of pectin lyase in response to DNA-damaging agents (24). This regulator is also known to affect expression of the phage tail-like bacteriocin carotovoricin Er in a temperature- and recA-dependent manner (32). The predicted product of ORF 12 did not show any significant match in the GenBank database. The ORF 13 codes for a predicted protein that is highly related to the products of the HD0112 and HI1494 genes found in the genomes of H. ducreyi and H. influenzae, respectively. These putative Haemophilus proteins show strong similarity to the phage T3 and T7 N-acetylmuramoyl-L-alanine amidases that are involved in hydrolysis of N-acetylmuramoyl residues and certain L-amino acid residues in the cell wall of bacteria (29). ORFs 14 and 15 code for predicted proteins with the highest similarity to the H. ducreyi HD0114 conserved hypothetical protein (genome accession reference NC_002940) and the H. influenzae HI1496 hypothetical protein (13), respectively.
Analysis of region II. After the region containing the mosaic FluMu-like genes comes a series of 13 ORFs where the similarity shifts from predominantly Haemophilus-like sequences to genes whose predicted encoded products are similar to proteins found in Salmonella enterica subsp. enterica serovar Typhi CT18 (35), the soil- and waterborne bacterium C. violaceum ATCC 12472 (4), P. luminescens subsp. laumondii TT01 (P. luminescens TT01) (9), and E. carotovora subsp. carotovora (accession reference AB017338) (Fig. 2 and 3). Starting in this region, the genes found in F3031 and the above-mentioned strains are highly similar both in sequence and organization. The ORF 16 product is similar to a membrane protein in serovar Typhi CT18 (STY1612), but it also has significant similarity to the product encoded by a P. luminescens hypothetical gene (Plu3445), and both hits occur in genomic regions containing multiple hypothetical and Mu-like genes (Fig. 3). Proteins encoded by ORFs 17 and 18 have similarities to the products of two genes in the same region of the serovar Typhi CT18 (STY1613 and STY1614) and Photorhabdus (Plu3444 and Plu3443) genomes. Although the function of these ORFs is not known, Plu3443 does have similarity to gp27 of Mu, the small subunit of a putative terminase (29). ORF 19 has the highest G+C content in this region and the highest similarity to the product of the Deinococcus radiodurans R1 ORF DRA0094 (50), which has recently been described as a component of the Mu-like RadMu prophage most likely encoding the large subunit of the terminase (29).
![]() View larger version (30K): [in a new window] |
FIG. 3. Graphical representation of loci with synteny to the F3031 genomic island. Loci from several organisms are depicted with individual arrows representing ORFs with direction of transcription shown. ORFs are drawn to scale within organism sections but are not drawn to scale between organisms. ORFs contained within the genomic island in F3031 are numbered, and those outside the island are shown as an open red arrow and rectangle (split HI1706 ORF) or open light blue arrows. ORFs in other bacteria are numbered according to conventions set by finished sequences and are only numbered every five ORFs for clarity. ORFs are not always numbered serially in finished sequences as illustrated by the absence of HD0125 in the H. ducreyi annotation. Diamonds in the S. enterica subsp. enterica Typhi CT18 locus show a series of invertible sequence repeats. Colors and shades of ORFs designate similar sequences among organisms. Numbers in the size scales are kilobases.
|
Analysis of region III in the genomic island. This region consists of ORFs 29 to 44, and almost all of them encode functions putatively related to the production of a bacteriocin (Table 3). Quite surprisingly, proteins with similarity to all of the region III products are highly conserved in serovar Typhi CT18 and in P. luminescens with the exception of ORF 34. ORFs 31 and 34 are not annotated in the E. carotovora Er sequence either. In C. violaceum, the order of genes is conserved from ORF 31 to ORF 36 and then the sequence diverges (Fig. 3).
The products of ORFs 29 and 30 have significant similarity to the tail sheath and core structural components, respectively, of a phage tail-like bacteriocin called carotovoricin Er from E. carotovora subsp. carotovora Er, which kills other Erwinia strains through an undefined mechanism (31, 33). The carotovoricin sheath is a contractile structure that surrounds the stable tail core and, interestingly, these two ORFs are represented partially in the subtracted fragment E3 (Fig. 2) (40). The product of ORF 31 is the only one in the genomic island that has similarity to the product of a P. aeruginosa gene, PAO624 (44), although ORF 31 is 57-bp shorter than PAO624 and the similarity is almost below the cutoff for significance, as is ORF 31's similarity to the ATP synthase alpha subunit. ORF 32 is the largest in the genomic island at 2,592 bp, and the putative protein product is similar to the largest component in the E. carotovora bacteriocin, ORF 5 (accession reference AB017338), and the P. luminescens ORF Plu3429 (Fig. 3). Despite having such a large coding region and two predicted transmembrane domains, a function could not be attributed to this ORF even with extensive in silico analyses. The product of ORF 33 is similar to protein products that are described as hypothetical, such as serovar Typhi CT18 STY1630. ORF 34 is the first of three ORFs in region III that have similarity at the protein level to bacteriophage tail fiber products. The translated products of both ORF 34 and ORF 39 have the most similarity to serovar Typhi CT18 encoded tail fibers, whereas the putative protein encoded by ORF 35 is related to predicted bacteriophage regulatory proteins, with the highest similarity to the product of CV2118 of C. violaceum ATCC 12472. Interestingly, the second highest match is the product of the BcepMu47 found in the Burkholderia cenocepacia phage BcepMu, which appears to contain genes encoding potential virulence factors (GenBank accession number YP_024720).
The first analysis showed that the protein encoded by ORF 40 has the most similarity to the resident H. influenzae Rd lysogenic phage HP1 tail fiber ORF 31 (10). However, upon closer examination it was noticed that the putative tail fiber encoded by ORF 40 was hybrid in nature. The N terminus has highest similarity to the protein product of the serovar Typhi CT18 gene STY1637, and the C terminus shows similarity to the encoded product from ORF 31 of the bacteriophage HP1 (Fig. 4A). Even though the average G+C content of this coding region is 39.2%, its 5' and 3' regions have values of 41 and 38.6% (Fig. 4B), respectively, giving evidence for the potential chimeric structure of this predicted gene. In between the genes encoding tail fibers are three ORFs (ORF 36 to ORF 38) that putatively encode phage baseplate proteins. ORF 38 has the highest G+C content in this region III of the genomic island, suggesting that it was acquired horizontally from an unrelated source. It should be noted that all three ORFs code for products with similarity to products encoded by P. luminescens TT01, serovar Typhi CT18, and E. carotovora (Fig. 3). It would seem from the sequence similarity that all of the structural components needed for the bacteriocin could be encoded from the already presented ORFs. Based on the organization in the E. carotovora carotovoricin Er locus (accession reference AB017338), expression of ORFs 11, 29, 30, 32, 33, 35, 36, 37, 38, 39, and 40 in the genomic island would be sufficient for expression of a carotovoricin-like bacteriocin. This conclusion is also supported by the drastic drop-off in synteny seen between the remaining ORFs in the genomic island and the serovar Typhi CT18 (35), C. violaceum (4), and E. carotovora loci (accession reference AB017338) (Fig. 3). Still, ORF 41 has the potential to encode another virus-like component since it has similarity to the product of ORF 32 from phage HP2 (52), which is thought to encode a tail collar protein. This HP2 phage is known to infect nontypeable H. influenzae strains, unlike HP1 that has only been described to infect H. influenzae Rd strains (52). This ORF also has similarity to the encoded product Plu3421 from P. luminescens. ORF42 differs from the other components in this region since it codes for a protein product with similarity to the H. influenzae R2866 enoyl-coenzyme A hydratase/carnithine racemase (accession reference ZP_00156541) and also has similarity, although to a lesser extent, to H. somnus ORF113401 (accession reference ZP_00132835), for which no function has been defined. The Mu-like prophage protein Com has similarity to the product of ORF 43, which has the lowest G+C content in region III at 34.2%. Com has been described previously as a regulator to the Mu gene mom, which is involved in DNA modification of adenine bases (29), although no such mom homolog is present in the F3031 genomic island. The final ORF in the genomic island resumes similarity to Haemophilus genes with matches to the H. influenzae Rd KW20 protein from HI1523 (13) and the protein produced from the H. ducreyi ORF HD0156 (accession reference NC_002940), whose functions have not been described.
![]() View larger version (19K): [in a new window] |
FIG. 4. Illustration of the chimeric structure of the genomic island ORF 40. (A) ORF 40 is graphically depicted by the large arrow with direction of transcription shown. Portions of the encoded product from this ORF that have similarity to proteins from serovar Typhi CT18 and H. influenzae HP1 phage are shown as lines. The region in ORF 40 that shares similarity to both proteins is enlarged to show the amino acid sequence for the ORF 40 product. Amino acids underscored with a double-headed arrow have similar sequences in the serovar Typhi CT18 ORF STY1637, portions that are similar to HP1 ORF 31 are shown as underscored hashmarks, and the region in ORF40 that is common to both proteins is underlined with a solid line. (B) The G+C content from the F3031 ORF 40 is shown along the entire length of the gene with a window of 100 nucleotides. The minimum, maximum, and mean of the G+C content are shown on the right side. The bars on the bottom illustrates, to scale, the ORF 40 with the separate regions of homology with the encoded products of serovar Typhi CT18 STY1637 and HP1 ORF 31 depicted as in panel A. The individual G+C percent values for the separate domains are also indicated. The open arrows indicate the location of the primers 40-5'S, 40-5'H, and 40-3' used in the RT-PCRs.
|
![]() View larger version (57K): [in a new window] |
FIG. 5. Prevalence of genomic island homologous sequences in the invasive and noninvasive isolates. Total DNA from F3031 (lanes labeled 31) and F1947 (lanes labeled 47) was digested with either EcoRI (E) or RsaI (R) and blotted to nitrocellulose after agarose gel electrophoresis. Lambda DNA ( ) cut with HindIII was used as a size marker. The probe used to test each blot is indicated at the bottom of each panel. All blots were also probed with radiolabeled DNA.
|
Expression analysis of selected ORFs within the F3031 genomic island. The transcriptional expression was verified for some of the ORFs in the genomic island by using RT-PCR. Figure 6 shows that all tested ORFs were expressed in F3031 cells cultured under routine conditions, although not all of them to the same levels. The transcription products of ORF 21, ORF 24, ORF 26, ORF 30, and ORF 36 to ORF 40 are readily detectable. In contrast, the transcription product of ORF 29, which is predicted to code for a protein similar to the E. carotovora tail sheath protein (Table 3), is the weakest. This is an interesting observation because Nguyen and others have recently reported that expression of the E. carotovora gene encoding the tail sheath is minimal at 37°C when treated with mitomycin C but increases in expression at lower temperatures (32). This expression is regulated, in part, by the homolog of rdgB, which has a counterpart in the F3031 genomic island in ORF 11. We have attempted purification of these phage-tail-like particles from F3031 cultures grown at 37°C after UV treatment, and we have failed to see these carotovoricin-like particles in electron microscope preparations. Also, we have never detected zones of killing when testing for bacteriocin activity in the usual agar plate diffusion assays when incubated overnight at 37°C (33). The lack of activity at higher temperatures has also been seen with the Yersinia enterocolitica bacteriocin enterocoliticin and, therefore, may be a general feature of this class of bacteriocins (45). We are currently investigating expression of the F3031 carotovoricin Er-like bacteriocin at lower temperatures.
![]() View larger version (57K): [in a new window] |
FIG. 6. RT-PCR of selected ORFs from the genomic island in F3031. Lambda DNA digested with HindIII ( ) was used as a size marker. Transcripts from each ORF, identified by the numbers on top of each lane, were reverse transcribed and PCR amplified with the primers listed in Table 1. The amplicons shown in the lanes labeled 40-5'S and 40-5'H were obtained by using the primer 40-3' in combination with 40-5'S and 40-5'H (Table 1 and Fig. 4B), respectively. The arrows identify the weak bands corresponding to the predicted sizes according to the sequence analysis of each ORF. The images of the ethidium bromide-stained agarose gels were inverted with Photoshop for more clarity of the images.
|
Further comparison of the F3031 genomic island with other bacterial loci. The conservation between this F3031 region and the regions in serovar Typhi CT18, P. luminescens TT01, H. ducreyi 35000HP, E. carotovora, and C. violaceum ATCC 12472 is striking (Fig. 3), considering that these bacteria are found in different ecological niches. The shared region between F3031 and S. enterica serovar Typhi CT18 is the most impressive in regards to conservation (Fig. 3). The F3031 genomic island-like sequence in serovar Typhi CT18 covers the region from STY1601 to STY1637 (35). The serovar Typhi CT18 region from STY1638 to STY1640, which flanks the genomic island in this bacterium, contains numerous invertible sequences, suggesting that the entire or part of the island could be inverted. This possibility is supported by the observation that the predicted product from STY1604 has similarity to an integrase, whereas the predicted protein produced from STY1643 is most similar to an invertase/recombinase. In contrast, no such invertible sequences and invertase/recombinase genes were found in the regions flanking either ends of the F3031 genomic island, suggesting that this genetic element could be more stable compared to that of serovar Typhi CT18. When we compared the S. enterica subsp. enterica serovar Typhi Ty2 (S. enterica Ty2) and CT18 genomes, in relation to the described STY locus, we found that the strains only differed in a single ORF, STY1641. However, comparison of the CT18 and serovar Typhimurium LT2 genomes (24) showed that the entire locus from STY1600 to STY1643 was unique to the enteropathogens serovar Typhi CT18 and S. enterica Ty2, an observation that suggests that functions encoded within this genomic region may play a role in disease (36). All of these facts suggest that some of the genes located within the F3031 genomic island may play a role in the unusual virulence attributes of the F3031 BPF isolate of biogroup aegyptius.
Another interesting observation is that a gene with similarity, at the protein level, to arginine/ornithine antiporters is located at STY1645 and a two-component response regulator is present at STY1646 and STY1647 (35). A similar arrangement is found in the area adjacent to the 3' end of the F3031 genomic island, where two ORFs (HI1707 and HI1708) encoding potential two-component regulators were mapped after the 3' fragment of the interrupted choline transport homolog HI1706 (Fig. 3). This would suggest that transporter-encoding genes, particularly those next to two-component regulatory genes, are targets for the insertion of genomic island elements. This possibility is supported by the observation that the C. violaceum genomic region from CV2115 to CV2165, which has sequences similar to F3031 genomic island sequences (Fig. 3), is next to an ORF (CV2167) with an encoded product similar to a sodium/malate symporter (4).
The production of a bacteriocin in Haemophilus is not novel (30). It has been known for some time that only type b strains can express haemocin, although the importance of this strict relationship is not known. However, no phage tail-like bacteriocin similar to that found in the plant pathogen E. carotovora has been described previously for a Haemophilus species, and it is interesting that the genome of biogroup aegyptius F3031 contains BPF-specific genes predicted to encode this type of bacteriocin. Equally interesting is the observation that a noncontiguous region similar to the F3031 genomic island, which covers from Plu3421 to Plu3490, is found in P. luminescens TT01. This is a symbiont of nematodes that acts as a vector for transmission to certain insects, which has the highest number of proposed toxin genes of any sequenced bacterium to date (9, 12). Upon infection of an insect, bacteria quickly overwhelm and kill the insect, presumably through the expression of a combination of virulence factors, including the production of a R-type bacteriocin, photorhabdicin, which was seen when cells were grown at 28°C (14). Analogous structures in P. luminescens W14 has shown killing of other Photorhabdus strains, as well as of E. coli (12). Plu3461 encodes a pectin lyase-like regulator and Plu3414 contains an endolysin- and lysozyme-like ORF, which is analogous to the predicted N-acetylmuramoyl-L-alanine amidase encoded by the ORF 13 of F3031 genomic island. The protein product of this ORF in Photorhabdus is presumably involved in release of the bacteriocin into the external environment (12). Based on these observations, it is tempting to speculate that F3031 may encode a phage-like bacteriocin that remains to be detected and characterized. The finding that ORF 40, predicted to encode a phage tail protein, has a hybrid structure, including a domain with similarity to a tail gene from HP1, helps to account for the evolutionary significance of the F3031 region harboring bacteriocin-like genes. Bacteriocins are usually specific for killing strains that are closely related to the producing bacterium, and the importance of this BPF region would be diminished if we could not explain how this bacteriocin could affect more closely related strains other than Erwinia- or Photorhabdus-like bacteria. Instead, this fusion could ensure that killing activity can be directed against other Haemophilus species via the HP1-like tail gene.
Another startling finding in this F3031-specific genomic island is the number of subtracted fragments that map to this location. In our previous work, we isolated 13 fragments specific to the BPF strain F3031 (40), 8 of which are located in this genomic island. In addition, another subtraction procedure done by Li and others (23), who performed a subtraction procedure similar to ours but with different strains, has three of their fragments map to this location in close proximity to each other. It is thus evident from two studies that these subtraction procedures did not merely produce short pieces of DNA found scattered in the genome with no relationship to each other but rather signify a region to be investigated more closely. Many of the ORFs in this region have no previously described function and therefore could be important in explaining what makes the F3031 strain so virulent compared to the F1947 strain. The fact that all of the genes tested in this region have an mRNA product substantiates that these genes need to be investigated further, especially now that it is known that they are genetically linked.
In summary, the present study describes the largest genomic island found to date in a Haemophilus strain, a finding that indicates that members of this genus can acquire large DNA fragments from unrelated sources as it has been described for other human pathogens. Our data also support the hypothesis that the integration of FluMu- and HP1-like phages played a role in the integration of foreign DNA within a gene coding for transport functions. However, it seems that at least one additional process reshaped this genomic region of the BPF strains. The conservation of sequences from region II and III in the genomic island to similar regions in serovar Typhi and P. luminescens supports this hypothesis. Based on the composition of region III, it is also possible to speculate that a phage-like DNA-containing bacteriocin, similar to that produced by P. aeruginosa (22), could have been involved in the reshaping process. The fact that this island also has extensive conservation with regions found in the genome of the soil bacterium C. violaceum and the insect pathogen P. luminescens suggests that the Brazilian F3031 isolate has acquired DNA material from unrelated sources in the environment, a process that has not been described for any Haemophilus species. Clearly, the role that horizontal gene transfer has played in the evolution Haemophilus is substantial and is only starting to be appreciated.
We thank D. J. Brenner (CDC) and A. Lesse (VAMC, Buffalo, N.Y.) for providing the biogroup aegyptius isolates. We thank L. Bakaletz (Ohio State University) for the nontypeable H. influenzae otitis isolates and S. H. Goodgal (University of Pennsylvania) for providing H. influenzae strain Eagan. We thank M. Fields (Miami University) for providing the primers D1F and 1540R. We are grateful to C. Wood, Miami University Center of Bioinformatics and Functional Genomics, for support and assistance with automated DNA sequencing and nucleotide sequence analysis.
Present address: Columbus Children's Research Institute, Department of Pediatrics, Ohio State University College of Medicine and Public Health, Columbus, Ohio. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»