ISCce1andISCce2TwoNovelInsertionSequen

ISCce1 and ISCce2 Two Novel Insertion Sequences in Clostridium cellulolyticum

http://www.100md.com 《细菌学杂志》2003年第3期

     Laboratoire de Bioénergétique et Ingénierie des Protéines, UPR 9036-CNRS, 13402 Marseille Cedex 20,¹ Université de Provence, 13331 Marseille Cedex 03, France²3s, http://www.100md.com

    Received 29 July 2002/ Accepted 4 November 20023s, http://www.100md.com

    ABSTRACT3s, http://www.100md.com

    Two new insertion sequences, ISCce1 and ISCce2, were found tobe inserted into the cipC gene of spontaneous mutants of Clostridiumcellulolyticum. In these insertional mutants, the cipC genewas disrupted either by ISCce1 alone or by both ISCce1 and ISCce2.ISCce1 is 1,292 bp long and has one open reading frame. Theopen reading frame encodes a putative 348-amino-acid proteinwith significant levels of identity with putative proteins havingunknown functions and with some transposases belonging to theIS481 and IS3 families. Imperfect 23-bp inverted repeats werefound near the extremities of ISCce1. ISCce2 is 1,359 bp long,carries one open reading frame, and has imperfect 35-bp invertedrepeats at its termini. The open reading frame encodes a putative398-amino-acid protein. This protein shows significant levelsof identity with transposases belonging to the IS256 family.Upon transposition, both ISCce1 and ISCce2 generate 8-bp directrepeats of the target sequence, but no consensus sequences couldbe identified at either insertion site. ISCce1 is copied atleast 20 times in the genome, as assessed by Southern blot analysis.ISCce2 was found to be mostly inserted into ISCce1. In addition,as neither of the elements was detected in seven other Clostridiumspecies, we concluded that they may be specific to the C. cellulolyticumstrain used.

    INTRODUCTIONyee, 百拇医药

    Insertion sequences (ISs) are small mobile genetic elementsthat are between 0.7 and 3.5 kb long and are found in the genomesof numerous bacteria. They contain only genes involved in theirtransposition. They usually have inverted repeats (IRs) at theirtermini and duplicate a sequence consisting of several basepairs at the target site upon transposition. Based on the homologybetween their transposase sequences and common structural features,these sequences have been classified in various IS families(for reviews see references 8 and 32). Insertion of an IS elementcan cause gene disruption or activation due to the creationor insertion of upstream promoters, and this contributes significantlyto the plasticity of the host cell genome. Mobile elements arecommonly associated with the virulence functions of many pathogens,such as Escherichia coli (11), Vibrio cholerae (43), Yersiniapestis (15), and Clostridium perfringens (6). IS elements arefrequently used as markers in restriction fragment length polymorphismstudies for epidemiological purposes, like those performed withSalmonella enterica serovar Typhimurium (IS200) (42) and Mycobacteriumtuberculosis (IS6110) (27). In addition, these sequences arevaluable tools for identifying relevant genes and the functionsin which they are involved.

    Only a few IS elements have been described so far in clostridia;four have been reported in Clostridium perfringens (6), onehas been reported in Clostridium beijerinckii NCIMB 8052 (30),and one has been reported in the cellulolytic bacterium Clostridiumthermocellum (39). Clostridium cellulolyticum is a mesophilicanaerobic cellulolytic bacterium which secretes enzymatic complexescalled cellulosomes (5, 17). These complexes are composed ofseveral enzymes, most of which are cellulases (Cel proteins);these enzymes are anchored to a large scaffolding protein (160kDa) that lacks catalytic activity, designated CipC (17, 36).Many of the cel genes form a large cluster spanning 24 kb beginningwith the cipC gene (3, 38). Functional studies of the cellulosomeshave been restricted so far to biochemical studies of recombinantsubunits overproduced in E. coli (4, 16, 19, 38). Gene transfertechniques were recently developed for C. cellulolyticum (25,44) and were used to modify its fermentation pathways (22).However, no description of a mutagenic system allowing randomor targeted mutagenesis has been described so far for this bacterium.Naturally occurring ISs would therefore be valuable tools fordeveloping a transposon-based mutagenesis system.

    In this paper, we describe two different IS elements which werefound in the cipC gene of various isolated clones of C. cellulolyticumATCC 35319. The features of these sequences, which are designatedISCce1 and ISCce2, are described below, and their membershipin various IS families is discussed.?[, 百拇医药

    MATERIALS AND METHODS?[, 百拇医药

    Bacterial strains, plasmids, media, and growth conditions. lists all the bacterial strains and plasmids used inthis study. E. coli DH5 was used as the recipient strain forthe recombinant plasmids (derivatives of pUC18, pUC19, or pGEM-T-Easy).It was grown at 37°C in Luria-Bertani medium supplementedwith ampicillin (100 µg/ml) (23).?[, 百拇医药

    fig.ommitted?[, 百拇医药

    T Bacterial strains and plasmids?[, 百拇医药

    C. cellulolyticum ATCC 35319 and mutant strains cipCMut1 andcipCMut2 were grown anaerobically at 32°C on basal medium(20) supplemented with either cellobiose (2 g/liter; Sigma-Aldrich)or MN300 cellulose (5 g/liter; Serva) as the carbon and energysource. Colonies of the mutant strains were isolated on solidmedium (basal medium supplemented with 2 g of cellobiose perliter and 15 g of agar per liter) under the anaerobic atmospherein a glove box (N₂-H₂, 95:5 [vol/vol]). Plates were incubatedin anaerobic jars under 2 x 10⁵ Pa of an N₂-CO₂ atmosphere (80:20,vol/vol).

    The other Clostridium strains were grown as previously described(10, 24, 26, 29, 31, 41).4q, 百拇医药

    DNA manipulations. Chromosomal DNA was obtained from the various Clostridium strainsby using a genomic DNA purification kit (Promega). DNA fromClostridium cellulovorans was a generous gift from R. H. Doi(University of California, Davis). Large-scale plasmid purificationfrom E. coli and small-scale plasmid purification from E. coliwere performed by using kits from Qiagen and Promega. Restrictionenzymes and DNA-modifying enzymes were purchased from Promegaand Roche Applied Science and were used as recommended by themanufacturers. DNA sequencing was performed by Genome Express(Grenoble, France).4q, 百拇医药

    Primers and probes. Primers were purchased from MWGAG-Biotech (Courtaboeuf, France). Primers c1 and c2 were used to amplify sequencesthat disrupt the cipC gene in the cipCMut1 and the cipCMut2strains. Primers A, B, C, D, E, F, G, and H were used in inversePCR experiments to analyze insertion sites of the two IS elements. The various primers were also used forsequencing ISCce1 and ISCce2.

    fig.ommitted&|, 百拇医药

    Primer sequences&|, 百拇医药

    fig.ommitted&|, 百拇医药

    Maps of the cipC gene in the wild-type strain (A) and mutant strains cipCMut2 (B) and cipCMut1 (C). orf1 (solid box) and orf2 (gray box) encode the putative transposases of ISCce1 and ISCce2, respectively. The vertical boxes represent insertion sites of ISCce1 in the cipC gene (cross-hatched box) and of ISCce2 in ISCce1 (solid box). The positions of primers A, B, C, D, E, F, G, H, c1, and c2 are indicated by arrows. Probe 2 and probe 3 are internal probes of ISCce1 and ISCce2, respectively. Restriction sites: EV, EcoRV; P, PstI; N, NdeI; HIII, HindIII; EI, EcoRI.&|, 百拇医药

    Probe 1 was obtained by PCR performed with primers M13-upwardand M13-downward by using the pS29 plasmid as thetemplate . Primers G and B and primers D and E wereused to synthesize probes 2 and 3, respectively. The cipC probewas synthesized by using primers c1 and c2 with wild-type DNA.&|, 百拇医药

    fig.ommitted&|, 百拇医药

    Discovery of insertion elements in C. cellulolyticum. (A) Map of the cipC gene disrupted by an insertion element (shaded box). The encoded domains are indicated above the gene (SS, signal sequence; CBM3, carbohydrate binding module of family 3; X2, unknown function module of family 2; C1 to C8, cohesin modules). (B) Southern blot analysis of PvuII-digested genomic DNA from various strains. The blot was probed with PCR digoxigenin-labeled probe 1 (part 1) and with the cipC probe (part 2). Lane WT, wild type; lane 1, cipCMut1; lane 2, cipCMut2. Sizes (in kilobase pairs) are indicated on the left.

    Southern blot analysis. DNAs from Clostridium strains were cut with PvuII or EcoRI andseparated by electrophoresis in a 0.7% agarose gel. DNA fragmentswere transferred by Southern blotting onto a nylon membrane(Roche Applied Science) and hybridized to the PCR-generateddigoxigenin-labeled probe at 68°C (in the case of C. cellulolyticumDNA) and at 68 or 55°C (in the case of heterologous ClostridiumDNAs). Targets were detected by chemiluminescence by using aDIG luminescent detection kit (Roche Applied Science). The probewas removed after each experiment by incubating the blot twicefor 20 min in a 0.2 M NaOH-0.1% sodium dodecyl sulfate solutionat 37°C in order to hybridize one blot successively withmany probes.j, 百拇医药

    Inverse PCR. DNA sequences flanking the IS elements in the genome of C. cellulolyticumwere amplified by inverse PCR (34). Total chromosomal DNA ofthe cipCMut1 strain was digested by a restriction enzyme cuttingthe IS element once near the unknown sequence. To determinethe sequences flanking ISCce1 at its left junction, DNA wasdigested with PstI or NdeI. The resulting fragmentswere ligated and used as templates for PCR amplification withdivergent primers A and B. The inverse PCR products were thenpurified by using a Qiaex II gel purification kit (Qiagen) andwere ligated to linearized pGEM-T-Easy vector. Ligation mixtureswere used to transform competent E. coli DH5 cells. Ampicillin-resistantcolonies were isolated. Plasmid DNA was purified and subjectedto restriction analysis. Depending on the orientation of theinsert, the T7 or SP6 primer was used to sequence the junction.The same protocol was used to determine the right junctionsof ISCce1, but in this case the DNA was digested with NdeI andthe PCR was carried out with primers G and H . In orderto find the right junctions of combined ISs, the DNA was digestedwith EcoRI or HindIII, and the PCR was performed with primersE and H . Fragments flanking ISCce2 at its left junctionswere synthesized with primers C and D from ligated EcoRV DNAfragments. Right junctions of ISCce2 were analyzed from inversePCR products obtained with primers E and F by using ligatedEcoRI or HindIII fragments as the templates.

    Computer analysis. Nucleotide sequences were analyzed with the DNASIS program,version 2.1. The BLAST program (1) was used for a homology searchof the nucleotide and protein sequences in the GenBank and IS(www-is.biotoul.fr) databases. The DNA binding motifs in theproteins were predicted by using the Helix-Turn-Helix program(13). Multiple-sequence alignments, obtained with ClustalW,version 1.7 (45), were used to construct phylogenetic treeswith Phylo_win (18).71z?f#0, 百拇医药

    Nucleotide sequence accession numbers. The nucleotide sequences of the IS elements described here,ISCce1 and ISCce2, have been deposited in the GenBank databaseunder accession numbers AY130778 and AY130779, respectively.71z?f#0, 百拇医药

    RESULTS71z?f#0, 百拇医药

    Discovery of insertion elements in C. cellulolyticum. A pUC18-PvuII genomic DNA library of C. cellulolyticum was previouslyconstructed in E. coli DH5 and screened by colony hybridizationwith a 285-bp probe complementary to the 3' end of cipC (35).The 3.8-kb PvuII fragment inserted into the pH62 recombinantplasmid of one of the selected clones was found to contain aninternal part of the cipC gene interrupted by a 2,659-bp sequence.This sequence contained two open reading frames (ORFs) encodingproteins which showed significant levels of identity with transposases.The cipC gene was disrupted at the beginning of the sequenceencoding cohesin 7 of the scaffolding protein CipC.

    A liquid culture of the strain used to construct the DNA librarywas plated onto solid medium. DNA was extracted from 17 isolatedcolonies, digested with PvuII, and subjected to a Southern blotanalysis by using probe 1 . Based on comparisons betweenthe various patterns obtained, three major groups were distinguished.The patterns of the first group were comparable to the patternobtained with the DNA purified from the reference strain (ATCC35319). Probe 1 hybridized with many fragments , indicating that many copies of this DNA sequencewere inserted at various loci on the chromosome. In the secondgroup, an additional fragment was detected in the DNA; thisfragment was 3.9 kb long (a representative example is shownin , part 1, lane 1). In the third group, the additionalfragment was 2.5 kb long , part 1, lane 2). When thecipC probe was used, 1.2- and 1.8-kb fragments were detectedin the DNA of the wild-type strain ; the probehybridized with two PvuII fragments of the cipC gene. The 1.8-kbfragment was also detected in lanes containing DNA from strains1 and 2, but the 1.2-kb fragment was not detected. Instead ofthe latter fragment, 3.9- and 2.5-kb fragments were detectedin strains 1 and 2, respectively , which indicatedthat the cipC gene had been disrupted in these two strains (whichwere designated cipCMut1 and cipCMut2).

    Structural analysis of the ISs. Genomic DNAs from strains cipCMut1 and cipCMut2 were used toamplify the sequences inserted into cipC. The PCR fragmentswere synthesized by using primers c1 and c2 designed from thecohesin 6- and C-terminal X2 module-encoding sequences, respectively). The sequences were analyzed after cloning into thepGEM-T-Easy vector by using primers c1, c2, B, C, D, and E . A 2,659-bp sequence was inserted into cipC in cipCMut1DNA; this sequence was identical to the copy found in the PvuIIfragment of pH62 and inserted at the same place . Another insertion element, which was 1,292 bp long, wasfound in the same place in cipCMut2 DNA. This element correspondsto the 2,659-bp sequence with its internal part deleted .g, 百拇医药

    The 1,292-bp DNA sequence contained only one ORF (orf1), whichwas 1,047 bp long and spanned almost the entire element; it was flanked by 23-bp IRs with six mismatches. The leftand right IRs were found at 50 and 26 bp of the extremities,respectively. Many characteristics typical of an IS were observed:(i) insertion of the 1,292-bp sequence yielded an 8-bp directrepeat (DR) footprint in the target sequence and (ii) the largeORF encoded a 348-amino-acid protein (40.2 kDa), designatedTnpA1, which exhibited significant levels of identity with ahypothetical protein (designated ORF1Ap [see below]) from Actinobacilluspleuropneumoniae (57%) (2), with a putative transposase (TnpWe)from a Wolbachia endosymbiont of Drosophila simulans (57%) (accessionnumber AAK69114), and with the putative proteins ID317 (55%)(21) and ChnZ (62%) (9) from Bradyrhizobium japonicum and Acinetobactersp. strain SE19, respectively. It also exhibited some identitywith the transposases encoded by many ISs belonging to the IS481family (20% to 38%) and with one IS (ISPg5 [7]) belonging tothe IS3 family (28%) (8). A multiple alignment of some of theseproteins enlightened many stretches of conserved aminoacids, including three aspartic residues and two glutamic residues.Three of these amino acids might constitute the DDE catalytictriad . Protein structure predictions suggested thatan -helix-turn-helix (HTH) DNA binding motif was present atthe N terminus of TnpA1 . Based on all these criteria,the element was designated ISCce1, although it does not haveany canonical IRs at its extremities.

    fig.ommittedo, 百拇医药

    Nucleotide sequence of ISCce1 and predicted amino acid sequence of transposase TnpA1. The putative ribosome binding site sequence is enclosed in a box. The ORF encoding transposase TnpA1 starts with an ATG codon at position 93 (boldface type) and ends with a TAA stop codon at position 1137 (asterisks). The deduced amino acid sequence is indicated under the corresponding nucleotide sequence. The putative ribosome binding site sequence is boxed. A palindromic sequence is overlined with arrows. The 8-bp duplicated sequence at the insertion site of ISCce2 into ISCce1 is underlined. Imperfect terminal IRs are indicated by incomplete arrows, with mismatches indicated by interruptions. The potential HTH DNA binding motif in the TnpA1 amino acid sequence is indicated by boldface type.o, 百拇医药

    fig.ommittedo, 百拇医药

    Alignment of TnpA1(from ISCce1) with ORF1Ap (A. pleuropneumoniae), ID317 (B. japonicum), TnpWe (Wolbachia endosymbiont of D. simulans), ChnZ (Acinetobacter sp. strain SE19), and proteins encoded by IS1121 (Clavibacter michiganensis) and ISPg5 (Porphyromonas gingivalis). A black background indicates identical amino acids, a dark gray background indicates very similar amino acids, and a light gray background indicates weakly similar amino acids. Conserved aspartic acid (D) and glutamic acid (E) residues are indicated below the alignment.

    The 1,359-bp DNA sequence that was found in the large 2,659-bpelement and was inserted into ISCce1 was designated ISCce2.The sequence surrounding a large ORF (orf2; length, 1,195 bp)was found to fulfill all the criteria required for an IS: an8-nucleotide target sequence was found to be duplicated on bothsides, and the extremities of the IS contained 35-bp imperfectIRs with 13 mismatches . The 398-amino-acid protein(45.9 kDa) encoded by orf2 exhibited significant levels of sequenceidentity (20 to 30%) with the transposases encoded by ISs belongingto the IS256 family (8). This protein, designated TnpA2, alsocontains a potential HTH DNA binding motif and the DDE triadof the catalytic domain .\7], 百拇医药

    fig.ommitted\7], 百拇医药

    Nucleotide sequence of ISCce2 and predicted amino acid sequence of the transposase TnpA2. The ORF encoding transposase TnpA2 starts with an ATG codon at position 109 (boldface type) and ends with a TAA stop codon at position 1303 (asterisks). The putative ribosome binding site sequence is enclosed in a box. The potential HTH DNA binding motif and the potential DDE catalytic triad motif in the TnpA2 amino acid sequence are indicated by boldface type and by circled residues, respectively. Terminal IRs are indicated by incomplete arrows, with mismatches indicated by interruptions.

    tnpA1 and tnpA2 are preceded by purine-rich sequences indicativeof potential ribosome binding sites . The G+Ccontents of ISCce1 and ISCce2 were 42 and 40%, respectively;these values are similar to the G+C content of the C. cellulolyticumgenome (40%). The genetic code usage in tnpA1 and tnpA2 correspondsto the usage which was previously observed in a set of functionalC. cellulolyticum genes (unpublished data).-kp, 百拇医药

    Insertion sites of ISCce1 and ISCce2 in the genome of C. cellulolyticum. To determine the sequences flanking ISCce1 in the genome ofC. cellulolyticum, chromosomal DNA of the cipCMut1 strain wasdigested with NdeI, EcoRI, HindIII, or PstI. Ligated fragmentswere used as templates for inverse PCR performed with primersA and B for the left junctions and primers G and H or primersE and H for the right junctions . PCR products werecloned into the pGEM-T-Easy vector and then analyzed by sequencing.Fourteen plasmids harboring fragments different from the cipCgene and disrupted by ISCce1 were obtained. Three of the ISCce1copies were combined with an ISCce2 copy. The ISCce1 targetsequences found were AT rich, but no consensus sequence couldbe identified

    fig.ommittedwi, http://www.100md.com

    Frequency of base occurrence at each position of the ISCce1 insertion siteswi, http://www.100md.com

    To determine the insertion sites of ISCce2, EcoRI-, HindIII-,or EcoRV-digested and ligated DNA was used in inverse PCR performedwith primers C and D to amplify the left junctions and withprimers E and F to amplify the right junctions . Onlyfour different recombinant plasmids harboring ISCce2 junctionswere obtained. In one of them, ISCce2 was inserted into an ISCce1copy. This combined IS might be one of those found when we searchedfor ISCce1 junctions (see above). No consensus target sequencewas identified from the four insertion sites obtained, ACATGCTT(in ISCce1), CATAATAA, CAGCACTT, and GCTTTTAT.wi, http://www.100md.com

    In other respects, partial sequences determined for variouscopies of each IS were found to be exactly identical to thesequences of the copies initially found in the cipC gene (datanot shown). Furthermore, the noncanonical location of IRs, whichwere found near the end of ISCce1, was confirmed by lookingat other copies cloned from genomic DNA.

    Close physical link between ISCce1 and ISCce2. ISCce2 was initially found within ISCce1 in the cipCMut1 strain.In addition, the sequence analysis of the IS junctions showedthat at least four of the seven ISCce2 copies found in thisstrain were inserted into ISCce1. In order to examine this unusualassociation in another way, the Southern blot which was previouslyprobed with a combined IS element was reprobed with probe 2to detect only ISCce1 and then with probe 3 to detectonly ISCce2 . The four bands detected in the wild-typelane with probe 3 could be superimposed on the bands revealedwith probe 2 . Furthermore, the pattern obtained withthe cipCMut1 DNA (when probe 3 was used) contained three bandswhich were absent in the wild-type DNA lane; two ofthese bands could be superimposed with those detected in thesame lane with probe 2. These findings suggest thatmany PvuII fragments contain both IS elements, although thepossibility that some of the common bands may have resultedfrom comigration of two different fragments, each containingone of the two ISs, cannot be ruled out. Nevertheless, of theseven ISCce2 copies studied, at least four were found to beinserted into ISCce1 (see above). Taken together, these resultsstrongly suggest that ISCce1 is a hot spot for the transpositionof ISCce2.

    fig.ommitted(^7, 百拇医药

    Distribution and association of ISCce1 and ISCce2 in C. cellulolyticum: Southern blot analysis of PvuII-digested genomic DNAs of the wild type (lane WT), cipCMut1 (lane 1), and cipCMut2 (lane 2) and of EcoRI-digested cipCMut1 DNA (lane 1E). Blots were hybridized with the ISCce1 probe (A) and with the ISCce2 probe (B). Superimposable bands detected in both hybridization experiments for all strains are indicated in panel B by arrowheads. The bands indicated by circles are superimposable bands obtained only with the mutant strains. Sizes (in kilobase pairs) are indicated on the left and on the right.(^7, 百拇医药

    ISCce1 and ISCce2 distribution among Clostridium species. Southern blotting of the PvuII-digested DNAs from C. cellulolyticumstrains probed with ISCce1 revealed between 10 and 14 bands.When the DNAs were probed with ISCce2, four to seven bands weredetected, depending on the strain . To determine thenumber of ISCce1 copies more exactly, an EcoRI digestion wasperformed. Up to 20 bands were detected, leading to the conclusionthat there were about 20 copies of ISCce1 .

    To determine whether ISCce1 and ISCce2 were present in otherclostridia, DNAs extracted from some selected stains were digestedby PvuII, electrophoresed, and hybridized with probe 2 or probe3. No fragments homologous to ISCce1 were detected in any ofthe strains tested with probe 2 in hybridization experimentscarried out at 68 or 55°C (data not shown). However, probe3 hybridized with one or two DNA fragments from Clostridiumcellobioparum, Clostridium papyrosolvens, Clostridium termitidis,and C. cellulovorans in the experiments carried out at 55°C. These fragments may therefore have low levels of sequencesimilarity with ISCce2.+0l@2|[, 百拇医药

    fig.ommitted+0l@2|[, 百拇医药

    Distribution of ISCce2 in Clostridium strains: Southern blotting of PvuII-digested DNAs of clostridia hybridized with probe 3 at 55°C. Lane 1, C. cellobioparum; lane 2, C. papyrosolvens; lane 3, C. termitidis; lane 4, C. cellulovorans; lane 5, C. saccharobutylicum; lane 6, C. thermocellum; lane 7, C. acetobutylicum.+0l@2|[, 百拇医药

    Our results indicate that ISCce1 and ISCce2 might be ISs specificto C. cellulolyticum.

    DISCUSSION{y}3d0@, 百拇医药

    In this paper, we report the discovery and characterization,in the cellulolytic bacterium C. cellulolyticum, of novel 1,292-and 1,359-bp IS elements, designated ISCce1 and ISCce2, respectively.These two elements were found to be frequently associated. Inat least four cases, ISCce2 was found to be located within ISCce1itself; this situation is comparable to that reported for twoISs in Pseudomonas syringae (40) and two other ISs in Sinorhizobiummeliloti (28).{y}3d0@, 百拇医药

    The nucleotide sequence of ISCce1 has only one long ORF, whichcodes for a putative transposase designated TnpA1. The deducedamino acid sequence showed significant levels of identity withproteins ORF1Ap (2), ID317 (21), TnpWe (accession number AAK69114),and ChnZ (9) and with some transposases encoded by ISs belongingto the IS481 family (33) and by ISPg5 belonging to the IS3 family(8) .{y}3d0@, 百拇医药

    To investigate the evolutionary relationships between TnpA1and these proteins, phylogenetic trees were drawn. The Phylo_Winprogram (18) was applied to the multiple-sequence alignmentof TnpA1, ORF1Ap, ID317, TnpWe, and 11 transposases encodedby elements belonging to the IS481 family (8), which was obtainedwith the program CLUSTAL W (45). The resulting tree shows thatTnpA1 (ISCce1), ORF1Ap, ID317, and TnpWe may constitute a group. This group is separated from the 11 members of theIS481 family, and the existence of a relationship between thetwo groups was not confirmed by a high bootstrap confidencelevel. A similar analysis was carried out with TnpA1 (ISCce1),ORF1Ap, ID317, TnpWe, ChnZ, and six transposases encoded byISs belonging to the IS3 family (all of which belong to theIS3 group) (8). As ChnZ is only 219 amino acids long, the treewas generated from part of the multiple-sequence alignment.As described above, TnpA1 (ISCce1), ORF1Ap, ID317, TnpWe, andChnZ constitute a group separated from the group formed by theIS3 family members . Again, the relationship betweenthe ISCce1 group and the IS3 group was not confirmed by a highbootstrap confidence level. Like TnpA1, the other proteins inthe ISCce1 group also show some homology with the transposasesof the IS481 family and with some members of the IS3 family(data not shown).

    fig.ommitted;jrz9p, 百拇医药

    Phylogenetic trees showing relationships between ISCce1 and members of the IS481 and IS3 families. The trees were constructed from a multiple-sequence alignment (ClustalW) of transposases of ISs and proteins showing identity with TnpA1 (ISCce1) by using the neighbor-joining method (Phylo_Win) ). The circled numbers are the percentages of support (bootstrap values) for individual nodes in the tree obtained by performing 100 replicate searches. Only values higher than 65% are indicated. A percentage of accepted mutation distance is indicated above each clade. (A) Tree constructed from the entire multiple-sequence alignment of TnpA1 (ISCce1), 11 members of the IS481 family, and other homologous proteins. The accession numbers for the members of the IS481 family used are as follows: ISA0963_6, AE000986; ISSco2, AL10949; ISMav2, AF286339; ISVch1, AF034434; ISAni1, X97015; IS1121, AF079817; IS1652, AL109949; IS1002, Z54268; ISBm3, AF047478; IS481, M22031; and ISCgl1, U85507. The accession numbers for the other homologous proteins used are as follows: TnpWe, AAK69114; ID317, AAG60838; and ORF1Ap, S27482. (B) Tree constructed from part of the multiple-sequence alignment of TnpA1, five members of the IS3 family, and other homologous proteins. The accession numbers for the members of the IS3 family used are as follows: ISPg5, AF224744; IS1520, AJ250598; IS981, M33933; IS600, X05992; and IS_LL6, U23813. The accession numbers for the other homologous proteins used (TnpWe, ID317 and ORF1Ap) are as described above; in addition, ChnZ (accession number AAG10024) was used.

    orf1Ap from A. pleuropneumoniae, like ISCce1 and ISCce2, wasfound when spontaneous mutants of the strain were characterized(2). This sequence is flanked by 26-bp IRs with four mismatches.Because the published sequence resulted from a recombinationevent, it was impossible to identify eventual DRs at the extremitiesof the sequence. The sequences flanking chnZ and id317, availablein the GenBank database, were analyzed. Only id317 was flankedby 43-bp imperfect IRs (with 13 mismatches), but no DRs werefound near these IRs. The absence of DRs might be due to genomicrearrangements, as in the case of the orf1Ap from A. pleuropneumoniae.No data are available on IRs and DRs in the flanking sequencesof the region encoding TnpWe.\%9ri], 百拇医药

    These results and those of the phylogenetic analysis suggestthat the unknown protein from A. pleuropneumoniae (2) (ORF1Ap),protein ID317 from B. japonicum (21), ChnZ from Acinetobactersp. strain SE19 (9), and the putative transposase from a Wolbachiaendosymbiont of D. simulans (accession number AAK96114) mightbe encoded by complete or truncated ISs. These sequences, alongwith ISCce1, might form a new group of ISs, which is probablyrelated to the IS481 and the IS3 families. In this group, ISswould (i) exhibit IRs, but not at the extremities of the element;(ii) generate the formation of DRs in the target upon transposition,although this feature could be identified only for ISCce1; and(iii) contain only one ORF encoding a putative transposase.The strict conservation of several D and E residues stronglysuggests that the catalytic mechanism of these transposasesinvolves a DDE triad.

    The nucleotide sequence of ISCce2 has one large ORF (tnpA2)that putatively encodes a transposase (TnpA2). This proteinhas significant levels of identity with many transposases belongingto the IS256 family. A phylogenetic tree was generated for 18IS elements belonging to the IS256 family or showing some identitywith members of this family . This tree shows that ISCce2might be a member of the IS256 family, but as it is locatedon a separate clade of the tree, it does not have any closerelatives belonging to this family. ISCce2 has all the featuresof the IS256 family (8): (i) it has IRs at its extremities (35bp in ISCce2); (ii) it duplicates an 8-bp target site sequenceupon transposition; (iii) TnpA2 has a DDE motif that has extendedregions similar to that of the transposases of the IS256 family;and (iv) TnpA2 exhibits similarities with the putative MurAgene product of the autonomous mutator element of Zea mays,MuDR (14).#, 百拇医药

    fig.ommitted#, 百拇医药

    Phylogenetic tree of some members of the IS256 family. This tree was constructed from a multiple-sequence alignment (ClustalW) (http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_clustalw.html) of transposases of ISs and proteins showing identity with TnpA2 (ISCce2) by using the neighbor-joining method (Phylo_Win) (http://biom1.univ-lyon1.fr/software/phylowin.html). The circled numbers are the percentages of support (bootstrap values) for individual nodes on the tree obtained by performing 100 replicate searches. Only values higher than 65% are indicated. A percentage of accepted mutation distance is indicated above each clade. The accession numbers for the proteins and ISs are as follows: TnpSm, BAB07803; ISRo1, U70364; IS1245, L33879; IS1553I, NP_338287; Tnp1250b, AF024666; IS1601-A, AAD44203; IS1081, X61270; IS1408, U62766; IS1407, X97307; IS1164, D67027; IS1512, U95314; IS16, U35366; IS256, M18086; IS406, M83145; ISRm5, U08627; ISRm3, M60971; and IS905A, L20851.

    ISCce1 and ISCce2 seem to be specific to C. cellulolyticum.Indeed, they were not found in any of the species that are phylogeneticallyclosely related to C. cellulolyticum, such as C. papyrosolvens,C. cellobioparum, and C. termitidis, or in the distantly relatedspecies, such as C. cellulovorans, Clostridium saccharobutylicum,C. thermocellum, and Clostridium acetobutylicum (12).!:ll;2h, 百拇医药

    Since ISCce1 and ISCce2 were isolated after they were insertedinto the cipC gene, they are therefore transpositionally active.The use of these ISs for construction of mutagenic tools isinteresting. Such tools should allow identification of new relevantgenes involved in cellulolysis.!:ll;2h, 百拇医药

    ACKNOWLEDGMENTS!:ll;2h, 百拇医药

    We thank R. H. Doi for kindly providing C. cellulovorans genomicDNA and G. Fichant for her help with the Phylo_Win program.We are grateful to O. Valette for her expert technical assistanceand to J. Blanc for revising the English in the manuscript.We thank A. Bélaich, H. P. Fièrobe, and S. Pagésfor helpful discussions.

    We acknowledge the financial support received from the CentreNational de la Recherche Scientifique and Universitéde Provence, from Conseil Général des Bouchesdu Rhône, and from Région Provence-Alpes-Côtesd'Azur. H. Maamar received a fellowship from the Tunisian Government.g!6!/&, http://www.100md.com

    REFERENCESg!6!/&, http://www.100md.com

    Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402.g!6!/&, http://www.100md.com

    Anderson, C., A. A. Potter, and G. F. Gerlach. 1991. Isolation and molecular characterization of spontaneously occurring cytolysin-negative mutants of Actinobacillus pleuropneumoniae serotype 7. Infect. Immun. 59:4110-4116.g!6!/&, http://www.100md.com

    Bagnara-Tardif, C., C. Gaudin, A. Belaich, P. Hoest, T. Citard, and J. P. Belaich.1992. Sequence analysis of a gene cluster encoding cellulases from Clostridium cellulolyticum. Gene 119:17-28.g!6!/&, http://www.100md.com

    Belaich, A., G. Parsiegla, L. Gal, C. Villard, R. Haser, and J. P. Belaich. 2002. Cel9M, a new family 9 cellulase of the Clostridium cellulolyticum cellulosome. J. Bacteriol. 184:1378-1384.

    Belaich, J. P., C. Tardif, A. Belaich, and C. Gaudin. 1997. The cellulolytic system of Clostridium cellulolyticum. J. Biotechnol. 57:3-14.:3, http://www.100md.com

    Brynestad, S., B. Synstad, and P. E. Granum. 1997. The Clostridium perfringens enterotoxin gene is on a transposable element in type A human food poisoning strains. Microbiology 143:2109-2115.:3, http://www.100md.com

    Califano, J. V., T. Kitten, J. P. Lewis, F. L. Macrina, R. D. Fleischmann, C. M. Fraser, M. J. Duncan, and F. E. Dewhirst. 2000. Characterization of Porphyromonas gingivalis insertion sequence-like element ISPg5. Infect. Immun. 68:5247-5253.:3, http://www.100md.com

    Chandler, M., and J. Mahillon. 2002. Insertion sequences, p. 305-366. In N. L. Craig, R. Craigie, M. Gellert, and A. M. Lambowitz (ed.), Mobile DNA II. ASM Press, Washington, D.C.:3, http://www.100md.com

    Cheng, Q., S. M. Thomas, K. Kostichka, J. R. Valentine, and V. Nagarajan. 2000. Genetic analysis of a gene cluster for cyclohexanol oxidation in Acinetobacter sp. strain SE19 by in vitro transposition. J. Bacteriol. 182:4744-4751.

    Chung, K. T. 1976. Inhibitory effects of H₂ on growth of Clostridium cellobioparum. Appl. Environ. Microbiol. 31:342-348.${5, http://www.100md.com

    Collins, C. M., and D. M. Gutman. 1992. Insertional inactivation of an Escherichia coli urease gene by IS3411. J. Bacteriol. 174:883-888.${5, http://www.100md.com

    Collins, M. D., P. A. Lawson, A. Willems, J. J. Cordoba, J. Fernandez-Garayzabal, P. Garcia, J. Cai, H. Hippe, and J. A. Farrow. 1994. The phylogeny of the genus Clostridium: proposal of five new genera and eleven new species combinations. Int. J. Syst. Bacteriol. 44:812-826.${5, http://www.100md.com

    Dodd, I. B., and J. B. Egan. 1990. Improved detection of helix-turn-helix DNA-binding motifs in protein sequences. Nucleic Acids Res. 18:5019-5026.${5, http://www.100md.com

    Eisen, J. A., M. I. Benito, and V. Walbot. 1994. Sequence similarity of putative transposases links the maize mutator autonomous element and a group of bacterial insertion sequences. Nucleic Acids Res. 22:2634-2636.${5, http://www.100md.com

    Fetherston, J. D., and R. D. Perry. 1994. The pigmentation locus of Yersinia pestis KIM6⁺ is flanked by an insertion sequence and includes the structural genes for pesticin sensitivity and HMWP2. Mol. Microbiol. 13:697-708.

    Gal, L., C. Gaudin, A. Belaich, S. Pages, C. Tardif, and J. P. Belaich. 1997. CelG from Clostridium cellulolyticum: a multidomain endoglucanase acting efficiently on crystalline cellulose. J. Bacteriol. 179:6595-6601.$dg?0n, 百拇医药

    Gal, L., S. Pages, C. Gaudin, A. Belaich, C. Reverbel-Leroy, C. Tardif, and J. P. Belaich. 1997. Characterization of the cellulolytic complex (cellulosome) produced by Clostridium cellulolyticum. Appl. Environ. Microbiol. 63:903-909.$dg?0n, 百拇医药

    Galtier, N., M. Gouy, and C. Gautier. 1996. SeaView and Phylo_win, two graphic tools for sequence alignment and molecular phylogeny. Comput. Applic. Biosci. 12:543-548.$dg?0n, 百拇医药

    Gaudin, C., A. Belaich, S. Champ, and J. P. Belaich. 2000. CelE, a multidomain cellulase from Clostridium cellulolyticum: a key enzyme in the cellulosome? J. Bacteriol. 182:1910-1915.$dg?0n, 百拇医药

    Giallo, J., C. Gaudin, J. P. Belaich, E. Petitdemange, and F. Caillet-Mangin. 1983. Metabolism of glucose and cellobiose by cellulolytic mesophilic Clostridium sp. strain H10. Appl. Environ. Microbiol. 45:843-849.

    Gottfert, M., S. Rothlisberger, C. Kundig, C. Beck, R. Marty, and H. Hennecke. 2001. Potential symbiosis-specific genes uncovered by sequencing a 410-kilobase DNA region of the Bradyrhizobium japonicum chromosome. J. Bacteriol. 183:1405-1412.\:/'#)., http://www.100md.com

    Guedon, E., M. Desvaux, and H. Petitdemange. 2002. Improvement of cellulolytic properties of Clostridium cellulolyticum by metabolic engineering. Appl. Environ. Microbiol. 68:53-58.\:/'#)., http://www.100md.com

    Hanahan, D. 1985. Techniques for transformation of E. coli, p. 109-135. In D. M. Glover (ed.), DNA cloning, a practical approach, vol. 1. IRL Press, Oxford, United Kingdom.\:/'#)., http://www.100md.com

    Hethener, P., A. Brauman, and J. L. Garcia. 1992. Clostridium termitidis sp. nov., a cellulolytic bacterium from the gut of the wood-feeding termite, Nasutitermes lujae. Syst. Appl. Microbiol. 15:52-58.\:/'#)., http://www.100md.com

    Jennert, K. C., C. Tardif, D. I. Young, and M. Young. 2000. Gene transfer to Clostridium cellulolyticum ATCC 35319. Microbiology 12:3071-3080.\:/'#)., http://www.100md.com

    Keis, S., R. Shaheen, and D. T. Jones. 2001. Emended descriptions of Clostridium acetobutylicum and Clostridium beijerinckii, and descriptions of Clostridium saccharoperbutylacetonicum sp. nov. and Clostridium saccharobutylicum sp. nov. Int. J. Syst. Evol. Microbiol. 51:2095-2103.

    Kivi, M., X. Liu, S. Raychaudhuri, R. B. Altman, and P. M. Small. 2002. Determining the genomic locations of repetitive DNA sequences with a whole-genome microarray: IS6110 in Mycobacterium tuberculosis. J. Clin. Microbiol. 40:2192-2198.1\?*d-, 百拇医药

    Laberge, S., A. T. Middleton, and R. Wheatcroft. 1995. Characterization, nucleotide sequence, and conserved genomic locations of insertion sequence ISRm5 in Rhizobium meliloti. J. Bacteriol. 177:3133-3142.1\?*d-, 百拇医药

    Lamed, R., E. Setter, and E. A. Bayer. 1983. Characterization of a cellulose-binding, cellulase-containing complex in Clostridium thermocellum. J. Bacteriol. 156:828-836.1\?*d-, 百拇医药

    Liyanage, H., P. Holcroft, V. J. Evans, S. Keis, S. R. Wilkinson, E. R. Kashket, and M. Young. 2000. A new insertion sequence, ISCb1, from Clostridium beijernickii NCIMB 8052. J. Mol. Microbiol. Biotechnol. 2:107-113.1\?*d-, 百拇医药

    Madden, R. H., M. J. Bryder, and N. J. Poole. 1982. Isolation and characterization of an anaerobic, cellulolytic bacterium, Clostridium papyrosolvens sp. nov. Int. J. Syst. Bacteriol. 32:87-91.

    Mahillon, J., and M. Chandler. 1998. Insertion sequences. Microbiol. Mol. Biol. Rev. 62:725-774.)8}-2f., 百拇医药

    McPheat, W. L., and T. McNally. 1987. Isolation of a repeated DNA sequence from Bordetella pertussis. J. Gen. Microbiol. 133:323-330.)8}-2f., 百拇医药

    Ochman, H., M. M. Medhora, D. Garza, and D. L. Hartl. 1990. Amplification of flanking sequences by inverse PCR, p. 219-227. In M. A. Innis, D. H. Gelfand, J. J. Sninsky, and T. J. White (ed.), PCR protocols. Academic Press, Inc., New York, N.Y.)8}-2f., 百拇医药

    Pages, S., A. Belaich, C. Tardif, C. Reverbel-Leroy, C. Gaudin, and J. P. Belaich. 1996. Interaction between the endoglucanase CelA and the scaffolding protein CipC of the Clostridium cellulolyticum cellulosome. J. Bacteriol. 178:2279-2286.)8}-2f., 百拇医药

    Pages, S., A. Belaich, H. P. Fierobe, C. Tardif, C. Gaudin, and J. P. Belaich. 1999. Sequence analysis of scaffolding protein CipC and ORFXp, a new cohesin-containing protein in Clostridium cellulolyticum: comparison of various cohesin domains and subcellular localization of ORFXp. J. Bacteriol. 181:1801-1810.

    Petitdemange, E., F. Caillet, J. Giallo, and C. Gaudin. 1984. Clostridium cellulolyticum sp. nov., a cellulolytic, mesophilic species from decayed grass. Int. J. Syst. Bacteriol. 34:155-159.clz7v, http://www.100md.com

    Reverbel-Leroy, C., A. Belaich, A. Bernadac, C. Gaudin, J. P. Belaich, and C. Tardif. 1996. Molecular study and overexpression of the Clostridium cellulolyticum celF cellulase gene in Escherichia coli. Microbiology 142:1013-1023.clz7v, http://www.100md.com

    Snedecor, B., E. Chen, and R. F. Gomez. 1983. In Proceedings of the IVth International Symposium on the Genetics of Industrial Microorganisms, p. 356-360.clz7v, http://www.100md.com

    Soby, S., B. Kirkpatrick, and T. Kosuge. 1993. Characterization of an insertion sequence (IS53) located within IS51 on the iaa-containing plasmid of Pseudomonas syringae pv. savastanoi. Plasmid 29:135-141.clz7v, http://www.100md.com

    Soucaille, P., and G. Goma. 1986. Acetonobutylic fermentation by Clostridium acetobutylicum ATCC 824: autobacteriocin production, properties, and effects. Curr. Microbiol. 13:163-169.clz7v, http://www.100md.com

    Stanley, J., N. Baquar, and E. J. Threlfall. 1993. Genotypes and phylogenetic relationships of Salmonella typhimurium are defined by molecular fingerprinting of IS200 and 16S rrn loci. J. Gen. Microbiol. 139:1133-1140.m, http://www.100md.com

    Stroeher, U. H., K. E. Jedani, B. K. Dredge, R. Morona, M. H. Brown, L. E. Karageorgos, M. J. Albert, and P. A. Manning. 1995. Genetic rearrangements in the rfb regions of Vibrio cholerae O1 and O139. Proc. Natl. Acad. Sci. USA 92:10374-10378.m, http://www.100md.com

    Tardif, C., H. Maamar, M. Balfin, and J. P. Belaich. 2001. Electrotransformation studies in Clostridium cellulolyticum. J. Ind. Microbiol. Biotechnol. 27:271-274.m, http://www.100md.com

    Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.(Hédia Maamar Pascale de Philip Jean-Pierre Bélaich and Chantal Tardif)

百拇医药网 http://www.100md.com/html/DirDu/2005/05/05/58/54/49.htm