Difference between revisions of "IS Families/IS6 family"

From TnPedia
Jump to navigation Jump to search
(Replaced content with "==General== There are at present nearly 160 family members in ISfinder (<nowiki>https://www-is.biotoul.fr/scripts/search-db.php</nowiki>) from nearly 80 bacterial and arch...")
Tags: Visual edit Replaced
Line 1: Line 1:
==General==
+
====General====
There are at present nearly 160 family members in ISfinder (<nowiki>https://www-is.biotoul.fr/scripts/search-db.php</nowiki>) from nearly 80 bacterial and archaeal species but this represents only a fraction of those present in the public databases. The family was named ��[1]� after the directly repeated insertion sequences in transposon Tn''6 ��''[2]''�'' to standardize the various names that had been attributed to identical elements (e.g. IS''15'', IS''26'', IS''46'', IS''140,'' IS''160'', IS''176'') ��[3–15]�, including one isolate, IS''15'', corresponding to an insertion of one iso-IS''6'' (IS''15''D) into another ��[4,5]�. More recently there has been some attempt to rename the family as the IS''26'' family (see ��[16]�), presumably because of accumulating experimental data from IS''26'' itself and the importance of this IS in accumulation and transmission of multiple antibiotic resistance, although this might potentially introduce confusion in the literature. IS''6'' family members have a simple organization (Fig. IS6.1) and generate 8bp direct target repeats on insertion. This family is very homogenous with an average length of about 800 bp for the majority (between 700 and 890 bp) and highly conserved short, generally perfect, IRs (Fig. IS6.1 and IS6.2). There are two examples of MITES ('''M'''iniature '''I'''nverted repeat '''T'''ransposable '''E'''lements composed of both IS ends and no intervening orfs; ��[17]� of 227 and 336 bp), 7 members between 1230 and 1460 bp and three members between 1710 and 1760 bp. One member, IS''15'', of 1648 bp represents and insertion of one IS into another ��[3,5]�. Many are found as part of compound transposons (called pseudo-compound transposons ��[1]� described below ��[16]�) invariably as flanking ''direct'' ''repeats'' (Fig. IS6.1) a consequence of their transposition mechanism ��[7,9,13,14,18–30]�.
+
There are at present nearly 160 family members in [https://isfinder.biotoul.fr/ ISfinder] from nearly 80 bacterial and archaeal species but this represents only a fraction of those present in the public databases. The family was named<ref name=":0">Galas DJ, Chandler M. Bacterial Insertion Sequences. In: Berg DE, Howe MM, editors. Mob DNA. Washington, D.C.: American Society for Microbiology; 1989. p. 109–162. </ref> after the directly repeated insertion sequences in transposon Tn''6'' <ref>Berg DE, Davies J, Allet B, Rochaix JD. Transposition of R factor genes to bacteriophage lambda. ProcNatlAcadSciUSA. 1975;72:3628–3632. </ref> to standardize the various names that had been attributed to identical elements (e.g. IS''15'', IS''26'', IS''46'', IS''140,'' IS''160'', IS''176'') <ref>Labigne-Roussel A, Courvalin P. IS15, a new insertion sequence widely spread in R plasmids of gram- negative bacteria. MolGenGenet. 1983;189:102–112.  
  
== Acknowledgements ==
+
<br /></ref><ref>Trieu-Cuot P, Courvalin P. Nucleotide sequence of the transposable element IS15. Gene. 1984;30:113–120. </ref><ref><nowiki><pubmed>2994132</pubmed></nowiki></ref><ref name=":6"><nowiki><pubmed>PMC326375</pubmed></nowiki></ref><ref name=":1"><nowiki><pubmed>PMC326375</pubmed></nowiki></ref><ref name=":7"><nowiki><pubmed>3003524</pubmed></nowiki></ref><ref name=":2"><nowiki><pubmed>PMC215669</pubmed></nowiki></ref><ref>Nucken EJ, Henschke RB, Schmidt FR. Nucleotide-sequence of insertion element IS15 delta IV from plasmid pBP11. DNA Seq. 1990;1:85–88.
We would like to thank Susu He (Nanjing University) for stimulating discussions concerning the transposition models and for supplying the IS''26'' transposase secondary structure predictions in Fig. IS6.2C).
+
 
 +
<br /></ref><ref><nowiki><pubmed>6304469</pubmed></nowiki></ref><ref><nowiki><pubmed>2999303</pubmed></nowiki></ref><ref name=":3">Colonna B, Bernardini M, Micheli G, Maimone F, Nicoletti M, Casalino M. The Salmonella wien virulence plasmid pZM3 carries Tn1935, a multiresistance transposon containing a composite IS1936- kanamycin resistance element. Plasmid. 1988;20:221–231.</ref><ref name=":4"><nowiki><pubmed>PMC162495</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC305975</pubmed></nowiki></ref>, although more recently there has been some attempt to rename the family (see <ref><nowiki><pubmed>32871211</pubmed></nowiki></ref>), presumably because of accumulating experimental data from IS''26'' itself and the importance of this IS in accumulation and transmission of multiple anti biotic resistance, although this might potentially introduce confusion in the literature. IS''6'' family members have a simple organization ([[:File:IS6.1.png|Fig. IS6.1]]) and generate 8bp direct target repeats on insertion. This family is very homogenous with an average length of about 800 bp and highly conserved short, generally perfect, IRs ([[:File:IS6.2.png|Fig. IS6.2]]). Many are found as part of compound transposons (called pseudo-compound transposons <ref name=":0" /> described below) invariably as flanking ''direct'' ''repeats'' ([[:File:IS6.1.png|Fig. IS6.1]]) a consequence of their transposition mechanism <ref name=":1" /><ref name=":2" /><ref name=":3" /><ref name=":4" /><ref><nowiki><pubmed>PMC1196216</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC3195058</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC3587239</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC219079</pubmed></nowiki></ref><ref name=":11"><nowiki><pubmed>PMC209129</pubmed></nowiki></ref><ref>Barberis-Maino L, Berger-Bachi B, Weber H, Beck WD, Kayser FH. IS431, a staphylococcal insertion sequence-like element related to IS26 from Proteus vulgaris. Gene. 1987;59:107–113. </ref><ref><nowiki><pubmed>PMC174916</pubmed></nowiki></ref><ref><nowiki><pubmed>3033719</pubmed></nowiki></ref><ref><nowiki><pubmed>2543009</pubmed></nowiki></ref><ref>Sundstrom L, Jansson C, Bremer K, Heikkila E, Olsson-Liljequist B, Skold O. A new dhfrVIII trimethoprim-resistance gene, flanked by IS26, whose product is remote from other dihydrofolate reductases in parsimony analysis. Gene. 1995;154:7–14. </ref><ref name=":8"><nowiki><pubmed>19074421</pubmed></nowiki></ref><ref><nowiki><pubmed>21393132</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC284528</pubmed></nowiki></ref>. 
 +
[[File:IS6.1.png|border|center|thumb|600x600px|'''Fig. IS6.1.''' IS''6'' family organization. '''Top'''. Structure of IS''6'' family. Left (IRL) and right (IRR) terminal 14 bp IRs are shown as blue triangles. The 8 bp direct target repeats are shown as pink arrow heads. The transposase open reading frame is shown in purple. '''Bottom.''' A Pseudo-compound transposon (see text for explanation). IS''6'' family characteristics are as above.
 +
 
 +
Here, two directly repeated IS flank a passenger gene in green. ]]
 +
 
 +
 
 +
[[File:Fig. IS6.2.png|alt=|border|center|thumb|600x600px|'''Fig. IS6.2.''' The general characteristics of the IS6 family. '''Top''': Distribution of IS length (base pairs). The number of examples used in the sample is shown above each column. '''Bottom''': Left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format (Crooks et al., 2004).]]
 +
 
 +
====Distribution and Phylogenetic Transposase Tree====
 +
A phylogenetic tree based on the transposase amino acid sequence of the [https://isfinder.biotoul.fr/ ISfinder] collection ([[:File:IS6.3.png|Fig. IS6.3]]) shows that the IS''6'' family members fall into a number of well-defined clades. This slightly more extensive set of IS corresponds well to the results of another wide-ranging phylogenetic analysis <ref name=":5"><nowiki><pubmed>PMC6807381</pubmed></nowiki></ref>. These clades include one which groups all archaeal IS''6'' family members (Fig. IS6.3a) composed mainly of ''[[wikipedia:Euryarchaeota|Euryarchaeota]]'' (''[[wikipedia:Haloarchaea|Halobacteria]]'' ; Fig. IS6.3ai-iii). '''Group''' '''aiv''' includes both ''[[wikipedia:Euryarchaeota|Euryarchaeota]]'' (''[[wikipedia:Thermococcales|Thermococcales]]'' and ''[[wikipedia:Methanococcales|Methanococcales]]'') and ''[[wikipedia:Crenarchaeota|Crenarchaeota]]'' (''[[wikipedia:Sulfolobales|Sulfolobales]]''). Of the 10 clades containing bacterial IS: clade b includes examples from the [[wikipedia:Alphaproteobacteria|Alpha]]-, [[wikipedia:Betaproteobacteria|Beta]]-, and [[wikipedia:Gammaproteobacteria|Gamma-''proteobacteria'']], ''[[wikipedia:Firmicutes|Firmicutes]]'', ''[[wikipedia:Cyanobacteria|Cyanobacteria]]'', ''[[wikipedia:Acidobacteria|Acidobacteria]]'' and [[wikipedia:Bacteroidetes|Bacteroidetes]] ; '''clade''' '''c''' is more homogenous and is composed of ''[[wikipedia:Alphaproteobacteria|Alphaproteobacteria]]'' (''[[wikipedia:Rhizobiaceae|Rhizobiaceae]]'' and ''[[wikipedia:Methylobacteriaceae|Methylobacteriaceae]]''); '''clade d''' includes some [[wikipedia:Actinobacteria|Actinobacteria]], [[wikipedia:Alphaproteobacteria|Alpha]]-, [[wikipedia:Betaproteobacteria|Beta]]-, and ''[[wikipedia:Gammaproteobacteria|Gamma-proteobacteria]]'' ; while '''clades e, f, g''' and '''h''' are composed exclusively of [[wikipedia:Firmicutes|Firmicutes]] (almost exclusively ''[[wikipedia:Lactococcus|Lactococci]]'' in the case of '''clades e and f'''). '''Clades I''' and '''j''' are more mixed. Clearly, the  [https://isfinder.biotoul.fr/ ISfinder] collection does not necessarily reflect the true IS''6'' family distribution and these grouping should be interpreted with care. For example, although many do not form part of the [https://isfinder.biotoul.fr/ ISfinder]database, IS''6'' family elements are abundant in archaea and cover almost all of the traditionally recognized archaeal lineages (methanogens, halophiles, thermoacidophiles, and hyperthermophiles <ref><nowiki><pubmed>PMC1847376</pubmed></nowiki></ref>.                               
 +
 
 +
[[File:IS6.3.png|border|center|thumb|860x860px|'''Fig. IS6.3.''' A dendrogram of IS''6'' family members. The figure shows 11 major clades. The surrounding colored circles and the insert indicate the clades identified by Harmer and Hall (2017). The insert shows the correspondence.]]       
 +
 
 +
====Terminal Inverted Repeats.====
 +
The division into clades is also underlined to some extent by the '''IR''' sequences. As shown in Fig. IS6.2 ('''bottom'''), in spite of the wide range of bacterial and archaeal species in which family members are found, there is a surprising sequence conservation. In particular, the presence of a G dinucleotide at the IS tips and '''cTGTt''' and '''caaa''' internal motifs. Sequence motifs are more pronounced when each clade is considered separately (Fig. IS6.4). 
 +
 
 +
=====Clade b=====
 +
(n=24; with [[wikipedia:Alphaproteobacteria|Alpha]]-, [[wikipedia:Betaproteobacteria|Beta]]-, and [[wikipedia:Gammaproteobacteria|Gamma-''proteobacteria'']], ''[[wikipedia:Firmicutes|Firmicutes]]'', ''[[wikipedia:Cyanobacteria|Cyanobacteria]]'', ''[[wikipedia:Acidobacteria|Acidobacteria]]'' and ''[[wikipedia:Bacteroidetes|Bacteroidetes]]'') maintains stronger traces of parts of these motifs (GG.. tcTGtt and CAaa).
 +
 
 +
=====Clade c=====
 +
(n= 14; ''[[wikipedia:Alphaproteobacteria|Alphaproteobacteria]]'': ''[[wikipedia:Rhizobiaceae|Rhizobiaceae]]'' and ''[[wikipedia:Methylobacteriaceae|Methylobacteriaceae]]'') shows considerable conservation of an extended motif (GGG... TGTCGCAAA) and some conservation further into both IRL and IRR, although these are different for each end.
 +
 
 +
=====Clade d=====
 +
(n=16; ''[[wikipedia:Actinobacteria|Actinobacteria]]'', ''[[wikipedia:Alphaproteobacteria|Alpha]]''-, ''[[wikipedia:Betaproteobacteria|Beta]]''-, and ''[[wikipedia:Gammaproteobacteria|Gamma-proteobacteria]]'') includes a well conserved GG..cTGTTGCAAA signature with little conservation further into each end.
 +
 
 +
=====Clade e=====
 +
(n=7) is composed entirely of IS from ''[[wikipedia:Lactococcus|Lactococcus]]'' and, as might be expected, exhibits fully conserved IR (GGTTCTGTTGCAAAGTTT) with significant conservation towards the IS interior, as would be expected in very closely related IS. However, some of the sequence conservation is present in both ends possibly indicating some functional role.
 +
 
 +
=====Clade f=====
 +
(n = 8; ''[[wikipedia:Lactococcus|Lactococcus]]'' and a single ''[[wikipedia:Leuconostoc|Leuconostoc]]'') also exhibits a highly conserved ggTTCTGTTGCAAAGTTT signature at IRL (with only a partially conserved GG at the tip due to just 2 members) which is less conserved in IRR. Again, there is some internal conservation.
 +
 
 +
=====Clade g=====
 +
(n = 8; composed of bacilli: ''[[wikipedia:Listeria|Lysteria]]'', ''[[wikipedia:Lactococcus|Lactococcus]]'', ''[[wikipedia:Enterococcus|Enterococcus]]'') also exhibits a similar signature (GGtTctgtTgcaAAgtTt) albeit less conserved. The left and has significant internal conservation.
 +
 
 +
=====Clade h=====
 +
(n = 11; largely ''[[wikipedia:Staphylococcus|Staphylococci]]'' with 2 ''[[wikipedia:Bacillus_thuringiensis|B. thuringiensis]]'') also exhibit the typical GGTTCTGTTGCAAAGTTt signature and some internal conservation in IRL.
 +
 
 +
=====Clade i=====
 +
(n = 1) is more heterogenous ([[wikipedia:Alphaproteobacteria|Alpha proteobacteria]]'': [[wikipedia:Methylobacterium|Methylobacterium]], [[wikipedia:Paracoccus|Paracoccus]], [[wikipedia:Roseovarius|Roseovarius]], [[wikipedia:Rhizobium|Rhizobium]], [[wikipedia:Bradyrhizobium|Bradyrhizobium]] ; [[wikipedia:Deinococcus–Thermus|Deinococci]]'' and ''[[wikipedia:Haloarchaea|Halobacteria]]''). It contains a poorly conserved IR sequence but does include a prominent gG dinucleotide tip and a poorly pronounced tgtcaagtt signature.
 +
 
 +
=====Clade j=====
 +
Finally, bacterial Clade j (n= 5) composed entirely of ''[[wikipedia:Firmicutes|Firmicutes]]'' (''[[wikipedia:Natranaerobius|Natranaerobius]]'', ''[[wikipedia:Clostridium|Clostridium]]'' and ''[[wikipedia:Thermoanaerobacter|Thermoanaerobacter]]'') exhibits a moderately well-defined internal signature TcTgTtAAgTt.
 +
 
 +
The archaeal-specific clades also generally exhibit well-defined consensus sequences.
 +
 
 +
=====Clade Ai=====
 +
(n = 12) is composed uniquely of ''[[wikipedia:Haloarchaea|Halobacterial]]'' ''[[wikipedia:Euryarchaeota|Euryarchaeota]]'' with a ggtaGTGTTcagatAaG signature and significant internal conservation which is different for each end.
 +
 
 +
=====Clade Aii=====
 +
(n = 5), again, composed entirely of ''[[wikipedia:Haloarchaea|Halobacterial]]'' ''[[wikipedia:Euryarchaeota|Euryarchaeota]]'' (''[[wikipedia:Haloarcula|Haloarcula]], [[wikipedia:Halomicrobium|Halomicrobium]], [[wikipedia:Natronomonas|Natronomonas]], [[wikipedia:Natronobacterium|Natronobacterium]], [[wikipedia:Natrinema|Natrinema]]'') also has well conserved ends, ggtcgTGTTTaGTT, and significant internal conservation which is different for each end.
 +
 
 +
=====Clade Aiii=====
 +
This is also the case for Clade Aiii, also composed of diverse [[wikipedia:Haloarchaea|''Halobacterial'']] ''species'' (''Halohasta, [[wikipedia:Haloferax|Haloferax]], [[wikipedia:Natrinema|Natrinema]], [[wikipedia:Natrialba|Natrialba]], [[wikipedia:Halogeometricum|Halogeometricum]], [[wikipedia:Natronomonas|Natronomonas]], [[wikipedia:Natronococcus|Natronococcus]],'' and ''[[wikipedia:Haloarcula|Haloarcula]]''): GgcACtGTCTAGtT.
 +
 
 +
=====Clade Aiv=====
 +
However, Clade Aiv (n = 9) which includes both ''[[wikipedia:Euryarchaeota|Euryarchaeota]]'' and ''[[wikipedia:Crenarchaeota|Crenarchaeota]]'', has poor conservation although on further analysis, an alignment shows significant conservation in the ''[[wikipedia:Sulfolobus|Sulfolobus]]'' and in the ''[[wikipedia:Pyrococcus|Pyrococcus]]'' groups with good interior conservation also in the 3 ''[[wikipedia:Pyrococcus|Pyrococcal]]'' members. It is possible that the IS ends in the ''[[wikipedia:Sulfolobus|Sulfolobus]]'' members have not been accurately identified.
 +
 
 +
The answer to the recent question:
 +
 
 +
“'''An analysis of the IS''6''/IS''26'' family of insertion sequences: is it a single family''' '''?'''”<ref name=":5" /> is therefore “'''Probably, yes'''”.
 +
 
 +
====Genomic Impact====
 +
Activity resulting in horizontal dissemination is suggested, for example, by the observation that copies identical to ''[[wikipedia:Mycobacterium_fortuitum|Mycobacterium fortuitum]]'' IS''6100'' <ref name=":12"><nowiki><pubmed>2163027</pubmed></nowiki></ref>('''Clade d''') occur in other bacteria: as part of a plasmid-associated catabolic transposon carrying genes for nylon degradation in ''[[wikipedia:Arthrobacter|Arthrobacter sp.]]'' <ref><nowiki><pubmed>PMC205175</pubmed></nowiki></ref>, from the ''[[wikipedia:Pseudomonas_aeruginosa|Pseudomonas aeruginosa]]'' plasmid R1003 <ref><nowiki><pubmed>PMC196970</pubmed></nowiki></ref>, and within the ''[[wikipedia:Xanthomonas_campestris|Xanthomonas campestris]]'' transposon Tn''5393b'' <ref name=":13"><nowiki><pubmed>PMC167566</pubmed></nowiki></ref>. Similar copies have also been reported in ''[[wikipedia:Salmonella_enterica|Salmonella enterica]]'' (typhimurium) <ref><nowiki><pubmed>10930753</pubmed></nowiki></ref>, and on plasmid pACM1 from ''[[wikipedia:Klebsiella_oxytoca|Klebsiella oxytoca]]'' ([https://www.ncbi.nlm.nih.gov/nuccore/AF107205.1/ AF107205]) <ref><nowiki><pubmed>25291385</pubmed></nowiki></ref>. 
 +
 
 +
A single member, IS''Dsp3'', present in single copy in [[wikipedia:Dehalococcoides|''Dehalococcoides sp.'' BAV1]] carries a passenger gene annotated as [[wikipedia:Hypothetical_protein|hypothetical protein]].
 +
 
 +
IS''257'' <ref>Rouch D, Skurray R. IS257 from Staphylococcus aureus member of an insertion sequence superfamily Gram-positive and Gram-negative bacteria. Gene. 1989;76:195–205. </ref>('''Clade h''') (also known as IS''431'') has played an important role in sequestering a variety of antibiotic resistance genes in clinical isolates of [[wikipedia:Methicillin|methicillin]] resistant ''[[wikipedia:Staphylococcus_aureus|Staphylococcus aureus]]'' (MRSA) (e.g.<ref name=":14">Rouch DA, Messerotti LJ, Loo SL, Jackson CA, Skurray RA. Trimethoprim resistance transposon Tn4003 from Staphylococcus aureus encodes genes for a dihydrofolate reductase and thymidylate synthetase flanked by three copies of IS257. Mol Microbiol. 1989;3:161–175. </ref><ref>Stewart PR, Dubin DT, Chikramane SG, Inglis B, Matthews PR, Poston SM. IS257 and small plasmid insertions in the mec region of the chromosome of Staphylococcus aureus. Plasmid. 1994;31:12–20. </ref><ref name=":9"><nowiki><pubmed>PMC101884</pubmed></nowiki></ref><ref name=":10"><nowiki><pubmed>PMC284724</pubmed></nowiki></ref>. It provides an outward oriented promoter which drives expression of genes located proximal to the left end. Moreover, both left and right ends appear to carry a –35 promoter component which would permit formation of hybrid promoters on insertion next to a resident –10 element <ref name=":9" /><ref><nowiki><pubmed>PMC107441</pubmed></nowiki></ref>. Insertion of can result in activation of a neighboring gene using both a hybrid promoter and an indigenous promoter <ref name=":9" />. IS''257'' is also involved in expression of ''[https://www.wikigenes.org/e/gene/e/2716475.html tetA]'' <ref><nowiki><pubmed>7830550</pubmed></nowiki></ref> and ''[https://www.wikigenes.org/e/gene/e/2598376.html dfrA]'' <ref name=":10" /> in  ''[[wikipedia:Staphylococcus_aureus|S. aureus]].''
 +
 
 +
IS''26'' <ref name=":6" /><ref name=":1" /><ref name=":7" />  ('''clade d''') is encountered with increasing frequency in plasmids of clinical importance where it is involved in expression of antibiotic resistance genes and plasmid rearrangements (see <ref name=":8" /><ref><nowiki><pubmed>PMC1913244</pubmed></nowiki></ref><ref><nowiki><pubmed>23330672</pubmed></nowiki></ref><ref><nowiki><pubmed>16870645</pubmed></nowiki></ref><ref><nowiki><pubmed>23169892</pubmed></nowiki></ref><ref><nowiki><pubmed>20093380</pubmed></nowiki></ref>). Its transposition mechanism contributes to its ability to assemble anti-bacterial resistance genes into clusters (e.g. <ref name=":15"><nowiki><pubmed>PMC4471558</pubmed></nowiki></ref>). It can also form hybrid promoters capable of driving different antibiotic resistance genes: ''[https://www.uniprot.org/uniprot/Q7WUG8 aphA7], bla<sub>S2A</sub>'' (''[[wikipedia:Klebsiella_pneumoniae|Klebsiella pneumoniae]]'' <ref name=":11" />), ''bla<sub>SHV-2a</sub>'' (''[[wikipedia:Pseudomonas_aeruginosa|Pseudomonas aeruginosa]]'' <ref><nowiki><pubmed>PMC89260</pubmed></nowiki></ref>) and [https://www.uniprot.org/uniprot/P14508 ''aphA''7] (''[[wikipedia:Pasteurella|Pasteurella piscicida]]'' <ref><nowiki><pubmed>27873653</pubmed></nowiki></ref>) as well as the wide spectrum [[wikipedia:Beta-lactam|beta-lactam]] resistance gene bla<sub>KPC</sub> ('''Table''' [[General Information/IS and Gene Expression|IS and Gene Expression]]).
 +
 
 +
The formation of hybrid promoters on insertion ('''Table''' [[General Information/IS and Gene Expression|IS and Gene Expression]]) is clearly a general property of members of the IS''6'' family <ref name=":11" /><ref name=":9" /><ref name=":10" /><ref><nowiki><pubmed>PMC2443897</pubmed></nowiki></ref><ref>Allmansberger R, Brau B, Piepersberg W. Genes for gentamicin-(3)-N-acetyl-transferases III and IV. II. Nucleotide sequences of three AAC(3)-III genes and evolutionary aspects. MolGenGenet. 1985;198:514–520. </ref>.
 +
 
 +
Another member, IS''6100'' <ref name=":12" /> ('''Clade d''')'','' often used as an aid in classifying mycobacterial isolates<ref><nowiki><pubmed>PMC268253</pubmed></nowiki></ref> <ref><nowiki><pubmed>PMC330226</pubmed></nowiki></ref><ref><nowiki><pubmed>PMC172007</pubmed></nowiki></ref> has been found to drive ''[https://www.wikigenes.org/e/gene/e/2716516.html strA] [https://www.wikigenes.org/e/gene/e/2716517.html strB]'' expression in [[wikipedia:Xanthomonas_campestris_pv._vesicatoria|''X. campestris'' pv. vesicatoria]], <ref name=":13" />.
 +
 
 +
This IS family is able to form transposons which resemble compound transposons with the flanking IS in direct repeat but, because of the particular transposition mechanism of IS''6'' family members (see below), were called pseudo-compound transposons <ref name=":0" />. These include Tn''610'' (flanked by IS''6100'' <ref name=":12" />), Tn''4003'' and others (flanked by IS''257'' <ref name=":14" />) and Tn''6023'' (flanked by IS''26'' <ref><nowiki><pubmed>21702681</pubmed></nowiki></ref>).
 +
 
 +
<br />
 +
====Clinical Importance of IS''26''.====
 +
In view of the particular importance of IS''26'' in sequestering antibiotic resistance genes and generating arrays of these genes in clinically important conjugative plasmids and in the host chromosome (see <ref name=":8" /><ref name=":15" />), it is worthwhile devoting a separate section to the contribution of this IS to the clinical landscape. Recognition of its place as an important player has derived from the large number of sequences now available of multiple antibiotic resistance plasmids and chromosomal segments such as '''G'''enomic '''R'''esistance '''I'''slands ('''GRI'''). It is now no longer practical to provide a complete analysis of the literature. At present (19<sup>th</sup> November 2020) a PubMed search using IS''26'' as the search term yielded nearly 450 citations. The references in the following are not exhaustive but simply provide examples.<br />
 +
 
 +
=====Arrays=====
 +
IS''6'' family members are often found in arrays (Fig. IS6.5 and Fig. IS6.6) in direct and inverted repeat in multiple drug resistant plasmids (e.g. ''S. typhimurium'' ��[27,60,61]�, ''Klebsiella quasipneumoniae'' ��[62]�, ''Acinetobacter baumannii'' ��[48,63]�, Proteus mirabilis ��[64]� and uncultured sewage bacteria ��[65]� among many others). These are often intercalated in or next to other transposable elements rather than neatly flanking ABR genes and can form units able to undergo tandem amplification.<br />
 +
 
 +
=====Amplification=====
 +
Shropshire et al ��[66]�, studying clinical isolates of non-carbapenemase-producing Carbapenem-Resistant Enterobacteria, non-CP-CRE, isolated from several patients with recurrent bacteraemia, observed an increase in carbapenem resistance partially due to IS''26''-mediated amplification of up to 10 fold of a cassette blaOXA-1 and blaCTX-M-1 which forms part of a larger chromosomal structure of IS''26'' arrays which they call TnMB1860 (Fig. IS6.6). It was unclear whether this cassette amplification was due to transposition activity or, as had been observed in similar, IS''1''-mediated, gene amplifications ��[67–72]�. Another example has been revealed by Hastak et al ��[73]� who analysed a multi resistant derivative of the clinically important, globally dispersed pathogenic, ''Escherichia coli'' ST131 subclade H30Rx,  isolated from a number of bacteraemic patients and revealed that increased piperacillin/tazobactam resistance was due to IS''26''-mediated amplification of blaTEM-1B. A similar type of limited (tandem dimer) amplification of an IS''26''-flanked blaSHV-5-carrying cassette found in plasmids from a number of geographically diverse enteric species was identified in a nosocomial ''E. cloacae'' strain ��[74]�. A more extensive amplification (>10 fold) was observed with the same cassette located in a different plasmid in a well-characterised laboratory strain of ''Escherichia coli'' and occurred in a ''recA''-independent manner ��[46]� and even higher levels of tandem amplification (~65 fold) of the ''aphA1'' gene in the IS''26''-based Tn''6020'' were identified in ''[[wikipedia:Acinetobacter_baumannii|Acinetobacter baumannii]]'' ��[75]�.<br />
 +
 
 +
=====Cointegrating plasmids.=====
 +
<br />
 +
 
 +
====Organization====
 +
 
 +
====Mechanism: the state of play====
 +
 
 +
====Cointegrate formation====
 +
<br />
 +
 
 +
====Circular transposon molecules: translocatable units (TU)====
 +
 
 +
====Targeted transposition.====
 +
 
 +
====Acknowledgements====
 +
We would like to thank [https://scholar.google.com/citations?user=0igAN8sAAAAJ&hl=en Susu He] ([https://www.nju.edu.cn/EN/ Nanjing University]) for stimulating discussions concerning the transposition models.
 +
 
 +
==Bibliography==
 +
<references />

Revision as of 16:58, 5 March 2021

General

There are at present nearly 160 family members in ISfinder from nearly 80 bacterial and archaeal species but this represents only a fraction of those present in the public databases. The family was named[1] after the directly repeated insertion sequences in transposon Tn6 [2] to standardize the various names that had been attributed to identical elements (e.g. IS15, IS26, IS46, IS140, IS160, IS176) [3][4][5][6][7][8][9][10][11][12][13][14][15], although more recently there has been some attempt to rename the family (see [16]), presumably because of accumulating experimental data from IS26 itself and the importance of this IS in accumulation and transmission of multiple anti biotic resistance, although this might potentially introduce confusion in the literature. IS6 family members have a simple organization (Fig. IS6.1) and generate 8bp direct target repeats on insertion. This family is very homogenous with an average length of about 800 bp and highly conserved short, generally perfect, IRs (Fig. IS6.2). Many are found as part of compound transposons (called pseudo-compound transposons [1] described below) invariably as flanking direct repeats (Fig. IS6.1) a consequence of their transposition mechanism [7][9][13][14][17][18][19][20][21][22][23][24][25][26][27][28][29].

Fig. IS6.1. IS6 family organization. Top. Structure of IS6 family. Left (IRL) and right (IRR) terminal 14 bp IRs are shown as blue triangles. The 8 bp direct target repeats are shown as pink arrow heads. The transposase open reading frame is shown in purple. Bottom. A Pseudo-compound transposon (see text for explanation). IS6 family characteristics are as above. Here, two directly repeated IS flank a passenger gene in green.


Fig. IS6.2. The general characteristics of the IS6 family. Top: Distribution of IS length (base pairs). The number of examples used in the sample is shown above each column. Bottom: Left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format (Crooks et al., 2004).

Distribution and Phylogenetic Transposase Tree

A phylogenetic tree based on the transposase amino acid sequence of the ISfinder collection (Fig. IS6.3) shows that the IS6 family members fall into a number of well-defined clades. This slightly more extensive set of IS corresponds well to the results of another wide-ranging phylogenetic analysis [30]. These clades include one which groups all archaeal IS6 family members (Fig. IS6.3a) composed mainly of Euryarchaeota (Halobacteria ; Fig. IS6.3ai-iii). Group aiv includes both Euryarchaeota (Thermococcales and Methanococcales) and Crenarchaeota (Sulfolobales). Of the 10 clades containing bacterial IS: clade b includes examples from the Alpha-, Beta-, and Gamma-proteobacteria, Firmicutes, Cyanobacteria, Acidobacteria and Bacteroidetes ; clade c is more homogenous and is composed of Alphaproteobacteria (Rhizobiaceae and Methylobacteriaceae); clade d includes some Actinobacteria, Alpha-, Beta-, and Gamma-proteobacteria ; while clades e, f, g and h are composed exclusively of Firmicutes (almost exclusively Lactococci in the case of clades e and f). Clades I and j are more mixed. Clearly, the ISfinder collection does not necessarily reflect the true IS6 family distribution and these grouping should be interpreted with care. For example, although many do not form part of the ISfinderdatabase, IS6 family elements are abundant in archaea and cover almost all of the traditionally recognized archaeal lineages (methanogens, halophiles, thermoacidophiles, and hyperthermophiles [31].

Fig. IS6.3. A dendrogram of IS6 family members. The figure shows 11 major clades. The surrounding colored circles and the insert indicate the clades identified by Harmer and Hall (2017). The insert shows the correspondence.

Terminal Inverted Repeats.

The division into clades is also underlined to some extent by the IR sequences. As shown in Fig. IS6.2 (bottom), in spite of the wide range of bacterial and archaeal species in which family members are found, there is a surprising sequence conservation. In particular, the presence of a G dinucleotide at the IS tips and cTGTt and caaa internal motifs. Sequence motifs are more pronounced when each clade is considered separately (Fig. IS6.4).

Clade b

(n=24; with Alpha-, Beta-, and Gamma-proteobacteria, Firmicutes, Cyanobacteria, Acidobacteria and Bacteroidetes) maintains stronger traces of parts of these motifs (GG.. tcTGtt and CAaa).

Clade c

(n= 14; Alphaproteobacteria: Rhizobiaceae and Methylobacteriaceae) shows considerable conservation of an extended motif (GGG... TGTCGCAAA) and some conservation further into both IRL and IRR, although these are different for each end.

Clade d

(n=16; Actinobacteria, Alpha-, Beta-, and Gamma-proteobacteria) includes a well conserved GG..cTGTTGCAAA signature with little conservation further into each end.

Clade e

(n=7) is composed entirely of IS from Lactococcus and, as might be expected, exhibits fully conserved IR (GGTTCTGTTGCAAAGTTT) with significant conservation towards the IS interior, as would be expected in very closely related IS. However, some of the sequence conservation is present in both ends possibly indicating some functional role.

Clade f

(n = 8; Lactococcus and a single Leuconostoc) also exhibits a highly conserved ggTTCTGTTGCAAAGTTT signature at IRL (with only a partially conserved GG at the tip due to just 2 members) which is less conserved in IRR. Again, there is some internal conservation.

Clade g

(n = 8; composed of bacilli: Lysteria, Lactococcus, Enterococcus) also exhibits a similar signature (GGtTctgtTgcaAAgtTt) albeit less conserved. The left and has significant internal conservation.

Clade h

(n = 11; largely Staphylococci with 2 B. thuringiensis) also exhibit the typical GGTTCTGTTGCAAAGTTt signature and some internal conservation in IRL.

Clade i

(n = 1) is more heterogenous (Alpha proteobacteria: Methylobacterium, Paracoccus, Roseovarius, Rhizobium, Bradyrhizobium ; Deinococci and Halobacteria). It contains a poorly conserved IR sequence but does include a prominent gG dinucleotide tip and a poorly pronounced tgtcaagtt signature.

Clade j

Finally, bacterial Clade j (n= 5) composed entirely of Firmicutes (Natranaerobius, Clostridium and Thermoanaerobacter) exhibits a moderately well-defined internal signature TcTgTtAAgTt.

The archaeal-specific clades also generally exhibit well-defined consensus sequences.

Clade Ai

(n = 12) is composed uniquely of Halobacterial Euryarchaeota with a ggtaGTGTTcagatAaG signature and significant internal conservation which is different for each end.

Clade Aii

(n = 5), again, composed entirely of Halobacterial Euryarchaeota (Haloarcula, Halomicrobium, Natronomonas, Natronobacterium, Natrinema) also has well conserved ends, ggtcgTGTTTaGTT, and significant internal conservation which is different for each end.

Clade Aiii

This is also the case for Clade Aiii, also composed of diverse Halobacterial species (Halohasta, Haloferax, Natrinema, Natrialba, Halogeometricum, Natronomonas, Natronococcus, and Haloarcula): GgcACtGTCTAGtT.

Clade Aiv

However, Clade Aiv (n = 9) which includes both Euryarchaeota and Crenarchaeota, has poor conservation although on further analysis, an alignment shows significant conservation in the Sulfolobus and in the Pyrococcus groups with good interior conservation also in the 3 Pyrococcal members. It is possible that the IS ends in the Sulfolobus members have not been accurately identified.

The answer to the recent question:

An analysis of the IS6/IS26 family of insertion sequences: is it a single family ?[30] is therefore “Probably, yes”.

Genomic Impact

Activity resulting in horizontal dissemination is suggested, for example, by the observation that copies identical to Mycobacterium fortuitum IS6100 [32](Clade d) occur in other bacteria: as part of a plasmid-associated catabolic transposon carrying genes for nylon degradation in Arthrobacter sp. [33], from the Pseudomonas aeruginosa plasmid R1003 [34], and within the Xanthomonas campestris transposon Tn5393b [35]. Similar copies have also been reported in Salmonella enterica (typhimurium) [36], and on plasmid pACM1 from Klebsiella oxytoca (AF107205) [37].

A single member, ISDsp3, present in single copy in Dehalococcoides sp. BAV1 carries a passenger gene annotated as hypothetical protein.

IS257 [38](Clade h) (also known as IS431) has played an important role in sequestering a variety of antibiotic resistance genes in clinical isolates of methicillin resistant Staphylococcus aureus (MRSA) (e.g.[39][40][41][42]. It provides an outward oriented promoter which drives expression of genes located proximal to the left end. Moreover, both left and right ends appear to carry a –35 promoter component which would permit formation of hybrid promoters on insertion next to a resident –10 element [41][43]. Insertion of can result in activation of a neighboring gene using both a hybrid promoter and an indigenous promoter [41]. IS257 is also involved in expression of tetA [44] and dfrA [42] in S. aureus.

IS26 [6][7][8] (clade d) is encountered with increasing frequency in plasmids of clinical importance where it is involved in expression of antibiotic resistance genes and plasmid rearrangements (see [27][45][46][47][48][49]). Its transposition mechanism contributes to its ability to assemble anti-bacterial resistance genes into clusters (e.g. [50]). It can also form hybrid promoters capable of driving different antibiotic resistance genes: aphA7, blaS2A (Klebsiella pneumoniae [21]), blaSHV-2a (Pseudomonas aeruginosa [51]) and aphA7 (Pasteurella piscicida [52]) as well as the wide spectrum beta-lactam resistance gene blaKPC (Table IS and Gene Expression).

The formation of hybrid promoters on insertion (Table IS and Gene Expression) is clearly a general property of members of the IS6 family [21][41][42][53][54].

Another member, IS6100 [32] (Clade d), often used as an aid in classifying mycobacterial isolates[55] [56][57] has been found to drive strA strB expression in X. campestris pv. vesicatoria, [35].

This IS family is able to form transposons which resemble compound transposons with the flanking IS in direct repeat but, because of the particular transposition mechanism of IS6 family members (see below), were called pseudo-compound transposons [1]. These include Tn610 (flanked by IS6100 [32]), Tn4003 and others (flanked by IS257 [39]) and Tn6023 (flanked by IS26 [58]).


Clinical Importance of IS26.

In view of the particular importance of IS26 in sequestering antibiotic resistance genes and generating arrays of these genes in clinically important conjugative plasmids and in the host chromosome (see [27][50]), it is worthwhile devoting a separate section to the contribution of this IS to the clinical landscape. Recognition of its place as an important player has derived from the large number of sequences now available of multiple antibiotic resistance plasmids and chromosomal segments such as Genomic Resistance Islands (GRI). It is now no longer practical to provide a complete analysis of the literature. At present (19th November 2020) a PubMed search using IS26 as the search term yielded nearly 450 citations. The references in the following are not exhaustive but simply provide examples.

Arrays

IS6 family members are often found in arrays (Fig. IS6.5 and Fig. IS6.6) in direct and inverted repeat in multiple drug resistant plasmids (e.g. S. typhimurium ��[27,60,61]�, Klebsiella quasipneumoniae ��[62]�, Acinetobacter baumannii ��[48,63]�, Proteus mirabilis ��[64]� and uncultured sewage bacteria ��[65]� among many others). These are often intercalated in or next to other transposable elements rather than neatly flanking ABR genes and can form units able to undergo tandem amplification.

Amplification

Shropshire et al ��[66]�, studying clinical isolates of non-carbapenemase-producing Carbapenem-Resistant Enterobacteria, non-CP-CRE, isolated from several patients with recurrent bacteraemia, observed an increase in carbapenem resistance partially due to IS26-mediated amplification of up to 10 fold of a cassette blaOXA-1 and blaCTX-M-1 which forms part of a larger chromosomal structure of IS26 arrays which they call TnMB1860 (Fig. IS6.6). It was unclear whether this cassette amplification was due to transposition activity or, as had been observed in similar, IS1-mediated, gene amplifications ��[67–72]�. Another example has been revealed by Hastak et al ��[73]� who analysed a multi resistant derivative of the clinically important, globally dispersed pathogenic, Escherichia coli ST131 subclade H30Rx, isolated from a number of bacteraemic patients and revealed that increased piperacillin/tazobactam resistance was due to IS26-mediated amplification of blaTEM-1B. A similar type of limited (tandem dimer) amplification of an IS26-flanked blaSHV-5-carrying cassette found in plasmids from a number of geographically diverse enteric species was identified in a nosocomial E. cloacae strain ��[74]�. A more extensive amplification (>10 fold) was observed with the same cassette located in a different plasmid in a well-characterised laboratory strain of Escherichia coli and occurred in a recA-independent manner ��[46]� and even higher levels of tandem amplification (~65 fold) of the aphA1 gene in the IS26-based Tn6020 were identified in Acinetobacter baumannii ��[75]�.

Cointegrating plasmids.


Organization

Mechanism: the state of play

Cointegrate formation


Circular transposon molecules: translocatable units (TU)

Targeted transposition.

Acknowledgements

We would like to thank Susu He (Nanjing University) for stimulating discussions concerning the transposition models.

Bibliography

  1. 1.0 1.1 1.2 Galas DJ, Chandler M. Bacterial Insertion Sequences. In: Berg DE, Howe MM, editors. Mob DNA. Washington, D.C.: American Society for Microbiology; 1989. p. 109–162.
  2. Berg DE, Davies J, Allet B, Rochaix JD. Transposition of R factor genes to bacteriophage lambda. ProcNatlAcadSciUSA. 1975;72:3628–3632.
  3. Labigne-Roussel A, Courvalin P. IS15, a new insertion sequence widely spread in R plasmids of gram- negative bacteria. MolGenGenet. 1983;189:102–112.
  4. Trieu-Cuot P, Courvalin P. Nucleotide sequence of the transposable element IS15. Gene. 1984;30:113–120.
  5. <pubmed>2994132</pubmed>
  6. 6.0 6.1 <pubmed>PMC326375</pubmed>
  7. 7.0 7.1 7.2 <pubmed>PMC326375</pubmed>
  8. 8.0 8.1 <pubmed>3003524</pubmed>
  9. 9.0 9.1 <pubmed>PMC215669</pubmed>
  10. Nucken EJ, Henschke RB, Schmidt FR. Nucleotide-sequence of insertion element IS15 delta IV from plasmid pBP11. DNA Seq. 1990;1:85–88.
  11. <pubmed>6304469</pubmed>
  12. <pubmed>2999303</pubmed>
  13. 13.0 13.1 Colonna B, Bernardini M, Micheli G, Maimone F, Nicoletti M, Casalino M. The Salmonella wien virulence plasmid pZM3 carries Tn1935, a multiresistance transposon containing a composite IS1936- kanamycin resistance element. Plasmid. 1988;20:221–231.
  14. 14.0 14.1 <pubmed>PMC162495</pubmed>
  15. <pubmed>PMC305975</pubmed>
  16. <pubmed>32871211</pubmed>
  17. <pubmed>PMC1196216</pubmed>
  18. <pubmed>PMC3195058</pubmed>
  19. <pubmed>PMC3587239</pubmed>
  20. <pubmed>PMC219079</pubmed>
  21. 21.0 21.1 21.2 <pubmed>PMC209129</pubmed>
  22. Barberis-Maino L, Berger-Bachi B, Weber H, Beck WD, Kayser FH. IS431, a staphylococcal insertion sequence-like element related to IS26 from Proteus vulgaris. Gene. 1987;59:107–113.
  23. <pubmed>PMC174916</pubmed>
  24. <pubmed>3033719</pubmed>
  25. <pubmed>2543009</pubmed>
  26. Sundstrom L, Jansson C, Bremer K, Heikkila E, Olsson-Liljequist B, Skold O. A new dhfrVIII trimethoprim-resistance gene, flanked by IS26, whose product is remote from other dihydrofolate reductases in parsimony analysis. Gene. 1995;154:7–14.
  27. 27.0 27.1 27.2 <pubmed>19074421</pubmed>
  28. <pubmed>21393132</pubmed>
  29. <pubmed>PMC284528</pubmed>
  30. 30.0 30.1 <pubmed>PMC6807381</pubmed>
  31. <pubmed>PMC1847376</pubmed>
  32. 32.0 32.1 32.2 <pubmed>2163027</pubmed>
  33. <pubmed>PMC205175</pubmed>
  34. <pubmed>PMC196970</pubmed>
  35. 35.0 35.1 <pubmed>PMC167566</pubmed>
  36. <pubmed>10930753</pubmed>
  37. <pubmed>25291385</pubmed>
  38. Rouch D, Skurray R. IS257 from Staphylococcus aureus member of an insertion sequence superfamily Gram-positive and Gram-negative bacteria. Gene. 1989;76:195–205.
  39. 39.0 39.1 Rouch DA, Messerotti LJ, Loo SL, Jackson CA, Skurray RA. Trimethoprim resistance transposon Tn4003 from Staphylococcus aureus encodes genes for a dihydrofolate reductase and thymidylate synthetase flanked by three copies of IS257. Mol Microbiol. 1989;3:161–175.
  40. Stewart PR, Dubin DT, Chikramane SG, Inglis B, Matthews PR, Poston SM. IS257 and small plasmid insertions in the mec region of the chromosome of Staphylococcus aureus. Plasmid. 1994;31:12–20.
  41. 41.0 41.1 41.2 41.3 <pubmed>PMC101884</pubmed>
  42. 42.0 42.1 42.2 <pubmed>PMC284724</pubmed>
  43. <pubmed>PMC107441</pubmed>
  44. <pubmed>7830550</pubmed>
  45. <pubmed>PMC1913244</pubmed>
  46. <pubmed>23330672</pubmed>
  47. <pubmed>16870645</pubmed>
  48. <pubmed>23169892</pubmed>
  49. <pubmed>20093380</pubmed>
  50. 50.0 50.1 <pubmed>PMC4471558</pubmed>
  51. <pubmed>PMC89260</pubmed>
  52. <pubmed>27873653</pubmed>
  53. <pubmed>PMC2443897</pubmed>
  54. Allmansberger R, Brau B, Piepersberg W. Genes for gentamicin-(3)-N-acetyl-transferases III and IV. II. Nucleotide sequences of three AAC(3)-III genes and evolutionary aspects. MolGenGenet. 1985;198:514–520.
  55. <pubmed>PMC268253</pubmed>
  56. <pubmed>PMC330226</pubmed>
  57. <pubmed>PMC172007</pubmed>
  58. <pubmed>21702681</pubmed>