General Information/Major Groups are Defined by the Type of Transposase They Use
The principal factor in IS classification is the similarity, at the primary sequence level, of the enzymes which catalyze their movement, their transposases (Tpases). In addition, a variety of characteristics are also taken into account. These include: the length and sequence of the short imperfect terminal inverted repeat sequences (IRs) carried by many ISs at their ends (TIRs or ITRs in eukaryotes); the length and sequence of the short flanking direct target DNA repeats (DRs) (TSD, Target Site Duplication, in eukaryotes) often generated on insertion; the organization of their open reading frames or the target sequences into which they insert. IS and some transposons can also be divided into two major types based on the chemistry used in breaking and rejoining DNA during TE displacement: the DDE (and DEDD) and HUH enzymes. Additional types of transposase enzymes have been identified (Fig.7.1) but are generally associated with other types of transposon rather than IS.
A relatively new type of potential transposase, Cas1, is associated with so-called casposons, elements that may resemble complex IS and are related to CRISPRs (for more details please see The Casposases section).
Groups with DDE Transposases
DDE enzymes, so-called because of a conserved Asp, Asp, Glu triad of amino acids which coordinate essential metal ions, use OH (e.g. H20) as a nucleophile in a transesterification reaction (Fig.7.1) and (Fig.7.2). IS with DDE enzymes are the most abundant type in the public databases (Fig.4.2). This is partly due to the fact that the definition of an IS became implicitly coupled to the presence of a DDE Tpase, an idea probably reinforced by the similarity between Tpases of IS (and other TE) and the retroviral integrases (Fig.7.2) particularly in the region including the catalytic site. More precisely, for these TE, the triad is DD(35)E in which the second D and E are separated by 35 residues. As more DDE transposases were identified, the distance separating the D and E residues was found to vary slightly (TABLE MGE transposases examined using secondary structure prediction programmes). However, for certain IS, this distance was significantly larger. In these cases, the Tpases include an “insertion domain” between the second D and E residues  with either α-helical or β-strand configurations (Fig.1.8.2). Although in most cases this is a prediction, it has been confirmed by crystallographic studies for the IS50 [β-strand and Hermes [α-helical; Tpases. The function of these “insertion domains” is not entirely clear.
Transposases examined by secondary structure prediction programs
|Table 2. Adapted from Hickman et al. 2010, Integrating prokaryotes and eukaryotes: DNA transposases in light of structure. 1 Information on the number of copies within the host genome was obtained from ISfinder or the reference indicated by the asterisk. 2 Where indicated, the secondary structure predicts an insertion domain between β5 and α4 with predominantly either β-strands or α-helices. 3 Relevant references include reviews or papers that report the results of secondary structure prediction, report sequence alignments or consensus sequences, identify the DDE/D catalytic residues, or demonstrate that the element is active. The association of certain eukaryotic superfamilies to specific IS families is as per Feschotte and Pritham (2005) and references therein.|
|Family||Element (or protein) analyzed||Active or # copies in genome1||From secondary structure, type of DDE/D motif2||Relevant references3|
|IS1||IS1N||>40*||DD(24)E||*Nyman et al., 1981; Ohta et al., 2002, 2004; Siguier et al., 2009|
|IS1595||ISPna2||—||DD(36)N||Siguier et al., 2009|
|Merlin: MERLIN1_SM||consensus||DD(36)E||Feschotte, 2004|
|IS3||IS911||Active||DD(35)E||Polard and Chandler, 1995; Rousseau et al., 2002|
|IS481||IS481||~100*||DD(35)E||*Glare et al., 1990; Chandler and Mahillon, 2002|
|IS4||IS50R||Active||PDB ID: 1muh||Rezsöhazy et al., 1993; Davies et al., 2000|
|IS701||IS701||Active (15*)||DD(β-strand)E||*Mazel et al., 1991|
|IS1634||IS1634||Active (~30*)||DD(β-strand)E||*Vilei et al., 1999|
|IS5||IS903||Active||DD(65)E||Derbyshire et al., 1987; Rezsöhazy et al., 1993; Tavakoli et al., 1997|
|PIF/Harbinger: PIFa (Z. mays)||Active||DD(59)E||Zhang et al., 2001; Kapitonov and Jurka, 2004; Sinzelle et al., 2008|
|IS1182||IS660||3||DD(β-strand)E||Takami et al., 2001|
|IS6||IS6100||Active||DD(34)E||Martin et al., 1990; Mahillon and Chandler, 1998|
|IS21||IS21||Active||DD(45)E||Mahillon and Chandler, 1998; Berger and Haas, 2001|
|IS30||IS30||Active||DD(33)E||Caspers et al., 1984; Mahillon and Chandler, 1998|
|IS66||IS679||Active||DD(α-helical?)E||Han et al., 2001|
|IS110||IS492||Active||DEDD||Perkins-Balding et al., 1999; Buchner et al., 2005|
|IS256||IS256||Active||DD(α-helical)E||Mahillon and Chandler, 1998; Prudhomme et al., 2002|
|MuDr/Foldback (Mutator)||Active||Eisen et al., 1994; Babu et al., 2006; Hua-Van and Capy, 2008|
|IS630||ISY100||Active||DD(34)E||Doak et al., 1994; Feng and Colloms, 2007|
|Tc1/mariner: Mos1 (D. mauritiana)||PDB ID: 2f7t||Plasterk et al., 1999; Richardson et al., 2006|
|Zator: Zator-1_HM||36*||DD(43)E||*Bao et al., 2009|
|IS982||ISPfu3||5||DD(47)E||Mahillon and Chandler, 1998|
|IS1380||IS1380A||~100*||DD(β-strand)E||*Takemura et al., 1991; Chandler and Mahillon, 2002|
|piggyBac (T. ni)||Active||DD(β-strand)D||Cary et al., 1989; Sarkar et al., 2003; Mitra et al., 2008|
|ISL3||IS31831||Active||DD(α-helical)E||Suzuki et al., 2006|
|Tn3||Tn3 (E. coli)||Active||DD(α-helical?)E, DD(α-helical)E insertion||Grindley, 2002|
|hAT||Hermes||Active||PDB ID: 2bw3||Warren et al., 1994; Rubin et al., 2001; Hickman et al., 2005|
|CACTA||CACTA1 (A. thaliana) En/Spm ZM||Active||DD(α-helical?)E/D?||Miura et al., 2001; DeMarco et al., 2006|
|Transib||Transib1_AG||Consensus||DD(α-helical)E||Kapitonov and Jurka, 2005; Chen and Li, 2008|
|RAG1 (M. musculus)||Active||Kim et al., 1999; Landree et al., 1999; Lu et al., 2006|
|Sola||Sola3-3_HM||Multiple copies*||DD(40)E||*Bao et al., 2009|
Major DDE transposition pathways
Although DDE-type transposons share basic transposition chemistry, different TE vary in the steps leading to the formation of a unique insertion intermediate (Fig.7.4). They catalyze the cleavage of a single DNA strand to generate a 3’OH at the TE ends which is subsequently used as a nucleophile to attack the DNA target phosphate backbone. This is known as the transferred strand. The variations are due to the way in which the second (non-transferred) strand is processed.
There are several ways in which second-strand processing can occur (Fig.7.4): for certain IS, the second strand is not cleaved but replication following the transfer of the first strand fuses donor and target molecules to generate cointegrates with a directly repeated copy at each donor/target junction. This is known as replicative transposition (e.g. IS6 and Tn3 families) or, more precisely, Target Primed Replicative Transposition (TPRT) (Fig.7.4 pathway a).
In the other pathways, the flanking donor DNA can be shed in several different ways: the non-transferred strand may be cleaved initially several bases within the IS prior to cleavage of the transferred strand [e.g. IS630 and Tc1 (Fig.7.4 pathway d); the 3’OH generated by the first-strand cleavage may be used to attack the second strand to form a hairpin structure at the IS ends liberating the IS from flanking DNA and subsequently hydrolyzed to regenerate the 3’OH known as conservative or cut-and-paste transposition (e.g. IS4 family; (Fig.7.4 pathway f) and (Tn10 movie - see below) (Figs. IS4.5; IS4.6; IS4.7); the 3’OH of the transferred strand from one IS end may attack the other to generate a donor molecule with a single strand bridge which is then replicated to produce a double-strand transposon circle intermediate and regenerating the original donor molecule known as copy-out-paste-in or more precisely Donor Primed Replicative Transposition (DPRT) (e.g. IS3 family)  (Fig.7.3 pathway e) and (IS911 movie - see below); or the 3’OH at the flank of the non-transferred strand may attack the second strand to form a hairpin on the flanking DNA and a 3’OH on the transferred strand (at present this has only been demonstrated for eukaryotic TE of the hAT family and in V(D)J recombination ) (Fig.7.4 pathway g).
Clearly, many families produce double-strand circular intermediates, but this does not necessarily mean that they all use the copy-paste DPRT mechanism, since a circle could formally be generated by excision involving recombination of both strands. These differences are reflected in the different IS families.
IS911. Copy out - Paste in (column e in figure 8.3)
Tn10. Cut and paste (column f in figure 8.3)
Groups with DEDD Transposases
A similar type of Tpase, known as a DEDD Tpase, is related to the Holiday junction resolvase, RuvC  but is at present limited to only a single known IS family (IS110). The organization of family members is quite different from that of the DDE ISs: they do not contain the typical terminal IRs of the DDE IS (although one subgroup, IS1111, carry sub-terminal IR) and do not generate flanking target DRs on insertion. This implies that their transposition occurs using a different mechanism as the DDE IS. It seems probable that an intermediate resembling a four-way Holliday junction is involved. Moreover, in contrast to the DDE transposases in which a DNA binding domain invariably precedes the catalytic domain, DEDD transposases appear to include a DNA binding domain downstream from the catalytic domain.
Groups with HUH Enzymes
|TE encoding the second major type of Tpase, called HUH (named for the conserved active site amino acid residues H=Histidine and U=large hydrophobic residue )(Fig.7.1) and (Fig.7.5), has been identified more recently. HUH enzymes are widespread single-strand nucleases. They include Rep proteins involved in bacteriophage and plasmid rolling circle replication and relaxases or Mob proteins involved in conjugative plasmid transfer. They are limited to two prokaryotic (IS91 and IS200/IS605; ) and one eukaryotic (helitron) TE family.
As Tpases, they are involved in presumed rolling circle transposition and also in single-strand transposition (see ). Not only is the transposition chemistry radically different to that of DDE group elements, since it involves DNA cleavage using a tyrosine residue and transient formation of a phospho-tyrosine bond, but the associated transposons have an entirely different organization and include sub-terminal secondary structures instead of IRs (see IS families ). Note that these Tpases are not related to the well-characterized tyrosine site-specific recombinases such as phage integrases.
There are two major HUH Tpase families: Y1 and Y2 enzymes (Fig.7.1)(see ) depending on whether there is a single or two catalytic Y residues. One family includes IS91-family transposases, the other includes IS200/IS605 transposases. Although these enzymes use the same Y-mediated cleavage mechanism, IS200/IS605 family Y1 transposases and IS91 transposases appear to carry out the transposition process in quite different ways. Neither carries terminal IRs nor do they generate DRs on insertion. Members of these families transpose using an entirely different mechanism to IS with DDE transposases. The members of the IS91 insertion sequence family, are related to a newly defined group, the ISCR (see “IS91-related ISCRs”) and with eukaryotic helitrons (Fig.7.1). These IS carry sub-terminal sequences which are able to form hairpin secondary structures (Fig.3.2). This is particularly marked in the IS200/IS605 family elements and, at least in the case of this family, it is these structures that are recognized by the transposase.
Groups with S-Transposases
The third transposase family is represented by IS607 which carries a Tpase closely related to serine recombinases such as the resolvases of Tn3 family elements. Little is known about their transposition mechanism. However, it appears likely, in view of the known activities of resolvases, that IS607 transposition may involve a double-strand DNA intermediate (Grindley cited as pers. comm. in ) see also  (Fig.7.1).
Groups with Y-Transposases
Finally, tyrosine site-specific recombinases of the bacteriophage integrase (Int) type are often associated with conjugative transposons (Integrative Conjugative Elements or ICE)( IS related to ICE) and are considered to be Tpases. However, at present there are no known IS which use this type of enzyme (Fig.7.1).
- ↑ Chandler M, Mahillon J. Insertion Sequences Revisited. In: Craig NL, Lambowitz AM, Craigie R, Gellert M, editors. Mobile DNA II. American Society of Microbiology; 2002. p. 305–366.
- ↑ Mahillon J, Chandler M . Insertion sequences. - Microbiol Mol Biol Rev: 1998 Sep, 62(3);725-74 [PubMed:9729608] [DOI]
- ↑ Siguier P, Gourbeyre E, Chandler M . Bacterial insertion sequences: their genomic impact and diversity. - FEMS Microbiol Rev: 2014 Sep, 38(5);865-91 [PubMed:24499397] [DOI]
- ↑ Siguier P, Gourbeyre E, Varani A, Ton-Hoang B, Chandler M . Everyman's Guide to Bacterial Insertion Sequences. - Microbiol Spectr: 2015 Apr, 3(2);MDNA3-0030-2014 [PubMed:26104715] [DOI]
- ↑ 5.0 5.1 5.2 5.3 5.4 Hickman AB, Dyda F . Mechanisms of DNA Transposition. - Microbiol Spectr: 2015 Apr, 3(2);MDNA3-0034-2014 [PubMed:26104718] [DOI]
- ↑ Fayet O, Ramond P, Polard P, Prère MF, Chandler M . Functional similarities between retroviruses and the IS3 family of bacterial insertion sequences? - Mol Microbiol: 1990 Oct, 4(10);1771-7 [PubMed:1963920] [DOI]
- ↑ Khan E, Mack JP, Katz RA, Kulkosky J, Skalka AM . Retroviral integrase domains: DNA binding and the recognition of LTR sequences. - Nucleic Acids Res: 1991 Feb 25, 19(4);851-60 [PubMed:1850126] [DOI]
- ↑ Kulkosky J, Jones KS, Katz RA, Mack JP, Skalka AM . Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. - Mol Cell Biol: 1992 May, 12(5);2331-8 [PubMed:1314954] [DOI]
- ↑ 9.0 9.1 9.2 9.3 Hickman AB, Chandler M, Dyda F . Integrating prokaryotes and eukaryotes: DNA transposases in light of structure. - Crit Rev Biochem Mol Biol: 2010 Feb, 45(1);50-69 [PubMed:20067338] [DOI]
- ↑ Davies DR, Mahnke Braam L, Reznikoff WS, Rayment I . The three-dimensional structure of a Tn5 transposase-related protein determined to 2.9-A resolution. - J Biol Chem: 1999 Apr 23, 274(17);11904-13 [PubMed:10207011] [DOI]
- ↑ Hickman AB, Perez ZN, Zhou L, Musingarimi P, Ghirlando R, Hinshaw JE, Craig NL, Dyda F . Molecular architecture of a eukaryotic DNA transposase. - Nat Struct Mol Biol: 2005 Aug, 12(8);715-21 [PubMed:16041385] [DOI]
- ↑ Turlan C, Chandler M . Playing second fiddle: second-strand processing and liberation of transposable elements from donor DNA. - Trends Microbiol: 2000 Jun, 8(6);268-74 [PubMed:10838584] [DOI]
- ↑ Curcio MJ, Derbyshire KM . The outs and ins of transposition: from mu to kangaroo. - Nat Rev Mol Cell Biol: 2003 Nov, 4(11);865-77 [PubMed:14682279] [DOI]
- ↑ Plasterk RH . The Tc1/mariner transposon family. - Curr Top Microbiol Immunol: 1996, 204;125-43 [PubMed:8556864] [DOI]
- ↑ van Luenen HG, Colloms SD, Plasterk RH . The mechanism of transposition of Tc3 in C. elegans. - Cell: 1994 Oct 21, 79(2);293-301 [PubMed:7954797] [DOI]
- ↑ Feng X, Colloms SD . In vitro transposition of ISY100, a bacterial insertion sequence belonging to the Tc1/mariner family. - Mol Microbiol: 2007 Sep, 65(6);1432-43 [PubMed:17680987] [DOI]
- ↑ Haniford DB, Ellis MJ . Transposons Tn10 and Tn5. - Microbiol Spectr: 2015 Feb, 3(1);MDNA3-0002-2014 [PubMed:26104553] [DOI]
- ↑ Chandler M, Fayet O, Rousseau P, Ton Hoang B, Duval-Valentin G . Copy-out-Paste-in Transposition of IS911: A Major Transposition Pathway. - Microbiol Spectr: 2015 Aug, 3(4); [PubMed:26350305] [DOI]
- ↑ Zhou L, Mitra R, Atkinson PW, Hickman AB, Dyda F, Craig NL . Transposition of hAT elements links transposable elements and V(D)J recombination. - Nature: 2004 Dec 23, 432(7020);995-1001 [PubMed:15616554] [DOI]
- ↑ Choi S, Ohta S, Ohtsubo E . A novel IS element, IS621, of the IS110/IS492 family transposes to a specific site in repetitive extragenic palindromic sequences in Escherichia coli. - J Bacteriol: 2003 Aug, 185(16);4891-900 [PubMed:12897009] [DOI]
- ↑ Buchner JM, Robertson AE, Poynter DJ, Denniston SS, Karls AC . Piv site-specific invertase requires a DEDD motif analogous to the catalytic center of the RuvC Holliday junction resolvases. - J Bacteriol: 2005 May, 187(10);3431-7 [PubMed:15866929] [DOI]
- ↑ Tobiason DM, Buchner JM, Thiel WH, Gernert KM, Karls AC . Conserved amino acid motifs from the novel Piv/MooV family of transposases and site-specific recombinases are required for catalysis of DNA inversion by Piv. - Mol Microbiol: 2001 Feb, 39(3);641-51 [PubMed:11169105] [DOI]
- ↑ 23.0 23.1 Chandler M, de la Cruz F, Dyda F, Hickman AB, Moncalian G, Ton-Hoang B . Breaking and joining single-stranded DNA: the HUH endonuclease superfamily. - Nat Rev Microbiol: 2013 Aug, 11(8);525-38 [PubMed:23832240] [DOI]
- ↑ 24.0 24.1 24.2 24.3 He S, Corneloup A, Guynet C, Lavatine L, Caumont-Sarcos A, Siguier P, Marty B, Dyda F, Chandler M, Ton Hoang B . The IS200/IS605 Family and "Peel and Paste" Single-strand Transposition Mechanism. - Microbiol Spectr: 2015 Aug, 3(4); [PubMed:26350330] [DOI]
- ↑ Thomas J, Pritham EJ . Helitrons, the Eukaryotic Rolling-circle Transposable Elements. - Microbiol Spectr: 2015 Aug, 3(4); [PubMed:26350323] [DOI]
- ↑ Zabala JC, de la Cruz F, Ortiz JM . Several copies of the same insertion sequence are present in alpha-hemolytic plasmids belonging to four different incompatibility groups. - J Bacteriol: 1982 Jul, 151(1);472-6 [PubMed:6282809] [DOI]
- ↑ Garcillán-Barcia MP, de la Cruz F . Distribution of IS91 family insertion sequences in bacterial genomes: evolutionary implications. - FEMS Microbiol Ecol: 2002 Nov 1, 42(2);303-13 [PubMed:19709290] [DOI]
- ↑ Lam S, Roth JR . Genetic mapping of IS200 copies in Salmonella typhimurim strain LT2. - Genetics: 1983 Dec, 105(4);801-11 [PubMed:6315530] [DOI]
- ↑ Lam S, Roth JR . Structural and functional studies of insertion element IS200. - J Mol Biol: 1986 Jan 20, 187(2);157-67 [PubMed:3009825] [DOI]
- ↑ Lam S, Roth JR . IS200: a Salmonella-specific insertion sequence. - Cell: 1983 Oct, 34(3);951-60 [PubMed:6313217] [DOI]
- ↑ Kersulyte D, Velapatiño B, Dailide G, Mukhopadhyay AK, Ito Y, Cahuayme L, Parkinson AJ, Gilman RH, Berg DE . Transposable element ISHp608 of Helicobacter pylori: nonrandom geographic distribution, functional organization, and insertion specificity. - J Bacteriol: 2002 Feb, 184(4);992-1002 [PubMed:11807059] [DOI]
- ↑ Höök-Nikanne J, Berg DE, Peek RM Jr, Kersulyte D, Tummuru MK, Blaser MJ . DNA sequence conservation and diversity in transposable element IS605 of Helicobacter pylori. - Helicobacter: 1998 Jun, 3(2);79-85 [PubMed:9631304] [DOI]
- ↑ Kersulyte D, Akopyants NS, Clifton SW, Roe BA, Berg DE . Novel sequence organization and insertion specificity of IS605 and IS606: chimaeric transposable elements of Helicobacter pylori. - Gene: 1998 Nov 26, 223(1-2);175-86 [PubMed:9858724] [DOI]
- ↑ del Pilar Garcillán-Barcia M, Bernales I, Mendiola MV, de la Cruz F . Single-stranded DNA intermediates in IS91 rolling-circle transposition. - Mol Microbiol: 2001 Jan, 39(2);494-501 [PubMed:11136468] [DOI]
- ↑ Ton-Hoang B, Guynet C, Ronning DR, Cointin-Marty B, Dyda F, Chandler M . Transposition of ISHp608, member of an unusual family of bacterial insertion sequences. - EMBO J: 2005 Sep 21, 24(18);3325-38 [PubMed:16163392] [DOI]
- ↑ Mendiola MV, de la Cruz F . IS91 transposase is related to the rolling-circle-type replication proteins of the pUB110 family of plasmids. - Nucleic Acids Res: 1992 Jul 11, 20(13);3521 [PubMed:1321417] [DOI]
- ↑ Mendiola MV, Jubete Y, de la Cruz F . DNA sequence of IS91 and identification of the transposase gene. - J Bacteriol: 1992 Feb, 174(4);1345-51 [PubMed:1310503] [DOI]
- ↑ Toleman MA, Bennett PM, Walsh TR . ISCR elements: novel gene-capturing systems of the 21st century? - Microbiol Mol Biol Rev: 2006 Jun, 70(2);296-316 [PubMed:16760305] [DOI]
- ↑ Kapitonov VV, Jurka J . Helitrons on a roll: eukaryotic rolling-circle transposons. - Trends Genet: 2007 Oct, 23(10);521-9 [PubMed:17850916] [DOI]
- ↑ Filée J, Siguier P, Chandler M . Insertion sequence diversity in archaea. - Microbiol Mol Biol Rev: 2007 Mar, 71(1);121-57 [PubMed:17347521] [DOI]
- ↑ Boocock MR, Rice PA . A proposed mechanism for IS607-family serine transposases. - Mob DNA: 2013 Nov 6, 4(1);24 [PubMed:24195768] [DOI]