Difference between revisions of "General Information/Transposase expression and activity"

From TnPedia
Jump to navigation Jump to search
Line 2: Line 2:
While many of the classical mechanisms of controlling gene expression, such as the production of transcriptional repressors (IS''1'': <ref><nowiki><pubmed>2553980</pubmed></nowiki></ref><ref><nowiki><pubmed>1848178</pubmed></nowiki></ref>; IS''2'': <ref><nowiki><pubmed>8107136</pubmed></nowiki></ref> or translational inhibitors (anti-sense RNA in the case of IS''10''; see <ref><nowiki><pubmed>8556869</pubmed></nowiki></ref> are known to operate in Tpase expression, several other mechanisms have also been uncovered.   
While many of the classical mechanisms of controlling gene expression, such as the production of transcriptional repressors (IS''1'': <ref><nowiki><pubmed>2553980</pubmed></nowiki></ref><ref><nowiki><pubmed>1848178</pubmed></nowiki></ref>; IS''2'': <ref><nowiki><pubmed>8107136</pubmed></nowiki></ref> or translational inhibitors (anti-sense RNA in the case of IS''10''; see <ref><nowiki><pubmed>8556869</pubmed></nowiki></ref> are known to operate in Tpase expression, several other mechanisms have also been uncovered.   
===Impinging transcription===
===Impinging transcription ===
Many ISs have evolved mechanisms which attenuate their activation by impinging transcription following insertion into active host genes. This was originally observed in the case of IS''1''<ref><nowiki><pubmed>6281761</pubmed></nowiki></ref><ref><nowiki><pubmed>6313938</pubmed></nowiki></ref>, for [[wikipedia:Bacteriophage_Mu|bacteriophage Mu]]<ref><nowiki><pubmed>3015876</pubmed></nowiki></ref> and IS''50''. To our knowledge other elements have not been examined.
Many ISs have evolved mechanisms which attenuate their activation by impinging transcription following insertion into active host genes. This was originally observed in the case of IS''1''<ref><nowiki><pubmed>6281761</pubmed></nowiki></ref><ref><nowiki><pubmed>6313938</pubmed></nowiki></ref>, for [[wikipedia:Bacteriophage_Mu|bacteriophage Mu]]<ref><nowiki><pubmed>3015876</pubmed></nowiki></ref> and IS''50''. To our knowledge other elements have not been examined.

Revision as of 18:41, 12 May 2020

Transposase expression and activity

While many of the classical mechanisms of controlling gene expression, such as the production of transcriptional repressors (IS1: [1][2]; IS2: [3] or translational inhibitors (anti-sense RNA in the case of IS10; see [4] are known to operate in Tpase expression, several other mechanisms have also been uncovered.

Impinging transcription

Many ISs have evolved mechanisms which attenuate their activation by impinging transcription following insertion into active host genes. This was originally observed in the case of IS1[5][6], for bacteriophage Mu[7] and IS50. To our knowledge other elements have not been examined.

This effect may be the result of disrupting complexes between Tpase and cognate DNA ends and could reflect either inhibition of transposase binding or disruption of extant transposase-end complexes.

In the case of bacteriophage Mu, transcription originating from within the element and impinging on the left end has also been shown to reduce activity[8]. It is possible that transcription disrupts the formation of intermediates including transposase and one or both Mu ends which lead to stable transposition complexes.

Sequestration of translation initiation signals

One such mechanism observed with IS10 and IS50, and potentially present in several other ISs, is the sequestering of translation initiation signals in an RNA secondary structure (Fig.1.32.1). These ISs carry inverted repeat sequences located close to the left end which include the ribosome binding site or translation initiation codon for the Tpase gene. Transcripts from the resident Tpase promoter include only the distal repeat unit which is unable to form the secondary structure, while transcripts from neighboring DNA include both repeats and would generate secondary structures in the mRNA which would sequester translation initiation signals[9][10]. This has been demonstrated experimentally for IS10 and IS50 but several additional insertion sequences carry such potential structures and might be expected to exhibit a similar mechanism (see [11]).

Fig 1.32.1. Sequestration of translation initiation signals. The figure shows the left end of an IS (green) with its terminal inverted repeat (red). A transcript impinging from outside the element (initiated at P extern) is shown as a wavy line above and transcription driven by the indigenous promoter (PTpase) is indicated below. Internal inverted repeat sequences are indicated by bold lines and their relative orientation is shown by arrows. The bottom panel shows the sequence and secondary structure features of IS10 in this region.

Programmed Translational Frameshifting

A second mechanism acts at the level of translation elongation and involves programmed translational frameshifting between two consecutive open reading frames (Fig.1.32.1). Typically a -1 frameshift is observed in which the translating ribosome slides one base upstream and resumes in the alternative phase. This generally occurs at the position of so-called slippery codons in a heptanucleotide sequence of the type X XXZ ZZN in phase 0 (where the bases paired with the anticodon are shown as triplets) which is read as XXX ZZZ N in the shifted -1 phase (Fig.1.32.1) (see e.g. [12][13][14], http://recode.genetics.utah.edu/). The sequence A AAA AAG is a common example of this type of heptanucleotide. Ribosomal shifting of this type is stimulated by structures in the mRNA which tend to impede the progression of the ribosome such as potential ribosome binding sites upstream or secondary structures (stem-loop structures and pseudoknots) downstream of the slippery codons. Translational control of transposition by frameshifting has been demonstrated both for IS1[15][16], and for members of the IS3 family (Fig.1.33.2) ([17]; see also [18]) but may also occur in several other IS elements (see for example IS5 and IS630 families). For IS1 and members of the IS3 family, the upstream frame appears to carry a DNA recognition domain whereas the downstream frame encodes the catalytic site. While the product of the upstream frame alone acts as a modulator of activity, presumably by binding to the IR sequences, frameshifting assembles both domains into a single protein, the Tpase, which directs the cleavages and strand transfer necessary for mobility of the element. The frameshifting frequency is thus critical in determining overall transposition activity. Although it has yet to be explored in detail, frameshifting could be influenced by host physiology thus coupling transposition activity to the state of the host cell.

Fig 1.33.1. Programmed translational frameshifting schema.A). Programmed translational frameshifting. The IS is shown in green and the terminal IRs in red. Two consecutive open reading frames (A and B) together with their relative reading phases (0 and –1, respectively) and the region of overlap are shown within the IS element in yellow. The overall secondary structure of the corresponding mRNA is shown below (bold line). The group of codons that permit the ribosome to slide back one nucleotide is also indicated by an open box. The potential to form a pseudoknot is also indicated. Both the secondary structure and pseudoknot slow down the ribosome and contribute to the frequency of translational frameshifting. The bottom of the figure shows how frameshifting can assemble two different functions into one protein. Here, this occurs between the N-terminal region that carries a helix–turn–helix motif (H–T–H) which allows sequence-specific DNA binding to the ISs as well as multimerization motifs (not shown) and the C-terminal region including the DD(35)E motif which is essential for catalysis. B). A general schematic showing the ribosome A, P and E sites, the slippery codons and the downstream secondary structure.
Fig 1.33.2. Frameshifiting in IS911 family. A). Organization of IS911. Top: Left (IRL) and right (IRR) terminal inverted repeats in grren; orfA (blue) and orfB (green) reading frames; IS911 promoter, pIRL. B). Structure/function map of OrfA and OrfAB; HTH: helix-turn-helix motif for DNA recognition and binding; LZ: leucine zipper motif involved in OrfAB and OrfA multimerization; M: region also necessary for OrfAB multimerization; DDE: conserved catalytic domain. C). Schematic of the stem-loop which facilitates ribosome slowdown and slippery codons at which the ribosome undergoes refphasing in the -1 direction.

Programmed Transcriptional Frameshifting

Another type of frameshifting may also occur in bacterial insertion sequences. This occurs at the transcriptional level and involves misreading and slippage of RNA polymerase on stretches of A residues. Although no real data is at present available, it may occur relatively frequently[19][20][21].

Unusual amino acids Pyrrolysine and Selenocycteine

Another type of recoding which appears to occur in Methanosarcina, is the “suppression” of the stop codon, UAG, by insertion of Pyrrolysine. This was first noted in the Methylamine methyltransferases which are important in the production of methane by archaeal methanogens. [22] identified an in-frame amber codon (TAG) in the trimethylamine methyltransferase genes of both M. barkeri and M. thermophila. However, at least in the case of M. barkeri, abundant quantities of the full-length protein could be obtained and it appeared that the TAG codon was read as Lys. This later proved to be the unusual amino acid pyrrolysine. Several IS copies in these archaeal methanogens carry TAG codons which are presumably “suppressed” by decoding as pyrrolysine.

In this framework, other stop codons are known to be suppressed by decoding as selenocysteine [23]; [24][25][26]. To our knowledge a single IS from the IS3 family, ISDvu3 from Desulfovibrio vulgaris, includes a selenocysteine inserted at a stop codon in its orfB frame (M. Land personal communication). Undoubtedly additional examples will be identified in the future.

The inclusion of these amino acids involves the presence of specific types of secondary structures in the mRNA (Fig.1.33.3).

Fig 1.33.3. Insertion of non-standard amino acids.Different RNA structures which signal the incorporation of non-canonical amino acids. From left to right: selenocysteine insertion element (SECIS). A specific factor, SelB, bridges selenocysteine loaded tRNA and SECIS. The result is that the selenocysteine is incorporated into the peptide at a UGA stop codon. Eukaryotes and archaea use a different type of structure; pyrrolysine insertion elements (PYLIS). This type of structure likely to signal for pyrrolysine to be inserted in the archaeal methyltransferase peptide at a UAG codon in a similarly to SECIS. From Schimmel and Beebe (2004)

Translation termination

A third potential mechanism derives from the observation that the translation termination codon of Tpase genes of certain elements is located within their IR sequences. Although to our knowledge, no extensive analysis of the significance of this arrangement has yet been undertaken, it seems possible that it may in some manner couple translation termination, transposase binding, and transposition activity. The transposase gene of several elements does not possess a termination codon. These include IS240C, a member of the IS6 family (Y. Chen and J.Mahillon unpublished), two members of the IS5 family, IS427[27] and ISMk1[28] and various members of the IS630 family including IS870 and ISRf1[29]. Instead, some of these elements insert into a relatively specific target sequence in which the target DR produced on insertion itself generates the Tpase termination codon (see: IS630 family). The relevance of this as a control mechanism has yet to be explored.

Transposase stability

Transposase stability can also contribute to the control of transposition activity. The Tpase of IS903 is sensitive to the E. coli Lon protease[30]. This sensitivity limits the activity of the Tpase both temporally and spatially and may provide an explanation for the observation that several Tpases function preferentially in cis. Indeed mutant IS903 Tpase derivatives have been isolated which exhibits an increased capacity to function in trans. These are more refractory to Lon degradation than the wildtype protein[31]. Some evidence that Lon may also be involved in regulating Tn5 (IS50) transposition has also been presented[32].

An observation which might also reflect Tpase instability is the temperature sensitive nature of IS1-mediated adjacent deletions in vivo[33], of Tn3 transposition[34] and of IS911 intramolecular recombination both in vivo and in vitro[35]. For IS911, incubation of the Tpase at 42°C results in an irreversible loss in activity. Further analyses showed that there was an increase in the proportion of transposase fragments some of which can presumably interact with full length transposase to inhibit its activity. This effect can be somewhat reduced by a series of mutation in the transposase gene whose function in stabilising the transposase is as yet unknown[36].

Co-translational binding and multimerization

Fig 1.36.1. Co-translational binding.Left. Artist impression. An RNA polymerase molecule (single blue sphere) in the process of transcribing the transposase gene while the following ribosome (double blue sphere) translates the N-terminal region of the transposase (brown ellipses). The emerging N-terminal ribosome-associated peptide is anchored close to the IS by the mRNA emerging from the RNApol transcription complex. The emerging peptide is shown binding to the neighboring double-strand end IS (green helix). Binding occurs within a short window of time during translation but before the C-terminus of the transposase has been synthesized. The C-terminal end is thought to mask the transposase N-terminal DNA binding domain preventing binding of the full-length protein. This type of control would prevent excessive transposition of other ISs in the cell (and consequent deleterious effects on genome integrity) due to the presence of free active transposase. Artwork adapted by David Villa (components from Fotolia). Right. Cartoon illustrating cotranslational binding. IRL and IRR are indicated as is the indigenous promoter pIRL, located partially in IRL. RNA polymerase (blue), both ribosome sub units (blue) and mRNA are also indicated. The nascent peptide is shown in brown. The cartoon is not to scale.

Certain prokaryotic IS transposases show a strong preference for acting on the element from which they are expressed rather than on other copies of the same element in the cell. This phenomenon of “cis” preference presumably serves to prevent general activation of several identical IS copies by any “accidental” (stochastic) transposase expression from a single IS. Several different IS such as IS1[37], IS10[38], IS50[39], IS903[40] and IS911[41] (see [42][43] and references therein) exhibit this regulatory phenotype but “cis” preference may be the result of a combination of diverse mechanisms. Thus the Lon protease enhances “cis” preference of the IS903 transposase[44]. Transposition is enhanced in the absence of Lon and can be overcome by increased transposase expression[45]. For IS10, it is influenced by translation levels, Tpase mRNA half-life and translation efficiency[46][47].

Another mechanism, co-translational binding based on tight coupling between prokaryotic transcription and translation, was proposed to explain the inability to complement a Tpase mutant of IS903[48] and, more specifically, for Tn5[49].

Some full length IS transposases bind weakly to their cognate IR but the isolated DNA binding domain can bind more strongly. This has been observed for transposases of several elements including IS1[50] and IS30[51][52] and has also been observed for that of IS911. Early studies using band shift assays demonstrated that full length OrfAB binds the IRs only weakly and that OrfA binding was even lower or undetectable[53][54]. However, a truncated version of OrfAB, OrfAB [1-149], which is amputated for the C-terminal catalytic domain bound both ends avidly[55] (see also Transposase Stability). It is important to note this implies that, in many in vitro systems, the majority of transposase is thus likely to be inactive or only partially active since it would not bind stably to its substrate. The observations suggest that the C-terminal (C-ter) domain inhibits specific binding by the sequence-specific N-terminal DNA binding domain possibly by steric masking (Fig.1.36.1). This idea is consistent with the observation that IS10 transposase activity is increased by partial denaturation (for example by treatment with low alcohol concentrations; [56]). It is also consistent with the observation that the OrfAB protein of IS2 can bind the IS2 IRs when it carries a large GFP tag[57][58].

One biological explanation for cis preference is that the nascent N-ter domain might fold before completion of translation of the C-ter domain and the nascent protein could initiate binding directly to the closest IS end. Once bound, it would no longer be sensitive to masking by the C-ter domain. If binding fails to occur after translation of the N-ter DNA binding domain, continuing translation and folding of the C-ter domain would then mask the DNA binding domain resulting in an inactive protein. This implies that binding necessary for subsequent catalysis would occur only transitorily early in translation (Fig.1.36.1).

Direct evidence for co-translational binding was provided for IS911 using an in vitro transcription/translation system[59] where it was also demonstrated that reducing the efficiency of the -1 translational frameshifting required for IS911 transposase expression resulted in an increase in binding of a nascent transposase peptide. This is presumably because slowing the frameshifting process increases the time that the N-terminal part of the protein (which carries the sequence-specific DNA binding domain) is present on the ribosome enhancing its probability of binding to a neighboring IS end. It is interesting to note that in many IS, the DNA binding domain which recognizes the IR is located at the N-terminal end of the protein which is translated first.

One of the remaining questions concerns transposase multimerization. They must form multimers within the transpososome at some stage in the transposition pathway. Some transposases are monomeric in the absence of DNA (e.g. MuA and Tn5; [60][61][62]) while others are multimeric dimeric in solution (e.g. INHIV-1; [63]; [64]; [65]; [66]).

In view of the possibility that many transposases undergo co-translational binding, and the observation that several different purified full-length transposases bind poorly to the ends of their cognate transposon (while the isolated N-terminal DNA binding domain alone binds robustly), it must be emphasized that purified transposases are probably largely inactive. This must be taken into account when assessing transposase properties.

A recent study has provided support for the idea that transposases may also be able to multimerize cotranslationally. This study, has shown that bacterial luciferase subunits LuxA and LuxB may assemble cotranslationally in vivo. This process requires ribosome-associated trigger factor. This chaperone apparently delays subunit interactions until the LuxB dimer interface is available[67].

Cleavage in Trans: A Committed Complex

Another level of regulation often occurs within the transpososome (see Reaction Mechanisms) itself. Transposases bound to their specific recognition sequence, generally at both ends of the TE, form a synaptic (SC) or paired-end complex (PEC) with a precise architecture for activity, and the transposase bound at one end is used to cleave the opposite end (Fig.1.36.2) (e.g. [68] and [69]). This ensures that the correct SC is formed prior to cleavage and strand transfer and thus avoids the generation of nonproductive cleavage products, which would cause damage to the donor DNA molecule.

Fig 1.36.2. Trans-cleavage. The figure shows a transposase dimer in a paired-end with two transposons ends. Cleavage of single-ends is inefficient. Transposase bound at one end cleaves the other end. Cleavage, therefore, requires preassembly of the synaptic complex.

Host factors

Transposition activity is frequently modulated by various host factors. These effects are generally specific for each element. A non-exhaustive list of such factors includes the DNA chaperones (or histone-like proteins), IHF, HU, HNS, and FIS, the replication initiator DnaA, the protein chaperone/proteases ClpX, P, and A, the SOS control protein LexA, and the Dam DNA methylase. In addition, proteins which govern DNA supercoiling in the cell can also influence transposition.


DNA chaperones may play roles in assuring the correct three dimensional architecture in the evolution of various nucleoprotein complexes necessary for productive transposition. They may also be involved in regulating Tpase expression. IHF, HU, HNS, and FIS have all been variously implicated in the case of bacteriophage Mu, in the control of Mu gene expression or directly in the transposition process (see [70] for review).

Several elements carry specific binding sites for IHF within, or close to, their terminal IRs. These can lie within (e.g. IS1: [71]; IS903: see [72] or close to (IS10: [73]) the Tpase promoter. IHF appears to influence the nature of IS10 transposition products by binding to a site 43 bp from one end [74], [75], [76].

IHF also stimulates Tpase binding to the ends of the Tn3 family member, Tn1000 or γδ [77]. Ironically, although IS1 was the first element in which IHF sites were identified (one within each IR), conditions have not yet been found in which IHF shows a clear effect on transposition or gene expression (D. Zerbib and M. Chandler, unpublished results).

The multiple roles of several of these proteins in both the IS10 and Tn5 (IS50) systems and the dynamics of their involvement has been determined in detail[78][79][80][81][82][83][84][85].

The histone-like nucleoid structuring protein H-NS, a global transcriptional regulator, has also been implicated in the regulation of bacterial transposition systems, including Tn10[86][87][88]. It appears to promote transposition by binding directly to the transposition complex (or transpososome).


In the case of IS50, an element of the same family as IS10, both the protein Fis and the replication initiator protein DnaA have been reported to intervene in transposition (see [89]). Finally another "histone-like" protein, HNS, has been reported to stimulate transposition of IS1 in certain circumstances[90].

Although their mode of action is at present unknown, several other host proteins with otherwise entirely different functions have been implicated in transposition.

Accessory proteins: Acyl carrier protein (ACP), ribosomal protein L29, PepA and ArgR Acyl carrier protein (ACP) was independently shown to stimulate 3' end cleavage of Tn3 by its cognate Tpase[91] and, together with ribosomal protein L29, to greatly increase binding of TnsD (a protein involved in Tn7 target selection) to the chromosomal insertion site, attTn7[92]. Moreover ACP and L29 moderately stimulate Tn7 transposition in vitro while L29 alone has a significant stimulatory effect in vivo[93]. The mode of action of these proteins may be similar to that of the accessory proteins PepA and ArgR which modify the architecture of the synaptic complex in certain XerC/XerD-mediated site-specific recombination reactions[94].


  1. <pubmed>2553980</pubmed>
  2. <pubmed>1848178</pubmed>
  3. <pubmed>8107136</pubmed>
  4. <pubmed>8556869</pubmed>
  5. <pubmed>6281761</pubmed>
  6. <pubmed>6313938</pubmed>
  7. <pubmed>3015876</pubmed>
  8. <pubmed>3015876</pubmed>
  9. <pubmed>2416461</pubmed>
  10. <pubmed>2438419</pubmed>
  11. <pubmed>7934941</pubmed>
  12. <pubmed>8384687</pubmed>
  13. <pubmed>8852897</pubmed>
  14. <pubmed>8811194</pubmed>
  15. <pubmed>2543983</pubmed>
  16. <pubmed>1848178</pubmed>
  17. <pubmed>1660923</pubmed>
  18. <pubmed>8384687</pubmed>
  19. <pubmed>PMC1088944</pubmed>
  20. <pubmed>16460832</pubmed>
  21. <pubmed>21673094</pubmed>
  22. <pubmed>10762254</pubmed>
  23. <pubmed>15788401</pubmed>
  24. <pubmed>23185002</pubmed>
  25. <pubmed>15372017</pubmed>
  26. <pubmed>12029131</pubmed>
  27. <pubmed>1963949</pubmed>
  28. <pubmed>8409920</pubmed>
  29. <pubmed>8387998</pubmed>
  30. <pubmed>2161528</pubmed>
  31. <pubmed>8898394</pubmed>
  32. <pubmed>2438419</pubmed>
  33. <pubmed>1101028</pubmed>
  34. <pubmed>378977</pubmed>
  35. <pubmed>9302015</pubmed>
  36. <pubmed>17078817</pubmed>
  37. <pubmed>6281761</pubmed>
  38. <pubmed>6299577</pubmed>
  39. <pubmed>6291787</pubmed>
  40. <pubmed>6271455</pubmed>
  41. <pubmed>22195971</pubmed>
  42. <pubmed>15207871</pubmed>
  43. <pubmed>2161528</pubmed>
  44. <pubmed>2161528</pubmed>
  45. <pubmed>8898394</pubmed>
  46. <pubmed>7692216</pubmed>
  47. <pubmed>8412678</pubmed>
  48. <pubmed>6271455</pubmed>
  49. <pubmed>6296834</pubmed>
  50. <pubmed>2826132</pubmed>
  51. <pubmed>15469518</pubmed>
  52. <pubmed>2154486</pubmed>
  53. <pubmed>10677279</pubmed>
  54. <pubmed>11352577</pubmed>
  55. <pubmed>9761671</pubmed>
  56. <pubmed>8132525</pubmed>
  57. <pubmed>22032517</pubmed>
  58. <pubmed>22277150</pubmed>
  59. <pubmed>22195971</pubmed>
  60. <pubmed>1655409</pubmed>
  61. <pubmed>9867814</pubmed>
  62. <pubmed>18680433</pubmed>
  63. <pubmed>12446721</pubmed>
  64. <pubmed>17157316</pubmed>
  65. <pubmed>19609359</pubmed>
  66. <pubmed>19229293</pubmed>
  67. <pubmed>26405228</pubmed>
  68. <pubmed>23135398</pubmed>
  69. <pubmed>10884228</pubmed>
  70. <pubmed>8805293</pubmed>
  71. <pubmed>2995832</pubmed>
  72. <pubmed>6261245</pubmed>
  73. <pubmed>8556869</pubmed>
  74. <pubmed>7744253</pubmed>
  75. <pubmed>9630232</pubmed>
  76. <pubmed>7556079</pubmed>
  77. <pubmed>2844529</pubmed>
  78. <pubmed>9630232</pubmed>
  79. <pubmed>11447129</pubmed>
  80. <pubmed>14530435</pubmed>
  81. <pubmed>14992723</pubmed>
  82. <pubmed>19696075</pubmed>
  83. <pubmed>19042975</pubmed>
  84. <pubmed>16166383</pubmed>
  85. <pubmed>17501923</pubmed>
  86. <pubmed>19696075</pubmed>
  87. <pubmed>16166383</pubmed>
  88. <pubmed>17501923</pubmed>
  89. <pubmed>7504907</pubmed>
  90. <pubmed>10583504</pubmed>
  91. <pubmed>9077464</pubmed>
  92. <pubmed>16166383</pubmed>
  93. <pubmed>16166383</pubmed>
  94. <pubmed>9348666</pubmed>