IS Families/IS21 family
IS21 was discovered in plasmid R68 where it was subsequently observed to undergo a tandem duplication which greatly facilitated the insertion of the plasmid into the Pseudomonas host genome. This led to the formation of Hfr strains and the subsequent demonstration that the Pseudomonas chromosome was genetically circular (Fig.IS21.1).
The conjugative plasmid R68 carries a single copy of IS21 and is inefficient in chromosome transfer. An R68 derivative, R68.45, exhibiting a marked increase in transfer of chromosomal genes, was isolated and proved to carry two directly repeated copies of IS21 separated by 3 bp (Fig.IS21.1).
In addition to the distinct examples restricted to individual species, several individual ISs are themselves quite widely distributed. Isoforms of IS21, several of which are incomplete, have been found in RP4 (IS8), in Shigella sonnei, and various clinical isolates of Escherichia coli, in the EHEC plasmid p0157 and in the Yersinia pestis virulence plasmid pCD1 (IS640; IS21p). Another family member, IS100 is found in several Yersinia plasmids, including pCD1, flanking a high pathogenicity island in Y. pestis (as well as multiple copies in the genome) and Y. pseudotuberculosis, and in the uropathogenic E. coli J96.
The IS21 family is fairly homogeneous. Family members have lengths in the range 1850-2800bp and are therefore among the largest bacterial IS elements (TABLE Characteristics of IS families). They carry related terminal IRs whose lengths may vary between 11 (IS21) and 50 bp (IS5376) and generally terminate in the dinucleotide 5'-CA-3' (Fig.IS21.2; IS21.3 and .IS21.4).
The IS21 family terminal IRs are complex. Several carry multiple repeated sequences at their ends which include part of the terminal IRs (Fig.IS21.2) and which may represent Tpase binding sites. IS21 itself also carries a degenerate set of such sites (Berger B, 2000, La séquence d’insertion IS21: la famille de cette élément bactérien, sa spécificité d'insertion et son utilisation pour une méthode de linker insertion mutagenesis in vitro, Université de Lausanne undergraduate thesis) which were not initially identified. Like a number of IS families, the ends of the element terminate with the dinucleotide 5’-CA-3’ (Fig.IS21.4).
Insertion results in a direct target repeat of 4 or, more frequently, 5 bp while two members (IS53 and IS408) may generate 8 bp.
IS21 family members encode two consecutive genes, along upstream frame designated istA the Tpase and a “helper” gene, and a shorter downstream frame, istB (Fig.IS21.2) which exhibits some similarity to the DnaA replication initiator protein due to the presence of an ATP binding motif, and often appears in BLAST searches of complete genomes.
Overall identities range from 10 to 59% for IstA and from 25 to 67% for IstB. For one member, IS53, a match of 45% identity of the potential istA frame with that of IS1326 can be obtained by the introduction of several frameshifts. An additional sequence, IS640 from Shigella sonnei, is a truncated iso-IS21. It carries an istA reading frame which is nearly identical to that of IS21 but is lacking istB and the right terminal IR. Many additional IS21 derivatives are appearing in the various bacterial genome sequences.
The istB frame may be located in a relative reading phase of -1 (e.g. IS21, IS5376) or +1 (e.g. IS232, IS1326) compared to istA. It can be slightly separated from istA (17 bp, IS408 - for which only a partial sequence lacking the right end is available) or overlap for one (IS21) or several base pairs (IS232, IS5376, IS1326), and is generally preceded by a potential ribosome binding site. In many members, the termination codon of istA and the initiation codon of istB overlap suggesting that IstB is produced by translational coupling (Fig.IS21.2).
The istA reading frame carries a motif related to the widespread integrase DDE motif but lacking the conserved K/R residue and a potential helix-turn-helix motif. Although there is no general conservation of the K/R residue expected seven residues downstream from the final E , there is good conservation of a basic amino acid at either position 7 or 8 (Berger B, 2000, La séquence d’insertion IS21: la famille de cette élément bactérien, sa spécificité d'insertion et son utilisation pour une méthode de linker insertion mutagenesis in vitro, Université de Lausanne undergraduate thesis). The istB frame carries a relatively well conserved potential NTP binding domain. Sequence alignments of IstA and of IstB gives very similar relationships.
IS21 has been studied in some detail. The tandem duplication observed in the original R68.45 can be envisaged to occur by various routes. It is presumably favoured by the marked tendency of IS21 to insert in, or close to, an IS21 end. It may also occur by unequal crossing over during replication of the donor plasmid or, as suggested for IS30, whose transposition mechanism shows some similarity with that of IS21 (see "IS30 family"), by dimerization of the donor plasmid and deletion of one intervening plasmid sequence located between the two IS copies (Fig.IS21.1). This type of mechanism has been demonstrated experimentally both for IS30 and for IS911; IS3 family). The tandemly repeated copies of IS21 promote insertion of the entire plasmid in a transposition event involving the abutted terminal IRs. Integration results in loss of the 3 bp located between the two ends in the parental R68.45 plasmid.
The resulting replicon fusion thus has the appearance of a true cointegrate (i.e. with a single directly repeated copy of the IS at each junction) although transposition does not involve duplication of the insertion sequence during the insertion event.
Both IstA and IstB are required for integration and production of a second IstA derivative (from an alternative translational initiation codon eight codons downstream of that of IstA itself) greatly increases the efficiency of replicon fusions for reasons that are at present unknown Integration is optimal when the distance separating the two ends is 4 bp. It is efficient with a 2 or 3 base pair separation but inefficient with smaller or larger intervening sequences. Like retroviruses, and several other transposable elements, IS21 terminates with a 5'-CA-3'. Mutation of the terminal A abolishes replicon fusion (Haas et al., 1996, Insertion sequence IS21: Related insertion sequence elements, transpositional mechanisms, and application to linker insertion mutagenesis, in Nakazawa T, Molecular biology of Pseudomonads).
The duplication of IS21 in R68.45 not only creates an efficient integrative donor structure by providing two closely spaced and correctly oriented IRs, but generates a strong promoter (as measured with a galK reporter gene) in which a -35 box located in the IRR of the upstream element is combined with a -10 box located in the abutted IRL of the downstream copy. This directs high levels of production of the IS21 transposition proteins in a similar manner to the junction promoter created by circularization of IS911 and IS2. The insertion of IS21 downstream of strong promoter results in the efficient expression of transposition proteins. Thus IS21 appears not to have adopted a mechanism which "protects" it against high levels of impinging transcription. It is presumably the necessity for a transposition substrate carrying two abutted ends and the low frequency of formation of such molecules which acts as a control on transposition activity. More recently, a low level of circular IS21 copies has been detected in preparations of IS21-carrying plasmid DNA (S. Schmid and D. Haas, pers.comm.), raising the possibility that IS21, like at least some of the IS3 family members, may also transpose via a circular intermediate.
It has been demonstrated in vitro, using E. coli cell extracts enriched for IstA, that this protein is responsible for the 3' end cleavage of IS21. A substrate carrying two abutted IS21 ends separated by 3 bp is cleaved specifically at both 3' ends in an IstA-dependent way [see ].
Current data are consistent with the notion that IS21 family members transpose using a two-step mechanism (Fig.IS21.7) by the formation of a reactive junction, similar to those formed in the copy-paste mechanism of IS3 and other families, in which two abutted IS21 ends are separated by several nucleotides. This is consistent with the marked tendency of IS21 to insert in, or close to an IS21 end. The reactive junction is subsequently integrated into the target DNA to give a DR of 4 to 8 bp. Integration is optimal when the distance separating the two ends in the junction is 4 bp. It is efficient with a 2- or 3-bp separation but inefficient with smaller or larger intervening sequences. IstA carries the DDE motif and is the Tpase. IstB is necessary for the target DNA capture and accurate insertion by IstA donor insertion by its cognate transposase.
Both IstA and IstB are required for efficient transposition while a product of alternative natural translation initiation within the Tpase gene (which removes the eight N-terminal amino acids) may facilitate integration to generate “cointegrates” and has been called a “cointegrase”.
Biochemical and structural data are now available concerning the role of IstB in IS21 transposition. Several transposable elements use regulatory AAA+ ATPases for function. For bacteriophage Mu (MuB) and Tn7 (TnsC), these proteins are involved in the choice of target sites. IS21, also carries a conserved potential NTP binding domain (Fig.IS21.5) which shares sequence similarity with the DnaA protein and because of this, often interferes with automatic bacterial genome annotation pipelines.
The X-ray crystallographic structure of IstB with its N-terminal domain removed including a bound inactive ADP-BeF3 ligand superposed well onto the AAA+ domain of DnaC (Fig.IS21.6). X-ray crystallography and cryo-electron microscopy showed that the full length IstB protein self-assembles into a helical decamer (a pentamer of dimers; a so-called clamshell-shape) forming a right-handed filament very similar to that of DnaC and MuB (Fig.IS21.6). Multimerization is dependent on the ATP-co-ordinating residues. This filament was capable of grasping DNA between two pentamers generating a 180° bend and also to introduce positive supercoils in an ATP dependent manner presumably by wrapping the DNA. Binding appears to show no sequence-specificity.
The transposase, IstA, was also shown to recognize the IstB DNA complex and to promote disassembly by stimulating ATP hydrolysis.
A model has been proposed in which IstB binding and subsequent DNA bending primes a selected insertion site for efficient transposition: an IstB decamer first associates with target DNA, and the complex then recruits IstA in the form of a transpososome or paired-end complex (PEC) including a pair of inverted IS21 repeats to generate a synaptic complex. Although it has been proposed that the PEC has precleared ends, this is not an essential requirement. The order of strand cleavage and the multiple stages of transpososome assembly must yet be defined. The inability of DNA to promote ATP turnover by IstB would impart sufficient stability and allow time for IstA recognition. IstA-stimulated ATP hydrolysis would lead to IstB dissociation to a clearing of the DNA, allowing the IstA access to the target DNA. In this model, IstB might be considered an AAA+-type co-chaperone (Fig.IS21.7).
IS21 has been reported to show a preference for insertion close to the end of a second copy of the element located in the target plasmid. This is a characteristic of many IS families whose members transpose via a circular dsDNA intermediate and presumably reflects their ability to form a synaptic complex between an IR carried by the intact IS and a second IR in the target molecule (see Fig.IS3.11).