General Information/IS Distribution
ISs are widespread (Fig.1.6.1) and can occur in very high numbers in prokaryotic genomes. A recent study concluded that proteins annotated as Tpases, or as proteins with related functions are by far the most abundant functional class in both the prokaryotic and eukaryotic genomic and metagenomic public databases (Fig.1.2.5).
Since the last surveys (e.g. ) many new ISs have been identified largely as a result of the massive increase in available sequenced prokaryotic genomes. Careful analysis of a number of these has also revealed that some genomes contain significant levels of truncated and partial ISs devoid of Tpase genes. These genomic "scars" represent traces of numerous ancestral transposition events. However, genome annotations are often based simply on the presence of Tpase genes (e.g. ) and do not include the entire DNA sequence with the IS ends. Indeed, a significant number of solo IS-related IRs have been identified in various genomes. Small IS fragments are rarely taken into account even though they can provide important insights into the evolutionary history of the host genome (Fig.1.6.2), (Fig.1.6.3) and (Fig.1.6.4). Not only can this seriously impair studies attempting to provide an overview of the evolutionary influence of TEs on bacterial and archeal genomes, but such fragments may encode truncated proteins and these could influence gene regulation (e.g. ). In bacteria  and eukaryotes truncated transposases have been shown to inhibit transposition. One example where annotation of IS fragments has provided important information is in the obligatory intracellular insect endosymbiont, Wolbachia, which also carries high numbers of full-length ISs. The sequence divergence observed suggests that several waves of IS invasion and elimination have occurred over evolutionary time .