ISAsp3
- Family IS4
- Group ISPepr1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Anabaena sp. | Anabaena sp. 90 |
DNA section
IS Length : 1452 bp
Ends
IR Length : 21/24
IRL : CAATACTGTAGCCAAAAAAAGAGGATGAAGAGAGAGGGCTGATGTAGTAG
IRR : CAATACTGTAGCCAAATTACGAGGGTTCAACGGCTGTAGGAGCATTGTGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGGAATATC | ACCAATGGAA | 0 | |
AAACCCTAAT | ACTAT | ACTTGTTTTA | 5 |
TTGCAATTTC | ACTT | CTAAAATGAC | 4 |
AATTTCACTT | ACTATTACTA | 0 | |
AATATATTCA | GTTAA | CTAACACCCA | 5 |
DNA sequence
CAATACTGTAGCCAAAAAAAGAGGATGAAGAGAGAGGGCTGATGTAGTAGAGTTATAGTCTTCCAAAACAAAAACTCGAAAACCACATACAGCCCATGTT
TGATATCCTAACACTCCTGCAATGCCTGCTACCAGAAATAAAAGTGACGACTATGCGGCAATTGAGTCAGATCATCATGGCCATGTTAGCAATGAGCGGA
CGAGTAACGATGTTGGGAATTTCGCGTTGGACGCTTAGTGGTGGTAGTTATCGGACGGTAATCAGATTCTTTCGGACAGTCATACCGTGGGCAACGGTGT
TCTGGGTGTTTTTTCGGCGGCATTTGTTTTGCCCAAATGATGTTTATTTGTTGGCAGGAGATGAAGTAGTAATAAGCAAAGCCGGGAAAAAAACCTATGG
ATTAGATAGGTTCTTCTCAAGTCTAACAAGCAAACCAATATCGGGGCTATCTTTCTTTACCTTATCATTAGTAAGTGTTCAACAAAGACACTCATTTCCG
ATTCAGATAGAACAGGTAATCAAGAGCGATGTAGAAAAAAGTATTGTCTCACCAATTCCAGAAGTAAAACCTCAAGAAAAACCTGGACGGGGACGACCAA
AAGGGAGTAAGAACAAAAATAAACAGGAGGTGATTTTTACATCTGAACTACTAAGAATTCAGAAAATGATTAATGAGCTATTTAAGTTAATAGCTAACTT
TATCCCCCTCACTTACTTGGTTGTAGATGGTCACTTTGGGAACAATAATGCTTTGCAAATGGCTAGACAGGTAAAGTTACACATAATTTCCAAATTACGC
CACGATTCGGCGTTATACATACCTTACCAACATCCTGACCCTAATCATCGTTCTCGTCGTAAATACGGAGATAAGCTGGACTGGCGTAATATTGCTGGGG
AATATTTACGTCAAAGCAGTATTGACGAAGATATCAAAACTGATATTTATCAAATGACTCTACTGCACAAAGAATTTGCTCAATCCTTGAATGTAGTGAT
TTTGGTAAAAACAAATATCAAGACGAATGCTGTTAGTCATGTGATTTTATTTTCCAGTGACTTAGATTTGTCTTATGAGAAAATAATCGACTACTATAGA
CTCCGCTTTCAAATCGAATTTAATTTTCGTGATGCCAAGCAGTTCTGGGGATTGGAAGATTTTATGAATCGGGGTCAAACTGCGGTGACTAATGCGGCCA
ATCTTTCTTTTTTTATGGTCAACTTATCTCATTATCTTTTAGCTCAATTCCGTCAAGATAATCCTGGTGCTGGAATTATTGATCTTAAGGCTTACTGTCG
TGGTTTTCGATATGTTCGTGAAATGTTAAAAATGCTTCCGGAACAACCAGAGCCTATTTTATTAGCCCAGATTTTTGCCAAGCTTACCTCTCTTGGTCGT
GTTCACAATGCTCCTACAGCCGTTGAACCCTCGTAATTTGGCTACAGTATTG
TGATATCCTAACACTCCTGCAATGCCTGCTACCAGAAATAAAAGTGACGACTATGCGGCAATTGAGTCAGATCATCATGGCCATGTTAGCAATGAGCGGA
CGAGTAACGATGTTGGGAATTTCGCGTTGGACGCTTAGTGGTGGTAGTTATCGGACGGTAATCAGATTCTTTCGGACAGTCATACCGTGGGCAACGGTGT
TCTGGGTGTTTTTTCGGCGGCATTTGTTTTGCCCAAATGATGTTTATTTGTTGGCAGGAGATGAAGTAGTAATAAGCAAAGCCGGGAAAAAAACCTATGG
ATTAGATAGGTTCTTCTCAAGTCTAACAAGCAAACCAATATCGGGGCTATCTTTCTTTACCTTATCATTAGTAAGTGTTCAACAAAGACACTCATTTCCG
ATTCAGATAGAACAGGTAATCAAGAGCGATGTAGAAAAAAGTATTGTCTCACCAATTCCAGAAGTAAAACCTCAAGAAAAACCTGGACGGGGACGACCAA
AAGGGAGTAAGAACAAAAATAAACAGGAGGTGATTTTTACATCTGAACTACTAAGAATTCAGAAAATGATTAATGAGCTATTTAAGTTAATAGCTAACTT
TATCCCCCTCACTTACTTGGTTGTAGATGGTCACTTTGGGAACAATAATGCTTTGCAAATGGCTAGACAGGTAAAGTTACACATAATTTCCAAATTACGC
CACGATTCGGCGTTATACATACCTTACCAACATCCTGACCCTAATCATCGTTCTCGTCGTAAATACGGAGATAAGCTGGACTGGCGTAATATTGCTGGGG
AATATTTACGTCAAAGCAGTATTGACGAAGATATCAAAACTGATATTTATCAAATGACTCTACTGCACAAAGAATTTGCTCAATCCTTGAATGTAGTGAT
TTTGGTAAAAACAAATATCAAGACGAATGCTGTTAGTCATGTGATTTTATTTTCCAGTGACTTAGATTTGTCTTATGAGAAAATAATCGACTACTATAGA
CTCCGCTTTCAAATCGAATTTAATTTTCGTGATGCCAAGCAGTTCTGGGGATTGGAAGATTTTATGAATCGGGGTCAAACTGCGGTGACTAATGCGGCCA
ATCTTTCTTTTTTTATGGTCAACTTATCTCATTATCTTTTAGCTCAATTCCGTCAAGATAATCCTGGTGCTGGAATTATTGATCTTAAGGCTTACTGTCG
TGGTTTTCGATATGTTCGTGAAATGTTAAAAATGCTTCCGGAACAACCAGAGCCTATTTTATTAGCCCAGATTTTTGCCAAGCTTACCTCTCTTGGTCGT
GTTCACAATGCTCCTACAGCCGTTGAACCCTCGTAATTTGGCTACAGTATTG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1341 bp | 446 aa | 96 | 1436 | + | No |
Chemistry : DDE
ORF sequence :
MFDILTLLQCLLPEIKVTTMRQLSQIIMAMLAMSGRVTMLGISRWTLSGGSYRTVIRFFRTVIPWATVFWVFFRRHLFCPNDVYLLAGDEVVISKAGKKT
YGLDRFFSSLTSKPISGLSFFTLSLVSVQQRHSFPIQIEQVIKSDVEKSIVSPIPEVKPQEKPGRGRPKGSKNKNKQEVIFTSELLRIQKMINELFKLIA
NFIPLTYLVVDGHFGNNNALQMARQVKLHIISKLRHDSALYIPYQHPDPNHRSRRKYGDKLDWRNIAGEYLRQSSIDEDIKTDIYQMTLLHKEFAQSLNV
VILVKTNIKTNAVSHVILFSSDLDLSYEKIIDYYRLRFQIEFNFRDAKQFWGLEDFMNRGQTAVTNAANLSFFMVNLSHYLLAQFRQDNPGAGIIDLKAY
CRGFRYVREMLKMLPEQPEPILLAQIFAKLTSLGRVHNAPTAVEPS
YGLDRFFSSLTSKPISGLSFFTLSLVSVQQRHSFPIQIEQVIKSDVEKSIVSPIPEVKPQEKPGRGRPKGSKNKNKQEVIFTSELLRIQKMINELFKLIA
NFIPLTYLVVDGHFGNNNALQMARQVKLHIISKLRHDSALYIPYQHPDPNHRSRRKYGDKLDWRNIAGEYLRQSSIDEDIKTDIYQMTLLHKEFAQSLNV
VILVKTNIKTNAVSHVILFSSDLDLSYEKIIDYYRLRFQIEFNFRDAKQFWGLEDFMNRGQTAVTNAANLSFFMVNLSHYLLAQFRQDNPGAGIIDLKAY
CRGFRYVREMLKMLPEQPEPILLAQIFAKLTSLGRVHNAPTAVEPS
Blast result :
Comments
ISPepr1 is 41% aa similiar to ISDra7. There are 4 complete copies and 1 frameshifted copy on the genome.
References
1] Hao Wang (2009) Direct submission