ISAs20
- Family IS3
- Group IS3
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP022426 | ND | Aeromonas salmonicida | Aeromonas media WS Aeromonas salmonicida Aeromonas salmonicida subsp. pectinolytica 34mel |
DNA section
IS Length : 1674 bp
Ends
IR Length : 19/29
IRL : CCCCCGGTGTCAGAATAGTTGTCGCCTAATCTGAATAATCTAAATGTGAA
IRR : CGGCGTTTTTCAAGATAGTTGTCGCGCAAAGCTGTTTTAAGCAGCCTCTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTGGCCAGCC | ACTG | TTCGGCAGCG | 4 |
CAGCAGCAGC | AAGG | CCCCCAGCCA | 4 |
AAGCAAGCCA | ACGG | CTTGTCCCCA | 4 |
TCGCGTTAGG | AGTC | TTTGCTCGTA | 4 |
TGAGCACTCA | CCTC | CCGTCCGATC | 4 |
TTACGAAGGA | AGTGCGAGTG | 0 | |
TCGGCATGGG | CCCTTTCCCA | 0 | |
CCCAAATGCT | GATATGGGCT | 0 |
DNA sequence
CCCCCGGTGTCAGAATAGTTGTCGCCTAATCTGAATAATCTAAATGTGAAGACCGGGGGCGGATAAAGAGTGTCGAATGGCTCGTTATTCAAGTGAGCGC
AAGGCTGCTTTGCTCAAGAAGCTGCTGCCGCCCATCAATATGTCGGTGGCCGAGTTGGCTCGGCAGGAAAACATCAGCGAGGTCACTCTGTATAATTGGC
GCAAACACGCCAAAGACGGAGGGTCTCCTGTGCCCGGGGACAACAAACTGACCGATGAATGGCCCGCCGAGGCCAAGTTCGCCGTCGTGCTGGAAACCGC
TGCGCTTTCCGAGATTGAACTCAGTGAGTACTGCCGCCGTAAGGGTCTCTATCCCGAGCAAGTCCAGCAATGGCGCCAAGCCTGTATTCTGGGCCAGCAA
TCCGCTCGTACCTTACAACAGGCGGAGAAGGCACAGGCCAAAGCGGACAAGAAGCGTATCCGGCAACTGGAGCAAGAGCTGCGGCGCAAAGACAAGGCCC
TGGCCGAGGCGGCAGCTCTGCTGATATTGCAAAAAAAGCTCGATGCCTACTGGAGCAACGTCGACGAGGACAACTGACCTCGTTGCCAGAGCGGCAGCAA
TACGTTGCATGGTGGCGCGAAGCGCTTGAAGCTGGCGCCCGCAAGCATCCTGCGGCCGAGGTGTTGGGGCTGAGTCTGCGCACGCTCCAGCGCTGGTTGG
CCGGCCCTGAGCTGAGTGCCGATAGGCGACCGGATGCGGTTCGTTCCATACCTGCTCATGCATTGAGCCCAGAGGAGCGGCAGGCCGTGCTGGACGTCTG
CAATAGCCAGGAGTTCGCCTCGTTGCCGCCGAGTCAAATTGTGCCACGGCTGGCCGACCAGGGGCGCTACTTGGCCAGTGAGTCGAGCTTCTATCGCATC
TTGCGGGCGGCTGACCAGCAGCACAGGCGTGGCCGGAGCCAGCCACCGCGTCATGTGCCGGTACCAACCAGCCATACCGCGACTGGGCCCAATCAAGTGT
GGTCCTGGGATATCACCTACTTGCCGTCACTCGTGCGCGGCCAGTATTACTACCTCTATCTGATAGAGGACATCTACAGCCGCAAAGGGGTGGGCTGGGA
AGTGCATGAGCAAGAGTGCGGTGAACGGGCGGCGGCGTTGCTGCAACGGAGCATCATCCGCGAGCAGTGCTGGAAGCAGCCGCTGGTCTTGCACTCGGAC
AACGGCGCGCCAATGAAGTCAGTCACGCTGCTGACCAAGATGCATGACCTGGGTGTCACGCCCTCGCGGGGACGGCCTCGGGTCAGCAACGATAATCCGT
ACTCGGAGTCATTGTTCAGGACGCTGAAGTACTGTCCGCAGTGGCCATCAGACGGCTTTGCCAGTCTGGAAGCGGCCAGGGAGTGGGTGCGCGACTTTAT
GGCCTGGTATAACGAAGAGCACCGGCATAGCCGTATCCGCTTCGTCACACCCAACGAGCGGCATCGGGGTGACGATAAAGCCCTGCTGGCGAAACGGGAT
GCTGTGTACCAGGCCGCCCGAGCACAACACCCAGCGAGGTGGAGTGGCAAGACGCGAGACTGGACACCTATCGGTGCCGTGATGCTAAATCCGGAGCGGC
CCGAGAAACCCGAGCCGGAGCAGAAAGAGGCTGCTTAAAACAGCTTTGCGCGACAACTATCTTGAAAAACGCCG
AAGGCTGCTTTGCTCAAGAAGCTGCTGCCGCCCATCAATATGTCGGTGGCCGAGTTGGCTCGGCAGGAAAACATCAGCGAGGTCACTCTGTATAATTGGC
GCAAACACGCCAAAGACGGAGGGTCTCCTGTGCCCGGGGACAACAAACTGACCGATGAATGGCCCGCCGAGGCCAAGTTCGCCGTCGTGCTGGAAACCGC
TGCGCTTTCCGAGATTGAACTCAGTGAGTACTGCCGCCGTAAGGGTCTCTATCCCGAGCAAGTCCAGCAATGGCGCCAAGCCTGTATTCTGGGCCAGCAA
TCCGCTCGTACCTTACAACAGGCGGAGAAGGCACAGGCCAAAGCGGACAAGAAGCGTATCCGGCAACTGGAGCAAGAGCTGCGGCGCAAAGACAAGGCCC
TGGCCGAGGCGGCAGCTCTGCTGATATTGCAAAAAAAGCTCGATGCCTACTGGAGCAACGTCGACGAGGACAACTGACCTCGTTGCCAGAGCGGCAGCAA
TACGTTGCATGGTGGCGCGAAGCGCTTGAAGCTGGCGCCCGCAAGCATCCTGCGGCCGAGGTGTTGGGGCTGAGTCTGCGCACGCTCCAGCGCTGGTTGG
CCGGCCCTGAGCTGAGTGCCGATAGGCGACCGGATGCGGTTCGTTCCATACCTGCTCATGCATTGAGCCCAGAGGAGCGGCAGGCCGTGCTGGACGTCTG
CAATAGCCAGGAGTTCGCCTCGTTGCCGCCGAGTCAAATTGTGCCACGGCTGGCCGACCAGGGGCGCTACTTGGCCAGTGAGTCGAGCTTCTATCGCATC
TTGCGGGCGGCTGACCAGCAGCACAGGCGTGGCCGGAGCCAGCCACCGCGTCATGTGCCGGTACCAACCAGCCATACCGCGACTGGGCCCAATCAAGTGT
GGTCCTGGGATATCACCTACTTGCCGTCACTCGTGCGCGGCCAGTATTACTACCTCTATCTGATAGAGGACATCTACAGCCGCAAAGGGGTGGGCTGGGA
AGTGCATGAGCAAGAGTGCGGTGAACGGGCGGCGGCGTTGCTGCAACGGAGCATCATCCGCGAGCAGTGCTGGAAGCAGCCGCTGGTCTTGCACTCGGAC
AACGGCGCGCCAATGAAGTCAGTCACGCTGCTGACCAAGATGCATGACCTGGGTGTCACGCCCTCGCGGGGACGGCCTCGGGTCAGCAACGATAATCCGT
ACTCGGAGTCATTGTTCAGGACGCTGAAGTACTGTCCGCAGTGGCCATCAGACGGCTTTGCCAGTCTGGAAGCGGCCAGGGAGTGGGTGCGCGACTTTAT
GGCCTGGTATAACGAAGAGCACCGGCATAGCCGTATCCGCTTCGTCACACCCAACGAGCGGCATCGGGGTGACGATAAAGCCCTGCTGGCGAAACGGGAT
GCTGTGTACCAGGCCGCCCGAGCACAACACCCAGCGAGGTGGAGTGGCAAGACGCGAGACTGGACACCTATCGGTGCCGTGATGCTAAATCCGGAGCGGC
CCGAGAAACCCGAGCCGGAGCAGAAAGAGGCTGCTTAAAACAGCTTTGCGCGACAACTATCTTGAAAAACGCCG
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
501 bp | 166 aa | 77 | 577 | + | No |
Description : First part of the transposase
ORF sequence :
MARYSSERKAALLKKLLPPINMSVAELARQENISEVTLYNWRKHAKDGGSPVPGDNKLTDEWPAEAKFAVVLETAALSEIELSEYCRRKGLYPEQVQQWR
QACILGQQSARTLQQAEKAQAKADKKRIRQLEQELRRKDKALAEAAALLILQKKLDAYWSNVDEDN
QACILGQQSARTLQQAEKAQAKADKKRIRQLEQELRRKDKALAEAAALLILQKKLDAYWSNVDEDN
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1218 bp | 405 aa | 421 | 1638 | + | No |
Description : Second part of the transposase
ORF sequence :
GGEGTGQSGQEAYPATGARAAAQRQGPGRGGSSADIAKKARCLLEQRRRGQLTSLPERQQYVAWWREALEAGARKHPAAEVLGLSLRTLQRWLAGPELSA
DRRPDAVRSIPAHALSPEERQAVLDVCNSQEFASLPPSQIVPRLADQGRYLASESSFYRILRAADQQHRRGRSQPPRHVPVPTSHTATGPNQVWSWDITY
LPSLVRGQYYYLYLIEDIYSRKGVGWEVHEQECGERAAALLQRSIIREQCWKQPLVLHSDNGAPMKSVTLLTKMHDLGVTPSRGRPRVSNDNPYSESLFR
TLKYCPQWPSDGFASLEAAREWVRDFMAWYNEEHRHSRIRFVTPNERHRGDDKALLAKRDAVYQAARAQHPARWSGKTRDWTPIGAVMLNPERPEKPEPE
QKEAA
DRRPDAVRSIPAHALSPEERQAVLDVCNSQEFASLPPSQIVPRLADQGRYLASESSFYRILRAADQQHRRGRSQPPRHVPVPTSHTATGPNQVWSWDITY
LPSLVRGQYYYLYLIEDIYSRKGVGWEVHEQECGERAAALLQRSIIREQCWKQPLVLHSDNGAPMKSVTLLTKMHDLGVTPSRGRPRVSNDNPYSESLFR
TLKYCPQWPSDGFASLEAAREWVRDFMAWYNEEHRHSRIRFVTPNERHRGDDKALLAKRDAVYQAARAQHPARWSGKTRDWTPIGAVMLNPERPEKPEPE
QKEAA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1562 bp | 520 aa | 77 | 1638 | + | Yes |
Chemistry : DDE
ORF sequence :
MARYSSERKAALLKKLLPPINMSVAELARQENISEVTLYNWRKHAKDGGSPVPGDNKLTDEWPAEAKFAVVLETAALSEIELSEYCRRKGLYPEQVQQWR
QACILGQQSARTLQQAEKAQAKADKKRIRQLEQELRRKDKALAEAAALLILQKKARCLLEQRRRGQLTSLPERQQYVAWWREALEAGARKHPAAEVLGLS
LRTLQRWLAGPELSADRRPDAVRSIPAHALSPEERQAVLDVCNSQEFASLPPSQIVPRLADQGRYLASESSFYRILRAADQQHRRGRSQPPRHVPVPTSH
TATGPNQVWSWDITYLPSLVRGQYYYLYLIEDIYSRKGVGWEVHEQECGERAAALLQRSIIREQCWKQPLVLHSDNGAPMKSVTLLTKMHDLGVTPSRGR
PRVSNDNPYSESLFRTLKYCPQWPSDGFASLEAAREWVRDFMAWYNEEHRHSRIRFVTPNERHRGDDKALLAKRDAVYQAARAQHPARWSGKTRDWTPIG
AVMLNPERPEKPEPEQKEAA
QACILGQQSARTLQQAEKAQAKADKKRIRQLEQELRRKDKALAEAAALLILQKKARCLLEQRRRGQLTSLPERQQYVAWWREALEAGARKHPAAEVLGLS
LRTLQRWLAGPELSADRRPDAVRSIPAHALSPEERQAVLDVCNSQEFASLPPSQIVPRLADQGRYLASESSFYRILRAADQQHRRGRSQPPRHVPVPTSH
TATGPNQVWSWDITYLPSLVRGQYYYLYLIEDIYSRKGVGWEVHEQECGERAAALLQRSIIREQCWKQPLVLHSDNGAPMKSVTLLTKMHDLGVTPSRGR
PRVSNDNPYSESLFRTLKYCPQWPSDGFASLEAAREWVRDFMAWYNEEHRHSRIRFVTPNERHRGDDKALLAKRDAVYQAARAQHPARWSGKTRDWTPIG
AVMLNPERPEKPEPEQKEAA
Blast result :
Comments
ISAs20 is 79% aa similar to ISPsy31.
The third ORF is a putative ORFAB transposase reconstructed in silico by a possible -1 frameshift.
The third ORF is a putative ORFAB transposase reconstructed in silico by a possible -1 frameshift.
References
1] F. Pfeiffer (2015) Direct submission
2] F. Pfeiffer, M.A. Zamora-Lagos, M. Blettinger, A. Yeroslaviz, A. Dahl, S. Gruber, B.H. Habermann (2018) BMC Genomics 19 (1), 20
2] F. Pfeiffer, M.A. Zamora-Lagos, M. Blettinger, A. Yeroslaviz, A. Dahl, S. Gruber, B.H. Habermann (2018) BMC Genomics 19 (1), 20