ISApr7
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_ABHC01000017 | ND | Alpha proteobacterium | Alpha proteobacterium BAL199 |
DNA section
IS Length : 3923 bp
Ends
IR Length : 25/30
IRL : GGCGACTATGCACTTGACGACACCACTCGGCGGGTGTAGTTAGAATGCAT
IRR : GGCGACTATGCACTTTGCAACACCAAGGCCGCGCAAGGCTACAACCGGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGGAATACGATA | GAAACTCG | CGCAGGAATCCCA | 8 |
DNA sequence
GGCGACTATGCACTTGACGACACCACTCGGCGGGTGTAGTTAGAATGCATGTCCAGCGACCTAACCCTTTTGGAAATCATGCGGCGCTTCTCCACCGAGG
AAGCGGCGCGTGCATATTTCGAGCGGATGCGCTGGCCGAACGGTCCGGTCTGCCCTCATTGCGGTAGCGCCGAAAAGAACTACGCGCTTACCCCCAACAA
GAAAGCCCGCATCCGCGAGGGCCTGTATAAGTGCGGAACCTGCGACGACCGATTTAGTGTGACCGTGGGGACGGTCATGGAGTCCTCGCATATCCCGCTG
CACAAGTGGCTGATCGCGTTCTATATGATGTGCGCCAGCAAAACGCAGATTTCCGCACTTCAGCTTCAGCGCCAGCTTGAGCTTGGTTCTTACCGGACCG
CGCACTTCCTATGTGCCCGCATTCGGTACGCCCTCAAGGATGCCGGCTCTGCTGGTCTGATTGGCGGCGAAGTCGAGGCCGACGAAACTTACATCGGCGG
CAAGGCCAAGGGCAAAGGTCGCGGCTACACCGGCAATAAGACCGCAGTCGTTTCGCTGGTCCAGCGCGGCGGTGAAGTTCGGTCGACCGTTGTTGCGGAG
CGCGTGACAGGTAAGACCATCGATACCCTGCTCCGACGCCACGTTACCGAGGAAGCCCACCTCAACACGGACGAGTCTCCCCTCTACAACAAGGCCGGTA
AGCGCTTCGCTTCGCATGCCCGCGTGAACCACTCCGCCGAAGAGTACGGCTATTACGATTACCGCTCGGGCCGCACCGTCACGACCAATACGGTCGAGGG
CTTCTTCGGCAACAGCAAGCGAAGCCTTGACGGTACGCACCACAACGTGAGCCGCCAGCATCTGCACCTATACACGGCGGAACTGGATTTCAAATACAAC
ACGCGGAAGTCGACGGACGGTGAGCGCACCGCCGAAGGCATCCGGCGCATTGAGGGGAAGCGCCTGATGTATAAACCTAAGGCTTCGGGCTGATGGCCCG
CGTTCGTGTTGCGGAAGTCCGCACGGAGGTTCTGCTAACCGAACTGCTCAAGGCCCAGGGATGGGATTGTCGGCGTCCGCCGAATGGCGAGATGCTGCGC
CAGCACGAATACAAAGACCATTCCCACCTGCGCGATGTGTTTCTGCACAGGAGCAAGGTGAGGATGATCGGGCATGGATTGCCCGAGGCCGTAGTGGTGG
ATCGGCAATCAATGCAGCCATTGATCGTGATCGAAGCGAAGGCGTCGATTTCAGACCTCGACAAAGCCCTGCGTGAGGCGACGGAGATTTACGGCAACGC
CTGTATCGACGCCGGTTACTCGCCCCTCGCCGTGGCTATCGCCGGGACCAGTGAGGATGACTTCGCAGTTCGCGTCCACAAGTGGAACGGCTCGGCGTGG
AAGGCCGTCACATACGAAGGCAACCCGATTGGGTGGATACCCAATCGTGTCGATGTCGAACGGCTCCGTGTGCCTTCCGCCACCCCCGAACTGCGCCCTT
CGGTCCCTAGTCCCGAAGTGCTGGCAAACTTCGCCGACGAGATTAACCGGCTGCTGCGCGAGTCCAACGTAAACGACCGCTCACGCCCCTCTGTCGTTGG
CGCGTGCATGCTCGCTCTCTGGCAGTCGAAGGGCGCCCTCCGCAAAGACCCGCGAAACATACTCGGCGACATAAATCAGGCGTGCGAAAAGGCGTTCTGG
AACGCGGGCAAAGCGGTGTTGGCCAAGAGCCTCCACGTTGACGAGGCGAATGACAAACTGGCGGTGAAGGCGCGGCGGATTATCAGTATCCTTGAGCGCC
TGAACGTCTCCGTTCTAACTGCCGAGCACGACTACCTTGGCCAGCTCTACGAGACGTTCTTCCGCTACGCTGGCGGCAATACGATTGGCCAGTATTTCAC
GCCGCGCCACATCGCGAGCTTCGGTGCCGATCTTCTCGGCGTTTCGATTGATGACGTAGTGCTCGACCCGACTTGCGGAACGGGCGGATTCCTCATCGCC
GCAATGGAGCGGGTCGCTCGCGAGCATCAGATTTCTCGCTCCGAAATGGTCAAGCTGGTAAGCACCCGGCTGATCGGCTTCGATGATGAACCCATCACGG
CTGCTCTTTGCGTCGCGAACATGATTTTGCGCGGCGATGGCTCATCTAGCGTGCATCGGGGCGATGCCTTCACGGCACCGGAGTATCCGATCGGCACGGC
GAGCGTTGTTCTCATGAACCCGCCGTACCCCCACAAGCAAACCGACACCCCTACCGAGGCGTTCGTGGAACGCGCGCTAGAGGGGTTGTCGCAAGGCTCG
CGCCTCGCTGCGGTCATTCCCCTGTCGCTGCTGGTCAAGAGCAACAAGGCAAGCTGGCGCAAGGCGATCCTGAAGAACAACACGCTAGAGGCAGCGATCA
AGCTCCCTGACGAGCTATTTCAGCCCTACGCGCAGCCCTACACGGTCATTGTCTATCTGCGGAAGGGCATCCCCCATCCGAAGGGCAAGCGCGCGTTCTT
CGCTCGTATCGAAAATGATGGCTTCCGCATTCGCAAGGCTGTTCGCGTTGCGTGCGAGGGATCTGAACTTCCCAAGATGCTGGCCCATTTTCAGGCCGGA
ACGAGCGAGGCAGGCGTCTGCGGTTGGTCCCAAGTGGACGAGGACGCAAGCTTTGGGCCGGGTGCATATATTCCCGCCAAGGAAATGACCGGCGAGGAAA
GCGACGACGCCACCCAAGAGGTAATTCGGGCACGCACATCGTTTGTCGCCTATCACGCCGCCGACCTGGTGCAGCTTTACACCGACAATCCGCTCGACGT
TCGTGCAATGCGCAAGAGGCCGTGGCAGTTCCAGGGTGTGAAGCTGGGCACCGTTGCCGCCTATTTCGACATCTACTACGGACAGAAAGAACTCCACAGC
AAAGATGGCCTGTTGCCGGGCCGCTCACTGGTCATATCGTCGTCCAAGTTCGATAACGGCTGCTACGGTCTCTTCGACTTCGAGCACATACTCAAACCGC
CTTTCGTGACCGTGCCGGGCAACGGCTCGATTGCCTACGCGCATGTTCAAGAGTGGCCCTGTGGCGTGTCCGATGATTGCATGCTCTTGCTTCCCAAGGA
GGGCGTGTCTCATTCGATGATGTACGTCGCCGCCGCAGCGATCCGAAACGAACGGTGGCGCTTCAGCTATGGGCGCAAGGCAACGCCGGACCGGATTGCG
GAGTTCCCGCTACCTCACACCGACGAACTGCTGGCGCGCGTTGACGAGTACCTTGCGCGAGCTGCTCGGGTTGAAGATCGCATGATCGAGGACGCAGAAG
ACGCGCTCGACAGCCAAACGGCACGCATGCGGCTAGCGGATTTGGGGAGCGGAAAAGCAACGGCAGTTTCGGGGGCGGAATTGGAGACGCGGCTTGCAGC
TATGATGGAGAACTAGGCGGGTGCCATACGTCGGCTTCGCCTTTACCACCGCCGCGCTGGACTTCTTAGCTACGCTGCCGCCGAAAATTCGGAAGCAGGT
CATTAAGAAGGCCAAGGCCCTGCATGCCAACCCGCATCCCCAAGGGTCGAAGAAGTTACACGGCGTGGTGACCGACGATGGCGATCCGGTGTATCGTGAG
CGATCTGGAGATTACCGCATCCTCTATGTGGTTCGCCCCGAAGAGGTGATGGTCCTCGATATTGACCATCGGAAGGACGTATATCGTATGCCTCAGACCA
AGGCAGAACCGGCCGACGAAATGAAAATGAAGGAAGCCGACTTCGACGCGATCATGAGCAAGGCGCTGGGCGTCGCTGCTCCACAGAACAAGGACGATGA
GCAGCCAGCTAAGCGGCTAAGCTCCTACCCGCCGAAGAAGCGCGGCACGTCCTAGGCCCAAAAAGCCGGTGTTGGCCGGTTGTAGCCTTGCGCGGCCTTG
GTGTTGCAAAGTGCATAGTCGCC
AAGCGGCGCGTGCATATTTCGAGCGGATGCGCTGGCCGAACGGTCCGGTCTGCCCTCATTGCGGTAGCGCCGAAAAGAACTACGCGCTTACCCCCAACAA
GAAAGCCCGCATCCGCGAGGGCCTGTATAAGTGCGGAACCTGCGACGACCGATTTAGTGTGACCGTGGGGACGGTCATGGAGTCCTCGCATATCCCGCTG
CACAAGTGGCTGATCGCGTTCTATATGATGTGCGCCAGCAAAACGCAGATTTCCGCACTTCAGCTTCAGCGCCAGCTTGAGCTTGGTTCTTACCGGACCG
CGCACTTCCTATGTGCCCGCATTCGGTACGCCCTCAAGGATGCCGGCTCTGCTGGTCTGATTGGCGGCGAAGTCGAGGCCGACGAAACTTACATCGGCGG
CAAGGCCAAGGGCAAAGGTCGCGGCTACACCGGCAATAAGACCGCAGTCGTTTCGCTGGTCCAGCGCGGCGGTGAAGTTCGGTCGACCGTTGTTGCGGAG
CGCGTGACAGGTAAGACCATCGATACCCTGCTCCGACGCCACGTTACCGAGGAAGCCCACCTCAACACGGACGAGTCTCCCCTCTACAACAAGGCCGGTA
AGCGCTTCGCTTCGCATGCCCGCGTGAACCACTCCGCCGAAGAGTACGGCTATTACGATTACCGCTCGGGCCGCACCGTCACGACCAATACGGTCGAGGG
CTTCTTCGGCAACAGCAAGCGAAGCCTTGACGGTACGCACCACAACGTGAGCCGCCAGCATCTGCACCTATACACGGCGGAACTGGATTTCAAATACAAC
ACGCGGAAGTCGACGGACGGTGAGCGCACCGCCGAAGGCATCCGGCGCATTGAGGGGAAGCGCCTGATGTATAAACCTAAGGCTTCGGGCTGATGGCCCG
CGTTCGTGTTGCGGAAGTCCGCACGGAGGTTCTGCTAACCGAACTGCTCAAGGCCCAGGGATGGGATTGTCGGCGTCCGCCGAATGGCGAGATGCTGCGC
CAGCACGAATACAAAGACCATTCCCACCTGCGCGATGTGTTTCTGCACAGGAGCAAGGTGAGGATGATCGGGCATGGATTGCCCGAGGCCGTAGTGGTGG
ATCGGCAATCAATGCAGCCATTGATCGTGATCGAAGCGAAGGCGTCGATTTCAGACCTCGACAAAGCCCTGCGTGAGGCGACGGAGATTTACGGCAACGC
CTGTATCGACGCCGGTTACTCGCCCCTCGCCGTGGCTATCGCCGGGACCAGTGAGGATGACTTCGCAGTTCGCGTCCACAAGTGGAACGGCTCGGCGTGG
AAGGCCGTCACATACGAAGGCAACCCGATTGGGTGGATACCCAATCGTGTCGATGTCGAACGGCTCCGTGTGCCTTCCGCCACCCCCGAACTGCGCCCTT
CGGTCCCTAGTCCCGAAGTGCTGGCAAACTTCGCCGACGAGATTAACCGGCTGCTGCGCGAGTCCAACGTAAACGACCGCTCACGCCCCTCTGTCGTTGG
CGCGTGCATGCTCGCTCTCTGGCAGTCGAAGGGCGCCCTCCGCAAAGACCCGCGAAACATACTCGGCGACATAAATCAGGCGTGCGAAAAGGCGTTCTGG
AACGCGGGCAAAGCGGTGTTGGCCAAGAGCCTCCACGTTGACGAGGCGAATGACAAACTGGCGGTGAAGGCGCGGCGGATTATCAGTATCCTTGAGCGCC
TGAACGTCTCCGTTCTAACTGCCGAGCACGACTACCTTGGCCAGCTCTACGAGACGTTCTTCCGCTACGCTGGCGGCAATACGATTGGCCAGTATTTCAC
GCCGCGCCACATCGCGAGCTTCGGTGCCGATCTTCTCGGCGTTTCGATTGATGACGTAGTGCTCGACCCGACTTGCGGAACGGGCGGATTCCTCATCGCC
GCAATGGAGCGGGTCGCTCGCGAGCATCAGATTTCTCGCTCCGAAATGGTCAAGCTGGTAAGCACCCGGCTGATCGGCTTCGATGATGAACCCATCACGG
CTGCTCTTTGCGTCGCGAACATGATTTTGCGCGGCGATGGCTCATCTAGCGTGCATCGGGGCGATGCCTTCACGGCACCGGAGTATCCGATCGGCACGGC
GAGCGTTGTTCTCATGAACCCGCCGTACCCCCACAAGCAAACCGACACCCCTACCGAGGCGTTCGTGGAACGCGCGCTAGAGGGGTTGTCGCAAGGCTCG
CGCCTCGCTGCGGTCATTCCCCTGTCGCTGCTGGTCAAGAGCAACAAGGCAAGCTGGCGCAAGGCGATCCTGAAGAACAACACGCTAGAGGCAGCGATCA
AGCTCCCTGACGAGCTATTTCAGCCCTACGCGCAGCCCTACACGGTCATTGTCTATCTGCGGAAGGGCATCCCCCATCCGAAGGGCAAGCGCGCGTTCTT
CGCTCGTATCGAAAATGATGGCTTCCGCATTCGCAAGGCTGTTCGCGTTGCGTGCGAGGGATCTGAACTTCCCAAGATGCTGGCCCATTTTCAGGCCGGA
ACGAGCGAGGCAGGCGTCTGCGGTTGGTCCCAAGTGGACGAGGACGCAAGCTTTGGGCCGGGTGCATATATTCCCGCCAAGGAAATGACCGGCGAGGAAA
GCGACGACGCCACCCAAGAGGTAATTCGGGCACGCACATCGTTTGTCGCCTATCACGCCGCCGACCTGGTGCAGCTTTACACCGACAATCCGCTCGACGT
TCGTGCAATGCGCAAGAGGCCGTGGCAGTTCCAGGGTGTGAAGCTGGGCACCGTTGCCGCCTATTTCGACATCTACTACGGACAGAAAGAACTCCACAGC
AAAGATGGCCTGTTGCCGGGCCGCTCACTGGTCATATCGTCGTCCAAGTTCGATAACGGCTGCTACGGTCTCTTCGACTTCGAGCACATACTCAAACCGC
CTTTCGTGACCGTGCCGGGCAACGGCTCGATTGCCTACGCGCATGTTCAAGAGTGGCCCTGTGGCGTGTCCGATGATTGCATGCTCTTGCTTCCCAAGGA
GGGCGTGTCTCATTCGATGATGTACGTCGCCGCCGCAGCGATCCGAAACGAACGGTGGCGCTTCAGCTATGGGCGCAAGGCAACGCCGGACCGGATTGCG
GAGTTCCCGCTACCTCACACCGACGAACTGCTGGCGCGCGTTGACGAGTACCTTGCGCGAGCTGCTCGGGTTGAAGATCGCATGATCGAGGACGCAGAAG
ACGCGCTCGACAGCCAAACGGCACGCATGCGGCTAGCGGATTTGGGGAGCGGAAAAGCAACGGCAGTTTCGGGGGCGGAATTGGAGACGCGGCTTGCAGC
TATGATGGAGAACTAGGCGGGTGCCATACGTCGGCTTCGCCTTTACCACCGCCGCGCTGGACTTCTTAGCTACGCTGCCGCCGAAAATTCGGAAGCAGGT
CATTAAGAAGGCCAAGGCCCTGCATGCCAACCCGCATCCCCAAGGGTCGAAGAAGTTACACGGCGTGGTGACCGACGATGGCGATCCGGTGTATCGTGAG
CGATCTGGAGATTACCGCATCCTCTATGTGGTTCGCCCCGAAGAGGTGATGGTCCTCGATATTGACCATCGGAAGGACGTATATCGTATGCCTCAGACCA
AGGCAGAACCGGCCGACGAAATGAAAATGAAGGAAGCCGACTTCGACGCGATCATGAGCAAGGCGCTGGGCGTCGCTGCTCCACAGAACAAGGACGATGA
GCAGCCAGCTAAGCGGCTAAGCTCCTACCCGCCGAAGAAGCGCGGCACGTCCTAGGCCCAAAAAGCCGGTGTTGGCCGGTTGTAGCCTTGCGCGGCCTTG
GTGTTGCAAAGTGCATAGTCGCC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
915 bp | 304 aa | 79 | 993 | + | No |
Chemistry : DDE
ORF sequence :
MRRFSTEEAARAYFERMRWPNGPVCPHCGSAEKNYALTPNKKARIREGLYKCGTCDDRFSVTVGTVMESSHIPLHKWLIAFYMMCASKTQISALQLQRQL
ELGSYRTAHFLCARIRYALKDAGSAGLIGGEVEADETYIGGKAKGKGRGYTGNKTAVVSLVQRGGEVRSTVVAERVTGKTIDTLLRRHVTEEAHLNTDES
PLYNKAGKRFASHARVNHSAEEYGYYDYRSGRTVTTNTVEGFFGNSKRSLDGTHHNVSRQHLHLYTAELDFKYNTRKSTDGERTAEGIRRIEGKRLMYKP
KASG
ELGSYRTAHFLCARIRYALKDAGSAGLIGGEVEADETYIGGKAKGKGRGYTGNKTAVVSLVQRGGEVRSTVVAERVTGKTIDTLLRRHVTEEAHLNTDES
PLYNKAGKRFASHARVNHSAEEYGYYDYRSGRTVTTNTVEGFFGNSKRSLDGTHHNVSRQHLHLYTAELDFKYNTRKSTDGERTAEGIRRIEGKRLMYKP
KASG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
2424 bp | 807 aa | 993 | 3416 | + | No |
Annotation : N-6 DNA methylaseDescription :
ORF sequence :
MARVRVAEVRTEVLLTELLKAQGWDCRRPPNGEMLRQHEYKDHSHLRDVFLHRSKVRMIGHGLPEAVVVDRQSMQPLIVIEAKASISDLDKALREATEIY
GNACIDAGYSPLAVAIAGTSEDDFAVRVHKWNGSAWKAVTYEGNPIGWIPNRVDVERLRVPSATPELRPSVPSPEVLANFADEINRLLRESNVNDRSRPS
VVGACMLALWQSKGALRKDPRNILGDINQACEKAFWNAGKAVLAKSLHVDEANDKLAVKARRIISILERLNVSVLTAEHDYLGQLYETFFRYAGGNTIGQ
YFTPRHIASFGADLLGVSIDDVVLDPTCGTGGFLIAAMERVAREHQISRSEMVKLVSTRLIGFDDEPITAALCVANMILRGDGSSSVHRGDAFTAPEYPI
GTASVVLMNPPYPHKQTDTPTEAFVERALEGLSQGSRLAAVIPLSLLVKSNKASWRKAILKNNTLEAAIKLPDELFQPYAQPYTVIVYLRKGIPHPKGKR
AFFARIENDGFRIRKAVRVACEGSELPKMLAHFQAGTSEAGVCGWSQVDEDASFGPGAYIPAKEMTGEESDDATQEVIRARTSFVAYHAADLVQLYTDNP
LDVRAMRKRPWQFQGVKLGTVAAYFDIYYGQKELHSKDGLLPGRSLVISSSKFDNGCYGLFDFEHILKPPFVTVPGNGSIAYAHVQEWPCGVSDDCMLLL
PKEGVSHSMMYVAAAAIRNERWRFSYGRKATPDRIAEFPLPHTDELLARVDEYLARAARVEDRMIEDAEDALDSQTARMRLADLGSGKATAVSGAELETR
LAAMMEN
GNACIDAGYSPLAVAIAGTSEDDFAVRVHKWNGSAWKAVTYEGNPIGWIPNRVDVERLRVPSATPELRPSVPSPEVLANFADEINRLLRESNVNDRSRPS
VVGACMLALWQSKGALRKDPRNILGDINQACEKAFWNAGKAVLAKSLHVDEANDKLAVKARRIISILERLNVSVLTAEHDYLGQLYETFFRYAGGNTIGQ
YFTPRHIASFGADLLGVSIDDVVLDPTCGTGGFLIAAMERVAREHQISRSEMVKLVSTRLIGFDDEPITAALCVANMILRGDGSSSVHRGDAFTAPEYPI
GTASVVLMNPPYPHKQTDTPTEAFVERALEGLSQGSRLAAVIPLSLLVKSNKASWRKAILKNNTLEAAIKLPDELFQPYAQPYTVIVYLRKGIPHPKGKR
AFFARIENDGFRIRKAVRVACEGSELPKMLAHFQAGTSEAGVCGWSQVDEDASFGPGAYIPAKEMTGEESDDATQEVIRARTSFVAYHAADLVQLYTDNP
LDVRAMRKRPWQFQGVKLGTVAAYFDIYYGQKELHSKDGLLPGRSLVISSSKFDNGCYGLFDFEHILKPPFVTVPGNGSIAYAHVQEWPCGVSDDCMLLL
PKEGVSHSMMYVAAAAIRNERWRFSYGRKATPDRIAEFPLPHTDELLARVDEYLARAARVEDRMIEDAEDALDSQTARMRLADLGSGKATAVSGAELETR
LAAMMEN
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 3421 | 3855 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPYVGFAFTTAALDFLATLPPKIRKQVIKKAKALHANPHPQGSKKLHGVVTDDGDPVYRERSGDYRILYVVRPEEVMVLDIDHRKDVYRMPQTKAEPADE
MKMKEADFDAIMSKALGVAAPQNKDDEQPAKRLSSYPPKKRGTS
MKMKEADFDAIMSKALGVAAPQNKDDEQPAKRLSSYPPKKRGTS
Blast result :
Comments
ISApr7 is 57% aa similar to ISRpa1. The first ORF is the transposase, the second is a N-6 DNA methylase and the third is an hypothetical protein.
References
1] Hagstrom,A., Ferriera,S., Johnson,J., Kravitz,S., Beeson,K., Sutton,G., Rogers,Y.-H., Friedman,R., Frazier,M. and Venter,J.C. (2007) J Craig Venter Institute Direct submission GenBank.
2] ISfinder annotation (2008)
2] ISfinder annotation (2008)