ISAzca2
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009937 | ND | Azorhizobium caulinodans | Azorhizobium caulinodans ORS 571 |
DNA section
IS Length : 2458 bp
Ends
IR Length : 19/25
IRL : GTAAGCGTCGTCCTACCCCCACCTTTCGCCGGCGAGGCTGAGGGGGCGAT
IRR : GTAAGCGTCGTCTTCAGGCCACGTTGCTGAGGACCGGCTGTGCTGGGTAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTCACAAACG | GC | TTTTAGGGGG | 2 |
DNA sequence
GTAAGCGTCGTCCTACCCCCACCTTTCGCCGGCGAGGCTGAGGGGGCGATGTTCCGAGCCTGGAGGGCTTGGACATATGACGATTTCAGAGCTTACGCTT
AAATCTAGCGCCGAGGAGCCGGTGCGGCGGTTCGAGGTTTTCACCGGAGCCGGGCGTCGCCGAGAGTGGGCTCCCGAGGAGAAGGCGCGGATTGTCACCG
AGAGCTTTGAGCCTGGAGCCACGGTGAGTGCAGTGGCGCGCCGCCACGGACTGTCGCCGCAGCAGTTGTTCACGTGGCGCCGTGAGGCGCGGAAAGCGAG
CGAGACGATCCCGGCTTTCGTGCCGGCGGTGGTCGCTCCGGAACCGGTTCCCGCATCTGAGCCGTCCGCCCAACCGTCACGGCCGAAGCCGCGTGATCGG
CAACGTCGCGCGGCAGCCATCGAGGTCGACGTCGCCGGGGTCAGGGTGACGATTGAGAACGGGGCCTCGCCCGCCACCATCGCAGCGGTGCTCGGCGCGC
TGAAGGCCGGATCGTGATCGGCCCGAGCGGCGCGGTCCGGGTCATGGTGGCGACACGGCCGGTGGACTTCCGCAAGGGGGCGGAGGGTCTCGCCGCTTTG
GTGCGCGAGACCATGGGCGCCGATCCGTTCTCCGGCACCGTGTACGTGTTCCGGGCCAAGCGAGCGGACCGGGTGAAGCTGGTGTTCTGGGACGGCACCG
GCGTCGTGCTGGTGGCGAAGCGGCTTGAGGACGGGGAGTTCCGCTGGCCGAAGATCGAGGACGGCACCCTGCAGCTGACGTCGGCCCAGCTCCAGGCCCT
GCTCGAGGGCCTCGACTGGCGGCGAGTGCATGAGGCGCAGCGCACGATGACACCGGTCACCGCCAGCTGAAGAGGCTCGCAGGGGCCTGGCGAATGTGAT
TCACTTCCGGTCATGGCCTCGCTCTCCACCGAGCCCCCCCGTGACCCCGAGACGCTCCAGGCGATGCTGATCGCGCGGGACACGGAGATCGAGCGCCTGC
GCCAGATCATCAAGGAGTTGCAGCGCCACCGCTTCGGACGGCGGGCGGAGAGCCTGCCCGAGGACCAACTGCTGCTGGCGCTCGAAGAGGCCGAACAGGT
CGAGGCTGCCTACTTCGCCGCGTCCGATGAACAGGTGCCGGCCGAGAAGGCCGAGCGGGTGGCCAAACGCCGTGCGAACCGCGGTGCTTTGCCCGCCCAT
CTGCCGCGGATCGAGACCGTGATCGATATCGAGAGTGACATCTGCCCCTGCTGTTCCGGCAAGCTGCACCGCATCGGCGAGGATGTGGCGGAACGGCTCG
ACATGGTGCCTGCCCAGTTCCGCGTCCTGGTGGTGCGGCGGCCGAAATATGCCTGCCGGTCCTGCGCGGACGTGGTGGTGCAGGCCCCCGCTTCGGCTCG
GTTGATCGAAGGTGGCATCCCGACCGAGGCAACCGTCGCCCATGTGCTGGTCTCGAAATACGCCGACCATCTGCCCCTGTATCGCCAAGTCCAGATCTAT
GCGCGGCAGGGCATCGCGCTCGACCGCTCGACGCTGGCGGACTGGGTCGGGCGCGCCGCCTTCCTGCTGCGACCTGTCCACGAGCGGCTGCTGGCGATGC
TAAGGGCCTCGACCAAGCTGTTCGCCGACGAGACCACGGCGCCGGTGCTCGATCCCGGCCGTGGCCGCACCAAGACCGGGCAGCTCTTCGCCTATGCCCG
TGATGACCGTCCGTGGGGCGGAGCCGATCCGCCCGCCGTCGCTTATGTCTATGCCCCCGATCGTAAGAGCGAGCGGCCGGTCGCCCACCTCGACGGCTTC
AGCGGCATCCTGCAGGTCGATGGCTATGGCGGCTACAAGGCGCTAGCCGGACGCAACGCCGTGAATCTGGCGTTCTGCTGGGCCCACGTGCGCCGCCGCT
TCTACGAGCTCGCTCAGAGCGGTCCCGCGCCGATCGCCAGCGAGGCCCTGCAGCGCATCGCCGAGCTCTATCGCCTCGAGGCTGCGATCCACGGCCGAAC
CCCAGATGAGCGCCGAGCCTCGCGCCAAGAGAAGAGCAGGCCGATCATCGAGGCCATGGAGCCATGGCTGCGCGAGAAGCTGGCTCTGATCAGCCAGAAG
ACCAAGCTGGCCGAGGCGATCCGCTATGCGCTCTCGCGCTGGCAGGGGCTGACCCCGTTCCTCGACGACGGCCGCGTCGAGTTGGACAACAACATCGTCG
AGCGCTCCATCCGACCCCTGGCCCTGACCCGGAAGAATGCGCTCTTCGCCGGCTCCGACGGCGGCGCCGAGCACTGGGCTGTCATGGCTTCGCGGGTCGA
GACCTGCAAGCTCAACGACATCGACCCCCAGGCCTACCTCGCGGACGTCATCACCCGCATCGTCAACAGCCACCCCAACAGCCGCATCGACGACCTCATG
CCATGGGCCTACCCAGCACAGCCGGTCCTCAGCAACGTGGCCTGAAGACGACGCTTAC
AAATCTAGCGCCGAGGAGCCGGTGCGGCGGTTCGAGGTTTTCACCGGAGCCGGGCGTCGCCGAGAGTGGGCTCCCGAGGAGAAGGCGCGGATTGTCACCG
AGAGCTTTGAGCCTGGAGCCACGGTGAGTGCAGTGGCGCGCCGCCACGGACTGTCGCCGCAGCAGTTGTTCACGTGGCGCCGTGAGGCGCGGAAAGCGAG
CGAGACGATCCCGGCTTTCGTGCCGGCGGTGGTCGCTCCGGAACCGGTTCCCGCATCTGAGCCGTCCGCCCAACCGTCACGGCCGAAGCCGCGTGATCGG
CAACGTCGCGCGGCAGCCATCGAGGTCGACGTCGCCGGGGTCAGGGTGACGATTGAGAACGGGGCCTCGCCCGCCACCATCGCAGCGGTGCTCGGCGCGC
TGAAGGCCGGATCGTGATCGGCCCGAGCGGCGCGGTCCGGGTCATGGTGGCGACACGGCCGGTGGACTTCCGCAAGGGGGCGGAGGGTCTCGCCGCTTTG
GTGCGCGAGACCATGGGCGCCGATCCGTTCTCCGGCACCGTGTACGTGTTCCGGGCCAAGCGAGCGGACCGGGTGAAGCTGGTGTTCTGGGACGGCACCG
GCGTCGTGCTGGTGGCGAAGCGGCTTGAGGACGGGGAGTTCCGCTGGCCGAAGATCGAGGACGGCACCCTGCAGCTGACGTCGGCCCAGCTCCAGGCCCT
GCTCGAGGGCCTCGACTGGCGGCGAGTGCATGAGGCGCAGCGCACGATGACACCGGTCACCGCCAGCTGAAGAGGCTCGCAGGGGCCTGGCGAATGTGAT
TCACTTCCGGTCATGGCCTCGCTCTCCACCGAGCCCCCCCGTGACCCCGAGACGCTCCAGGCGATGCTGATCGCGCGGGACACGGAGATCGAGCGCCTGC
GCCAGATCATCAAGGAGTTGCAGCGCCACCGCTTCGGACGGCGGGCGGAGAGCCTGCCCGAGGACCAACTGCTGCTGGCGCTCGAAGAGGCCGAACAGGT
CGAGGCTGCCTACTTCGCCGCGTCCGATGAACAGGTGCCGGCCGAGAAGGCCGAGCGGGTGGCCAAACGCCGTGCGAACCGCGGTGCTTTGCCCGCCCAT
CTGCCGCGGATCGAGACCGTGATCGATATCGAGAGTGACATCTGCCCCTGCTGTTCCGGCAAGCTGCACCGCATCGGCGAGGATGTGGCGGAACGGCTCG
ACATGGTGCCTGCCCAGTTCCGCGTCCTGGTGGTGCGGCGGCCGAAATATGCCTGCCGGTCCTGCGCGGACGTGGTGGTGCAGGCCCCCGCTTCGGCTCG
GTTGATCGAAGGTGGCATCCCGACCGAGGCAACCGTCGCCCATGTGCTGGTCTCGAAATACGCCGACCATCTGCCCCTGTATCGCCAAGTCCAGATCTAT
GCGCGGCAGGGCATCGCGCTCGACCGCTCGACGCTGGCGGACTGGGTCGGGCGCGCCGCCTTCCTGCTGCGACCTGTCCACGAGCGGCTGCTGGCGATGC
TAAGGGCCTCGACCAAGCTGTTCGCCGACGAGACCACGGCGCCGGTGCTCGATCCCGGCCGTGGCCGCACCAAGACCGGGCAGCTCTTCGCCTATGCCCG
TGATGACCGTCCGTGGGGCGGAGCCGATCCGCCCGCCGTCGCTTATGTCTATGCCCCCGATCGTAAGAGCGAGCGGCCGGTCGCCCACCTCGACGGCTTC
AGCGGCATCCTGCAGGTCGATGGCTATGGCGGCTACAAGGCGCTAGCCGGACGCAACGCCGTGAATCTGGCGTTCTGCTGGGCCCACGTGCGCCGCCGCT
TCTACGAGCTCGCTCAGAGCGGTCCCGCGCCGATCGCCAGCGAGGCCCTGCAGCGCATCGCCGAGCTCTATCGCCTCGAGGCTGCGATCCACGGCCGAAC
CCCAGATGAGCGCCGAGCCTCGCGCCAAGAGAAGAGCAGGCCGATCATCGAGGCCATGGAGCCATGGCTGCGCGAGAAGCTGGCTCTGATCAGCCAGAAG
ACCAAGCTGGCCGAGGCGATCCGCTATGCGCTCTCGCGCTGGCAGGGGCTGACCCCGTTCCTCGACGACGGCCGCGTCGAGTTGGACAACAACATCGTCG
AGCGCTCCATCCGACCCCTGGCCCTGACCCGGAAGAATGCGCTCTTCGCCGGCTCCGACGGCGGCGCCGAGCACTGGGCTGTCATGGCTTCGCGGGTCGA
GACCTGCAAGCTCAACGACATCGACCCCCAGGCCTACCTCGCGGACGTCATCACCCGCATCGTCAACAGCCACCCCAACAGCCGCATCGACGACCTCATG
CCATGGGCCTACCCAGCACAGCCGGTCCTCAGCAACGTGGCCTGAAGACGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
441 bp | 146 aa | 77 | 517 | + | No |
AG : IS66 TnpA
ORF sequence :
MTISELTLKSSAEEPVRRFEVFTGAGRRREWAPEEKARIVTESFEPGATVSAVARRHGLSPQQLFTWRREARKASETIPAFVPAVVAPEPVPASEPSAQP
SRPKPRDRQRRAAAIEVDVAGVRVTIENGASPATIAAVLGALKAGS
SRPKPRDRQRRAAAIEVDVAGVRVTIENGASPATIAAVLGALKAGS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 514 | 870 | + | No |
AG : IS66 TnpB
ORF sequence :
VIGPSGAVRVMVATRPVDFRKGAEGLAALVRETMGADPFSGTVYVFRAKRADRVKLVFWDGTGVVLVAKRLEDGEFRWPKIEDGTLQLTSAQLQALLEGL
DWRRVHEAQRTMTPVTAS
DWRRVHEAQRTMTPVTAS
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1533 bp | 510 aa | 913 | 2445 | + | No |
Chemistry : DDE
ORF sequence :
MASLSTEPPRDPETLQAMLIARDTEIERLRQIIKELQRHRFGRRAESLPEDQLLLALEEAEQVEAAYFAASDEQVPAEKAERVAKRRANRGALPAHLPRI
ETVIDIESDICPCCSGKLHRIGEDVAERLDMVPAQFRVLVVRRPKYACRSCADVVVQAPASARLIEGGIPTEATVAHVLVSKYADHLPLYRQVQIYARQG
IALDRSTLADWVGRAAFLLRPVHERLLAMLRASTKLFADETTAPVLDPGRGRTKTGQLFAYARDDRPWGGADPPAVAYVYAPDRKSERPVAHLDGFSGIL
QVDGYGGYKALAGRNAVNLAFCWAHVRRRFYELAQSGPAPIASEALQRIAELYRLEAAIHGRTPDERRASRQEKSRPIIEAMEPWLREKLALISQKTKLA
EAIRYALSRWQGLTPFLDDGRVELDNNIVERSIRPLALTRKNALFAGSDGGAEHWAVMASRVETCKLNDIDPQAYLADVITRIVNSHPNSRIDDLMPWAY
PAQPVLSNVA
ETVIDIESDICPCCSGKLHRIGEDVAERLDMVPAQFRVLVVRRPKYACRSCADVVVQAPASARLIEGGIPTEATVAHVLVSKYADHLPLYRQVQIYARQG
IALDRSTLADWVGRAAFLLRPVHERLLAMLRASTKLFADETTAPVLDPGRGRTKTGQLFAYARDDRPWGGADPPAVAYVYAPDRKSERPVAHLDGFSGIL
QVDGYGGYKALAGRNAVNLAFCWAHVRRRFYELAQSGPAPIASEALQRIAELYRLEAAIHGRTPDERRASRQEKSRPIIEAMEPWLREKLALISQKTKLA
EAIRYALSRWQGLTPFLDDGRVELDNNIVERSIRPLALTRKNALFAGSDGGAEHWAVMASRVETCKLNDIDPQAYLADVITRIVNSHPNSRIDDLMPWAY
PAQPVLSNVA
Blast result :
Comments
orfA, orfB and orfC of ISAzca2 are respectively 62%, 87% and 82% aa similar to orfA, orfB and orfC of ISFpe1.
References
1] Suzuki,S., Aono,T., Lee,K.B., Suzuki,T., Liu,C.T., Miwa,H., Wakao,S., Iki,T. and Oyaizu,H. (2007) Appl. Environ. Microbiol. 73 (20), 6650-6659.
2] Lee,K.B., Aono,T., Kaneko,T. and Oyaizu,H. (2007) Direct Submission GenBank.
3] ISfinder annotation (2008)
2] Lee,K.B., Aono,T., Kaneko,T. and Oyaizu,H. (2007) Direct Submission GenBank.
3] ISfinder annotation (2008)