ISAau4
- Family Tn3
- Group
Isoform Synonym(s) TnAau4
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008711 | ND | Arthrobacter aurescens | Arthrobacter aurescens TC1 |
DNA section
IS Length : 3898 bp
Ends
IR Length : 18/21
IRL : GGGGTCTTGGTAGTAACGGCGGGAAATTGAACGCTAAGCCTCGTTGGGCT
IRR : GGAGTCTCGGTAGTAGCGGCGTGTGTGGATTTCGGCGCTGTTTGGTGGCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGAGCCTTGC | AAAGTGGAGA | 0 |
DNA sequence
GGGGTCTTGGTAGTAACGGCGGGAAATTGAACGCTAAGCCTCGTTGGGCTGGTCTTGGTGCTGCTCGGTAGCCTTGGTGTGTGTCGGTGGAGTTTCTGAG
CGAGGAGCAGGCGGGCGGGTTCGGGCGTTTCCTGGGTGAGCCGTCGCGGGCTGATCTGGAGCGGTTTTTCTATCTTGATGACGCTGATCTGGAGCTGATC
GCGAAGCGGCGTGGAGACCACAACCGGCTCGGCTTTGCCGTGCAGCTTGGGACGATCCGGTTCCTGGGTGTGCTGCTTGCGGATCCGCTTGATGTTCCGT
GGGGTGTTGTCGATTACCTTTCGGCCCGGCTGGGCACCGCGGACCCCTCGATCGTGAAGAAATACATGCGGCGGCGACCGACGGTTCACGAGCACGCCCG
CGAGATCCGCGCTGTTTACGGCTACCGTGACCTGGTCGGGCCTGTCCTGGAAGACCTGTCGGCGTACGTCTACTCGAGGGCGTGGACGCACGGGGAGGGT
CCGAGTGTTCTTTTCGAGCTGGCGACGGCGTGGCTTCGTCGGGAGCGTGTGCTTCTTCCCGGGGTGACGACGCTCGTGCGGGTGGTGCAGTCTGCGCGCG
AAGCGGCGCAGTCCGGGGTGTACGGCGTCGTTGCGACGGCGGCCAGCGCGGTCGATCCGCGGTTGCCGGTGGTGCTGCGCGGGCTGCTCGTGACTGACCG
GGGTGAGCGGGTCTCGCGGTTGGAGTTGCTGCGGGCGGGTCCGACGAGGGTCTCGGGCCCGGAGCTGGACAAGGCGTTGGGCAGGGTCGCTGCGTTGCGG
GCGCTCGGGGCCAGGGCGGTGGACCTGTCGGCGGTGCCGCCGGCGCGGGTGCGTGCCTTGGCCCGGTACGGGATCGGGGCCAAGGCGCAGTCGTTGCGGC
GTTTGGCCGAACCACGACGCACGGCGACGCTCGTGGCGACGGTCACGGCGTTGGAGGCCAACGCGGTCGATGATGCGCTGGACCTGTTTGATCTGCTGAT
GACCACGCGGGTGCTCGACCCCTCGCGCCGTGCGGCGGTCGCGGAGCGGTTGGCGAAGATGCCCGAACTGGAGAAGGCCTCGGGCGTTCTGGCCCGGGTC
GGAGCCCGGCTGCTTCGCGTGCTTGAGGAGTCCGGCGACCAGGTTGATGTCGCGGCAGCGTGGGCGGCTCTGGAACAGGTCGCCGCGCGGGACCGGATCG
CCGATGCGGTGGCGAAGGTGGGTGAGCTCGTCCCCGACGAGAGCGGCGCCGACGGGGCGATGCGTGGGCAGATGGCGCGTCGTTTCCGGACCGTGGCACC
GTTCCTGCGGCTGTTGGCCACCACGATCCCGTGGGGTGCGACCGCCGCCGGCCAGCCCCTGCTCGAGGCGCTTGCCCGCCTGGACGGGTTGCGGGGTCGG
CGCAAGGTGCGGCGCGAGGAAATCGACGAGGCGCTGGTGCCGCGGGCCTGGCACGCGGCGGTGTTCGGCCGCGCCGGCGGGGCCGGGGTGGACCGGGACG
CGTGGGTGGTGTGCGTGCTGGAACAGCTGCGTTCGGGCCTTCGCCGCCGCGACGTATTCGCGGTCGGCTCCACCAGGTGGGGCGACCCGCGCACCCGCCT
GCTCGACGGTCCCGCTTGGGAGGCGGTGCGCGAACAGGCGTTGACGAGCCTGAGCCTTCACGCCCCGGTGTCCGAGCACCTGCGCACTCGGACGGAGGTG
CTCGACGCCGCCTGGCGAGGGCTCGCCGCGGCGATCGGGCAGACCGGCCCGGACGGGTCCGTGCAGCTGACCGAGGGGCCGGACGGAAGGGTCAGGCTCA
CCGTCTCCCCGCTCGAGGCCTTGGAGATCCCCGACTCCCTCACGAAGCTGCGCAAGCAGGTGGCAGCGATGCTGCCGCGGGTGGACCTGCCCAAGATCCT
GCTGGAGGTCCACTCCTGGACCGGGTTCCTTCACGCCTACACCCACATCGGACAGTCCGGTTCCCGGATGAGGGATCTTCCGGTCTCGGTCGCCGCGGTC
CTGATCGCCCAGGCCTGCAACGTCGGCCTGACACCGGTCGTCGCCGAGGGGCACCCGGCGCTGACCCGGGACCGGCTGGGGCACGTGGACGCGAACTACG
TGCGCGCCGAGACCCACGCCGCCGCGAACGCCCTCCTGATCGATGCGCAGGCCGGGGTGCCGATCGCGAGCTCGTGGGGCGGCGGGCTGCTGGCCTCGGT
GGACGGGCTGCGGTTCGTCGTGCCGGTGCGCACCATCAACGCCGCGCCGAACCCGAAGTACTTCGGCCGCGGCCGGGGGCTGACCTGGTTCAACGCGGTC
AACGACCAGGCCGCCGGGATCGGCGGAGTCGTCGTGCCCGGCACGGTGCGCGACTCGCTGTACGTGCTGGACACCATGCTCAACCTCGACGGCGGCCCGA
AGCCCGAGATGGTCGCCTCCGACACCGCCTCCTACTCCGACCTGGTCTTCGGGATCTTCACGCTGCTCGGCTACCGCTTCGCACCGCGCATCGCGGACCT
GTCCGACCAACGCCTGTGGCGCACCGGGATGCCCGGCGGCGAGGCGGACTACGGGGCGCTGAACGCGGTGGCGCGCAACAAGGTCAACCTGGCGAAGATC
ACCGCCCACTGGGACGACATGACCCGCGTGGCCGCCTCGCTGGTGACCGGGACGGTCCGCGCCTACGACGTGCTGCGCATGCTCACCCGCGACAACGGGG
CACCGAACCCGCTCGGGGCGGCGATCGCGGAGTACGGGCGCATCGCCAAGACCCTGCACCTGCTGGCCCTCATCGACCCGACGGATGAGACCTACCGGCG
TTCGATCAACACCCAGCTGACCGTCCAGGAGTCACGCCACCGCCTGGCCAGGGCGATCTTCCACGGCCGGCGCGGGCAGATCCACCAGCGCTACCGAGAG
GGCCAGGAGGACCAGCTCGGCGCGCTCGGACTCGTGCTCAACGCCGTCGTGCTATGGAACACCCGCTACACCGCCGCCGCCGTCACGGCGCTCCGCGAGG
CCGGACAGGACATCCCCGAGACTGACCTCGCCCGGCCGTCACCGCTGGCCGATCAGCACATCAACATGCTCGGCCGCTACGCCTTCACCGCACCCACACC
CGACGGGCTACGACCGCTGCAAGACCCCGCAGCCGGGCAAATCGAGCGCTGACCGGCACCCGTTCGGCACATAGCATTGGCTGCAGAGCACGTTGCCGAG
AATTCGCCCCTCGCAATCGACGGTCGCGCGATCACCGGCACCACCGCCACATCGAAGGACGGCAAGATCATCGAATGCGAACAGGGGGAAGGCGCATGAC
TTCTCCGCACACAGGGCTTCTGCACCACGTGGAGCTCTGGGTTCCCGACATCGAACGCGCCGCGGCCCAGTGGGGATGGCTGCTCGAGGAGATCGGCTAT
GACCCGTTCCAGGTGTGGCCAGGCGGCCGCAGCTGGAGGCTCGCCCACACCTACATCGTGCTCGAGCAGTCACCCGACATGCGTGGCGGGAACCACGACC
GCAAGCGCCCGGGCCTCAACCATCTCGCCTTCTACGCCGGGAACCGGCAGCGTGTCGATGACCTCGCCACCGCCGCCCCGCACCACGGCTGGACACTCCT
CTTCCCGGATCGCCACCCACACGCCGGCGGACCGCAAACGTATGCCGCCTACCTGACCAACACCGACGGCTACGAGGCCGAACTGATCGCCCACGACTGA
ACCGCCCTCCACGGGCCGTGACGGTTTCCCGGCAGCACTCGACCACGTCCCCGAAAACAACCAGCCCGAGACCGTCACCGATGACGTCCTCGAAGCCGCC
GTGAGAATCCTTGCCTTCCACAACCAGCGACATCGAGCGATCACAAAACGCCACCAAACAGCGCCGAAATCCACACACGCCGCTACTACCGAGACTCC
CGAGGAGCAGGCGGGCGGGTTCGGGCGTTTCCTGGGTGAGCCGTCGCGGGCTGATCTGGAGCGGTTTTTCTATCTTGATGACGCTGATCTGGAGCTGATC
GCGAAGCGGCGTGGAGACCACAACCGGCTCGGCTTTGCCGTGCAGCTTGGGACGATCCGGTTCCTGGGTGTGCTGCTTGCGGATCCGCTTGATGTTCCGT
GGGGTGTTGTCGATTACCTTTCGGCCCGGCTGGGCACCGCGGACCCCTCGATCGTGAAGAAATACATGCGGCGGCGACCGACGGTTCACGAGCACGCCCG
CGAGATCCGCGCTGTTTACGGCTACCGTGACCTGGTCGGGCCTGTCCTGGAAGACCTGTCGGCGTACGTCTACTCGAGGGCGTGGACGCACGGGGAGGGT
CCGAGTGTTCTTTTCGAGCTGGCGACGGCGTGGCTTCGTCGGGAGCGTGTGCTTCTTCCCGGGGTGACGACGCTCGTGCGGGTGGTGCAGTCTGCGCGCG
AAGCGGCGCAGTCCGGGGTGTACGGCGTCGTTGCGACGGCGGCCAGCGCGGTCGATCCGCGGTTGCCGGTGGTGCTGCGCGGGCTGCTCGTGACTGACCG
GGGTGAGCGGGTCTCGCGGTTGGAGTTGCTGCGGGCGGGTCCGACGAGGGTCTCGGGCCCGGAGCTGGACAAGGCGTTGGGCAGGGTCGCTGCGTTGCGG
GCGCTCGGGGCCAGGGCGGTGGACCTGTCGGCGGTGCCGCCGGCGCGGGTGCGTGCCTTGGCCCGGTACGGGATCGGGGCCAAGGCGCAGTCGTTGCGGC
GTTTGGCCGAACCACGACGCACGGCGACGCTCGTGGCGACGGTCACGGCGTTGGAGGCCAACGCGGTCGATGATGCGCTGGACCTGTTTGATCTGCTGAT
GACCACGCGGGTGCTCGACCCCTCGCGCCGTGCGGCGGTCGCGGAGCGGTTGGCGAAGATGCCCGAACTGGAGAAGGCCTCGGGCGTTCTGGCCCGGGTC
GGAGCCCGGCTGCTTCGCGTGCTTGAGGAGTCCGGCGACCAGGTTGATGTCGCGGCAGCGTGGGCGGCTCTGGAACAGGTCGCCGCGCGGGACCGGATCG
CCGATGCGGTGGCGAAGGTGGGTGAGCTCGTCCCCGACGAGAGCGGCGCCGACGGGGCGATGCGTGGGCAGATGGCGCGTCGTTTCCGGACCGTGGCACC
GTTCCTGCGGCTGTTGGCCACCACGATCCCGTGGGGTGCGACCGCCGCCGGCCAGCCCCTGCTCGAGGCGCTTGCCCGCCTGGACGGGTTGCGGGGTCGG
CGCAAGGTGCGGCGCGAGGAAATCGACGAGGCGCTGGTGCCGCGGGCCTGGCACGCGGCGGTGTTCGGCCGCGCCGGCGGGGCCGGGGTGGACCGGGACG
CGTGGGTGGTGTGCGTGCTGGAACAGCTGCGTTCGGGCCTTCGCCGCCGCGACGTATTCGCGGTCGGCTCCACCAGGTGGGGCGACCCGCGCACCCGCCT
GCTCGACGGTCCCGCTTGGGAGGCGGTGCGCGAACAGGCGTTGACGAGCCTGAGCCTTCACGCCCCGGTGTCCGAGCACCTGCGCACTCGGACGGAGGTG
CTCGACGCCGCCTGGCGAGGGCTCGCCGCGGCGATCGGGCAGACCGGCCCGGACGGGTCCGTGCAGCTGACCGAGGGGCCGGACGGAAGGGTCAGGCTCA
CCGTCTCCCCGCTCGAGGCCTTGGAGATCCCCGACTCCCTCACGAAGCTGCGCAAGCAGGTGGCAGCGATGCTGCCGCGGGTGGACCTGCCCAAGATCCT
GCTGGAGGTCCACTCCTGGACCGGGTTCCTTCACGCCTACACCCACATCGGACAGTCCGGTTCCCGGATGAGGGATCTTCCGGTCTCGGTCGCCGCGGTC
CTGATCGCCCAGGCCTGCAACGTCGGCCTGACACCGGTCGTCGCCGAGGGGCACCCGGCGCTGACCCGGGACCGGCTGGGGCACGTGGACGCGAACTACG
TGCGCGCCGAGACCCACGCCGCCGCGAACGCCCTCCTGATCGATGCGCAGGCCGGGGTGCCGATCGCGAGCTCGTGGGGCGGCGGGCTGCTGGCCTCGGT
GGACGGGCTGCGGTTCGTCGTGCCGGTGCGCACCATCAACGCCGCGCCGAACCCGAAGTACTTCGGCCGCGGCCGGGGGCTGACCTGGTTCAACGCGGTC
AACGACCAGGCCGCCGGGATCGGCGGAGTCGTCGTGCCCGGCACGGTGCGCGACTCGCTGTACGTGCTGGACACCATGCTCAACCTCGACGGCGGCCCGA
AGCCCGAGATGGTCGCCTCCGACACCGCCTCCTACTCCGACCTGGTCTTCGGGATCTTCACGCTGCTCGGCTACCGCTTCGCACCGCGCATCGCGGACCT
GTCCGACCAACGCCTGTGGCGCACCGGGATGCCCGGCGGCGAGGCGGACTACGGGGCGCTGAACGCGGTGGCGCGCAACAAGGTCAACCTGGCGAAGATC
ACCGCCCACTGGGACGACATGACCCGCGTGGCCGCCTCGCTGGTGACCGGGACGGTCCGCGCCTACGACGTGCTGCGCATGCTCACCCGCGACAACGGGG
CACCGAACCCGCTCGGGGCGGCGATCGCGGAGTACGGGCGCATCGCCAAGACCCTGCACCTGCTGGCCCTCATCGACCCGACGGATGAGACCTACCGGCG
TTCGATCAACACCCAGCTGACCGTCCAGGAGTCACGCCACCGCCTGGCCAGGGCGATCTTCCACGGCCGGCGCGGGCAGATCCACCAGCGCTACCGAGAG
GGCCAGGAGGACCAGCTCGGCGCGCTCGGACTCGTGCTCAACGCCGTCGTGCTATGGAACACCCGCTACACCGCCGCCGCCGTCACGGCGCTCCGCGAGG
CCGGACAGGACATCCCCGAGACTGACCTCGCCCGGCCGTCACCGCTGGCCGATCAGCACATCAACATGCTCGGCCGCTACGCCTTCACCGCACCCACACC
CGACGGGCTACGACCGCTGCAAGACCCCGCAGCCGGGCAAATCGAGCGCTGACCGGCACCCGTTCGGCACATAGCATTGGCTGCAGAGCACGTTGCCGAG
AATTCGCCCCTCGCAATCGACGGTCGCGCGATCACCGGCACCACCGCCACATCGAAGGACGGCAAGATCATCGAATGCGAACAGGGGGAAGGCGCATGAC
TTCTCCGCACACAGGGCTTCTGCACCACGTGGAGCTCTGGGTTCCCGACATCGAACGCGCCGCGGCCCAGTGGGGATGGCTGCTCGAGGAGATCGGCTAT
GACCCGTTCCAGGTGTGGCCAGGCGGCCGCAGCTGGAGGCTCGCCCACACCTACATCGTGCTCGAGCAGTCACCCGACATGCGTGGCGGGAACCACGACC
GCAAGCGCCCGGGCCTCAACCATCTCGCCTTCTACGCCGGGAACCGGCAGCGTGTCGATGACCTCGCCACCGCCGCCCCGCACCACGGCTGGACACTCCT
CTTCCCGGATCGCCACCCACACGCCGGCGGACCGCAAACGTATGCCGCCTACCTGACCAACACCGACGGCTACGAGGCCGAACTGATCGCCCACGACTGA
ACCGCCCTCCACGGGCCGTGACGGTTTCCCGGCAGCACTCGACCACGTCCCCGAAAACAACCAGCCCGAGACCGTCACCGATGACGTCCTCGAAGCCGCC
GTGAGAATCCTTGCCTTCCACAACCAGCGACATCGAGCGATCACAAAACGCCACCAAACAGCGCCGAAATCCACACACGCCGCTACTACCGAGACTCC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
3072 bp | 1023 aa | 81 | 3152 | + | No |
Chemistry : DDE
ORF sequence :
MSVEFLSEEQAGGFGRFLGEPSRADLERFFYLDDADLELIAKRRGDHNRLGFAVQLGTIRFLGVLLADPLDVPWGVVDYLSARLGTADPSIVKKYMRRRP
TVHEHAREIRAVYGYRDLVGPVLEDLSAYVYSRAWTHGEGPSVLFELATAWLRRERVLLPGVTTLVRVVQSAREAAQSGVYGVVATAASAVDPRLPVVLR
GLLVTDRGERVSRLELLRAGPTRVSGPELDKALGRVAALRALGARAVDLSAVPPARVRALARYGIGAKAQSLRRLAEPRRTATLVATVTALEANAVDDAL
DLFDLLMTTRVLDPSRRAAVAERLAKMPELEKASGVLARVGARLLRVLEESGDQVDVAAAWAALEQVAARDRIADAVAKVGELVPDESGADGAMRGQMAR
RFRTVAPFLRLLATTIPWGATAAGQPLLEALARLDGLRGRRKVRREEIDEALVPRAWHAAVFGRAGGAGVDRDAWVVCVLEQLRSGLRRRDVFAVGSTRW
GDPRTRLLDGPAWEAVREQALTSLSLHAPVSEHLRTRTEVLDAAWRGLAAAIGQTGPDGSVQLTEGPDGRVRLTVSPLEALEIPDSLTKLRKQVAAMLPR
VDLPKILLEVHSWTGFLHAYTHIGQSGSRMRDLPVSVAAVLIAQACNVGLTPVVAEGHPALTRDRLGHVDANYVRAETHAAANALLIDAQAGVPIASSWG
GGLLASVDGLRFVVPVRTINAAPNPKYFGRGRGLTWFNAVNDQAAGIGGVVVPGTVRDSLYVLDTMLNLDGGPKPEMVASDTASYSDLVFGIFTLLGYRF
APRIADLSDQRLWRTGMPGGEADYGALNAVARNKVNLAKITAHWDDMTRVAASLVTGTVRAYDVLRMLTRDNGAPNPLGAAIAEYGRIAKTLHLLALIDP
TDETYRRSINTQLTVQESRHRLARAIFHGRRGQIHQRYREGQEDQLGALGLVLNAVVLWNTRYTAAAVTALREAGQDIPETDLARPSPLADQHINMLGRY
AFTAPTPDGLRPLQDPAAGQIER
TVHEHAREIRAVYGYRDLVGPVLEDLSAYVYSRAWTHGEGPSVLFELATAWLRRERVLLPGVTTLVRVVQSAREAAQSGVYGVVATAASAVDPRLPVVLR
GLLVTDRGERVSRLELLRAGPTRVSGPELDKALGRVAALRALGARAVDLSAVPPARVRALARYGIGAKAQSLRRLAEPRRTATLVATVTALEANAVDDAL
DLFDLLMTTRVLDPSRRAAVAERLAKMPELEKASGVLARVGARLLRVLEESGDQVDVAAAWAALEQVAARDRIADAVAKVGELVPDESGADGAMRGQMAR
RFRTVAPFLRLLATTIPWGATAAGQPLLEALARLDGLRGRRKVRREEIDEALVPRAWHAAVFGRAGGAGVDRDAWVVCVLEQLRSGLRRRDVFAVGSTRW
GDPRTRLLDGPAWEAVREQALTSLSLHAPVSEHLRTRTEVLDAAWRGLAAAIGQTGPDGSVQLTEGPDGRVRLTVSPLEALEIPDSLTKLRKQVAAMLPR
VDLPKILLEVHSWTGFLHAYTHIGQSGSRMRDLPVSVAAVLIAQACNVGLTPVVAEGHPALTRDRLGHVDANYVRAETHAAANALLIDAQAGVPIASSWG
GGLLASVDGLRFVVPVRTINAAPNPKYFGRGRGLTWFNAVNDQAAGIGGVVVPGTVRDSLYVLDTMLNLDGGPKPEMVASDTASYSDLVFGIFTLLGYRF
APRIADLSDQRLWRTGMPGGEADYGALNAVARNKVNLAKITAHWDDMTRVAASLVTGTVRAYDVLRMLTRDNGAPNPLGAAIAEYGRIAKTLHLLALIDP
TDETYRRSINTQLTVQESRHRLARAIFHGRRGQIHQRYREGQEDQLGALGLVLNAVVLWNTRYTAAAVTALREAGQDIPETDLARPSPLADQHINMLGRY
AFTAPTPDGLRPLQDPAAGQIER
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
123 bp | 40 aa | 3177 | 3299 | + | No |
Annotation : Description :
ORF sequence :
MAAEHVAENSPLAIDGRAITGTTATSKDGKIIECEQGEGA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
405 bp | 134 aa | 3296 | 3700 | + | No |
Annotation : putative glyoxalase family proteinDescription :
ORF sequence :
MTSPHTGLLHHVELWVPDIERAAAQWGWLLEEIGYDPFQVWPGGRSWRLAHTYIVLEQSPDMRGGNHDRKRPGLNHLAFYAGNRQRVDDLATAAPHHGWT
LLFPDRHPHAGGPQTYAAYLTNTDGYEAELIAHD
LLFPDRHPHAGGPQTYAAYLTNTDGYEAELIAHD
Blast result :
Comments
ISAau4 ORFA (the Transposase) is 57% aa similar to ISThsp9.
ORFB and ORFC are passenger genes.
ORFC is a putative glyoxalase family protein.
ORFB and ORFC are passenger genes.
ORFC is a putative glyoxalase family protein.
References
1] ISfinder annotation (2009)
2] Mongodin,E.F., Shapir,N., Daugherty,S.C., DeBoy,R.T., Emerson,J.B., Shvartzbeyn,A., Radune,D., Vamathevan,J., Riggs,F., Grinberg,V., Khouri,H., Wackett,L.P., Nelson,K.E. and Sadowsky,M.J. (2006) PLoS Genet. 2 (12), E214
2] Mongodin,E.F., Shapir,N., Daugherty,S.C., DeBoy,R.T., Emerson,J.B., Shvartzbeyn,A., Radune,D., Vamathevan,J., Riggs,F., Grinberg,V., Khouri,H., Wackett,L.P., Nelson,K.E. and Sadowsky,M.J. (2006) PLoS Genet. 2 (12), E214