Tn5 sequence
PAU at UK.AC.AFRC.NFL
PAU at UK.AC.AFRC.NFL
Fri Jun 12 10:08:00 EST 1992
>I have been searching for a recent Tn5 restriction map, or its sequence,
>without much success. I would be extremely grateful for any assistance in this
>regard.
Here is the Tn5 sequence which we assembled from segments on the database.
I hope its useful, but I can give no guarantee as to its accuracy. It is
in UWGCG format with the headers included so that the sources and joins
can be seen.
Richard Pau, Nitrogen Fixation Laboratory, Brighton, Sussex.
Email PAU at UK.AC.AFRC.NFL
--------------------------------------------------------------------
>DL;TRN5IR1
Transposon Tn5 left inverted repeat with genes for transposase and
Km-resistance (aminoglycoside-3'-O-phosphotransferase)
LOCUS TRN5IR1 1737 bp ds-DNA BCT 15-DEC-1988
DEFINITION Transposon Tn5 left inverted repeat with genes for transposase and
Km-resistance (aminoglycoside-3'-O-phosphotransferase).
ACCESSION V00615
KEYWORDS aminoglycoside-3'-O-phosphotransferase; drug resistance;
insertion sequence; kanamycin resistance; transposase.
SEGMENT 1 of 2
SOURCE Transposon Tn5 DNA [2],[1].
ORGANISM Transposon Tn5
Prokaryota; Bacteria; Transposon Tn5.
REFERENCE 1 (bases 1 to 100)
AUTHORS Fuller,R.S., Funnell,B.E. and Kornberg,A.
TITLE The dnaA protein complex with the E. coli chromosomal replication
origin (oriC) and other DNA sites
JOURNAL Cell 38, 889-900 (1984)
STANDARD full staff_review
REFERENCE 2 (bases 38 to 1737)
AUTHORS Auerswald,E.-A., Ludwig,G. and Schaller,H.
TITLE Structural analysis of Tn5
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 107-113 (1981)
STANDARD full staff_review
COMMENT Even though an unidentified reading frame is indicated in [2], a
protein of the correct size coded for by the sequence shown here
has been identified. The protein is believed to be a transposase.
FEATURES Location/Qualifiers
CDS 129. .1481
/note="transposase"
CDS 1588. .>1737
/note="aminoglycoside-3'-O-phosphotransferase"
repeat_region 38. .1571
/note="left inverted repeat"
misc_binding 45. .53
/note="dnaA binding site [Cell 38, 889-900 (1984)]"
BASE COUNT 421 a 445 c 515 g 356 t
ORIGIN HaeIII site.
tn5.seq Length: 5865 November 21, 1990 16:37 Check: 517 ..
1 CCAGCAAGCA AGCTAAAAAG TAAAGCAACA ACATAACCTG ACTCTTATAC
51 ACAAGTAGCG TCCTGAACGG AACCTTTCCC GTTTTCCAGG ATCTGACTTC
101 CATGTGACCT CCTAACATGG TAACGTTCAT GATAACTTCT GCTCTTCATC
151 GTGCGGCCGA CTGGGCTAAA TCTGTGTTCT CTTCGGCGGC GCTGGGTGAT
201 CCTCGCCGTA CTGCCCGCTT GGTTAACGTC GCCGCCCAAT TGGCAAAATA
251 TTCTGGTAAA TCAATAACCA TCTCATCAGA GGGTAGTGAA GCCATGCAGG
301 AAGGCGCTTA CCGATTTTAC CGCAATCCCA ACGTTTCTGC CGAGGCGATC
351 AGAAAGGCTG GCGCCATGCA AACAGTCAAG TTGGCTCAGG AGTTTCCCGA
401 ACTGCTGGCC ATTGAGGACA CCACCTCTTT GAGTTATCGC CACCAGGTCG
451 CCGAAGAGCT TGGCAAGCTG GGCTCTATTC AGGATAAATC CCGCGGATGG
501 TGGGTTCACT CCGTTCTCTT GCTCGAGGCC ACCACATTCC GCACCGTAGG
551 ATTACTGCAT CAGGAGTGGT GGATGCGCCC GGATGACCCT GCCGATGCGG
601 ATGAAAAGGA GAGTGGCAAA TGGCTGGCAG CGGCCGCAAC TAGCCGGTTA
651 CGCATGGGCA GCATGATGAG CAACGTGATT GCGGTCTGTG ACCGCGAAGC
701 CGATATTCAT GCTTATCTGC AGGACAGGCT GGCGCATAAC GAGCGCTTCG
751 TGGTGCGCTC CAAGCACCCA CGCAAGGACG TAGAGTCTGG GTTGTATCTG
801 ATCGACCATC TGAAGAACCA ACCGGAGTTG GGTGGCTATC AGATCAGCAT
851 TCCGCAAAAG GGCGTGGTGG ATAAACGCGG TAAACGTAAA AATCGACCAG
901 CCCGCAAGGC GAGCTTGAGC CTGCGCAGTG GGCGCATCAC GCTAAAACAG
951 GGGAATATCA CGCTCAACGC GGTGCTGGCC GAGGAGATTA ACCCGCCCAA
1001 GGGTGAGACC CCGTTGAAAT GGTTGTTGCT GACCGGCGAA CCGGTCGAGT
1051 CGCTAGCCCA AGCCTTGCGC GTCATCGACA TTTATACCCA TCGCTGGCGG
1101 ATCGAGGAGT TCCATAAGGC ATGGAAAACC GGAGCAGGAG CCGAGAGGCA
1151 ACGCATGGAG GAGCCGGATA ATCTGGAGCG GATGGTCTCG ATCCTCTCGT
1201 TTGTTGCGGT CAGGCTGTTA CAGCTCAGAG AAAGCTTCAC GCTGCCGCAA
1251 GCACTCAGGG CGCAAGGGCT GCTAAAGGAA GCGGAACACG TAGAAAGCCA
1301 GTCCGCAGAA ACGGTGCTGA CCCCGGATGA ATGTCAGCTA CTGGGCTATC
1351 TGGACAAGGG AAAACGCAAG CGCAAAGAGA AAGCAGGTAG CTTGCAGTGG
1401 GCTTACATGG CGATAGCTAG ACTGGGCGGT TTTATGGACA GCAAGCGAAC
1451 CGGAATTGCC AGCTGGGGCG CCCTCTGGTA AGGTTGGGAA GCCCTGCAAA
1501 GTAAACTGGA TGGCTTTCTT GCCGCCAAGG ATCTGATGGC GCAGGGGATC
1551 AAGATCTGAT CAAGAGACAG GATGAGGATC GTTTCGCATG ATTGAACAAG
1601 ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC
1651 TATGACTGGG CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG
1701 GCTGTCAGCG CAGGGGCGCC GGTTCTTTTT GTCAAGA
>SEQED
(include) of: trn5ne.gcg check: 3912 from: 302 to:
1300>
CCG ACCTGTCCGG
1751 TGCCCTGAAT GAACTGCAGG ACGAGGCAGC GCGGCTATCG TGGCTGGCCA
1801 CGACGGGCGT TCCTTGCGCA GCTGTGCTCG ACGTTGTCAC TGAAGCGGGA
1851 AGGGACTGGC TGCTATTGGG CGAAGTGCCG GGGCAGGATC TCCTGTCATC
1901 TCACCTTGCT CCTGCCGAGA AAGTATCCAT CATGGCTGAT GCAATGCGGC
1951 GGCTGCATAC GCTTGATCCG GCTACCTGCC CATTCGACCA CCAAGCGAAA
2001 CATCGCATCG AGCGAGCACG TACTCGGATG GAAGCCGGTC TTGTCGATCA
2051 GGATGATCTG GACGAAGAGC ATCAGGGGCT CGCGCCAGCC GAACTGTTCG
2101 CCAGGCTCAA GGCGCGCATG CCCGACGGCG AGGATCTCGT CGTGACCCAT
2151 GGCGATGCCT GCTTGCCGAA TATCATGGTG GAAAATGGCC GCTTTTCTGG
2201 ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT CAGGACATAG
2251 CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC
2301 CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC
2351 CTTCTATCGC CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA
2401 AATGACCGAC CAAGCGACGC CCAACCTGCC ATCACGAGAT TTCGATTCCA
2451 CCGCCGCCTT CTATGAAAGG TTGGGCTTCG GAATCGTTTT CCGGGACGCC
2501 GGCTGGATGA TCCTCCAGCG CGGGGATCTC ATGCTGGAGT TCTTCGCCCA
2551 CCCCGGGCTC GATCCCCTCG CGAGTTGGTT CAGCTGCTGC CTGAGGCTGG
2601 ACGACCTCGC GGAGTTCTAC CGGCAGTGCA AATCCGTCGG CATCCAGGAA
2651 ACCAGCAGCG GCTATCCGCG CATCCATGCC CCCGAACTGC AGGAGTGGGG
2701 AGGCACGATG GCCGCTTTGG TCGACCCGGA CGGGA
<SEQED
(include) of: trn5ne.gcg check: 3912 from: 302 to:
1300<
C
>SEQED
(include) of: TRN5STR.GCG check: 9384 from: 390 to:
2040>
CCGG ACGGGACGCT
2751 CCTGCGCCTG ATACAGAACG AATTGCTTGC AGGCATCTCA TGAGTGTGTC
2801 TTCCCGTTTT CCGCCTGAGG TCACTGCGTG GATGGAGCGC TGGCGCCTGC
2851 TGCGCGACGG CGAGCTGCTC ACCACCCACT CGAGCTGGAT ACTTCCCGTC
2901 CGCCAGGGGG ACATGCCGGC GATGCTGAAG GTCGCGCGCA TTCCCGATGA
2951 AGAGGCCGGT TACCGCCTGT TGACCTGGTG GGACGGGCAG GGCGCCGCCC
3001 GAGTCTTCGC CTCGGCGGCG GGCGCTCTGC TCATGGAGCG CGCGTCCGGG
3051 GCCGGGGACC TTGCACAGAT AGCGTGGTCC GGCCAGGACG ACGAGGCTTG
3101 CAGGATCCTC TGCGACACCG CCGCTCGTCT GCACGCGCCG CGGTCCGGAC
3151 CGCCGCCCGA TCTCCATCCG CTACAGGAAT GGTTCCAGCC GCTTTTCCGG
3201 TTGGCCGCTG AGCACGCGGC ACTTGCGCCC GCCGCCAGCG TAGCGCGCCA
3251 ACTTCTGGCG GCGCCGCGCG AGGTGTGCCC GCTCCACGGC GACCTGCACC
3301 ACGAGAACGT GCTCGACTTC GGCGACCGCG GCTGGCTGGC CATCGACCCG
3351 CACGGACTGC TCGGCGAGCG CACCTTCGAC TATGCCAACA TCTTCACGAA
3401 TCCCGATCTC AGCGACCCCG GTCGCCCGCT TGCGATCCTG CCGGGCAGGC
3451 TGGAGGCTCG ACTCAGCATT GTGGTCGCGA CGACCGGGTT TGAGCCCGAA
3501 CGGCTTCTTC GCTGGATCAT TGCATGGACG GGCTTGTCGG CAGCCTGGTT
3551 CATCGGCGAC GGCGACGGCG AGGGCGAGGG CGCTGCGATT GATCTGGCCG
3601 TAAACGCCAT GGCACGCCGG TTGCTTGACT AGCGCGGTCA CCGATCTCAC
3651 CTGGTCGTCG AGCTAGGTCA GGCCGTGTCG GGCGTGATCC GCTGGAAGTC
3701 GTTGCGGGCC ACACCCGCCG CCTCGAAGCC CTGCACCAGG CCGGCATCGT
3751 GGTGTGCGTG GCCGAGGGAC TATGGAAGGT GCCGGACGAT CTGCCCGAGC
3801 AGGGCCGCCG CTATGACGCC CAGCGTCTTG GTGGCGTGAC GGTGGAGCTG
3851 AAATCGCACC TGCCCATCGA GCGGCAGGCC CGCGTGATCG GTGCCACCTG
3901 GCTTGACCAG CAGTTGATCG ACGGTGGCTC GGGCTTGGGC GACCTGGGCT
3951 TTAGCAGTGA GGCCAAGTAG GCGATACAGC AGCGCGCGGA CTTCCTGGCC
4001 GAACAGGGAC TGGCCGAGCG GCGCGGGCAG CGCGTGATCC TCACCGGAAT
4051 CTGCTCGGCA GCAGCGGGCT CGGGAACTGG CGCAGGCCGC GAAGGACATT
4101 GCCGCCGATA CCGGCCTGGA GCATCGCCCC GTGGCCGACG GCCAGCGCGT
4151 TGCCGGCGTC TACCGGCGCC CCGTCATGCT CGCCAGCGGG CGAAATGGGA
4201 TGCTTGATGA CGCCAAGGGG TCCAGCCTCG TGCGGTGGAA GCCCATCGAA
4251 CAGCGGCTTG GGGAGCAGCT CGCCGCGACG GTGCGCGGTG GCGGCGTGTC
4301 TTGGGAGATT GGACGACAGC GTGGGCCGGC CCCTGTCTCT TGATCAGATC
4351 TTGATCCCCT GCGCCATCAG ATCCTTGGCG GCAAGAA
<SEQED
(include) of: TRN5STR.GCG check: 9384 from: 390 to:
2040<
>SEQED
(include) of: TN5R.SEQ check: 6145 from: 123
to: 1600>
AGC CATCCAGTTT
4401 ACTTTGCAGG GCTTCCCAAC CTTCCCAGAG GGCGCCCCAG CTGGCAATTC
4451 CGGTTCGCTT GCTGTCCATA AAACCGCCCA GTCTAGCTAT CGCCATGTAA
4501 GCCCACTGCA AGCTACCTGC TTTCTCTTTG CGCTTGCGTT TTCCCTTGTC
4551 CAGATAGCCC AGTAGCTGAC ATTCATCCGG GGTCAGCACC GTTTCTGCGG
4601 ACTGGCTTTC TACGTGTTCC GCTTCCTTTA GCAGCCCTTG CGCCCTGAGT
4651 GCTTGCGGCA GCGTGAAGCT TTCTCTGAGC TGTAACAGCC TGACCGCAAC
4701 AAACGAGAGG ATCGAGACCA TCCGCTCCAG ATTATCCGGC TCCTCCATGC
4751 GTTGCCTCTC GGCTCCTGCT CCGGTTTTCC ATGCCTTATG GAACTCCTCG
4801 ATCCGCCAGC GATGGGTATA AATGTCGATG ACGCGCAAGG CTTGGGCTAG
4851 CGACTCGACC GGTTCGCCGG TCAGCAACAA CCATTTCAAC GGGGTCTCAC
4901 CCTTGGGCGG GTTAATCTCC TCGGCCAGCA CCGCGTTGAG CGTGATATTC
4951 CCCTGTTTTA GCGTGATGCG CCCACTGCGC AGGCTCAAGC TCGCCTTGCG
5001 GGCTGGTCGA TTTTTACGTT TACCGCGTTT ATCCACCACG CCCTTTTGCG
5051 GAATGCTGAT CTGATAGCCA CCCAACTCCG GTTGGTTCTT CAGATGGTCG
5101 ATCAGATACA ACCCAGACTC TACGTCCTTG CGTGGGTGCT TGGAGCGCAC
5151 CACGAAGCGC TCGTTATGCG CCAGCCTGTC CTGCAGATAA GCATGAATAT
5201 CGGCTTCGCG GTCACAGACC GCAATCACGT TGCTCATCAT GCTGCCCATG
5251 CGTAACCGGC TAGTTGCGGC CGCTGCCAGC CATTTGCCAC TCTCCTTTTC
5301 ATCCGCATCG GCAGGGTCAT CCGGGCGCAT CCACCACTCC TGATGCAGTA
5351 ATCCTACGGT GCGGAATGTG GTGGCCTCGA GCAAGAGAAC GGAGTGAACC
5401 CACCATCCGC GGGATTTATC CTGAATAGAG CCCAGCTTGC CAAGCTCTTC
5451 GGCGACCTGG TGGCGATAAC TCAAAGAGGT GGTGTCCTCA ATGGCCAGCA
5501 GTTCGGGAAA CTCCTGAGCC AACTTGACTG TTTGCATGGC GCCAGCCTTT
5551 CTGATCGCCT CGGCAGAAAC GTTGGGATTG CGGTAAAATC GGTAAGCGCC
5601 TTCCTGCATG GCTTCACTAC CCTCTGATGA GATGGTTATT GATTTACCAG
5651 AATATTTTGC CAATTGGGCG GCGACGTTAA CCAAGCGGGC AGTACGGCGA
5701 GGATCACCCA GCGCCGCCGA AGAGAACACA GATTTAGCCC AGTCGGCCGC
5751 ACGATGAAGA GCAGAAGTTA TCATGAACGT TACCATGTTA GGAGGTCACA
5801 TGGAAGTCAG ATCCTGGAAA ACGGGAAAGG TTCCGTTCAG GACGCTACTT
5851 GTGTATAAGA GTCAG
<SEQED (include) of: TN5R.SEQ check:
6145 from: 123 to: 1600<
More information about the Methods
mailing list