Q: free restriction analysis software

Pierre Lindenbaum lindenb at jouy.inra.fr
Thu Jul 15 02:44:48 EST 1999


: Markus Hoenicka wrote:
: > 
: > I'd like to have a simple tool for restriction analysis, i.e I feed
: > a sequence file to it and a list of restriction enzymes, and the tool
: > tells me the number and location of the restriction sites, the length
: > of the released fragments etc.
: > 

Here is a description of my online program called 'CloneIt'.
I hope it will help you...

	Pierre LINDENBAUM

------------------------------------------------------------------------------


     _________________________________________________________________

                                    CLONEIT(tm)
                                  ONLINE VERSION

   A WEB BASED INTERFACE FINDING SUB-CLONING STRATEGIES, IN-FRAME DELETIONS
         AND FRAMESHIFTS USING RESTRICTION ENZYMES AND DNA POLYMERASES.

	        http://topaze.jouy.inra.fr/cgi-bin/CloneIt/CloneIt	

     _________________________________________________________________


 What is CloneIt ?


The use of plasmid vectors in cloning strategies is the basis of any
molecular biologist repertoire. Efficiency is essential and requires the
use of the most appropriate restriction enzymes choices for the desired
application. Selection of those enzymes remains a skill. Several points
that merit consideration include (1) the enzymes characteristics, (2)
location of the restriction sites within the sequence, (3) complementarity
of protuding ends, (4) possible self ligation, (5) use of modifying DNA
polymerase generating blunt ends, (6) in frame cloning constraints,(7) use
of partial digestions and (8) the creation of a stop codon at the ligation
site. This exercise is labor intensive, even with the help of the
classical DNA analysis softwares. These programs typically facilitate
localizing restriction sites in a plasmid sequence;  however it is
impossible to examine all restriction sites and /or combinations then, a
simple cloning strategy may be over-looked as a result of incomplete or
inadequate searching. 

CloneIt Online was created in order to quickly find cloning 
strategies (including sub-cloning, in-frame deletions and frameshifts) 
while controlling the problems described above.

CloneIt Online is available at the following URL: 

        http://topaze.jouy.inra.fr/cgi-bin/CloneIt/CloneIt	

----------------------------------------------------------------------------
  Samples
----------------------------------------------------------------------------
SubCloning
............................................................................

CloneIt V1.0 has found a solution:

 Digest VECTOR with EcoRI [G^AATTC] (878) and Sal I [g^tcgac] (894).

 5'  --G.CCG.G/AA.TT C.CCG.GGG.ATC.CG/T.CGA. CCT.GCA.GCC.AAG--  3'
 3'  --C.GGC.C TT.AA/G.GGC.CCC.TAG.GC A.GCT./GGA.CGT.CGG.TTC--  5'
 NH2     P   E    F    P   G   I   R    R   P    A   A   K   .COOH

Digest first with EcoRI .Then treat with Klenow DNA polymerase.
Finally digest with Sal I .

 Digest INSERT with Sca I [AGT^ACT] (1034) and Sal I [g^tcgac] (3605).

 5' --AT.AAA.GT/ A.CTT.TCA.AAG.AAA.G-- --TA.GAG./TCG.A CC.TGC.AGG.CAT.G-- 3'
 3' --TA.TTT.CA/ T.GAA.AGT.TTC.TTT.C-- --AT.CTC. AGC.T/GG.ACG.TCC.GTA.C-- 3'
 NH2     K   V     L   S   K   K   E -  --  E   S    T    C   R   H   A .COOH

Treat with T4 DNA polymerase.

Sites wil be in frame ligated in 5'.

The first stop codon detected BEFORE the EcoRI site (878) is localized at
position 428 on vector. The first stop codon detected AFTER the Sca I
site (1034) is localized at position 3558 on insert.

Digestion post-ligation:  no enzyme was found.

.......................................................................
        Finding in-frame deletions
.......................................................................

 CloneIt has found an in-frame deletion:

  Digest INSERT with Bcl I [t^gatca] (1427) and PstI [CTGCA^G] (3611).

5'  --TG.TAT./GAT.C AG.GTT.CTT.ACT.G--  --CG.ACC. TGC.A/GG.CAT.GCA.AGC.T-- 3'
3'  --AC.ATA. CTA.G/TC.CAA.GAA.TGA.C--  --GC.TGG./ACG.T CC.GTA.CGT.TCG.A-- 5'
NH2      Y   D    Q    V   L   T   E --  --  T   C    R    H   A   S   F .COOH

 Treat with T4 DNA polymerase.

  Cloning boxes boundaries :[880-1155] [3359-3634].
        Original: 5' ================================================ 3'
        Deletion: 5' =========......................................= 3'
        Equivalent to a  deletion of 728 amino acids [79 %]

Digestion post-ligation: BamHI Sal I Acc I Nsi I .

The first stop codon detected AFTER the PstI site (3611) is localized at
position 3639 on insert.

Translated truncated sequence:...
NSSSVPGAIKGSMAYRKRGARREANINNNDRMQEKDDEKQDQNNRMQLSDKVLSKKEEVV
TDSQEEIKIADEVKKSTKEESKQLEVLKTKEEHQKEIQYEILQKTIPTFEPKESILKKLE
DIKPEQAKKQTKLFRIFEPRQLPIYRANGEKELRNRTYTKLKKDTLPGDYDVREYFLNLY
DR----------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
--HASFCS...

.........................................................................
Finding frameshifts in INSERT.
.........................................................................

CloneIt  has found an Enzyme that could induce frameshift.

  Digest INSERT with AccI [gt^mkac] (1659).

  5'  --.ATC.AGT.//AT  A.CAC.ATA.AAT.GAT--  3'
  3'  --.TAG.TCA.  TA//T.GTG.TAT.TTA.CTA--  5'
  NH2    I   S   I       H   I   N   D   .COOH

Treat with Klenow DNA polymerase.
Beware : AccI [1 partial site] .

After digestion, fill-in and ligation:

  5'  .ATC.AGT.ATA.TAC.ACA.TAA.ATG 3'
  3'  .TAG.TCA.TAT.ATG.TGT.ATT.TAC 5'
  NH2  I   S   I   Y   T   *   M        COOH

        ¥ FrameShift (+ 2)
        ¥ 2 bases Added.
        ¥ Site is NOT reconstitued after ligation.
        ¥ [51 %] percentage of Insert.
        FIGURE:
                =========================
                                        | (+2)
                                        ===========================

Translated sequence:...EFELGTRGSMATFKDACYHYKRLNKLNSLVLKLGANDETRPAPMTKYKGTCL
YTNLTYCRGCALYHVCQTCSQYNRCFLDEEPHLLRMRTFKDVVTKEDIEGLLTMYETLFPINEKLVNKFINSVKQ
EYLLETYNHLLMPITLQALTINLEDNVYYIFGYYDCMEHENQTPFQFINLLEKYDKLLLDDRNFHRMSHLPVILQ
RYFSKSRFLSKGKKRLSRSDFSDNLMEDRHSPTSLMQVVRNCISiyT*...

............................................................................
Informations
............................................................................


Get information about Bst1107 I

INSERT Pattern
--------------
Bst1107I [GTA^TAC] (1659) p5_1 digestion.
 1 4652 pb      Bst1107I  1744 - Bst1107I  1659
 2 85 pb        Bst1107I  1659 - Bst1107I  1744
...........................................................................
.
 Prototype                :SnaI
 Microorganism            :Bacillus stearothermophilus RFL1107
 Source                   :A.A. Janulaitis
 Methylation site         :
 Commercial availability  :
        Angewandte Gentechnologie Systeme
        Fermentas AB
        Takara Shuzo Co. Ltd.
        Boehringer-Mannheim
 New England Biolabs Refs :457
.........................................................................
Looking for Isoschizomers. (*):Commercialy available)
        ( ) BspM90I  GTA^TAC.
        (*) BssNAI  GTA^TAC.
             also available at:
                        SibEnzyme Ltd.
        ( ) BstBSI  GTA^TAC.
        (*) BstZ17I  GTA^TAC.
             also available at:
                        New England BioLabs
        ( ) XcaI  GTA^TAC.

...........................................................................
Restriction Map
...........................................................................

                                     SACI   SALI
                               ECORI :      hincii
   styi                        acsi  :      acci         ECO52I
   NCOI                   alwi :     ECL136II            eaei
   bstdsi                xhoii :     bsp1286i            bsh1285i
MSCI           xmni      BAMHI :     banii  :           NOTI
eaei    ecorv  : vspi    alwi  :     alw21i :     HINDIII:
:  :    :      : :       ::    :     :      :     :     ::
TGGCCATGGATATCGGAATTAATTCGGATCCGAATTCGAGCTCCGTCGACAAGCTTGCGGCCGCA \ 5266
    ¥         ¥         ¥         ¥         ¥         ¥         ¥  \
ACCGGTACCTATAGCCTTAATTAAGCCTAGGCTTAAGCTCGAGGCAGCTGTTCGAACGCCGGCGT   \ 5330
T  P  T :I  S  E :L  I  R::I  R: I  R: A  P :S  T :S  L :R  P  H  ->
:A :M  D: I  G :I: N  S  D: P  N  S  S  S  V: D  K: L  A::A  A  L ->
: H: G  Y  R  N: *  F  G :S  E :F  E :L  R  R  Q  A  C  G: R  T   ->
: P: V  $  L  R: L  *  A :$  A :*  A :R  P  L  Q  E  F  A: P  T


...........................................................................
Intersections
...........................................................................

                            _______________________________
                            |    VECTOR    |    INSERT    |
                            _______________________________
                            |  IN  |  OUT  |  IN  |  OUT  |
___________________________________________________________
  AatII    [GACGT^C       ] |    0 |    1  |    0 |    1  |
  AccI     [GT^MKAC       ] |    1 |    3  |    1 |    3  |
  AccIII   [T^CCGGA       ] |    0 |    0  |    0 |    0  |
  Acc65I   [G^GTACC       ] |    0 |    1  |    2 |    0  |
  AccBSI   [CCGCTC(-3/-3) ] |    0 |    3  |    0 |    3  |

(...)

  VspI     [AT^TAAT       ] |    0 |    6  |    3 |    6  |
  XbaI     [T^CTAGA       ] |    0 |    0  |    2 |    1  |
  XcmI     [CCANNNNN^NNNNT] |    0 |    2  |    0 |    0  |
  XhoI     [C^TCGAG       ] |    0 |    0  |    0 |    1  |
  XhoII    [R^GATCY       ] |    2 |    7  |    2 |    6  |
  XmaI     [C^CCGGG       ] |    1 |    0  |    1 |    0  |
  XmnI     [GAANN^NNTTC   ] |    0 |    4  |    0 |    3  |
___________________________________________________________
                            |  IN  |  OUT  |  IN  |  OUT  |
                            _______________________________
                            |    VECTOR    |    INSERT    |
                            _______________________________





More information about the Bio-soft mailing list