New version of tacg, a restriction enzyme mapping tool for unix

Harry Mangalam mangalam at
Sat Sep 28 00:40:13 EST 1996

tacg, a program for the restriction enzyme analysis of DNA. << Release 1.5 >>
by Harry Mangalam, UC Irvine (mangalam at, 714 824 4824)

tacg is a character-based, command line tool for the restriction enzyme 
analysis of DNA for unix-like operating systems.  

Briefly, it takes sequence from stdin, strips out anything not an 'a', 'c', 
't', or 'g', filters the resulting sequence thru the restriction enzyme 
database to generate the restriction sites and then does some basic 
calculations on that data to generate enzyme cut-frequency summary tables, 
tables of site and fragment data, a linear map (with or without 
translations), pseudo-graphical ladder and summary maps as well as gel 
simulations (using a simple log fit).  Enzymes can be filtered by a variety 
of criteria; these affect what is printed as output.

For the impatient, here's how one might use it:

tail +44 seq.file | tacg -n6  -o5  -F2  -l -g10 -L -w90 >

Translation: chop off the top 44 lines of seq.file (tail +44 seq.file) and 
pipe (|) the resulting sequence to tacg, returning info on all 6+ cutters 
(-n6) that generate 5' overlaps (-o5), giving me the sorted fragment sizes 
of those enz's that match (-F2) and both ladder (-l) and gel (-g10) map, 
along with the default linear restriction map (-L) and 1 letter, 1 frame 
translation (the default) and write the output 90 characters wide (-w90) to 
a file called

A longer version of this document, examples of output, binaries, and source code
are at:

Binaries have been compiled on and are available for a variety of 
Unix-related Operating Systems:

   Silicon Graphics/IRIX 5.3 (R3000, R4X00)
   Sun SPARC/SunOS (5.3, 4.1.3)
   DEC Alpha/DEC Unix aka OSF/1 (V3.0/347)
   DECStations (MIPS R3000-based)/Ultrix (4.2)
   Convex C3480/ConvexOS
   NeXT boxes (Black)/NeXTSTEP (3.2) 
   Intel PCs/Linux (1.2.8(486), 2.0.0 (586))

Source code has been uploaded to Don Gilbert's IUBIO archive and should 
eventually come to rest at:

as well as Sunsite's Linux archive:
(latter also contains the Linux binaries)

Both executables and the ANSI C source code are freely available for 
nonprofit purposes. 

I'd also be interested in hearing suggested refinements and critiques.

Harry J Mangalam, MolBio+Biochem / Dev+Cell Bio, Rm 4201, BioSciII  UC Irvine, Irvine, CA, 92717, (714) 824-4824, fax (714) 824 8598

More information about the Bio-soft mailing list