MSA 2.0 (Multiple Sequence Alignment)
Alex Schaffer
schaffer at cs.rice.edu
Sun Mar 19 00:51:45 EST 1995
I am pleased to announce the availability of MSA version 2.0, which is
an improved version of the MSA program to do multisequence alignment
under the sum-of-pairs cost criterion. Version 2.0 is distributed by
Alejandro Schaffer (schaffer at cs.rice.edu). Please contact him if you
have any problems with MSA 2.0.
MSA version 1.0 was written by J. Kececioglu, S. Altschul, D. Lipman,
and R. Miner and distributed in 1989, and it was briefly described in:
1. D. Lipman, S. Altschul and J. Kececioglu, "A Tool for Multiple
Sequence Alignment", Proc. Natl. Acad. Sci. USA 86 (1989) 4412-4415.
MSA 2.0 is an improved version that uses substantially less space and
time. It was developed by S. K. Gupta and A. A. Schaffer,
with help from some notes by J. Kececioglu. It is described in:
2. S. K. Gupta, J. Kececioglu, and A. A. Schaffer, "Making the
Shortest-Paths Approach to Sum-of-Pairs Multiple Sequence Alignment
More Space Efficient in Practice", (extended abstract) to appear in
Proc. 6th Annual Combinatorial Pattern Matching conference (CPM '95);
(full paper) submitted for publication.
Paper 2 describes the improvements and also includes a more detailed
explanation of the most important algorithm in MSA 1.0. A copy of
paper 2 is included as paper.ps with the MSA 2.0 distribution.
Some other papers that helped develop the theory behind MSA are:
3. Carrillo & Lipman, "The Multiple Sequence Alignment Problem in Biology",
SIAM J. Appl. Math. 48 (1988) 1073-1082;
4. Altschul & Lipman, "Trees, Stars, and Multiple Biological Sequence
Alignment", SIAM J. Appl. Math. 49 (1989) 197-209;
5. Altschul, "Gap Costs for Multiple Sequence Alignment",
J. Theor. Biol. 138 (1989) 297-309;
6. Altschul, Carroll & Lipman, "Weights for Data Related by a Tree",
J. Molec. Biol. 207 (1989) 647-653;
7. Altschul, "Leaf Pairs and Tree Dissections",
SIAM J. Discrete Math. 2 (1989) 293-299;
MSA is designed to run under the UNIX operating
system, and generally requires several megabytes of memory.
The form that input to the program is required to take is
described in the accompanying manual page.
MSA is available by anonymous ftp from softlib.cs.rice.edu as follows:
ftp softlib.cs.rice.edu
login as anonymous
leave *full* e-mail address as password
cd pub/msa
Retrieve either msa.tar.Z or all the other files in the directory.
close
quit
If you retrived the files all together as msa.tar.Z, then run:
uncompress msa.tar.Z
tar xvf msa.tar
and this will pull the files apart and leave them in a subdirectory called
softlib.
To make a compiled version of msa, run
make msa
To try the program, run:
msa SampleData.txt
If you wish to be kept apprised of all bug fixes and other updates,
please send e-mail to Alejandro Schaffer (schaffer at cs.rice.edu).
More information about the Bio-soft
mailing list