*Announce* BOXSHADE v3.2 available

Unknown Unknown
Tue Apr 16 06:24:29 EST 1996


**************************ANNOUNCEMENT**********************************
*                                                                      *
*                     BOXSHADE version 3.2                             *
*                                                                      *
************************************************************************


A new version of BOXSHADE, the PASCAL program intended for 
shading multiple aligned sequence files, is now available from the EMBL 
software archive. The upgrade to version 3 was never properly announced,
so the following details, extracted from the documentation 
(BOXSHAD3.DOC), are to bring us up to date.


------------------------BOXSHADE v3.2--------------------------------

The major change is the addition of PICT file output. PICT files are 
standard on Macs, and are also read by most of the PC drawing packages 
(e.g. Corel, Harvard Graphics, FreeLance for Windows). The files have 
been tested on every Mac and PC program I could get my hands on, and 
import correctly. Extra notation, outlining regions, etc., can now be 
done to the shaded alignment, which can also be in colour (text and 
shading).
*Nothing* is perfect in this world, however, and an editable figure 
containing (for 6 sequences of 300 amino acids each) about 4000 
editable objects will give most computers a headache unless they are 
very fast and have buckets of free RAM. The image that is imported is 
'grouped', and can be moved around and sized, and things put over or 
behind it. If you 'ungroup' it, you will have problems. Be warned!!
Partly because of this problem, I have included the option of 
'printing' either the shading *only*, or both shading and text. The 
text alone, without modification, can be produced using the ASCII file 
output option, or with PRETTY or other programs. This can be put on 
top of the shading. The shading can be resized to fit the text, as 
required. This procedure gives you less than half the number of 
'objects', and makes everything quicker.

A few other small things have been tidied up: the 'master' sequence 
can be set in the command line for 'comparison to a single sequence, 
and /split is enforced for EPSF and PICT output. I tried to take care 
of the situation where the block is bigger than the page (anyone out 
there aligning 100 sequences?). Note that the program will still allow 
you to put too many characters on the page for PS, EPSF, HPGL, FIG and 
PICT output, where different font sizes are possible, but, hey, we're 
all intelligent people, right?



-------------------- BOXSHADE    3.0 -----------------------------------

This is version 3.0 rather than 2.8 because the shading algorithm has 
been completely rewritten, and is now more sophisticated. Basically, 
MDB wrote it to do what he would do if he was shading an alignment by 
hand. As an unsophisticated molecular biologist, he hopes it will meet 
the requirement of all the other unsophisticated molecular biologists 
out there. The full description of the shading process is below.

Also in this version, the introduction of a 'threshold' fraction of 
residues that must agree before there is a consensus at that position.

Two more output formats have been included. These are a .FIX file for 
the Unix program xfig, and ASCII text output, with varying or conserved 
residues printed. 

-------------------------------------------------------------------------
 SHORT PROGRAM DESCRIPTION
-------------------------------------------------------------------------

- Purpose -

BOXSHADE is a program for creating good looking printouts from multiple-
aligned protein or DNA sequences. The program does no alignment by 
itself,
it has to take as input a file preprocessed by a multiple alignment 
program
or a multiple file editor. See below for a list of supported input 
formats
and output devices.
In the standard BOXSHADE output, identical and similar residues in the
multiple-alignment chart are represented by different colors or shadings.
There are some more options concerning the kind of shading to be applied,
sequence numbering, consensus output and so on.
The user interface is a bit clumsy at the moment, one has to answer a lot
of questions in order to get the desired output. There is, however, the
possibility to use default parameters from a standard parameter file or 
to
supply the program with parameters from the command line.
At the moment, the VMS and DOS versions of BOXSHADE have identical user
interfaces.


- Input formats -

BOXSHADE 3.2 knows about the following input file formats:
 (some of them are generally used only for MSDOS or VMS systems)
 + CLUSTAL and CLUSTALV, multiple alignment program, DOS/VMS/MAC
   default extension .ALN
 + ESEE, multiple sequence editor, DOS
   default extension .ESE
 + PHYLIP, phylogenetic analysis package, DOS, VMS, UNIX
   default extension .PHY
 + PILEUP and PRETTY of the GCG sequence analysis package VMS/UNIX
   default extensions .MSF and .PRE 
   NB!!  you are strongly encouraged NOT to use the PRETTY format as 
input, it may be incompatible with the revised version of  .MSF input. 
We can't actually think why anyone would use this format now, .MSF 
files are more useful generally.
 + MALIGNED, multiple sequence editor, VMS only
   default extension .MAL
BOXSHADE tries to determine the file type from the extension but will 
work
also if different extensions are used.


- Output devices -

BOXSHADE 3.2 supports the following output devices
  + POSTSCRIPT/EPS creates POSTSCRIPT(TM) files for printing on a 
Laserprinter
    or for further conversion with a POSTCRIPT interpreter (like 
GHOSTSCRIPT)
  + HPGL for export to various graphics programs or for 
conversion/printing
    with the shareware program PRINTGL. Plotting BOXSHADE output on a 
plotter
    is generally not recommended
  + RTF for export to various word-processing and graphics programs
  + CRT, uses direct screen writes to the PC-monitor. Possible options 
depend
    on the graphics adapter used. This output device is supported only in
    the MSDOS version.
  + ANSI. On a PC, this option uses an ANSI device driver (ANSI.SYS) that 
has 
    to be loaded in CONFIG.SYS previously. Possible character renditions 
are 
    reverse, bold,underlined, blinking etc. On non-DOS systems, this 
option
    behaves more or less like the VT100 output mode.
  + VT100 for display on a VT100 compatible terminal or emulator.
  + ReGISterm for display on a ReGIS compatible graphics terminal or 
emulator. 
  + ReGISfile for later conversion by the program RETOS (copyright DEC)
    in order to print on DIGITALs printer series.
  + LJ250 for printing on DIGITALS LJ250 color printer.
  + ASCII output showing either the conserved residues or the varying 
    ones (others as '-').
  + FIX file for xfig 2.1. 
  + PICT file for MAC and PCs


- Shading strategies (similarity to consensus or single sequence) -

Starting with version 3, BOXSHADE has a new shading system. The first 
difference is the introduction of a threshold fraction of residues 
that must agree for there to be a consensus. Previously, the program 
assumed that SOME residue was always the consensus. If no two residues 
were the same, the first sequence provided the consensus residue. The 
threshold fraction can be any number between 0.0 and 1.0. The number 
of sequences that must agree for there to be a consensus is, as you 
might expect, this fraction times the total number of sequences in the 
alignment (fractions of a sequence count as one, e.g. 3.2 becomes 4).

The second difference is the idea of 'consensus by similarity'; this 
tries to take account of the situations where all the sequences may 
have (for example) R or K at a position, but neither in a majority. It 
would not be logical to shade one type of residue as 'identical' 
and the other as 'similar'; the threshold function might also 
eliminate both as being in too small numbers. Therefore, if there is not 
a 
single residue that is conserved (greater than the threshold) at a 
position, the program looks for a 'group' of amino acids that fulfills 
the requirements. 'Groups' are defined in the .grp files. Users can 
tailor these to their personal prejudices. Any amino acid not listed 
is assumed not to be in a group. 

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

PROGRAM AVAILABILITY:

The full source is distributed with the archive. Anyone is welcome 
(indeed encouraged!) to recompile this for other platforms. Executables
are available for VAX/VMS (boxshad3.uue in the VAX subdirectory) and 
AXP/OSF1 (box_exe_axposf1.tar.Z in the unix subdirectory). Peter A. 
Stockwell (peter at sanger.otago.ac.nz) has ported it to DEC's Ultrix 4.4 
and Sun/Solaris 2.3. There is a DOS executable of version 3.0 in the DOS 
subdirectory of the EMBL archive (Sorry, we *don't* have facilities to
make new compiled DOS versions. Anyone out there with TurboPascal care to 
help?) 

The program can also be run from ISREC's web page. The URL for the 
boxshade server is under http://ulrec3.unil.ch/software

The number of options available via WWW is somewhat reduced, not only
to make things simpler, but mainly since the WWW forms interface does not
offer conditional menus.
Maybe one day someone will write a Java version... 


Kay Hofmann
Bioinformatics group
ISREC
CH-1066 Epalinges s/Lausanne
Switzerland
E-mail: khofmann at isrec-sun1.unil.ch

Michael D. Baron
Institute for Animal Health
Ash Road
Pirbright
Surrey GU24 0NF
U.K.
E-mail: michael.baron at bbsrc.ac.uk





More information about the Bio-soft mailing list