BLAST Software Update

usenet news poster usenet at nlm.nih.gov
Wed Sep 18 19:43:02 EST 1991


September 18, 1991

New versions of the Basic Local Alignment Search Tool (BLAST) family
of programs BLASTP, BLASTN, BLASTX, and BLAST3, along with a new addition
to the family, TBLASTN, are available for anonymous ftp on ncbi.nlm.nih.gov
(IP address 130.14.20.1).  Source code for these and other programs is
posted in the pub/blast, pub/dfa, pub/gish, and pub/ncbi subdirectories.

Features of the distribution include:

o  a new program TBLASTN, that compares an amino acid query sequence against
   a nucleotide sequence database.  By default, the database is dynamically
   translated and searched in all 6 reading frames (3 on each strand), with
   command line options to search just the "top" or just the "bottom" strand
   and to select from one of nine built-in genetic codes.

o  a new utility program named "pam" to generate PAM substitution matrices
   of any desired generation (from PAM 2 to PAM 511) and scale.

o  codons which contain IUPAC ambiguity letters are translated by BLASTX and
   TBLASTN into specific amino acids when such inference is possible.

o  second pass, low stringency (using a lower cutoff score) comparisons
   against a database sequence are performed by BLASTP and TBLASTN when they
   find at least one alignment against a sequence that satisfies the primary
   cutoff score.

o  improved report format, including:  a summary section of one-line
   sequence descriptions of those database sequences whose alignments
   satisfied the cutoff score; and a histogram of maximum alignment
   scores for all sequences in the database.

o  each of the 3 sections of the output report (histogram, one-line summaries,
   and sequence alignments) can be independently turned off or regulated
   by command line options.

o  BLASTDB and BLASTPAM environment variables can be used to point the
   programs to alternate directories for database and substitution
   matrix files.

o  tested compatibility with several UNIX platforms, including: Sun
   Microsystems SunOS 4.1.1, Silicon Graphics IRIX 3.3.2, Next 2.1,
   Apple A/UX 2.0.1, AIX/370 1.2.0; at least partial compatibility
   with Hewlett-Packard HP/UX and Cray Research Unicos.

o  support for multiprocessing on Silicon Graphics (SGI) PowerIRIS computers
   running SGI IRIX System V Release 3.3.2 or later, providing typical
   overall improvement in speed of about 7-fold on machines with 8 processors.

o  a new default word length (W) of 3 in the programs BLASTP, BLASTX, TBLASTN,
   and BLAST3, which reduces memory demands at a modest cost in overall
   execution speed.

o  rather than using fixed constants, the default values for parameters
   T (neighborhood word score threshold) and X (alignment extension drop-off
   score) are calculated dynamically, using ad hoc equations that utilize
   measures of information for each individual database query.

o  the default, calculated value for the neighborhood word score threshold
   T is now lower than in the previous distribution, to obtain high
   sensitivity by default (at the expense of increased execution time).

o  on systems which support it, improved utilities to manage memory-resident
   database files using UNIX System V interprocess communication facilities.

  Warren Gish                           phone:  (301) 496-2475, ext. 64
  Staff Fellow                          FAX:  (301) 480-9241
  National Center                       Internet:  gish at ncbi.nlm.nih.gov
     for Biotechnology Information
  National Library of Medicine
  Building 38A, Room 8S-806
  8600 Rockville Pike
  Bethesda, MD 20894-0001




More information about the Bio-soft mailing list