NCBI UniGene files

Francis Durst francis.durst at bota-ulpnospam.u-strasbg.fr
Wed Jan 31 13:13:34 EST 2001


In article <3A776EF3.642CDBDC at staff.usyd.edu.au> Bill Blackhall <b.blackhall at staff.usyd.edu.au> writes:
>From: Bill Blackhall <b.blackhall at staff.usyd.edu.au>
>Subject: Re: NCBI UniGene files
>Date: Wed, 31 Jan 2001 12:48:36 +1100

>The NCBI files have a file extension of .cgi (what that means, I have no
>idea). They appear to be simple text files with each EST within them in
>fasta format. Each EST begins on a new line with the > symbol, then some
>text, and then the sequence starting on a new line. There is no trace
>data associated with them. Some of the files contain 100 or more ESTs,
>so copying and pasting into separate files could get tedious.

You may use Seqverter (www.genestudio.com/seqverter.htm) to
split multi-sequence fasta files into individual files (also fasta).

All the best

Francis
PS mine runs on Winnt 4, don't know about other OS






More information about the Staden mailing list