IUBio Biosequences .. Software .. Molbio soft .. Network News .. FTP

GCG and EMBOSS format public biosequence data availability

Don Gilbert gilbertd at bio.indiana.edu
Thu Mar 7 16:43:45 EST 2002

We are making GCG plus EMBOSS format databanks of recent GenBank
DNA databank plus non-redundent EMBL, GenPept, PIR and SwissProt
available on a trial basis for public use.  
You can fetch these data from IUBio Archive:


 Mar  6 22:36 Readme
 Mar  6 22:33 emboss.default
 Mar  6 22:15 gcgdbconfigure/
 Mar  6 18:50 gcgembl/       (rel 69, non-redundant w/ genbank)
 Mar  7 00:14 gcggenbank1/   (core genbank, release 128)
 Mar  7 13:48 gcggenbank2/   (est,gss of rel 128)
 Mar  6 19:11 gcggenpept/ 
 Mar  6 18:46 gcgpir/
 Mar  6 19:04 gcgswissprot/

These are gzip compressed, but otherwise should drop into a GCG 
system with minor editing of the gcgdbconfigure file 
paths.  Included are EMBOSS package indices with each data set
(total size about 60 GB uncompressed; 20 GB compressed).

This is a trial to see if those of you
who support GCG/EMBOSS want such a pre-digested set of
data + indices.  Let us know if you find it useful.

-- Don Gilbert

-- d.gilbert--bioinformatics--indiana-u--bloomington-in-47405
-- gilbertd at bio.indiana.edu


More information about the Bio-soft mailing list

Send comments to us at biosci-help [At] net.bio.net