From owner-embl-db@hgmp.mrc.ac.uk  Thu Jul  6 14:49:26 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id F37A917A90; Thu,  6 Jul 2000 14:49:25 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id 7C1BC17AD0; Thu,  6 Jul 2000 14:49:24 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id EFC2117A9C; Tue,  4 Jul 2000 17:32:27 +0100 (BST)
Received: from moderated.news.pipex.net (moderated.news.pipex.net [158.43.192.79])
	by mercury.hgmp.mrc.ac.uk (Postfix) with SMTP id 62BE8415D4
	for <bionet-molbio-embldatabank@net.bio.net>; Tue,  4 Jul 2000 17:32:22 +0100 (BST)
Received: (qmail 29663 invoked by alias); 4 Jul 2000 16:32:21 -0000
Received: (qmail 29659 invoked from network); 4 Jul 2000 16:32:20 -0000
Received: from mailer3.bham.ac.uk (147.188.128.54)
  by moderated.news.pipex.net with SMTP; 4 Jul 2000 16:32:20 -0000
Received: from bham.ac.uk ([147.188.128.127])
	by mailer3.bham.ac.uk with esmtp (Exim 3.02 #16)
	id 139VcR-0000cb-00
	for bionet-molbio-embldatabank@moderators.isc.org; Tue, 04 Jul 2000 17:32:19 +0100
Received: from usenet.bham.ac.uk ([147.188.128.47])
	by bham.ac.uk with esmtp (Exim 3.10 #1)
	id 139VcR-0007Te-00
	for bionet-molbio-embldatabank@moderators.isc.org; Tue, 04 Jul 2000 17:32:19 +0100
Received: from news by usenet.bham.ac.uk with local (Exim 3.03 #1)
	id 139VeD-0002if-00
	for bionet-molbio-embldatabank@moderators.isc.org; Tue, 04 Jul 2000 17:34:09 +0100
To: bionet-molbio-embldatabank@moderators.isc.org
From: Miklos Cserzo <miklos@pugh.bip.bham.ac.uk>
Newsgroups: embnet.general,bionet.molbio.embldatabank
Subject: Re: EMBL 63 available
Organization: The University of Birmingham news server
Message-ID: <Pine.SGI.4.21.0007041706380.22247-100000@pugh.bip.bham.ac.uk>
References: <395B6EF1.C08A3C0C@ebi.ac.uk>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-Trace: usenet.bham.ac.uk 962728449 10442 147.188.9.20 (4 Jul 2000 16:34:09 GMT)
X-Complaints-To: usenet@usenet.bham.ac.uk
NNTP-Posting-Date: 4 Jul 2000 16:34:09 GMT
To: Peter Stoehr <stoehr@ebi.ac.uk>
Date: Thu,  6 Jul 2000 14:49:24 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk



On 29 Jun 2000, Peter Stoehr wrote:

Hi Folks,

> Release 63 of the EMBL Nucleotide Sequence Database is available from:
> the EBI ftp server and other mirror sites below:
>   
>   ftp://ftp.ebi.ac.uk/pub/databases/embl/release           (UK)
>   ftp://ftp.dk.embnet.org/pub/databases/embl               (Denmark)
>   ftp://ftp.es.embnet.org/pub/databases/embl/release       (Spain)
> 

how long does it normally take to download the full release? (This is the
fifth day for me between Hinxton and B'ham. :((()

miklos

Miklos Cserzo                                     University of Birmingham
c/o School of Biosciences                     MRC - Bioinformatics Project
Tel: +44-121-414-3037                  Schools of Biosciences and Medicine
Fax: +44-121-414-3982                        Edgbaston, Birmingham B15 2TT
E-mail: miklos@bip.bham.ac.uk                               United Kingdom





From owner-embl-db@hgmp.mrc.ac.uk  Fri Jul  7 10:40:00 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id DD13D17B19; Fri,  7 Jul 2000 10:39:59 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id E14A117B13; Fri,  7 Jul 2000 10:39:57 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id 36F6217B90; Thu,  6 Jul 2000 16:32:17 +0100 (BST)
Received: from briar.org (backdraft.briar.org [192.207.123.123])
	by mercury.hgmp.mrc.ac.uk (Postfix) with ESMTP id 6FF31415DC
	for <bionet-molbio-embldatabank@net.bio.net>; Thu,  6 Jul 2000 16:32:15 +0100 (BST)
Received: (from smap@localhost) by briar.org (8.9.1b+Sun/8.6.12) id LAA16354 for <bionet-molbio-embldatabank@moderators.isc.org>; Thu, 6 Jul 2000 11:32:13 -0400 (EDT)
X-Authentication-Warning: backdraft.briar.org: smap set sender to <news@news.nottingham.ac.uk> using -f
Received: from pat.ccc.nottingham.ac.uk(128.243.40.194) by backdraft via smap (V2.1)
	id xma016351; Thu, 6 Jul 00 11:32:07 -0400
Received: from oyez.ccc.nottingham.ac.uk ([128.243.241.164] helo=news.nottingham.ac.uk)
	by nottingham.ac.uk with esmtp (Exim 3.13 #2)
	id 13ADbm-00071S-00
	for bionet-molbio-embldatabank@moderators.isc.org; Thu, 06 Jul 2000 16:30:34 +0100
Received: from news by news.nottingham.ac.uk with local (Exim 1.92 #1)
	for bionet-molbio-embldatabank@moderators.isc.org
	id 13ADdD-0004k4-00; Thu, 6 Jul 2000 16:32:03 +0100
To: bionet-molbio-embldatabank@moderators.isc.org
From: Keith Bradnam <keith@thale.life.nottingham.ac.uk>
Newsgroups: embnet.general,bionet.molbio.embldatabank
Subject: Re: EMBL 63 available
Organization: ACS, The University of Nottingham
Message-ID: <Pine.SOL.3.96.1000706162614.23901O-100000@thale>
References: <395B6EF1.C08A3C0C@ebi.ac.uk> <Pine.SGI.4.21.0007041706380.22247-100000@pugh.bip.bham.ac.uk>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-Trace: oyez.ccc.nottingham.ac.uk 962897523 18230 128.243.116.150 (6 Jul 2000 15:32:03 GMT)
X-Complaints-To: usenet@news.nottingham.ac.uk
NNTP-Posting-Date: 6 Jul 2000 15:32:03 GMT
To: Miklos Cserzo <miklos@pugh.bip.bham.ac.uk>
X-Sender: keith@thale
Date: Fri,  7 Jul 2000 10:39:57 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk

On 6 Jul 2000, Miklos Cserzo wrote:

> 
> 
> On 29 Jun 2000, Peter Stoehr wrote:
> 
> Hi Folks,
> 
> > Release 63 of the EMBL Nucleotide Sequence Database is available from:
> > the EBI ftp server and other mirror sites below:
> >   
> >   ftp://ftp.ebi.ac.uk/pub/databases/embl/release           (UK)
> >   ftp://ftp.dk.embnet.org/pub/databases/embl               (Denmark)
> >   ftp://ftp.es.embnet.org/pub/databases/embl/release       (Spain)
> > 
> 
> how long does it normally take to download the full release? (This is the
> fifth day for me between Hinxton and B'ham. :((()

>From personal experience of downloading EMBL (or more specifically just
downloading the plant sequences in EMBL) I would say that it will take a
*long* time if you try and download it during the day, during the week.
If you can schedule to try and download it at the weekend, it can make a
big difference and you *should* be able to download the entire database.

I think that database growth is still exceeding developments in bandwidth
though so this might get worse before it gets better.

Keith

~  Keith Bradnam - Developer, Arabidopsis Genome Resource (AGR)
~  Nottingham Arabidopsis Stock Centre - http://nasc.nott.ac.uk/
~  University Park, University of Nottingham, NG7 2RD, UK
~  Tel: (0115) 951 3091 






From owner-embl-db@hgmp.mrc.ac.uk  Fri Jul  7 10:40:07 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id 2770617B16; Fri,  7 Jul 2000 10:40:07 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id 3C5F217B13; Fri,  7 Jul 2000 10:40:04 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id 9279F17A60; Thu,  6 Jul 2000 14:56:12 +0100 (BST)
Received: from rutgers.rutgers.edu (bsd.rutgers.edu [165.230.4.71])
	by mercury.hgmp.mrc.ac.uk (Postfix) with ESMTP id 9E9BF415D4
	for <bionet-molbio-embldatabank@net.bio.net>; Thu,  6 Jul 2000 14:56:10 +0100 (BST)
Received: from cliff.niehs.nih.gov (root@cliff.niehs.nih.gov [157.98.8.7])
	by rutgers.rutgers.edu (8.8.8/8.8.8) with ESMTP id JAA18868
	for <bionet-molbio-embldatabank@moderators.isc.org>; Thu, 6 Jul 2000 09:56:08 -0400 (EDT)
Received: from cliff.niehs.nih.gov (IDENT:root@localhost [127.0.0.1])
	by cliff.niehs.nih.gov (8.9.3/8.9.3/NIEHS-POST-1.3) with ESMTP id JAA08438
	for <bionet-molbio-embldatabank@moderators.isc.org>; Thu, 6 Jul 2000 09:56:07 -0400
Received: from trollope.niehs.nih.gov (trollope.niehs.nih.gov [157.98.13.26])
	by cliff.niehs.nih.gov (8.9.3/8.9.3/NIEHS-PRE-1.3) with ESMTP id JAA08411;
	Thu, 6 Jul 2000 09:56:05 -0400
Received: by trollope.niehs.nih.gov with Internet Mail Service (5.5.2650.21)
	id <3FANWSYX>; Thu, 6 Jul 2000 09:56:20 -0400
Message-ID: <4D693F933DD8D311A18C00E018B00576021FBC8C@trollope.niehs.nih.gov>
From: "Staffa.Nick" <staffa@niehs.nih.gov>
To: "'Miklos Cserzo '" <miklos@pugh.bip.bham.ac.uk>,
	"'bionet-molbio-embldatabank@moderators.isc.org '" <bionet-molbio-embldatabank@moderators.isc.org>,
	"'Peter Stoehr '" <stoehr@ebi.ac.uk>
Subject: RE: EMBL 63 available
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain
Newsgroups: bionet.molbio.embldatabank
Date: Fri,  7 Jul 2000 10:40:04 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk

 'Twas a real pain for me here.
Genbank completed overnight, but between my mistakes, embl going down, and
general slowness of downloads, it was more than 3 days.

Nick Staffa
National Institute of Environmental Health Sciences
ITSS contract computer support
NIH
RTP, NC 
USA

>From the Birthplace of the Star Spangled Banner,
Baltimore Maryland

-----Original Message-----
From: Miklos Cserzo
To: bionet-molbio-embldatabank@moderators.isc.org; Peter Stoehr
Sent: 7/6/00 9:49 AM
Subject: Re: EMBL 63 available



On 29 Jun 2000, Peter Stoehr wrote:

Hi Folks,

> Release 63 of the EMBL Nucleotide Sequence Database is available from:
> the EBI ftp server and other mirror sites below:
>   
>   ftp://ftp.ebi.ac.uk/pub/databases/embl/release           (UK)
>   ftp://ftp.dk.embnet.org/pub/databases/embl               (Denmark)
>   ftp://ftp.es.embnet.org/pub/databases/embl/release       (Spain)
> 

how long does it normally take to download the full release? (This is
the
fifth day for me between Hinxton and B'ham. :((()

miklos

Miklos Cserzo                                     University of
Birmingham
c/o School of Biosciences                     MRC - Bioinformatics
Project
Tel: +44-121-414-3037                  Schools of Biosciences and
Medicine
Fax: +44-121-414-3982                        Edgbaston, Birmingham B15
2TT
E-mail: miklos@bip.bham.ac.uk                               United
Kingdom






From owner-embl-db@hgmp.mrc.ac.uk  Sat Jul  8 19:30:19 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id B193117B44; Sat,  8 Jul 2000 19:30:18 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id 5164B17A80; Sat,  8 Jul 2000 19:30:17 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id 6901917B21; Fri,  7 Jul 2000 17:26:40 +0100 (BST)
Received: from moderated.news.pipex.net (moderated.news.pipex.net [158.43.192.79])
	by mercury.hgmp.mrc.ac.uk (Postfix) with SMTP id 3EA70415D5
	for <bionet-molbio-embldatabank@net.bio.net>; Fri,  7 Jul 2000 17:26:39 +0100 (BST)
Received: (qmail 29430 invoked by alias); 7 Jul 2000 16:26:38 -0000
Received: (qmail 29426 invoked from network); 7 Jul 2000 16:26:38 -0000
Received: from pat.ccc.nottingham.ac.uk (HELO nottingham.ac.uk) (128.243.40.194)
  by moderated.news.pipex.net with SMTP; 7 Jul 2000 16:26:38 -0000
Received: from oyez.ccc.nottingham.ac.uk ([128.243.241.164] helo=news.nottingham.ac.uk)
	by nottingham.ac.uk with esmtp (Exim 3.13 #2)
	id 13Aaw4-0003ne-00
	for bionet-molbio-embldatabank@moderators.isc.org; Fri, 07 Jul 2000 17:25:05 +0100
Received: from news by news.nottingham.ac.uk with local (Exim 1.92 #1)
	for bionet-molbio-embldatabank@moderators.isc.org
	id 13AaxW-0001mW-00; Fri, 7 Jul 2000 17:26:34 +0100
To: bionet-molbio-embldatabank@moderators.isc.org
From: Keith Bradnam <keith@thale.nott.ac.uk>
Newsgroups: bionet.molbio.embldatabank
Subject: RE: EMBL 63 available
Organization: ACS, The University of Nottingham
Message-ID: <Pine.SOL.3.96.1000707170611.408J-100000@thale>
References: <4D693F933DD8D311A18C00E018B00576021FBC8C@trollope.niehs.nih.gov>
Reply-To: Keith Bradnam <keith@thale.nott.ac.uk>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-Trace: oyez.ccc.nottingham.ac.uk 962987194 6850 128.243.116.150 (7 Jul 2000 16:26:34 GMT)
X-Complaints-To: usenet@news.nottingham.ac.uk
NNTP-Posting-Date: 7 Jul 2000 16:26:34 GMT
X-Sender: keith@thale
Date: Sat,  8 Jul 2000 19:30:17 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk

On 7 Jul 2000, Staffa.Nick wrote:

>  'Twas a real pain for me here.
> Genbank completed overnight, but between my mistakes, embl going down, and
> general slowness of downloads, it was more than 3 days.


It's easy to imagine that in the (near) future we'll all be using Gigabit
(or higher) ethernet/internet connections and will laugh at the day we
were restricted to 100 Mbits per sec...but I'm not sure how much more
(seemingly exponential) sequence database growth will occur before any
major improvements in bandwidth occur.

I was processing EMBL 63 updates today for my Arabidopsis database and
found that there were 50,000 new Arabidopsis EST sequences in one day's
update! This one update represents a third of all the sequences we
currently have and a major jump in database size (especially when you add
on associated information such as blast homologies).

Therefore...

Does anybody know if anyone has looked at developing better compression
tools purposely for EMBL/GenBank records?  I know that GenBank has only
recently moved over to Gzip but I feel that there might be something
better that could be developed. This is based on the observation that most
new EMBL sequences appear to be very long and therefore the DNA part of
the sequence entry constitutes the greatest fraction of the total file
size (particularly for the HTG sequences where there is little
annotation).

As a test, I recently took a very long DNA sequence and tried different
compression programs on it and found...

Original sequence - 203,335 bases/bytes
pack - 56,298 bytes
compress - 57,711
winzip (maximum compression setting) - 60,427
gzip (maximum compression setting) - 60,457
gzip (default) - 61,996
winzip (default) - 63,072


So in this case, an older UNIX compression tool ('pack') beats the rest by
a small margin.  However, I wrote a dead simple script to further knock
the sequence down to 50,834 bytes, i.e. 75% compression which is easily
possibly if you encode 4 DNA characters as 1 bit of an 8-bit byte.

Of course this doesn't work for protein sequences, or where there are N's
in the DNA sequence, or for all the ancillarly information in an EMBL
record, and it's only a small saving.  But apply that saving in
compression to an entire database and you might save a few hours
downloading time.

Does anybody know of any research being done on this?  I know somebody at
Nottingham University who is kind of interested, but wants to know if
there would be interest in such a compression program...and for that to
happen I guess it would have to be accepted as a standard by all major
databases and made very easy to get hold of.

I can't help feel that extra compression could be gained by further
considering some of the more frequent hexanucleotides that occur in DNA
sequences.  

Anyone have any info/thoughts/views???

Keith

P.S. I accept that in some ways this is arguing about something that might
be blown out of the water by any new developments in bandwidth...but maybe
there is some mileage in this.

~  Keith Bradnam - Developer, Arabidopsis Genome Resource (AGR)
~  Nottingham Arabidopsis Stock Centre - http://nasc.nott.ac.uk/
~  University Park, University of Nottingham, NG7 2RD, UK
~  Tel: (0115) 951 3091 







From owner-embl-db@hgmp.mrc.ac.uk  Sat Jul  8 19:30:27 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id 5858417B43; Sat,  8 Jul 2000 19:30:27 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id 2227317AB2; Sat,  8 Jul 2000 19:30:26 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id B30C617A71; Sat,  8 Jul 2000 00:22:43 +0100 (BST)
Received: from mailbox1.ucsd.edu (mailbox1.ucsd.edu [132.239.1.53])
	by mercury.hgmp.mrc.ac.uk (Postfix) with ESMTP id 3FA64415D4
	for <bionet-molbio-embldatabank@net.bio.net>; Sat,  8 Jul 2000 00:22:42 +0100 (BST)
Received: from Astrovan.cstone.net (astrovan.cstone.net [209.145.64.80])
	by mailbox1.ucsd.edu (8.9.3/8.9.3) with ESMTP id QAA15355
	for <bionet-molbio-embldatabank@moderators.isc.org>; Fri, 7 Jul 2000 16:22:39 -0700 (PDT)
Received: from box6.cho.cstone.net ([209.145.64.50]) by Astrovan.cstone.net
          (Post.Office MTA v3.5.3 release 223 ID# 0-59789U13500L1350S0V35)
          with ESMTP id net
          for <bionet-molbio-embldatabank@moderators.isc.org>;
          Fri, 7 Jul 2000 19:15:29 -0400
Received: (from news@localhost)
	by box6.cho.cstone.net (8.9.3/8.9.3) id TAA74215
	for bionet-molbio-embldatabank@moderators.isc.org; Fri, 7 Jul 2000 19:18:22 -0400 (EDT)
	(envelope-from news@mail.cstone.net)
To: bionet-molbio-embldatabank@moderators.isc.org
From: Michael Black <mbb8n@virginia.edu>
Newsgroups: embnet.general,bionet.molbio.embldatabank
Subject: Re: EMBL 63 available
Organization: University of Virginia
Message-ID: <396666CE.23ADCEDB@virginia.edu>
References: <395B6EF1.C08A3C0C@ebi.ac.uk> <Pine.SGI.4.21.0007041706380.22247-100000@pugh.bip.bham.ac.uk> <Pine.SOL.3.96.1000706162614.23901O-100000@thale>
Mime-Version: 1.0
Content-Type: multipart/mixed;
 boundary="------------8B869823E0EBBAB7A21E9560"
X-Trace: box6.cho.cstone.net 963011902 74211 209.145.71.5 (7 Jul 2000 23:18:22 GMT)
X-Complaints-To: abuse@cstone.net
NNTP-Posting-Date: 7 Jul 2000 23:18:22 GMT
X-Mailer: Mozilla 4.73 [en] (X11; I; Linux 2.2.15-4mdk i686)
X-Accept-Language: en
Date: Sat,  8 Jul 2000 19:30:26 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk

This is a multi-part message in MIME format.
--------------8B869823E0EBBAB7A21E9560
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

When I first tried downloading from the ebi site, I was only seeing 2-4K/sec,
and after two days I killed the download.  The next day I switched to the
Danish site and got transfers of 230-300K/sec and got the entire release in
one working day (last Wednesday, during normal business hours).  Try the
mirrors and see if your speed inproves.

Keith Bradnam wrote:

> On 6 Jul 2000, Miklos Cserzo wrote:
>
> >
> >
> > On 29 Jun 2000, Peter Stoehr wrote:
> >
> > Hi Folks,
> >
> > > Release 63 of the EMBL Nucleotide Sequence Database is available from:
> > > the EBI ftp server and other mirror sites below:
> > >
> > >   ftp://ftp.ebi.ac.uk/pub/databases/embl/release           (UK)
> > >   ftp://ftp.dk.embnet.org/pub/databases/embl               (Denmark)
> > >   ftp://ftp.es.embnet.org/pub/databases/embl/release       (Spain)
> > >
> >
> > how long does it normally take to download the full release? (This is the
> > fifth day for me between Hinxton and B'ham. :((()
>
> >From personal experience of downloading EMBL (or more specifically just
> downloading the plant sequences in EMBL) I would say that it will take a
> *long* time if you try and download it during the day, during the week.
> If you can schedule to try and download it at the weekend, it can make a
> big difference and you *should* be able to download the entire database.
>
> I think that database growth is still exceeding developments in bandwidth
> though so this might get worse before it gets better.
>
> Keith
>
> ~  Keith Bradnam - Developer, Arabidopsis Genome Resource (AGR)
> ~  Nottingham Arabidopsis Stock Centre - http://nasc.nott.ac.uk/
> ~  University Park, University of Nottingham, NG7 2RD, UK
> ~  Tel: (0115) 951 3091

--
Michael Black, Ph.D.
Molecular Biology Computing Support
University of Virginia, ITC-ACHS
P.O. Box 800777
Charlottesville, VA
22908-0777
voice: (804)982-4039
fax: (804)982-4030
mblack@virginia.edu
--



--------------8B869823E0EBBAB7A21E9560
Content-Type: text/x-vcard; charset=us-ascii;
 name="mbb8n.vcf"
Content-Transfer-Encoding: 7bit
Content-Description: Card for Michael Black
Content-Disposition: attachment;
 filename="mbb8n.vcf"

begin:vcard 
n:Black;Michael
tel;fax:(894)982-4030
tel;work:(804)982-4039
x-mozilla-html:TRUE
url:http://www.people.virginia.edu/~mbb8n/
org:University of Virginia;ITC-Academic Computing Health Sciences
version:2.1
email;internet:mblack@virginia.edu
title:Molecular Biology Computing Support
adr;quoted-printable:;;University of Virginia=0D=0AITC-ACHS=0D=0AP.O. Box 800777;Charlottesville;VA;22908-0777;USA
x-mozilla-cpt:;0
fn:Michael Black, Ph.D.
end:vcard

--------------8B869823E0EBBAB7A21E9560--





From owner-embl-db@hgmp.mrc.ac.uk  Mon Jul 10 16:48:00 2000
Return-Path: <owner-embl-db@hgmp.mrc.ac.uk>
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 110)
	id BB6AE17B03; Mon, 10 Jul 2000 16:47:59 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6014)
	id 390D417A89; Mon, 10 Jul 2000 16:47:58 +0100 (BST)
Received: by mercury.hgmp.mrc.ac.uk (Postfix, from userid 6024)
	id 072F317AD2; Mon, 10 Jul 2000 12:59:28 +0100 (BST)
Received: from niobium.hgmp.mrc.ac.uk (niobium [193.62.192.41])
	by mercury.hgmp.mrc.ac.uk (Postfix) with ESMTP id 8221E415CF
	for <bionet-molbio-embldatabank@net.bio.net>; Mon, 10 Jul 2000 12:59:28 +0100 (BST)
Received: (from news@localhost)
	by niobium.hgmp.mrc.ac.uk (8.9.3+Sun/8.8.8) id MAA07006
	for bionet-molbio-embldatabank@net.bio.net; Mon, 10 Jul 2000 12:59:28 +0100 (BST)
To: bionet-molbio-embldatabank@net.bio.net
From: Peter Stoehr <stoehr@ebi.ac.uk>
Newsgroups: embnet.general,bionet.molbio.embldatabank
Subject: Re: EMBL 63 available
Organization: EMBL-EBI
Message-ID: <3969BA9F.F27B16AD@ebi.ac.uk>
References: <395B6EF1.C08A3C0C@ebi.ac.uk> <Pine.SGI.4.21.0007041706380.22247-100000@pugh.bip.bham.ac.uk> <Pine.SOL.3.96.1000706162614.23901O-100000@thale> <396666CE.23ADCEDB@virginia.edu>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
X-Trace: niobium.hgmp.mrc.ac.uk 963230367 7004 193.62.196.237 (10 Jul 2000 11:59:27 GMT)
X-Complaints-To: news@net.bio.net
NNTP-Posting-Date: 10 Jul 2000 11:59:27 GMT
X-Mailer: Mozilla 4.7 [en] (WinNT; I)
X-Accept-Language: en
Date: Mon, 10 Jul 2000 16:47:58 +0100 (BST)
Sender: owner-embl-db@hgmp.mrc.ac.uk
Precedence: bulk

That's good news about the use of the Danish site, and we will put more effort
into establishing more mirror sites at strategic network locations. We have had
over 150 sites now downloading the full release from EBI alone, about 600GB to
shift, and although we will soon improve our network connection further, it
will be important to get several accurate copies of releases (and updates) out
there to spread the load.

Regards,
Peter Stoehr
EMBL-EBI
 
Michael Black wrote:
> 
> When I first tried downloading from the ebi site, I was only seeing 2-4K/sec,
> and after two days I killed the download.  The next day I switched to the
> Danish site and got transfers of 230-300K/sec and got the entire release in
> one working day (last Wednesday, during normal business hours).  Try the
> mirrors and see if your speed inproves.
> 
> Keith Bradnam wrote:
> 
> > On 6 Jul 2000, Miklos Cserzo wrote:
> >
> > >
> > >
> > > On 29 Jun 2000, Peter Stoehr wrote:
> > >
> > > Hi Folks,
> > >
> > > > Release 63 of the EMBL Nucleotide Sequence Database is available from:
> > > > the EBI ftp server and other mirror sites below:
> > > >
> > > >   ftp://ftp.ebi.ac.uk/pub/databases/embl/release           (UK)
> > > >   ftp://ftp.dk.embnet.org/pub/databases/embl               (Denmark)
> > > >   ftp://ftp.es.embnet.org/pub/databases/embl/release       (Spain)
> > > >
> > >
> > > how long does it normally take to download the full release? (This is the
> > > fifth day for me between Hinxton and B'ham. :((()
> >
> > >From personal experience of downloading EMBL (or more specifically just
> > downloading the plant sequences in EMBL) I would say that it will take a
> > *long* time if you try and download it during the day, during the week.
> > If you can schedule to try and download it at the weekend, it can make a
> > big difference and you *should* be able to download the entire database.
> >
> > I think that database growth is still exceeding developments in bandwidth
> > though so this might get worse before it gets better.
> >
> > Keith
> >
> > ~  Keith Bradnam - Developer, Arabidopsis Genome Resource (AGR)
> > ~  Nottingham Arabidopsis Stock Centre - http://nasc.nott.ac.uk/
> > ~  University Park, University of Nottingham, NG7 2RD, UK
> > ~  Tel: (0115) 951 3091
> 
> --
> Michael Black, Ph.D.
> Molecular Biology Computing Support
> University of Virginia, ITC-ACHS
> P.O. Box 800777
> Charlottesville, VA
> 22908-0777
> voice: (804)982-4039
> fax: (804)982-4030
> mblack@virginia.edu
> --





