From owner-embldatabank@net.bio.net Wed Mar 03 22:00:00 1993
Path: biosci!daresbury!buzz.bmc.uu.se!embl-heidelberg.de!omond
From: omond@embl-heidelberg.de (Roy Omond)
Newsgroups: bionet.general,bionet.molbio.embldatabank,embnet.general
Subject: Scheduled downtime at EMBL
Message-ID: <1993Mar4.161720.74650@embl-heidelberg.de>
Date: 4 Mar 93 15:17:20 GMT
Organization: EMBL, European Molecular Biology Laboratory
Lines: 27
Xref: biosci bionet.general:4226 bionet.molbio.embldatabank:151


Notice of scheduled downtime:

The entire main VAXcluster at the EMBL will be offline on Sunday 7-Mar-93
from 10:30 MET until ca. 23:00 MET.    We are installing a new release of
VMS, and doing "other housekeeping stuff".

All of EMBL's eMail-based services will be unavailable during this time
(although most mailers should queue files until we're back online again).

This includes the following services:

	Netserv		:	network file server
	FASTA		:	FASTA server
	QUICK		:	GCG Quicksearch server
	Blitz		:	Sequence search server on MasPar MP-1
	PredictProtein	:	Secondary structure prediction server

*Not* affected are EMBL's anonymous ftp server at ftp.EMBL-Heidelberg.DE
and the gopher hole at the same address.

Apologies for the downtime :-)

Roy Omond
System Manager/PostMaster etc. etc.
European Molecular Biology Laboratory
Heidelberg, Germany.

From owner-embldatabank@net.bio.net Sun Mar 07 22:00:00 1993
Path: biosci!agate!howland.reston.ans.net!zaphod.mps.ohio-state.edu!cs.utexas.edu!tamsun.tamu.edu!tamuts!gco5031
From: gco5031@tamuts.tamu.edu (Guilherme Co. Oliveira)
Newsgroups: bionet.molbio.embldatabank
Subject: Blast search
Message-ID: <1ng1hkINN54j@tamsun.tamu.edu>
Date: 8 Mar 93 17:59:16 GMT
Organization: Texas A&M University, College Station
Lines: 7
NNTP-Posting-Host: tamuts.tamu.edu

Could anyone please send me information on how to
do the BLAST search (mail format, and address for
submission)?
Thanks in advance,
	Guilherme Oliveira
	Department of Biology
	Texas A&M University

From owner-embldatabank@net.bio.net Mon Mar 08 22:00:00 1993
Path: biosci!embl-heidelberg.de!cameron
From: cameron@embl-heidelberg.de
Newsgroups: bionet.announce,bionet.molbio.embldatabank
Subject: European Bioinformatics Institute - Press release
Message-ID: <1993Mar9.172445.75382@embl-heidelberg.de>
Date: 9 Mar 93 16:24:45 GMT
Sender: kristoff@net.bio.net
Organization: EMBL, European Molecular Biology Laboratory
Lines: 63
Approved: bionews-moderator@net.bio.net
Xref: biosci bionet.announce:393 bionet.molbio.embldatabank:153


              The European Bioinformatics Institute (EBI)
                         A New EMBL Outstation
                           EMBL Press Release
                        Heidelberg, 9 March 1993

The EMBL Council decided today that its recently  announced  Outstation,
the  European  Bioinformatics  Institute  (EBI)  will  be  located  near
Cambridge in the UK.  Its goal will be to ensure that the  growing  body
of   information   from   molecular   biology  and  genome  research  is
conveniently accessible to all facets of  the  European  scientific  and
biotechnology  community  in  ways which promote scientific progress and
global competitiveness.  The EBI will expand on the  work  of  the  EMBL
Data  Library,  which  was established more than a decade ago to build a
database of nucleotide sequences, and today supplies this and many other
kinds of biological information to scientists throughout the world.

Today's competitive biological research and biotechnology industries are
crucially   dependent   on   up-to-date  information.   Modern  research
scientists  and  biotechnologists  are  becoming  intensive   users   of
electronic  information  resources  -  computers, software, networks and
databases, and the EBI will be a European focus for the  development  of
and access to products and services based on these technologies.

The  EBI  will  continue  the  Data  Library's  work  in  preparing  and
distributing  databases  of biological interest in collaboration with an
extensive network of other independent European centres.  It will expand
in  areas  crucial  to  the  long-term achievement of this service goal,
which include:  technology tracking in biology and informatics, research
and development, training and user support.  Specific areas for research
will include comparison algorithms, networked information resources, and
new database design.

Within two years the EBI will employ almost 70 scientific and  technical
staff  in  a  custom-designed  building  on the Genome Campus at Hinxton
Park, south of Cambridge.  The site already houses  the  Sanger  Centre,
directed  by  Dr.   John  Sulston,  which  specialises  in  genome-scale
sequencing.

After evaluation of very credible offers from Germany,  Sweden  and  the
United  Kingdom, the decision on the UK location was reached at the EMBL
Council meeting of March 9.  Strong commitment from  the  UK  Government
and  scientific  community  resulted  in  a  proposal  from  the Medical
Research Council in partnership  with  the  Wellcome  Trust  to  provide
excellent   facilities   for  the  new  EMBL  Outstation.   The  British
Government has furthermore committed itself  to  conclude  an  agreement
with  EMBL  such that the EBI will enjoy the same privileges as EMBL has
in Germany and France.

The major part of the EBI operating budget is expected to come from EMBL
and  the  Commission  of  the  European  Communities  (EC) both of which
finance the present Data Library.  DG XII of the EC has  been  a  strong
supporter of bioinformatics throughout Europe and especially the work at
EMBL.

EMBL is an international, intergovernmental  organisation  dedicated  to
basic  research,  advanced  training  and  the  provision of services in
molecular biology.  It was founded in 1973 and has  become  one  of  the
premier  biological research centres in the world.  The EBI is the third
Outstation of  EMBL;  the  other  two,  located  in  Hamburg,  FRG,  and
Grenoble,  France,  provide  facilities  for  the analysis of biological
structures at the atomic level.  The EBI like the other two  Outstations
will be administered by EMBL.

From owner-embldatabank@net.bio.net Mon Mar 08 22:00:00 1993
Path: biosci!daresbury!buzz.bmc.uu.se!embl-heidelberg.de!cameron
From: cameron@embl-heidelberg.de
Newsgroups: bionet.molbio.embldatabank
Subject: European Bioinformatics Institute - Press release
Message-ID: <1993Mar9.172954.75384@embl-heidelberg.de>
Date: 9 Mar 93 16:29:54 GMT
Organization: EMBL, European Molecular Biology Laboratory
Lines: 63


              The European Bioinformatics Institute (EBI)
                         A New EMBL Outstation
                           EMBL Press Release
                        Heidelberg, 9 March 1993

The EMBL Council decided today that its recently  announced  Outstation,
the  European  Bioinformatics  Institute  (EBI)  will  be  located  near
Cambridge in the UK.  Its goal will be to ensure that the  growing  body
of   information   from   molecular   biology  and  genome  research  is
conveniently accessible to all facets of  the  European  scientific  and
biotechnology  community  in  ways which promote scientific progress and
global competitiveness.  The EBI will expand on the  work  of  the  EMBL
Data  Library,  which  was established more than a decade ago to build a
database of nucleotide sequences, and today supplies this and many other
kinds of biological information to scientists throughout the world.

Today's competitive biological research and biotechnology industries are
crucially   dependent   on   up-to-date  information.   Modern  research
scientists  and  biotechnologists  are  becoming  intensive   users   of
electronic  information  resources  -  computers, software, networks and
databases, and the EBI will be a European focus for the  development  of
and access to products and services based on these technologies.

The  EBI  will  continue  the  Data  Library's  work  in  preparing  and
distributing  databases  of biological interest in collaboration with an
extensive network of other independent European centres.  It will expand
in  areas  crucial  to  the  long-term achievement of this service goal,
which include:  technology tracking in biology and informatics, research
and development, training and user support.  Specific areas for research
will include comparison algorithms, networked information resources, and
new database design.

Within two years the EBI will employ almost 70 scientific and  technical
staff  in  a  custom-designed  building  on the Genome Campus at Hinxton
Park, south of Cambridge.  The site already houses  the  Sanger  Centre,
directed  by  Dr.   John  Sulston,  which  specialises  in  genome-scale
sequencing.

After evaluation of very credible offers from Germany,  Sweden  and  the
United  Kingdom, the decision on the UK location was reached at the EMBL
Council meeting of March 9.  Strong commitment from  the  UK  Government
and  scientific  community  resulted  in  a  proposal  from  the Medical
Research Council in partnership  with  the  Wellcome  Trust  to  provide
excellent   facilities   for  the  new  EMBL  Outstation.   The  British
Government has furthermore committed itself  to  conclude  an  agreement
with  EMBL  such that the EBI will enjoy the same privileges as EMBL has
in Germany and France.

The major part of the EBI operating budget is expected to come from EMBL
and  the  Commission  of  the  European  Communities  (EC) both of which
finance the present Data Library.  DG XII of the EC has  been  a  strong
supporter of bioinformatics throughout Europe and especially the work at
EMBL.

EMBL is an international, intergovernmental  organisation  dedicated  to
basic  research,  advanced  training  and  the  provision of services in
molecular biology.  It was founded in 1973 and has  become  one  of  the
premier  biological research centres in the world.  The EBI is the third
Outstation of  EMBL;  the  other  two,  located  in  Hamburg,  FRG,  and
Grenoble,  France,  provide  facilities  for  the analysis of biological
structures at the atomic level.  The EBI like the other two  Outstations
will be administered by EMBL.

From owner-embldatabank@net.bio.net Mon Mar 29 23:00:00 1993
Path: biosci!daresbury!cnbvx3.cnb.uam.es!mrege
From: mrege@cnbvx3.cnb.uam.es
Newsgroups: bionet.molbio.embldatabank
Subject: sequence not available
Message-ID: <1993Mar30.203442.22@cnbvx3.cnb.uam.es>
Date: 30 Mar 93 20:34:41 GMT
Organization: C.N.Biotecnologia,  CSIC
Lines: 32


Last year we submitted a DNA sequence to EMBL Databank with the corresponding
protein sequence deduced from it (bss gene from Serratia marcescens, encoding
for bacteriocin 28b). We marked it as not available until publication. 
X62454 was the accession number it was given.

Since this sequence was published on August 1992 (Viejo et al. Cloning and
DNA sequence analysis of a bacteriocin gene of Serratia marcescens'. Journal
of General Microbiology, 138:1737-1743), we thought it was made available
to database users. The fact is it's not, so we wonder:

	1. How long is the avg time from publication to release of a sequence?

	2. What is the correct way to submit such a sequence?

	3. Where can we address to update this entry?

	4. Are there a lot of sequences lost for the searches because of 
	people (like us) simply forgot 'resubmit' them after publication?

I don't know if this is a FAQ question, but we have not been able to find
in the submission form any suggestion (anything like 'Please do inform when
and where your sequence/s have been published to the following address...')


With compliments,


Dr. Miquel Regue			e-mail:regue@farmacia.ub.es
Dept of Microbiology
Faculty of Pharmacy
Barcelona (Spain)

From owner-embldatabank@net.bio.net Tue Mar 30 23:00:00 1993
Path: biosci!daresbury!bioftp.unibas.ch!comp.bioz.unibas.ch!doelz
From: doelz@comp.bioz.unibas.ch (Reinhard Doelz)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <1993Mar31.095957.17595@comp.bioz.unibas.ch>
Date: 31 Mar 93 09:59:57 GMT
References: <1993Mar30.203442.22@cnbvx3.cnb.uam.es>
Sender: usenet@comp.bioz.unibas.ch (NEWS transaction account)
Reply-To: doelz@urz.unibas.ch
Organization: EMBnet Switzerland [BASEL]
Lines: 29
Nntp-Posting-Host: biox.embnet.unibas.ch

In article <1993Mar30.203442.22@cnbvx3.cnb.uam.es>, mrege@cnbvx3.cnb.uam.es writes:
|> for bacteriocin 28b). We marked it as not available until publication. 
|> X62454 was the accession number it was given.
|> 
ID   S45302     backbone; DNA; PRO; 1568 BP.                             
AC   S45302; X62454;                                                        
DT   16-FEB-1993 (Rel. 34, Created)                                         
DT   16-FEB-1993 (Rel. 34, Last updated, Version 1)                         

RN   [1]                                                                    
RP   1-1568                                                                 
RA   Viejo M.B., Gargallo D., Ferrer S., Enfedaque J., Regue M.;            
RT   "Cloning and DNA sequence analysis of a bacteriocin gene of         
RT   Serratia marcescens.";
RL   J. Gen. Microbiol. 138:1737-1743(1992).
XX
CC   This entry [NCBI gibbsq 113316] was created by the journal scanning
CC   component of NCBI/GenBank at the National Library of Medicine. This
CC   sequence comes from Fig. 4.

-- 
+----------------------------------+-------------------------------------+
|    Dr. Reinhard Doelz            | RFC     doelz@urz.unibas.ch         |
|      Biocomputing                | DECNET  20579::48130::doelz         |
|Biozentrum der Universitaet       | X25     022846211142036::doelz      |
|   Klingelbergstrasse 70          | FAX     x41 61 261- 6760 or 267- 2078     
|     CH 4056 Basel                | TEL     x41 61 267- 2076 or 2247    |   
+------------- bioftp.unibas.ch is the SWISS EMBnet node ----------------+
               -----------------------------------------

From owner-embldatabank@net.bio.net Tue Mar 30 23:00:00 1993
Path: biosci!daresbury!daresbury!news
From: rlippens@ulb.ac.be (Lippens Renee)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <1993Mar31.093441.6087@gserv1.dl.ac.uk>
Date: 31 Mar 93 10:32:52 GMT
Sender: rlippens%be.ac.ulb@be.ac.kuleuven.cc1
Distribution: bionet
Lines: 44
Original-To: embl-db@uk.ac.daresbury
X-Mailer: ELM [version 2.3 PL11]

>Miquel Regue writes :
>
>Last year we submitted a DNA sequence to EMBL Databank with the corresponding
>protein sequence deduced from it (bss gene from Serratia marcescens, encoding
>for bacteriocin 28b). We marked it as not available until publication.
>	< text deleted >
>Since this sequence was published on August 1992 < text deleted >
>we thought it was made available to database users. The fact is it's not.
>	< text deleted >
>	4. Are there a lot of sequences lost for the searches because of
>       people (like us) simply forgot 'resubmit' them after publication?

	Answer to point 4. seems to be YES.
	Very recently I have experienced this problem for four sequences
  within 2 months.  I have lost a bundle of time and energy to find
  e-mail addresses and then to find out which one of GenBank, EMBL or
  DDBJ was in position to release the published information !!

	I hope that suggestions to shorten the delay between publication and 
  release of the sequence which will be expressed on the net, will be 
  considered by the databank managers.  Because, obviously, asking the authors
  to notify the databank when data has been published is not ideal. 

Below are some useful addresses to people looking desperatly for some
nucleic acid sequence :
EMBL		Kate Rice			Kate.Rice@EMBL-Heidelberg.DE
GenBank		Barbara Rapp			rapp@ncbi.nlm.nih.gov
						rose@weft.nlm.nih.gov
		Jolene Schwertfeger		jjs@temin.Lanl.GOV
DDBJ		citation and data update	ddbjsub@ddbj.nig.ac.jp
		Shigeko Suzuki			ssuzuki@genes.nig.ac.jp

-- 
  +_______________________________________________________________+
  +                                                               +
  +  Lippens Renee               | Internet : rlippens@ulb.ac.be  +
  +  Computer Support Assistant  | EARN     : rlippens@bbrbfu60   +
  +                                                               +
  +  Universite Libre de Bruxelles        | Fax   : 32 67 287777  +
  +  Departement de Biologie Moleculaire  | Phone : 32 67 287762  +  
  +  Service de Genetique Appliquee       |                       +
  +  Rue de l'Industrie, 24               | Internal postmail     +
  +  1400  Nivelles      Belgium          | CP 800                +
  +_______________________________________________________________+

From owner-embldatabank@net.bio.net Tue Mar 30 23:00:00 1993
Path: biosci!agate!ames!haven.umd.edu!uunet!mcsun!news.funet.fi!hydra!klaava!biotek5.pc.Helsinki.FI!roos
From: roos@Operoni.Helsinki.FI (Christophe Roos)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <roos.18@Operoni.Helsinki.FI>
Date: 31 Mar 93 14:51:41 GMT
References: <1993Mar30.203442.22@cnbvx3.cnb.uam.es>
Sender: news@klaava.Helsinki.FI (Uutis Ankka)
Organization: Institute of Biotechnology (Univ. Helsinki)
Lines: 25

In article <1993Mar30.203442.22@cnbvx3.cnb.uam.es> mrege@cnbvx3.cnb.uam.es writes:


>Last year we submitted a DNA sequence to EMBL Databank with the corresponding
>protein sequence deduced from it (bss gene from Serratia marcescens, encoding
>for bacteriocin 28b). We marked it as not available until publication. 
>X62454 was the accession number it was given.

>Since this sequence was published on August 1992 (Viejo et al. Cloning and
>DNA sequence analysis of a bacteriocin gene of Serratia marcescens'. Journal
>of General Microbiology, 138:1737-1743), we thought it was made available
>to database users. The fact is it's not, so we wonder: (...)

Once the paper has been accepted / published, an e-mail to "datasubs@embl-
heidelber.de" telling that accession number ...... can be made public makes 
life easier to those who keep the database up to date and can in 
general make sequences faster available to all of us.

I have encountered the situation with published sequences not being marked 
as public (they had been entered long ago). Again an e-mail to Heidelberg
has usually made it available within 24 hours. Each time it was a sequence 
published in Nature. Nature is by the way not requiring the sequences to be 
submitted to a database before publication: archaic !

Christophe Roos

From owner-embldatabank@net.bio.net Tue Mar 30 23:00:00 1993
Path: biosci!lhc!ray!dab
From: dab@ray.nlm.nih.gov (Dennis Benson)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <1993Mar31.192735.26903@nlm.nih.gov>
Date: 31 Mar 93 19:27:35 GMT
References: <roos.18@Operoni.Helsinki.FI>
Sender: news@nlm.nih.gov
Organization: National Library of Medicine
Lines: 44
X-Newsreader: Tin 1.1 PL4

roos@Operoni.Helsinki.FI (Christophe Roos) writes:
: 
: Once the paper has been accepted / published, an e-mail to "datasubs@embl-
: heidelber.de" telling that accession number ...... can be made public makes 
: life easier to those who keep the database up to date and can in 
: general make sequences faster available to all of us.
: 
: I have encountered the situation with published sequences not being marked 
: as public (they had been entered long ago). Again an e-mail to Heidelberg
: has usually made it available within 24 hours. Each time it was a sequence 
: published in Nature. Nature is by the way not requiring the sequences to be 
: submitted to a database before publication: archaic !
: 
: Christophe Roos
: 

Ditto for GenBank.  Users are encouraged to send mail to:

     update@ncbi.nlm.nih.gov

if they fail to find a sequence in the database that has appeared in print with
an accession number.  We route these messages to the database (NCBI-GenBank,
EMBL, DDBJ) that has the author's direct submission so that release can occur
and the record can be exchanged among the databases.  Before sending a message,
however, it would help us if users checked the updates for the missing sequence.
With the NCBI mail server a mail message as below will check the latest GenBank
updates, the updates from EMBL and DDBJ (which get incorporated within
24 hours of their release), plus the most recent full GenBank release.

Send a message in the form:

DATALIB genbank
BEGIN
M92819    (enter accession number(s) here)

to:

   retrieve@ncbi.nlm.nih.gov



Dennis Benson
National Center for Biotechnology Information
National Library of Medicine/NIH

From owner-embldatabank@net.bio.net Wed Mar 31 23:00:00 1993
Path: biosci!uwm.edu!wupost!darwin.sura.net!newsserver.jvnc.net!yale.edu!yale!hsdndev!husc-news.harvard.edu!husc.harvard.edu!robison1
From: robison1@husc10.harvard.edu (Keith Robison)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <robison1.733621222@husc.harvard.edu>
Date: 31 Mar 93 23:40:22 GMT
References: <1993Mar31.093441.6087@gserv1.dl.ac.uk>
Distribution: bionet
Lines: 27
Nntp-Posting-Host: husc10.harvard.edu

The best way to get data released is to send the accession number and
publication information to 

	update@ncbi.nlm.nih.gov


because NCBI then figures out who's responsible (the first letter in
the accession number will tell you, but you have to learn the code).
You of course do NOT have to be the original author -- I have used this
mechanism many times when stuff isn't in the database when the journal 
paper shows up.

	This is clearly not a complete solution to the problem.  There
is a related problem of released entries whose JOURNAL fields are stuck
at "in press" -- it makes tracking down citations a real nuisance.
I think that all entries marked "in press" or being held should be
periodically reviewed (perhaps after 3 months).  This, plus the journal
scanning by NCBI, should greatly reduce the number of entries which are
marooned in one fashion or another.


Keith Robison
Harvard University
Department of Cellular and Developmental Biology
Department of Genetics / HHMI

robison@biosun.harvard.edu 

From owner-embldatabank@net.bio.net Wed Mar 31 23:00:00 1993
Path: biosci!daresbury!daresbury!news
From: rlippens@ulb.ac.be (Lippens Renee)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <1993Apr1.090522.23287@gserv1.dl.ac.uk>
Date: 1 Apr 93 10:00:38 GMT
Sender: rlippens%be.ac.ulb@be.ac.kuleuven.cc1
Distribution: bionet
Lines: 24
Original-To: embl-db@uk.ac.daresbury
X-Mailer: ELM [version 2.3 PL11]

Keith Robison writes:
>
>	< Text deleted >
> There is a related problem of released entries whose JOURNAL fields
> are stuck at "in press" -- it makes tracking down citations a real nuisance.
> I think that all entries marked "in press" or being held should be
> periodically reviewed (perhaps after 3 months).
>                		       ^^^^^^^^
>	< Text deleted >

	The suggestion is good but 3 months is really too long,
  I feel that screening every month will be necessary.
  But why not ask to the Journals or Publishers to notify the databanks when
  a publication containing an Accession Number is issued.
-- 
  +_______________________________________________________________+
  +                                                               +
  +  Lippens Renee               | Internet : rlippens@ulb.ac.be  +
  +  Computer Support Assistant  | EARN     : rlippens@bbrbfu60   +
  +                                                               +
  +  Universite Libre de Bruxelles        | Fax   : 32 67 287777  +
  +  Departement de Biologie Moleculaire  | Phone : 32 67 287762  +  
  +  Service de Genetique Appliquee       | BELGIUM               +
  +_______________________________________________________________+

From owner-embldatabank@net.bio.net Wed Mar 31 23:00:00 1993
Path: biosci!TEMIN.LANL.GOV!pgil
From: pgil@TEMIN.LANL.GOV (Paul Gilna)
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <9304011633.AA04460@temin.lanl.gov>
Date: 1 Apr 93 16:33:30 GMT
Sender: daemon@net.bio.net
Distribution: bionet
Lines: 104



Keith Robison makes the point:

> 	This is clearly not a complete solution to the problem.  There
> is a related problem of released entries whose JOURNAL fields are stuck
> at "in press" -- it makes tracking down citations a real nuisance.
> I think that all entries marked "in press" or being held should be
> periodically reviewed (perhaps after 3 months).  This, plus the journal
> scanning by NCBI, should greatly reduce the number of entries which are
> marooned in one fashion or another.
> 
and Renee Lippens suggests

> 	The suggestion is good but 3 months is really too long,
>   I feel that screening every month will be necessary.
>   But why not ask to the Journals or Publishers to notify the databanks when
>   a publication containing an Accession Number is issued.


Keith makes a good point regarding the lack of progression from "in
press" to published. However the solutions proposed all suggest that
the databases devote some effort to scanning the publication medium to
address the problem AFTER it`s occurred.

Let me ask the following (rhetorical please!) question: does someone
continually remind you to renew your driving permit or auto tax once
they have expired?  More to the point, does someone do it for you? Not
in this country, it is assumed that ownership of a vehicle or
possession of driving permit bring with it the responsibility for
keeping either current, you're lucky if you get a reminder at all.

When you submit your data to GenBank, you are explicitly asked
(extracted from GenBank submission acknowlegement text):

> 
> We list the citation associated with your sequence data as follows:
> 
>         rDNA sequences of Encephalitozoon hellem and Encephalitizoon 
>         cuniculi: species identification and phylogenetic construction
>         J. Eukaryot. Microbiol. (1993) In Press
> 
> Please inform us of changes in the citation or corrections to sequence data
> by sending electronic mail to
> 
> 	update@ncbi.nlm.nih.gov 

In other words, we have conferred upon the submitting author, the
responsibility for updating and maintaining his or her data.


Michael Cinkosky mentioned:

> During the
> time between submission and publication, we repeatedly contact the original
> submitters requesting information on the status of the publication and
> asking for permission to release the data.  In this way, many of the
> submissions that are originally confidential are still released prior to
> publication (with the authors permission).

We believe that we can provide a better and more effective service to
our submitting customer (and in turn the community) if we can help them
exercise that responsibility rather then have us do it for them.

At the moment, we are doing this relatively manually, but expect to
have automated a lot of this step within the next few months so that
"reminder" messages go out automatically on a periodic basis.  While we
hadn't specifically considered it before this, Keith's observation
suggests that extending this to other publication statuses (manuscript
in prep, submitted to "journal", unpublished, etc.,) would be a
worthwhile move.

The primary gain from this approach (in addition to reinforcing the
notion that responsibility for data maintenance lies with the owner of the
data) is that one increases one's chances of updating the data BEFORE
they go wrong.

Today, for each new sequence that we release to the nets, we release at
least another updated sequence, where the update was provided to us by
the submitting author. This suggests that along with the exponential
increase in volume of new data, the changes to those data are keeping
the same pace. Not long ago, I was fond of quoting the fact that
updates from authors represented a third of our output--I think this
vision thing is catching on!

Finally, I should mention that our contract with NCBI calls for us to
develop a data importing capability for their ASN.1 feed, which should
give us an automated method for flagging changes in publication status
for submissions which have been published; this should give us a
partial failsafe for those authors who have yet to catch on (even
there, however, I expect we would pass something back to the author)!


Cheers,

--paul


Paul Gilna
co-PI, GenBank/LANL


----- End Included Message -----


From owner-embldatabank@net.bio.net Wed Mar 31 23:00:00 1993
Path: biosci!daresbury!buzz.bmc.uu.se!embl-heidelberg.de!stoehr
From: stoehr@embl-heidelberg.de
Newsgroups: bionet.molbio.embldatabank
Subject: Re: sequence not available
Message-ID: <1993Apr1.225827.79625@embl-heidelberg.de>
Date: 1 Apr 93 21:58:27 GMT
References: <1993Mar31.093441.6087@gserv1.dl.ac.uk>
Distribution: bionet
Organization: EMBL, European Molecular Biology Laboratory
Lines: 29

In article <1993Mar31.093441.6087@gserv1.dl.ac.uk>, rlippens@ulb.ac.be
(Lippens Renee) writes:
> 	I hope that suggestions to shorten the delay between publication and 
>   release of the sequence which will be expressed on the net, will be 
>   considered by the databank managers.  Because, obviously, asking the authors
>   to notify the databank when data has been published is not ideal. 

Of course they will be considered. I agree, asking the authors to notify the
database is not the whole answer, but it helps when other more automated
mechanisms fail. Matching published to submitted sequences is surpisingly messy 
and labour-intensive if you have to rely on comparisons of sequence, author
names, species etc. We positively welcome interaction with sequence authors as
this often yields significant new information.

> Below are some useful addresses to people looking desperatly for some
> nucleic acid sequence :
> EMBL		Kate Rice			Kate.Rice@EMBL-Heidelberg.DE

Our Kate is indeed very useful and quick to help, but nobody will read her
private e-mail while she is away, let alone answer it.
PLEASE use the addresses:
 datalib@embl-heidelberg.de         (for any general query or flame)
 update@embl-heidelberg.de          (if you have details of updates to a
                                     database entry, including publication
                                     information)

Regards,
Peter Stoehr
EMBL Data Library

