TrEMBL release 12 announcement

Claire O'Donovan odonovan at ebi.ac.uk
Wed Nov 24 05:44:42 EST 1999


 
-------------- next part --------------

INTRODUCTION
============

TrEMBL is a protein sequence database supplementing the SWISS-PROT
Protein Sequence Data Bank. TrEMBL contains the translations of all
coding sequences (CDS) present in the EMBL Nucleotide Sequence
Database not yet integrated in SWISS-PROT. TrEMBL can be considered
as a preliminary section of SWISS-PROT. For all TrEMBL entries
which should finally be upgraded to the standard SWISS-PROT
quality, SWISS-PROT accession numbers have been assigned.


RELEASE 12.0 OF TrEMBL
=====================

This TrEMBL release was created from the EMBL Nucleotide Sequence Database 
release 60 and contains 276'472 sequence entries, comprising 75'524'740 amino
acids. To minimize redundancy, the translations of all coding sequences (CDS)
in the EMBL Nucleotide Sequence Database already included in SWISS-PROT release
38 and updates up to 18.10.1999 have been removed from TrEMBl release 12.

TrEMBL is split in two main sections; SP-TrEMBL and REM-TrEMBL:

SP-TrEMBL (SWISS-PROT TrEMBL) contains the entries (225'878),
which should be eventually incorporated into SWISS-PROT.
SWISS-PROT accession numbers have been assigned for all SP-TrEMBL
entries.

SP-TrEMBL is organized in subsections:

arc.dat (Archaea):             10017 entries
fun.dat (Fungi):                7130 entries
hum.dat (Human):                9417 entries
inv.dat (Invertebrates):       26516 entries
mam.dat (Other Mammals):        3498 entries
mhc.dat (MHC proteins):         4563 entries
org.dat (Organelles):          18993 entries
phg.dat (Bacteriophages):       2201 entries
pln.dat (Plants):              18993 entries
pro.dat (Prokaryotes):         51151 entries
rod.dat (Rodents):              7992 entries
unc.dat (Unclassified):          129 entries
vrl.dat (Viruses):             61006 entries
vrt.dat (Other Vertebrates):    4897 entries


REM-TrEMBL (REMaining TrEMBL) contains the entries (50'566) that we do
not want to include in SWISS-PROT.


WEEKLY UPDATES OF TrEMBL AND NON-REDUNDANT DATA SETS
====================================================
Weekly cumulative updates of TrEMBL are available by anonymous FTP and
from the EBI SRS server.
We also produce every week a complete non-redundant protein sequence
collection by providing three compressed files (these are in the
directory /pub/databases/sp_tr_nrdb on the EBI FTP server):
sprot.dat.Z, trembl.dat.Z and trembl_new.dat.Z.


ACCESS/DATA DISTRIBUTION
========================

FTP server:     ftp.ebi.ac.uk/pub/databases/trembl
SRS server:     http://srs.ebi.ac.uk/

TREMBL is also available on the SWISS-PROT CD-ROM.
SWISS-PROT + TREMBL is searchable on the FASTA3, BLAST2 and Bic_sw
servers of the EBI.



TrEMBL HAS BEEN PREPARED BY:
============================

Rolf Apweiler, Kirsty Bates, Margaret Biswas, Sergio Contrino, 
Wolfgang Fleischmann, Gill Fraser, Cathy Gedman, Henning Hermjakob, 
Vivien Junker, Youla Karavidopoulou, Fiona Lang, Minna Lehvaslaiho, 
Michele Magrane, Maria Jesus Martin, Steffen Moeller, Nicoletta 
Mitaritonna, Virginie Mittard, Nicola Mulder, Claire O'Donovan, 
Isabelle Phan, Sandrine Pilbout, Lucia Rodriguez-Monge and Eleanor
Whitfield at the EMBL Outstation - European Bioinformatics Institute 
(EBI) in Hinxton, UK;

Amos Bairoch and Alain Gateau at the Swiss Institute of Bioinformatics
in Geneva, Switzerland.


-----------------------------------------------------------------
Claire O'Donovan                       email:odonovan at ebi.ac.uk
EMBL Outstation EBI
(European Bioinformatics Institute)    URL: http://www.ebi.ac.uk
Wellcome Trust Genome Campus           Tel: +44 (1223) 494460
Hinxton                                fax: +44 (1223) 494468
Cambridge
CB10 1SD UK
-----------------------------------------------------------------





More information about the Embl-db mailing list