Release 8 of TrEMBL, a protein sequence database supplementing SWISS-PROT

Maria Jesus Martin martin at ebi.ac.uk
Mon Nov 23 07:58:35 EST 1998


  INTRODUCTION
============

TrEMBL is a protein sequence database supplementing the SWISS-PROT
Protein Sequence Data Bank. TrEMBL contains the translations of all
coding sequences (CDS) present in the EMBL Nucleotide Sequence
Database not yet integrated in SWISS-PROT. TrEMBL can be considered
as a preliminary section of SWISS-PROT. For all TrEMBL entries
which should finally be upgraded to the standard SWISS-PROT
quality, SWISS-PROT accession numbers have been assigned.


RELEASE 8.0 OF TrEMBL
=====================

This TrEMBL release is created from the EMBL Nucleotide Sequence
Database release 56 and contains 224'543 sequence entries,
comprising 60'188'661 amino acids.

TrEMBL is split in two main sections; SP-TrEMBL and REM-TrEMBL:

SP-TrEMBL (SWISS-PROT TrEMBL) contains the entries (180'763),
which should be eventually incorporated into SWISS-PROT.
SWISS-PROT accession numbers have been assigned for all SP-TrEMBL
entries.

SP-TrEMBL is organized in subsections:

arc.dat (Archea):               7397 entries
fun.dat (Fungi):                6007 entries
hum.dat (Human):                7688 entries
inv.dat (Invertebrates):       22829 entries
mam.dat (Other Mammals):        2892 entries
mhc.dat (MHC proteins):         3985 entries
org.dat (Organelles):          14230 entries
phg.dat (Bacteriophages):       1824 entries
pln.dat (Plants):              14749 entries
pro.dat (Prokaryotes):         39777 entries
rod.dat (Rodents):              6923 entries
unc.dat (Unclassified):           44 entries
vrl.dat (Viruses):             48472 entries
vrt.dat (Other Vertebrates):    3946 entries


REM-TrEMBL (REMaining TrEMBL) contains the entries (43'780) that we do
not want to include in SWISS-PROT.


WEEKLY UPDATES OF TrEMBL AND NON-REDUNDANT DATA SETS
====================================================
Weekly cumulative updates of TrEMBL are available by anonymous FTP and
from the EBI SRS server.
We also produce every week a complete non-redundant protein sequence
collection by providing three compressed files (these are in the
directory /pub/databases/sp_tr_nrdb on the EBI FTP server):
sprot.dat.Z, trembl.dat.Z and trembl_new.dat.Z.


ACCESS/DATA DISTRIBUTION
========================

FTP server:     ftp.ebi.ac.uk/pub/databases/trembl
SRS server:     http://srs.ebi.ac.uk/

TREMBL is also available on the SWISS-PROT CD-ROM.
SWISS-PROT + TREMBL is searchable on the FASTA3, BLAST2 and Bic_sw
servers of the EBI.



TrEMBL HAS BEEN PREPARED BY:
============================

Rolf Apweiler, Sergio Contrino, Wolfgang Fleischmann, Gill Fraser,
Henning Hermjakob, Vivien Junker, Stephanie Kappus,Youla Karavidopoulou,

Fiona Lang, Michele Magrane, Maria Jesus Martin, Steffen Moeller,
Nicoletta Mitaritonna and Claire O'Donovan at the EMBL Outstation
- European Bioinformatics Institute (EBI) in Hinxton, UK;
Amos Bairoch and Alain Gateau at the Medical Biochemistry Department of
the University of Geneva, Switzerland.


-----------------------------------------------------------------
Maria Jesus Martin                     email:martin at ebi.ac.uk
EMBL Outstation EBI
(European Bioinformatics Institute)    URL: http://www.ebi.ac.uk
Wellcome Trust Genome Campus           Tel: +44 (1223) 494408
Hinxton                                fax: +44 (1223) 494468
Cambridge
CB10 1SD UK
-----------------------------------------------------------------






More information about the Embl-db mailing list