Release 10 of TrEMBL, a protein sequence database supplementing SWISS-PROT

Maria Jesus Martin martin at ebi.ac.uk
Tue Jun 1 04:36:54 EST 1999


INTRODUCTION
============

TrEMBL is a protein sequence database supplementing the SWISS-PROT
Protein Sequence Data Bank. TrEMBL contains the translations of all
coding sequences (CDS) present in the EMBL Nucleotide Sequence
Database not yet integrated in SWISS-PROT. TrEMBL can be considered
as a preliminary section of SWISS-PROT. For all TrEMBL entries
which should finally be upgraded to the standard SWISS-PROT
quality, SWISS-PROT accession numbers have been assigned.


RELEASE 10.0 OF TrEMBL
=====================

This TrEMBL release is created from the EMBL Nucleotide Sequence
Database release 58 and contains 244'862 sequence entries,
comprising 66'562'800 amino acids.

TrEMBL is split in two main sections; SP-TrEMBL and REM-TrEMBL:

SP-TrEMBL (SWISS-PROT TrEMBL) contains the entries (201'082),
which should be eventually incorporated into SWISS-PROT.
SWISS-PROT accession numbers have been assigned for all SP-TrEMBL
entries.

SP-TrEMBL is organized in subsections:

arc.dat (Archea):               7408 entries
fun.dat (Fungi):                6679 entries
hum.dat (Human):                8518 entries
inv.dat (Invertebrates):       23653 entries
mam.dat (Other Mammals):        3130 entries
mhc.dat (MHC proteins):         4236 entries
org.dat (Organelles):          16261 entries
phg.dat (Bacteriophages):       1971 entries
pln.dat (Plants):              17352 entries
pro.dat (Prokaryotes):         45992 entries
rod.dat (Rodents):              7480 entries
unc.dat (Unclassified):           44 entries
vrl.dat (Viruses):             53916 entries
vrt.dat (Other Vertebrates):    4442 entries


REM-TrEMBL (REMaining TrEMBL) contains the entries (46'785) that we do
not want to include in SWISS-PROT.


WEEKLY UPDATES OF TrEMBL AND NON-REDUNDANT DATA SETS
====================================================
Weekly cumulative updates of TrEMBL are available by anonymous FTP and
from the EBI SRS server.
We also produce every week a complete non-redundant protein sequence
collection by providing three compressed files (these are in the
directory /pub/databases/sp_tr_nrdb on the EBI FTP server):
sprot.dat.Z, trembl.dat.Z and trembl_new.dat.Z.


ACCESS/DATA DISTRIBUTION
========================

FTP server:     ftp.ebi.ac.uk/pub/databases/trembl
SRS server:     http://srs.ebi.ac.uk/

TREMBL is also available on the SWISS-PROT CD-ROM.
SWISS-PROT + TREMBL is searchable on the FASTA3, BLAST2 and Bic_sw
servers of the EBI.



TrEMBL HAS BEEN PREPARED BY:
============================

Rolf Apweiler, Kirsty Bates, Margaret Biswas, Sergio Contrino,
Wolfgang Fleischmann, Gill Fraser, Henning Hermjakob, Vivien Junker,
Youla Karavidopoulou, Fiona Lang,  Minna Lehvaslaiho, Michele Magrane,
Maria Jesus Martin, Steffen Moeller, Nicoletta Mitaritonna,
Nicola Mulder, Claire O'Donovan and Eleanor Whitfield
at the EMBL Outstation - European Bioinformatics Institute (EBI) in
Hinxton, UK;
Amos Bairoch and Alain Gateau at the Swiss Institute of Bioinformatics
in Geneva, Switzerland.


-----------------------------------------------------------------
Maria Jesus Martin                     email:martin at ebi.ac.uk
EMBL Outstation EBI
(European Bioinformatics Institute)    URL: http://www.ebi.ac.uk
Wellcome Trust Genome Campus           Tel: +44 (1223) 494408
Hinxton                                fax: +44 (1223) 494468
Cambridge
CB10 1SD UK
-----------------------------------------------------------------





More information about the Proteins mailing list