The Beta release of TREMBL, a protein sequence database supplementing the
SWISS-PROT Protein Sequence Data Bank is now available from
ftp.ebi.ac.uk in directory /pub/databases/trembl.
INTRODUCTION
============
TREMBL is a protein sequence database supplementing the SWISS-PROT Protein
Sequence Data Bank. TREMBL contains the translations of all coding
sequences (CDS) present in the EMBL Nucleotide Sequence Database not
integrated in SWISS-PROT. At the moment we treat TREMBL as an independent
dataset in SWISS-PROT format, but shortly TREMBL will become a part of
SWISS-PROT.
BETA RELEASE OF TREMBL
======================
This TREMBL release is created from the EMBL nucleotide sequence release 46
and contains 98'447 sequence entries, comprising 25'879'831 amino acids.
TREMBL is split in two main sections; SP-TREMBL and REM-TREMBL:
SP-TREMBL (SWISS-PROT TREMBL) contains the entries (81'861) which should be
incorporated into SWISS-PROT. SP-TREMBL is partially redundant against
SWISS-PROT, since approximately 40'000 SP-TREMBL entries are only
additional sequence reports of proteins already in SWISS-PROT. We will try
to merge these sequence reports as fast as possible with the already
existing SWISS-PROT entries for these proteins, so as to make SWISS-PROT
and TREMBL completely nonredundant.
SP-TREMBL is organized in subsections:
fun.dat (Fungi): 3772 entries
inv.dat (Invertebrates): 9689 entries
mam.dat (Other Mammals): 1907 entries
mhc.dat (MHC proteins): 2007 entries
org.dat (Organelles): 5241 entries
phg.dat (Bacteriophages): 939 entries
Pln.dat (Plants): 5294 entries
pri.dat (Primates): 6425 entries
pro.dat (Prokaryotes): 15645 entries
rod.dat (Rodents): 5837 entries
unc.dat (Unclassified): 201 entries
vrl.dat (Viruses): 22677 entries
vrt.dat (Other Vertebrates): 2227 entries
REM-TREMBL (REMaining TREMBL) contains the entries (16'586) that we do
not want to include in SWISS-PROT.
ACCESS/DATA DISTRIBUTION
========================
FTP server: ftp.ebi.ac.uk/pub/databases/trembl
TREMBL is also available on the SWISS-PROT CD-ROM.
TREMBL HAS BEEN PREPARED BY:
============================
Rolf Apweiler, Alain Gateau, Vivien Junker, Fiona Lang, and Claire
O'Donovan at the EMBL Outstation - European Bioinformatics Institute
(EBI) in Hinxton;
Amos Bairoch at the Medical Biochemistry Department of the University
of Geneva.
=======================================================================
Rolf Apweiler | Email:apweiler at ebi.ac.uk
EBI - European Bioinformatics Institute | URL: http://www.ebi.ac.uk
Wellcome Trust Genome Campus, Hinxton | Tel: +44 (1223) 494435
Cambridge CB10 1RQ, UK | Fax: +44 (1223) 494968
========================================================================