Release 14 of TREMBL, a protein sequence database supplementing SWISS-PROT

Maria Jesus Martin martin at
Wed Jun 28 04:12:00 EST 2000


TrEMBL is a protein sequence database supplementing the SWISS-PROT
Protein Sequence Data Bank. TrEMBL contains the translations of all
coding sequences (CDS) present in the EMBL Nucleotide Sequence
Database not yet integrated in SWISS-PROT. TrEMBL can be considered
as a preliminary section of SWISS-PROT. For all TrEMBL entries
which should finally be upgraded to the standard SWISS-PROT
quality, SWISS-PROT accession numbers have been assigned.


The goal of this TrEMBL release is to achieve synchronization with
SWISS-PROT release 39.0. Therefore, all sequence entries present in
SWISS-PROT release 39.0 have been removed from TrEMBL release 13,
further upgrading of existing TrEMBL entries was achieved and only
a very few new entries were incorporated.

TrEMBL release 14 contains 351'834 sequence entries, comprising
100'069'442 amino acids.

TrEMBL is split in two main sections: SP-TrEMBL and REM-TrEMBL.

SP-TrEMBL (SWISS-PROT TrEMBL) contains the entries (297'973) which
should be eventually incorporated into SWISS-PROT. SWISS-PROT accession
numbers have been assigned for all SP-TrEMBL entries.

SP-TrEMBL is organized in subsections:

arc.dat (Archaea):             11776 entries
fun.dat (Fungi):                8877 entries
hum.dat (Human):               13757 entries
inv.dat (Invertebrates):       44786 entries
mam.dat (Other Mammals):        4977 entries
mhc.dat (MHC proteins):         5264 entries
org.dat (Organelles):          25730 entries
phg.dat (Bacteriophages):       2666 entries
pln.dat (Plants):              28878 entries
pro.dat (Prokaryotes):         62006 entries
rod.dat (Rodents):             10226 entries
unc.dat (Unclassified):           31 entries
vrl.dat (Viruses):             72237 entries
vrt.dat (Other Vertebrates):    6762 entries

REM-TrEMBL (REMaining TrEMBL) contains the entries (53'861)
that we do not want to include in SWISS-PROT.


FTP server:
SRS server:

TrEMBL is also available on the SWISS-PROT CD-ROM.
SWISS-PROT + TrEMBL is searchable on the following servers at the EBI:

Bic_sw  (
Scanps  (
MPSrch  (


Rolf Apweiler, Kirsty Bates, Margaret Biswas, Sergio Contrino,
Kirill Degtyarenko, Wolfgang Fleischmann, Gill Fraser, Cathy Gedman,
Henning Hermjakob, Vivien Junker, Youla Karavidopoulou, Paul Kersey,
Fiona Lang, Minna Lehvaslaiho, Michele Magrane, Maria Jesus Martin,
Steffen Moeller, Nicoletta Mitaritonna, Virginie Mittard, Nicola
Mulder, Claire O'Donovan, John F. OM-4Rourke, Isabelle Phan,
Sandrine Pilbout, Lucia Rodriguez-Monge, Eleanor Whitfield and
Allyson Williams at the
EMBL Outstation - European Bioinformatics Institute (EBI)
in Hinxton, UK;
Amos Bairoch and Alain Gateau at the Swiss Institute of Bioinformatics
in Geneva, Switzerland.

Maria Jesus Martin                     email:martin at
EMBL Outstation EBI
(European Bioinformatics Institute)    URL:
Wellcome Trust Genome Campus           Tel: +44 (1223) 494408
Hinxton                                fax: +44 (1223) 494468

More information about the Bioforum mailing list