Statistics on GenBank, PIR, etc., required

POSTMAST at GUNBRF.BITNET POSTMAST at GUNBRF.BITNET
Tue Oct 8 13:54:00 EST 1991


In message <9110081756.AA14008 at genbank.bio.net>, Geir Egil Hauge asks:
> Do someone have some statistics on EMBL(NUC), GENBANK(NUC) and NBRF(PIR)?
> I would be grateful to receive such statistics. That is: Information about
> previous releases:
>   RELEASENUMBER, DATE, NUMBER OF SEQUENCES, NUMBERS OF AM.ACIDS/NUCLEOTIDES

Here is a table for PIR

Growth of the Protein Sequence Database

Rel. Release        Number of Sequences               Number of  Residues
---- -------- -------------------------------- --------------------------------
Num. Date     PIR1    PIR2     PIR3    Total   PIR1        PIR2        PIR3
---- -------- -------------------------------- --------------------------------
 7   08/15/83  2,372                     2,372    430,262
 8.1 01/12/84  2,511                     2,511    470,158
 1.0 04/24/84  2,676                     2,676    526,466
 2.1 08/13/84  2,784                     2,784    557,759
 3.0 11/15/84  2,898                     2,898    591,717
 4.0 02/25/85  3,061                     3,061    657,289
 5.0 05/17/85  3,182     202             3,384    694,014     50,980
 6.0 08/28/85  3,309                     3,309    738,997
 7.0 11/27/85  3,447     168             3,615    778,218     41,826
 8.0 02/28/86  3,557     303             3,860    809,285     74,515
 9.0 05/28/86  3,712     281             3,993    862,289     62,140
10.0 08/13/86  3,800     448             4,248    890,703     72,293
11.0 12/04/86  4,028     584             4,612    963,031     99,118
12.0 03/17/87  4,253     497             4,750  1,029,056     74,390
13.0 06/30/87  4,525     890             5,415  1,116,951    186,015
14.0 09/30/87  4,721   1,697             6,418  1,118,149    393,114
15.0 12/28/87  4,931   1,865             6,796  1,264,388    420,173
16.0 03/31/88  5,251   2,145             7,396  1,384,621    471,361
17.0 06/30/88  5,407   3,181             8,588  1,448,175    761,222
18.0 09/30/88  5,556   3,582             9,138  1,510,026    867,973
19.0 12/31/88  5,722   4,805            10,527  1,568,922  1,233,133
20.0 03/31/89  5,980   5,178            11,158  1,681,392  1,320,790
21.0 06/30/89  6,158   6,318            12,476  1,766,843  1,639,179
22.0 09/30/89  6,330   7,083            13,413  1,850,665  1,856,749
23.0 12/31/89  6,550   7,822            14,372  1,942,966  2,034,937
24.0 03/31/90  6,858   9,666            16,524  2,065,365  2,562,028
25.0 06/30/90  7,068  10,663            17,731  2,139,528  2,859,535
26.0 09/30/90  7,235  12,216    6,363   25,814  2,221,416  3,348,438  1,779,096
27.0 12/31/90  7,747  12,607    6,444   26,798  2,386,941  3,417,043  1,816,684
28.0 03/31/91  7,967  12,607    7,658   28,232  2,469,675  3,416,095  2,190,727
29.0 06/30/91  8,309  12,601   10,985   31,895  2,633,415  3,390,420  3,067,214

The figures for this table are from various database files:
PIR1.NAM, PIR2.NAM, and PIR3.NAM  (for the Release Number and Date, also the
number of sequences and residues in each dataset)
------------------------------------------------------------------------
                                 Dr. John S. Garavelli
                                 Database Coordinator
                                 Protein Identification Resource
                                 National Biomedical Research Foundation
                                 Washington, DC  20007
                                 POSTMASTER at GUNBRF.BITNET



More information about the Bioforum mailing list