The release notes for GenBank 205.0 (gbrel.txt) have been patched,
in order to (partially) address the lack of statistics for TSA
sequencing projects which make use of the WGS approach:
-r--r--r-- 1 gbupdate gbrel 379581 Dec 16 15:05 gbrel.txt
TSA stats have been added to Section 2.2.8 , and the content of
the patch is provided below.
Mark Cavanaugh
GenBank
NCBI/NLM/NIH/HHS
The following table provides the number of bases and the number of sequence
records for bulk-oriented Transcriptome Shotgun Assembly (TSA) RNA sequencing
projects processed at GenBank, beginning with Release 190.0 in June of 2012.
TSA sequences processed prior to Release 190.0 in June of 2012 were
handled individually, and are present in the gbtsa*.seq files of GenBank
releases (hence, they contribute to the statistics in the first table
of this section).
Subsequent to that date NCBI began processing TSA submissions using an
approach that is analogous to the bulk-oriented approach used for WGS,
assigning a TSA project code (for example: GAAA) to each TSA submission.
Note 1 : Although we provide statistics for bulk-oriented TSA submissions
as of the dates for GenBank releases, TSA files are not distributed or updated
in conjunction with those releases. Rather, per-project TSA data files are
continuously available in the TSA areas of the NCBI FTP site:
ftp://ftp.ncbi.nih.gov/ncbi-asn1/tsaftp://ftp.ncbi.nih.gov/genbank/tsa
Note 2 : NCBI's partner institutions within the INSDC might still choose
to treat TSA submissions as separate records, without a common TSA project
code. In which case, they will not be included in this table.
Note 3 : This table is incomplete. Statistics for Releases 190.0 to
200.0 will be back-filled, as time allows.
Release Date Base Pairs Entries
201 Apr 2014 23632325832 29734989
202 Jun 2014 31707343431 38011942
203 Aug 2014 33676182560 40556905
204 Oct 2014 36279458440 43567759
205 Dec 2014 46056420903 62635617