IUBio

[Genbank-bb] GenBank 205.0 Release Notes : TSA statistics added

Cavanaugh, Mark (NIH/NLM/NCBI) [E] via genbankb%40net.bio.net (by cavanaug from ncbi.nlm.nih.gov)
Tue Dec 16 15:11:38 EST 2014


The release notes for GenBank 205.0 (gbrel.txt) have been patched,
in order to (partially) address the lack of statistics for TSA
sequencing projects which make use of the WGS approach:

-r--r--r-- 1 gbupdate gbrel 379581 Dec 16 15:05 gbrel.txt

TSA stats have been added to Section 2.2.8 , and the content of
the patch is provided below.

Mark Cavanaugh
GenBank
NCBI/NLM/NIH/HHS


  The following table provides the number of bases and the number of sequence
records for bulk-oriented Transcriptome Shotgun Assembly (TSA) RNA sequencing
projects processed at GenBank, beginning with Release 190.0 in June of 2012.

  TSA sequences processed prior to Release 190.0 in June of 2012 were
handled individually, and are present in the gbtsa*.seq files of GenBank
releases (hence, they contribute to the statistics in the first table
of this section).

  Subsequent to that date NCBI began processing TSA submissions using an
approach that is analogous to the bulk-oriented approach used for WGS,
assigning a TSA project code (for example: GAAA) to each TSA submission.

  Note 1 : Although we provide statistics for bulk-oriented TSA submissions
as of the dates for GenBank releases, TSA files are not distributed or updated
in conjunction with those releases. Rather, per-project TSA data files are
continuously available in the TSA areas of the NCBI FTP site:

          ftp://ftp.ncbi.nih.gov/ncbi-asn1/tsa
          ftp://ftp.ncbi.nih.gov/genbank/tsa

  Note 2 : NCBI's partner institutions within the INSDC might still choose
to treat TSA submissions as separate records, without a common TSA project
code. In which case, they will not be included in this table.

  Note 3 : This table is incomplete. Statistics for Releases 190.0 to
200.0 will be back-filled, as time allows.

Release      Date     Base Pairs     Entries

  201    Apr 2014    23632325832    29734989
  202    Jun 2014    31707343431    38011942
  203    Aug 2014    33676182560    40556905
  204    Oct 2014    36279458440    43567759
  205    Dec 2014    46056420903    62635617




More information about the Genbankb mailing list

Send comments to us at biosci-help [At] net.bio.net