Here is a relevant report I wrote for internal circulation a
few months ago. Note that a few large chunks of sequeunce
have been deposited into Genbank since then.
September 30, 1995
Here's a run-down on the progress of sequencing the human
genome to date (Genbank Release 90.0), considering only DNA (i.e. no
mRNA, RNA or ss-DNA):
Number of Contigs Greater Than Total Length in these Contigs
40 kb 27 2.60 Mb
30 kb 59 3.74 Mb
20 kb 86 4.40 Mb
10 kb 240 6.50 Mb
5 kb 631 9.15 Mb
1 kb 4115 16.59 Mb
Note on table interpretation: There are 59 contigs greater than 30kb,
not 27+59.
Considering contigs greater than 10 kb as being useful for building
the Human Genome sequence, that makes roughly (6.50 Mb)/(3 Gb)=0.22% of
the genome sequenced to date.
Here are the loci greater than 40kb:
HUMTCRB 684973 TCRb 7
HUMRETBLAS 180388 Retinoblastoma 13
HUMFMR1S 152351
HSU07000 152141
HUMIDUR 130000
HUMNEUROF 100849 Neurofibromatosis 17
HUMTCRADCV 97634 TCRa 14
HSABLGR3 84539
HSTCRBV (redundant with HUMTCRB) 77743
HUMHBB 73308 Beta Globin 11
HUMFGLBTK 69363
HUMMMDBC 68468
HUMGHCSA 66495 Growth Hormone and Chorionic Somatomammotropin 17
HSMHCAPG 66109 Major Histocompatibility Complex 6
HSABLGR2 59012
HUMHDABCD 58864
HUMHPRTB 56737 Hypoxanthine Phosphoribosyl Transferase X
HUMVITDBP 55136 Vitamin D Binding Protein Gene
HUMPKD1GEN 53522
HSG6PDGEN 52173
HSU24498 47934
HSU34879 46610
HSU15177 43599
HSU13369 42999
HSU15422 40573
HUMHDAC 40289
HSL261H12 40198
HUMHDAD 40103
Jared Roach
roach at u.washington.eduhttp://weber.u.washington.edu/~roach/