IUBio Biosequences .. Software .. Molbio soft .. Network News .. FTP

Genome Factoids Wanted

Jared Roach roach at u.washington.edu
Wed Feb 14 19:29:09 EST 1996


Here is a relevant report I wrote for internal circulation a
few months ago.  Note that a few large chunks of sequeunce 
have been deposited into Genbank since then.


September 30, 1995
	Here's a run-down on the progress of sequencing the human 
genome to date (Genbank Release 90.0), considering only DNA (i.e. no 
mRNA, RNA or ss-DNA):

Number of Contigs Greater Than		Total Length in these Contigs
40 kb		27					 2.60 Mb
30 kb		59					 3.74 Mb
20 kb		86					 4.40 Mb
10 kb		240					 6.50 Mb
 5 kb		631					 9.15 Mb
 1 kb		4115					16.59 Mb

Note on table interpretation: There are 59 contigs greater than 30kb, 
not 27+59.


Considering contigs greater than 10 kb as being useful for building 
the Human Genome sequence, that makes roughly (6.50 Mb)/(3 Gb)=0.22% of 
the genome sequenced to date.


Here are the loci greater than 40kb:
HUMTCRB	684973	TCRb	7
HUMRETBLAS	180388	Retinoblastoma	13
HUMFMR1S	152351		
HSU07000	152141		
HUMIDUR	130000		
HUMNEUROF	100849	Neurofibromatosis	17
HUMTCRADCV	97634	TCRa	14
HSABLGR3	84539		
HSTCRBV (redundant with HUMTCRB)	77743		
HUMHBB	73308	Beta Globin	11
HUMFGLBTK	69363		
HUMMMDBC	68468		
HUMGHCSA	66495	Growth Hormone and Chorionic Somatomammotropin	17
HSMHCAPG	66109	Major Histocompatibility Complex	6
HSABLGR2	59012		
HUMHDABCD	58864		
HUMHPRTB	56737	Hypoxanthine Phosphoribosyl Transferase	X
HUMVITDBP	55136	Vitamin D Binding Protein Gene	
HUMPKD1GEN	53522		
HSG6PDGEN	52173		
HSU24498	47934		
HSU34879	46610		
HSU15177	43599		
HSU13369	42999		
HSU15422	40573		
HUMHDAC	40289		
HSL261H12	40198		
HUMHDAD	40103		

Jared Roach
roach at u.washington.edu
http://weber.u.washington.edu/~roach/



More information about the Biochrom mailing list

Send comments to us at biosci-help [At] net.bio.net