Gene identification through BAC end sequencing

Eugen Buehler buehler at
Fri Aug 8 10:18:09 EST 1997

The Arabidopsis thaliana Genome Center at the University of Pennsylvania
is sequencing chromosome 1 of Arabidopsis in colaboration with groups at
Stanford and Berkeley.  As part of this project, the ATGC is using BAC
end sequencing to aid in mapping and selection of clones for sequencing.
A benefit of this method has been the early identification of genes
through similarity to known protein and nucleotide sequences.

The Arabidopsis thaliana Genome Center has updated its table of protein
and nucleotide neighbors for BAC end sequences generated at ATGC.
Sequence similarity comes from the Genbank dbGSS entry for each end
sequence.  Several groups have already identified genes of interest
using this data.  The table can be found at our home page:

The are two pages available, one containing hyperlinks to sequences:

and the other containing a linkless table (for slower, wimpy computers):

!!!!!!!!! PLEASE NOTE !!!!!!!!!

If you wish to obtain a particular BAC we have end sequenced, please
contact the maintainer of that library.  Information about whom to
contact is available at:


Eugen Buehler
buehler at
Arabidopsis thaliana Genome Center
University of Pennsylvania

More information about the Plantbio mailing list