Further GenBank Complaints
Tom Schneider
toms at fcs260c2.ncifcrf.gov
Wed Apr 8 17:03:01 EST 1992
Now that you all think GenBank is wonderful and all problems are solved
(or about to be solved with the new transition), I would like to point out
a problem I just came across. Two entries contain binding sites for Gal4,
K02115 has
misc_binding 368..384
/note="binding site for GAL4 (positive control protein)
M81879 has
misc_feature 217..233
/note="GAL4 site"
This means that no program can treat these the same way. Are there no rules on
how to enter things in GenBank? Why don't these things have NAMES? You can't
parse notes. (That is, no program can be smart enough to read these notes and
automatically extract the sites.) If this kind of problem continues to be
ignored, there will be a terrible price to pay later on!
Tom Schneider
National Cancer Institute
Laboratory of Mathematical Biology
Frederick, Maryland 21702-1201
toms at ncifcrf.gov
More information about the Bioforum
mailing list