Duplicate use of entry codes

Cary O'Donnell ODONNELL at UK.AC.AFRC.ARCB
Thu Aug 27 11:55:00 EST 1992


** PLEASE IGNORE PRECEDING MAIl - WRONG FILE SENT IN ERROR **


K.P. Lam at University of Kent found the following in PIR33:

+++++++++++++++++++   begin scripts  ++++++++++++++++++++++++++++++

=========
pir2.seq
=========
>F1;A26616
Cytochrome-b5 reductase (EC 1.6.2.2), placental - Human (fragment)
/LGHMVLFPVWFLYSLLMKLFQRSTPAITLESPDIKYPLRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARI
DGNLVVRPYTPISSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIRPDKK
SNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQTEKDILLRPELEELRNKHSARFKLWYTLDWDY
GQGFVNEEMIRDHLPPPE
EEPLVLMCGPPPMIQYACLPNLDHVGHPTERCFVF*

=========

pir3.seq
=========
>F1;A26616
*Cytochrome-b5 reductase (EC 1.6.2.2), placental - Human (fragment)
/LGHMVLFPVWFLYSLLMKLFQRSTPAITLESPDIKYPLRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARI
DGNLVVRPYTPISSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIRPDKK
SNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQTEKDILLRPELEELRNKHSARFKLWYTLDRAP
EAWDYGQGFVNEEMIRDH
LPPPEEEPLVLMCGPPPMIQYACLPNLDHVGHPTERCFVF*

+++++++++++++++++++   end scripts  ++++++++++++++++++++++++++++++

Clearly, this shows that there has been a duplicated use of the sequence ID 
for two very similar (almost identical!!) sequences.  

----------------------------------------------------------------------------
I can fish out both entries using XQS, and FETCH in the GCG package. Just in
case I was going mad, I thought I would check using the PIR3.SEQ with GCG's 
FASTA on PIR32 and then on PIR33 :

PIR32:
======
The best scores are:				            init1 initn opt..

Pir3:Js0468  *Cytochrome-b5 reductase (EC 1.6.2.2), place...1558  1558  1558
Pir2:A23896  Cytochrome-b5 reductase (EC 1.6.2.2) - Bovine  1487  1487  1487
Pir2:A40495  Cytochrome-b5 reductase (EC 1.6.2.2) - Rat     1443  1443  1443
Pir1:Rdhub5  Cytochrome-b5 reductase (EC 1.6.2.2) - Human   1434  1434  1434
Pir2:B26616  Cytochrome-b5 reductase (EC 1.6.2.2), hepati...1398  1398  1402
Pir2:A26616  Cytochrome-b5 reductase (EC 1.6.2.2), placen...1210  1210  1511
  ^^^^^^^^^
   There it is

The best scores are:				            init1 initn opt..

>> Pir3:A26616  *Cytochrome-b5 reductase (EC 1.6.2.2), place...1559  1559  1559
Pir3:Js0468  *Cytochrome-b5 reductase (EC 1.6.2.2), place...1558  1558  1558
Pir2:A23896  Cytochrome-b5 reductase (EC 1.6.2.2) - Bovine  1487  1487  1487
Pir2:A40495  Cytochrome-b5 reductase (EC 1.6.2.2) - Rat     1443  1443  1443
Pir1:Rdhub5  Cytochrome-b5 reductase (EC 1.6.2.2) - Human   1434  1434  1434
Pir2:B26616  Cytochrome-b5 reductase (EC 1.6.2.2), hepati...1398  1398  1402
>> Pir2:A26616  Cytochrome-b5 reductase (EC 1.6.2.2), placen...1210  1210  1511

Is this an accidental use of duplicating entry codes? It can cause problems
when creating derivative databases, as KPL was doing.

Cary


*****************************************************************************
AFRC Computing Division    JANET   : ODONNELL at UK.AC.AFRC.ARCB
West Common                INTERNET: ODONNELL at ARCB.AFRC.AC.UK
Harpenden                  GlobeNET: 00d 21m 45s W   51d 48m 30s N 
Herts AL5 2JE              Tel: (+44) 582 762271 xt 229 Fax: (+44) 582 761710
U.K.                       (AFRC = Agricultural & Food Research Council)
-----------------------------------------------------------------------------



More information about the Proteins mailing list