Script for extracting genes ?

Jeremy Cook cookj at landcare.cri.nz
Mon Jul 22 15:51:10 EST 1996

In article <Pine.SOL.3.93.960709132407.17049A-100000 at evolution> Thomas Sicheritz <thomas at evolution.bmc.uu.se> writes:
>From: Thomas Sicheritz <thomas at evolution.bmc.uu.se>
>Subject: Script for extracting genes ?
>Date: Tue, 9 Jul 1996 13:49:38 +0200


>I need a script/program which can scan an EMBL or Genbank file and
>extract all gene names with start-stop, length, type, direction ...
>(maybe exon information too)

>the output should be something like that: 
>rice chloroplast:
>name      type         dir      start   stop    length
>rps16:x2:2      CDS     R       4487    4635    148
>rps16:x1:2      CDS     R       5514    5553    39
>trnQ    tRNA    R       6615    6687    72
>psbK    CDS     F       7033    7218    185
>psbI    CDS     F       7608    7718    110
>trnS    tRNA    R       7829    7916    87
>ORF100  CDS     F       8349    8651    302

>has anybody seen or written such a script/program (in perl,tcl,C,C++) ?


Are all this required data held in these EMBL or Genbank files, and are these 
two source formats text?


More information about the Embl-db mailing list

Send comments to us at biosci-help [At] net.bio.net