<p class=MsoNormal><font size=2 face=Arial><span style='font-size:10.0pt;
font-family:Arial'>Is anyone aware of any software (not commercial), that can
be used for extraction of portions of the<o:p></o:p></span></font></p>
<p class=MsoNormal><span class=SpellE><span class=GramE><font size=2
face=Arial><span style='font-size:10.0pt;font-family:Arial'>flatfiles</span></font></span></span><span
class=GramE><font size=2 face=Arial><span style='font-size:10.0pt;font-family:
Arial'> ?</span></font></span><font size=2 face=Arial><span style='font-size:
10.0pt;font-family:Arial'> (I think that’s how they call the <span
class=SpellE>GenBank</span> entries you get after a search<span class=GramE>) .</span>
For example if in a database <span class=SpellE>flatfile</span> entry, <o:p></o:p></span></font></p>
<p class=MsoNormal><span class=GramE><font size=2 face=Arial><span
style='font-size:10.0pt;font-family:Arial'>you</span></font></span><font
size=2 face=Arial><span style='font-size:10.0pt;font-family:Arial'> have a reference
to coding sequence as “ CDS : 235 ….. 1500 <span class=SpellE>bp</span><span
class=GramE>” ,</span> is there a software that can find the keyword
“CDS” <o:p></o:p></span></font></p>
<p class=MsoNormal><span class=GramE><font size=2 face=Arial><span
style='font-size:10.0pt;font-family:Arial'>in</span></font></span><font size=2
face=Arial><span style='font-size:10.0pt;font-family:Arial'> the <span
class=SpellE>flatfile</span>, and then read and return the string composed of
the letters a c g t, that is between the numbers 235…..1500 <o:p></o:p></span></font></p>
<p class=MsoNormal><span class=GramE><font size=2 face=Arial><span
style='font-size:10.0pt;font-family:Arial'>in</span></font></span><font size=2
face=Arial><span style='font-size:10.0pt;font-family:Arial'> the sequence at
the end of the file ? I am particularly interested, to extract promoter regions
from whole gene entries of <o:p></o:p></span></font></p>