[Bioperl-l] Bio/SeqIO/swiss.pm parsing error

Erik er at xs4all.nl
Tue Nov 14 09:40:20 UTC 2006


Btw and FYI:

I have run this regex through SwissProt and Trembl and a previous version
of those two files (to be version-precise: swissprot 51 and 50; trembl 33
and 34).

It matched all ID lines.

Thanks :)

Erik

> ...
>     unless(  m{^
>                  ID              \s+     #
>                  (\S+)           \s+     #  $1  entryname
>                  ([^\s;]+);      \s+     #  $2  DataClass
>                  (?:PRT;)?       \s+     #  Molecule Type (optional)
>                  [0-9]+[ ]AA     \.      #  Sequencelength (capture?)
>                  $
>                  }ox ) {
> ...
>
> The molecule type was always PRT and was a carryover from EMBL format
> divisions.





More information about the Bioperl-l mailing list