[Biopython-dev] SwissProt parsing inconsistency between Bio.SeqIO, Bio.SwissProt

Peter Cock p.j.a.cock at googlemail.com
Tue Apr 21 20:06:56 UTC 2009


On Tue, Apr 21, 2009 at 1:04 PM, Peter Cock <p.j.a.cock at googlemail.com> wrote:
>>> It looks like the SwissProt format has changed, and we
>>> should be parsing the new extended DE lines more
>>> carefully, and splitting these entries up and recording
>>> them in the SeqRecord.annotations dictionary?
>>
>> That sounds reasonable. The dictionary will have to be
>> nested though. Something like this ...
>>

Thinking this over, we should take that SwissProt file and load it
into BioSQL using BioPerl, and see how they dealt with the DE lines,
and try and do the same for Bio.SeqIO in order that loading it into
BioSQL with Biopython gives more or less the same thing.

Peter



More information about the Biopython-dev mailing list