[Biopython-dev] [Bug 1747] GenBank parser is very slow and memory hungry for large input files

bugzilla-daemon at portal.open-bio.org bugzilla-daemon at portal.open-bio.org
Wed Mar 9 11:34:45 EST 2005


http://bugzilla.open-bio.org/show_bug.cgi?id=1747

biopython-bugzilla at maubp.freeserve.co.uk changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
 Attachment #198 is|0                           |1
           obsolete|                            |



------- Additional Comments From biopython-bugzilla at maubp.freeserve.co.uk  2005-03-09 11:34 -------
Created an attachment (id=200)
 --> (http://bugzilla.open-bio.org/attachment.cgi?id=200&action=view)
Patch to the class _Scanner in Bio/GenBank/__init__.py

This version of the patch completely replaces the Martel code in the class
_Scanner.

The feature parsing code has (in my opinion) been simplified (since the
previous patch).

However, with all the new code added to deal with the "GenBank header section",
the feed function is probably ripe for splitting up into sub functions.

This has been fully tested with the Tests/test_GenBank.py as shipped with
Biopython 1.40b.

As before, patch created on Windows XP with the Cygwin diff command against
Bio/GenBank/__init__.py revision 1.53 (shipped with BioPython 1.30 and 1.40b):

diff my_version.py vcs_version.py > patch.txt



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.


More information about the Biopython-dev mailing list