[Biopython] Bio.trie

Andrew Dalke dalke at dalkescientific.com
Tue Jan 4 21:59:09 EST 2011


On Dec 29, 2010, at 4:24 AM, Michiel de Hoon wrote:
> We would like to know though how many users Bio.trie has, so we can decide whether it is worthwhile to update this module. If you are using Bio.trie, please let us know (preferably via the mailing list). If there are no current users, I suggest that we deprecate and later remove this module from Biopython.

I am not a user but the other day I was looking through the Python bug list and came across:

   http://bugs.python.org/issue9520

   The best existing implementation I've been able to find so far
   is one in the BioPython. Compared to defaultdict(int) on the
   task of counting words. Dataset 123,981,712 words (6,504,484
   unique), 1..21 characters long:
     * bio.tree - 459 Mb/0.13 Hours, good O(1) behavior
     * defaultdict(int) - 693 Mb/0.32 Hours, poor, almost O(N) behavior



				Andrew
				dalke at dalkescientific.com





More information about the Biopython mailing list