[Biopython-dev] BioSQL bugs

Brad Chapman chapmanb at uga.edu
Tue Mar 30 19:31:37 EST 2004


Hi Mark;

[I ask how we can make tests work without autocommit]

> I think bioperl makes use of functions (see the biosqldb-pg.sql). I was 
> thinking about adding some of these function calls to the DBUtils 
> section to speed up the transactions. Removing some of the constraints 
> will increase the speed as the database grows. This code works fine for 
> small sets, but it quickly slows down (probably because of the checks).

That would be great -- honestly, I am not a database expert at all
(as you can probably tell from my mails). This seems like a good
place to start. I'd definitely appreciate more contributions along
this line from you if you'd be willing to do more work on it. 

[I'm confused about the taxon table as well]
> These values are needed for nested-set representation 
> <http://www.oreillynet.com/pub/a/network/2002/11/27/bioconf.html?page=1>. 
> They are used to quickly limit a branch of a tree. Selecting on the 
> values >= the left and <= the right gives you all the elements under 
> that part of the tree. I don't think it would be easy to add a new 
> element to the tree with out rebuilding the whole representation. 
> Therefore, I just skip it and put in a null (and print out that it 
> wasn't known). This needs to be fixed in the source of the data.

Thanks for the link. That makes good sense now -- it seems as if the
intent is to have the taxonomy information pre-loaded from taxon
tables, and then linking to the taxon table when loading records.

I agree with you -- I think the best way to handle it is to add
functionality (maybe to a mixin class that DBServer can derive from)
to load taxon table information into a database. Then, if this taxon
information exists link to it, otherwise add nulls as you suggest.

Thanks for the explanations!
Brad



More information about the Biopython-dev mailing list