[BioSQL-l] Re: Singapore schema

Hilmar Lapp hlapp at gmx.net
Thu Feb 20 09:23:48 EST 2003


On Thursday, February 20, 2003, at 05:13  AM, McCulloch, Alan wrote:

> Hi Hilmar,
>
> Is there a Singapore schema yet ? - just checked
> http://cvs.bioperl.org/cgi-bin/viewcvs/viewcvs.cgi/biosql-schema/doc/ 
> ?cv
> sroot=biosql
> but the diagram looks like the old one , although Aaron's docs are
> there.

The diagram is still old. It will not change before I get back home and  
get time to update it. (It is generated by ERwin.)

You need to look at the DDL in sql/biosqldb-{mysql,pg}.sql. The Oracle  
version will also be updated only later.

>   For example , how tightly coupled to this schema are BioJava and
> BioPerl going to be ?
>
>   Is the aim to provide essentially a Genbank repository , or are there
> plans to go beyond that ?

Both. There are different use cases, ranging from a temporary sequence  
cache, a SRS like repository, a toolkit's persistence store, to a rich  
gene annotation database.

>   We have been advised that for our bioinformatics developers to be
> productive with BioJava and BioPerl , we need to change our existing  
> in-house database schema so that it conforms to the BioSQL schema.

This depends entirely on what you want to use Bioperl and Biojava for.  
Biosql is their primary persistence store. If you want to have  
persistent Biojava or bioperl objects, you need to provide a  
biosql-compliant schema in order to use the native APIs. Otherwise  
there is no need for that.

>   we would not be wanting to create dependencies of the Bio* projects  
> on
> any particular database
>   technology (i.e. relational) or schema.
>

The toolkits themselves are not dependent, but their persistence  
mechanisms are.

>   Is it really true that if BioJava and BioPerl want to talk to a DB it
> has in practice to be BioSQL ?

Yes, unless you want to write your own persistence layer. In both  
bioperl and biojava we are striving to provide a relatively generic  
persistence framework, but the physical object-relational mapping will  
have to come from you if you don't run a biosql-compliant database.  
Also, both toolkits have plans to support other schemas, for instance  
Chado. Those are only plans yet though ...

>  Do you have any feel for what people with existing databases are  
> doing ? (I would have thought , writing drivers for their own DB's to  
> support sequence-object construction  with their own DB as source. How  
> hard is it to set up these drivers ?)

It depends on how incompatible your relational model is with the  
bioperl and biojava object models. If your schema is very similar to  
biosql, the idea is you just change the physical OR mapping and you're  
done. Generic OR mapping is hard though, so I'd expect a bumpy ride if  
your schema is a lot different.

> I have a slight concern that BioSQL appears to be being seen by some  
> as a de facto interoperability standard, as well as a handy Genbank  
> repository,

It is meant to be one of the interoperability standards, yes.

> and this might introduce some unfortunate dependencies.

Possibly. We've tried other technologies (BioCORBA), with little  
success over the long run.

	-hilmar

>
>   These might seem like strange questions, but there is some  
> uncertainty
> here about the
>   scope of BioSQL and its relationship to the other Open* projects, and
> we are
>   unsure what to make of the suggestion that we need to change our
> database
>   schema. Perhaps an addition to the docs of an overview section might
> set us straight.
>
>
> Alan McCulloch
> Bioinformatics Software Engineer
> AgResearch NZ
>
> PS
>
> Thanks for all your work ! - the BioSQL schema is great and we have  
> used
> alot of its ideas recently.
>
> PPS sorry for replying to you directly but for some reason I am unable
> to
> post to the list, I get errors about suspicious message header.
>
>
>
> -----Original Message-----
> From: biosql-l-request at open-bio.org
> [mailto:biosql-l-request at open-bio.org]
> Sent: Thursday, 20 February 2003 6:00 a.m.
> To: biosql-l at open-bio.org
> Subject: BioSQL-l Digest, Vol 2, Issue 5
>
>
> Send BioSQL-l mailing list submissions to
> 	biosql-l at open-bio.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> 	http://open-bio.org/mailman/listinfo/biosql-l
> or, via email, send a message with subject or body 'help' to
> 	biosql-l-request at open-bio.org
>
> You can reach the person managing the list at
> 	biosql-l-owner at open-bio.org
>
> When replying, please edit your Subject line so it is more specific  
> than
> "Re: Contents of BioSQL-l digest..."
>
>
> Today's Topics:
>
>    1. BioSQL release candidate (Hilmar Lapp)
>
>
> ----------------------------------------------------------------------
>
> Date: Wed, 19 Feb 2003 17:38:45 +0800
> From: Hilmar Lapp <hlapp at gmx.net>
> To: Biosql <biosql-l at open-bio.org>
> Cc: Bioperl <bioperl-l at bioperl.org>
> Subject: [BioSQL-l] BioSQL release candidate
> Message-ID: <F0CD1C46-43ED-11D7-9655-000393B4BFF6 at gmx.net>
> Content-Type: text/plain; charset=US-ASCII; format=flowed
> MIME-Version: 1.0 (Apple Message framework v551)
> Content-Transfer-Encoding: 7bit
> Precedence: list
> Message: 1
>
> We're preparing BioSQL for a schema freeze and subsequent release in
> Singapore at the BioHackathon. Based on a lot of feedback and review
> with and by Aaron I introduced quite a number of changes to the schema.
> I committed a release candidate for both MySQL and PostgreSQL to the
> repository.
>
> Aaron also committed a schema documentation in doc/schema-overview.txt.
>
> Please take a look at the changes and make your voice heard if you see
> us heading for trouble or if you've got any comments and/or suggestions
> ...
>
> 	-hilmar
>
> -- 
> -------------------------------------------------------------
> Hilmar Lapp                            email: lapp at gnf.org
> GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
> -------------------------------------------------------------
>
> ------------------------------
>
> _______________________________________________
> BioSQL-l mailing list
> BioSQL-l at open-bio.org http://open-bio.org/mailman/listinfo/biosql-l
>
>
> End of BioSQL-l Digest, Vol 2, Issue 5
> **************************************
> =======================================================================
> Attention: The information contained in this message and/or attachments
> from AgResearch Limited is intended only for the persons or entities
> to which it is addressed and may contain confidential and/or privileged
> material. Any review, retransmission, dissemination or other use of, or
> taking of any action in reliance upon, this information by persons or
> entities other than the intended recipients is prohibited by AgResearch
> Limited. If you have received this message in error, please notify the
> sender immediately.
> =======================================================================
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------



More information about the BioSQL-l mailing list