[BioSQL-l] PhyloDB module updates
Hilmar Lapp
hlapp at gmx.net
Mon Feb 18 21:39:59 UTC 2008
On Feb 14, 2008, at 10:32 AM, aaron.j.mackey at gsk.com wrote:
>>> Though speaking of which, what has been bugging me lately is that
>>> they only provide for a single taxonomy to be used (presumably the
>>> one from NCBI, since one column expressly refer to the
>>> NCBI_taxon_ID. I've been thinking about suggesting to generalize
>>> that a little, but the backwards compatibility issues might just
>>> outweigh the benefits.
>
> One approach would be to reuse the existing ontology framework (and
> perhaps include a NCBI-taxonomy-specific "view" to recapitulate the
> existing taxon tables). This would have the added advantage of
> allowing
> DAG-like taxonomies.
Indeed that's a strong possibility too, and in fact for example the
NCBI taxonomy can be downloaded from the OBO foundry in OBO format
(and also in OWL I believe). We are also converting another taxonomy
(Eschmayer's Catalog of Fishes) into an ontology (Teleost Taxonomy
Ontology - TTO - can also be downloaded from the NCBO).
That said, most taxonomies aren't (yet?) available as an ontology,
and for the exercise of converting there are a couple of gotchas that
require manual review of the result.
Maybe the ideal level of support for this (i.e., when a taxonomy
doesn't come as an ontology to begin with) would be in the toolkit
(s), mapping from a taxonomy model (Bio::Taxonomy, or Bio::Tree, for
example?) to the ontology model (Bio::Ontology).
(This is mostly a stream of thoughts, not clear arguments one way or
another.)
> "taxon_id" foreign keys would then instead be
> "term_id" (too bad there's no such thing as column-level aliases/
> views).
Indeed. But there's more than that needed. For example, ranks would
be extra relationships ('has_rank') that would need to be joined in.
-hilmar
--
===========================================================
: Hilmar Lapp -:- Durham, NC -:- hlapp at gmx dot net :
===========================================================
More information about the BioSQL-l
mailing list