[BioSQL-l] Getting gene name, function etc. from biosql

Nick Loman n.j.loman at bham.ac.uk
Mon Aug 23 19:08:52 UTC 2010


Hi Michael

You need a join on seqfeature_qualifier_value to get this detail. This 
table stores feature qualifiers as key/value pairs, with the 
corresponding key name ('name', 'product', etc.) belonging to the 
relation 'term', so you'll need to join on that too.

HTH

Cheers

Nick


Michael Muratet wrote:
> Greetings
>
> I am working on assembling gene CDS sequences on a medium scale, e.g.,  
> for all S. aureus strains, and I'm trying to find a way to get gene  
> names from biosql entries I created from Genbank files with  
> load_seqdatabase.pl. I'm using a query like this:
>
> SELECT
>      c.seqfeature_id, b.strand, SUBSTR(a.seq, b.start_pos, b.end_pos- 
> b.start_pos+1) as seq
> FROM
>      biosequence a
>      JOIN
>      seqfeature c
>      ON (a.bioentry_id=c.bioentry_id)
>      JOIN
>      location b
>      ON (b.seqfeature_id=c.seqfeature_id)
> WHERE
>      c.type_term_id=12
>      AND
>      c.bioentry_id=221
>
> This seems to work OK to get the sequence with the provision that one  
> needs to reverse complement the sequence if the strand is minus.
>
> But I don't see anything in the schema that will allow me to identify  
> the gene name or product from the seqfeature_id.
>
> Is gene name or product in the schema somewhere and I've missed it?
>
> Thanks
>
> Mike
>
>
> Michael Muratet, Ph.D.
> Senior Scientist
> HudsonAlpha Institute for Biotechnology
> mmuratet at hudsonalpha.org
> (256) 327-0473 (p)
> (256) 327-0966 (f)
>
> Room 4005
> 601 Genome Way
> Huntsville, Alabama 35806
>
>
>
>
>
> _______________________________________________
> BioSQL-l mailing list
> BioSQL-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biosql-l
>   



More information about the BioSQL-l mailing list