[BioSQL-l] Relationship between Gene CDS and mRNA Seqfeatures

Peter biopython at maubp.freeserve.co.uk
Wed Nov 3 16:20:05 UTC 2010


On Wed, Nov 3, 2010 at 3:49 PM, Nicholas Thrower <throwern at msu.edu> wrote:
>
> Hello all,
>
> I¹m new to this list so I hope this question hasn¹t been asked before. I
> couldn¹t find an archive to search.
>
> I have a biosql schema in oracle loaded with several genbank entries
> including TAIR9. I would like some help determining the relationship between
> mrna and cds features that belong to a single Gene entry.
>
> My seqfeature table has the following entries.
>
> OID         RANK    DISPLAY_NAME    ENT_OID     TYPE_TRM_OID
> SOURCE_TRM_OID
> 666269  22747   Gene                       643516        626447
> 626431
> 666270  22748   Mrna                       643516        626451
> 626431
> 666271  22749   Mrna                       643516        626451
> 626431
> 666272  22750   Cds                          643516        626455
> 626431
> 666273  22751   Cds                          643516        626455
> 626431
>
> I¹m trying to correlate CDS to mRNA so that I can display them. All of the
> seqfeatures have the same exact locus_tag qualifier and I can¹t find any
> other qualifiers noting this relationship. Do I need to make an assumption
> based on seqfeature id, or rank in order to determine the relationship
> between mRNA and CDS.
>
> Here is an example from TAIR of what this data could look like.
> http://arabidopsis.org/servlets/TairObject?id=134549&type=locus
>
> How do I know which CDS and mRNA are paired together?
>
> -Nick

In a GenBank file this relationship between CDS and mRNA or gene
features is implicit from the order in the file. I think the rank is used in
BioSQL to record the feature order, so you'd have to look at that.
(This is assuming the Bio* wrapper you used to load the GenBank
file into BioSQL isn't making any inferences like this for you).

P.S. Which Bio* wrapper are you using? BioPerl?

Peter




More information about the BioSQL-l mailing list