[DAS2] feature search algorithm (was Re: feature locations)
Helt,Gregg
Gregg_Helt at affymetrix.com
Mon Aug 21 16:28:28 UTC 2006
4) Return the entire feature group because in the feature group:
a) a location of the root of the group overlaps (30,40)
b) there is a feature of type exon
c) there is a feature with title "Foo"
Pushing this farther:
Foo: type=transcript, location=(10, 60)
Bip: type=polyA-site location = (50,55)
Bar: type=exon, location=(10, 20)
Baz: type=exon, location=(50, 60)
Search: overlaps(30,40), type=polyA-site, title=Baz
Also returns the feature group because:
Foo meets root overlaps(30,40)
Bip meets type=polyA-site
Baz meets title=Baz
In trying to work backwards from what I feel multi-filter queries should
return, here's the rules that seem to give me what I want:
a) For range filters, the feature group passes the filter if the root of
the feature group meets the range requirement.
b) For non-range filters, the feature group passes the filter if any
feature in the feature group meets the filter requirement.
c) All filters are AND'd together
gregg
> -----Original Message-----
> From: das2-bounces at lists.open-bio.org [mailto:das2-bounces at lists.open-
> bio.org] On Behalf Of Andrew Dalke
> Sent: Saturday, August 19, 2006 7:19 AM
> To: DAS/2
> Subject: Re: [DAS2] feature search algorithm (was Re: feature
locations)
>
> > Given a database:
> >
> > Foo: (10, 60)
> > Bar: (10, 20)
> > Baz: (50, 60)
>
> I'll modify it a bit more. Make it be
>
> Foo: type=transcript, location=(10, 60)
> Bar: type=exon, location=(10, 20)
> Baz: type=exon, location=(50, 60)>
> What does the search for
>
> overlaps(30,40), type==exon, title==Foo
>
> return and why? I can think of three answers:
>
> 1) return everything because in the feature group
>
> there is a feature which overlaps(30,40)
> there is a feature which is of type exon
> there is a feature with title "Foo"
>
> (call this the "each query term must match at least
> one feature in a feature group" algorithm)
>
> 2) return nothing because there is no feature
> which overlaps(30,40) and has type exon and
> has title "Foo"
>
> (call this the "at least one feature must be
> matched by all query terms" algorithm. This is
> the current algorithm)
>
> 3) return nothing because while the root feature
> overlaps(30,40) there is no feature which is both
> of type exon and with title "Foo".
>
> (call this the "range searches are special" algorithm.)
>
>
> Now what does the search for
>
> overlaps(30,40), type==exon, title==Bar
>
> return and why? Using the same three algorithms:
>
> 1) return everything because each of the three
> criteria are matched by at least one feature in
> the feature group
>
> 2) return nothing because no feature matches all
> three criteria.
>
> 3) return everything because the root feature
> overlaps(30,40) and the Bar feature meets the
> other two criteria.
>
>
> Andrew
> dalke at dalkescientific.com
>
> _______________________________________________
> DAS2 mailing list
> DAS2 at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/das2
More information about the DAS2
mailing list