[Bioperl-l] SeqFeature::Generic 'frame' method
Jason Stajich
jason@chg.mc.duke.edu
Mon, 26 Feb 2001 18:32:20 -0500 (EST)
That does seem safer. I can easily transform from 1,2,3 to 0,1,2
so frame +1 in blast will be equivalent to frame 0 in GFF. But what about
cases where we don't have a frame, is Frame 0 technically correct there?
Strand information for a HSP currently comes from direction
of start/end points on the query seq. Is it concievable that they
wouldn't match? i.e. will a match in -3 frame always correspond to a
match on rev strand of query.... This might be a dilhema... I already
checked in code which breaks because of invalid frames, I will fix so it
is at least transformed to valid number space of gff. Strand will get
ignored for now.
-Jason
On Mon, 26 Feb 2001, Hilmar Lapp wrote:
> Jason Stajich wrote:
> >
> > Currently we only support frames of 0,1,2 as part of gff support I'm sure.
> > However, as I try and map a TBLASTN HSP to a SeqFeature, frames
> > can be -3,-2,-1,0,1,2,3. Does anyone see problems with extending the
> > supported values in the frame method to handle those above. Does this
> > break GFF standards somehow? I can imagine someone parsing a blast
> > report and writing the HSPs out as GFF features so it is quite likely that
> > the two cases shall meet.
> >
>
> Hmm. As long as there's a clear context of an HSP being the result of
> parsing a Blast report there's no problem with extending the set of
> allowed values. However, as soon as this context gets lost, like you
> said (and going through GFF is not the only way to lose that), there's
> a problem, because then frame 1 and frame 1 no longer mean the same
> thing. Bad.
>
> Can we map TBLASTN frame to the definitions we have set forth in
> SeqFeatureI? E.g. TBLASTN parsed frame -3 would become SeqFeatureI
> strand -1, frame 2. TBLASTN parsed frame 1 would be mapped to strand
> 1, frame 0, and so forth. This would keep both the definitions and the
> TBLASTN information.
>
> The only remaining problem is TBLASTN frame 0. What does it mean? If
> it's no frame, then it's equivalent to SeqFeatureI frame 0.
>
> What do you think?
>
> Hilmar
> --
> -------------------------------------------------------------
> Hilmar Lapp email: lapp@gnf.org
> GNF, San Diego, Ca. 92121 phone: +1-858-812-1757
> -------------------------------------------------------------
>
Jason Stajich
jason@chg.mc.duke.edu
Center for Human Genetics
Duke University Medical Center
http://www.chg.duke.edu/