[Bioperl-l] Re: [SO-devel] Re: GFF3 preliminary
Richard Durbin
rd at sanger.ac.uk
Fri Feb 21 10:21:19 EST 2003
I have an edited version of SO which I will check back in shortly if it
does not clash with too many other SO changes since Monday, in which the
SOFA:SOFA tags that Michael and I put in last week are replaced by
either SOFA:junction or SOFA:region. This makes explicit which SOFA
terms are junctions and which are regions.
Note that SO/SOFA themselves don't mandate whether you use base
coordinates (as in GFF) or interbase coordinates (as perhaps in some
wizzy new format from Berkeley). i.e. SO/SOFA are not formats, but
ontologies of terms, and you can and should use them with other formats
than GFF. The distinction between regions and junctions is real
however, and will have to be reflected in any format. Some things
inherently have extent covering bases (regions) and some things are
boundaries between neighbouring bases (junctions).
Richard
Chris Mungall wrote:
>
> On Fri, 21 Feb 2003, Richard Durbin wrote:
>
>
>>length
>>just before the start of the ATG and after the stop codon. There are SO
>>terms for the start_codon and stop_codon but you really don't want to
>>use them to define the start and end. e.g. Michael's 4 base start
>>codons, but also many start and stop codons are broken by introns.
>>There is no reason not to use junctions.
>
>
> Ah, ok - spaced on the distinction between codon/junction there
>
> I'm still not comfortable with the hack to get round not having interbase
> coordinates, where certain subgraphs of SOFA are treated as flanking bases
> and the rest is treated as inclusive bases. I guess it's necessary for GFF
> though.
>
> should we make this explicit in the names; eg all types for features have
> flanking-base coordinates have the suffix '_junction'?
>
>
>>Richard
>>
>>
>>
>>
>
>
More information about the Bioperl-l
mailing list