[BioPerl] Re: [Bioperl-l] gff_string on an HSPI object is not
Bio::DB::GFF friendly
Aaron J.Mackey
amackey at virginia.edu
Fri Jan 9 16:42:27 EST 2004
Forgive me for a stupid question, but does GBrowse (v1.58) now support
GFF3? Namely, can I have start/stops in sub-feature coordinates in my
input GFF3 and expect bp_load_gff.pl to behave properly (i.e. generate
"canonical" top-level coordinates for storage)? I didn't see anything
in the documentation, so I was surprised to see some of the words in
these posts ...
Thanks
On Jan 9, 2004, at 4:09 PM, Mark Wilkinson wrote:
> Cool. I'm heavily into making the HSP's output proper GFF3 today for
> some of the Gbrowse tools that I have been working on, so I will jump
> in
> and do this over the next day or two.
>
> Cheers!
>
> Mark
>
> On Fri, 2004-01-09 at 14:49, Scott Cain wrote:
>> I think everything you wrote below is correct. As far as I know, only
>> Allen and I have been working BTGFF's GFF3 code, and we haven't
>> touched
>> the alignment portion, so I am not surprised that it is wrong. I
>> suppose fixing BTGFF may break some tools, but I know that the chado
>> loader I wrote will handle it correctly :-)
>>
>> Thanks,
>> Scott
>>
>>
>> On Fri, 2004-01-09 at 15:45, Mark Wilkinson wrote:
>>> On Fri, 2004-01-09 at 11:22, Scott Cain wrote:
>>>
>>>> - be sure to use a SO term for the type (ie, match or one of its
>>>> children)
>>>
>>> So... actually the existing implementation of GFF3 in bioperl
>>> from Bio::Tools::GFF->new(-gff_version => 3)
>>> does not generate correctly formatted GFF3 for alignment features,
>>> yeah?
>>>
>>> e.g. for column 9 of an alignment feature I get:
>>>
>>> Target=gi|2828774:54232..54206
>>>
>>> whereas I think I should be getting
>>>
>>> Target=gi|2828774+54232+54206
>>>
>>> In addition, it passes through all sorts of other tags that begin
>>> with
>>> capital letters:
>>>
>>> Bits=46.1;FracId=0.962962962962963
>>>
>>> these should be
>>>
>>> bits=46.1;fracId=0.962962962962963
>>>
>>> if I am reading the spec correctly.
>>>
>>> Finally, the column-3 term that comes out is "similarity", but it
>>> should be
>>> one of the *match terms. Is that also correct?
>>>
>>> Please confirm that I am interpreting the GFF3 spec correctly for
>>> these
>>> Alignment features and I would be happy to go in and fix things
>>> (a.k.a. break
>>> everyone else's tools ;-) )
>>>
>>> Cheerio!
>>>
>>> Mark
>>>
> --
> Mark Wilkinson <markw at illuminae.com>
> Illuminae
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
More information about the Bioperl-l
mailing list