[Bioperl-l] problems with Bio::Tools::GFF

Lincoln Stein lstein at cshl.edu
Sat Nov 8 08:48:09 EST 2003


No, it is still tab delimited, but people consistently screw it up and it 
might be better to split on space, since spaces are no longer allowed in the 
columns.

Lincoln

On Monday 03 November 2003 12:41 pm, Scott Cain wrote:
> Hi Jason and Lincoln,
>
> I have a few concerns with Bio::Tools::GFF. The first is with the method
> _from_gff3_string, which does a split on \t to separate columns.  I
> think the GFF3 spec says it can be space delimited, so that should
> probably be \s+.  Additionally, to split the groups column, it uses
> \s*;\s*, but I think that spaces have to be escaped, therefore, it
> should only split on ; and spaces would indicate a problem (especially
> if one splits on spaces as indicated above).
>
> Finally, it doesn't provide a method of accessing the sequence that is
> optionally at the bottom of the file.  I am not exactly sure how to
> implement that (or I would), but I suspect it will have to be handled in
> the next_feature method.  Of course, the problem with handling it there
> is that it is not a feature.
>
> Scott

-- 
Lincoln Stein
lstein at cshl.edu
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor, NY 11724
(516) 367-8380 (voice)
(516) 367-8389 (fax)


More information about the Bioperl-l mailing list