[Bioperl-l] lightweight features

Hilmar Lapp hlapp at gmx.net
Wed Nov 7 23:19:15 UTC 2007


It seems to me that there are applications where you're dealing with  
a huge number of features (such as GFF) and where therefore a  
lightweight object makes tremendous sense. But when you parse a  
genbank file, I'm not sure that's the bottleneck, unless maybe it's a  
large contig with lots of feature annotations.

I guess we'll ultimately want a way to control the type of feature  
being instantiated by a parser, e..g using a factory.

	-hilmar

On Nov 7, 2007, at 4:12 PM, Chris Fields wrote:

> I can see preferring a lightweight simple SF over SF::Generic in the
> next BioPerl dev cycle.  I guess we would just layer split locations
> as simple sub-features/segments, typing when necessary?  That
> shouldn't be much more overhead than creating a layered  
> Location::Split.
>
> chris
>
> On Nov 7, 2007, at 2:09 PM, Jason Stajich wrote:
>
>> It uses hashes there so technically it is not entirely array based.
>>
>> -jason
>> On Nov 7, 2007, at 3:04 PM, Lincoln Stein wrote:
>>
>>> I wonder if it is worth moving to the array-based version more
>>> generally,
>>> then.
>>>
>>> How does the array based feature object deal with tags?
>>>
>>> Lincoln
>>>
>>> On Nov 7, 2007 2:52 PM, Jason Stajich <jason at bioperl.org> wrote:
>>>
>>>> The array-based Bio::SeqFeature::Slim is only about 7% faster than
>>>> Bio::Graphics::Feature so I suspect most of the speedup comes from
>>>> removing
>>>> location objects.
>>>>
>>>> Generic     6.75        --      -37%      -41%
>>>> GraphicsF   4.26       58%        --       -7%
>>>> Slim        3.98       70%        7%        --
>>>>
>>>> this is using code on the lightweight_feature_branch so
>>>> cvs -d:ext:USERNAME at pub.open-bio.org:/home/repository/bioperl co -r
>>>> lightweight_feature_branch -d core_lwf bioperl-live
>>>>
>>>> http://jason.open-bio.org/~jason/bioperl/
>>>> seqfeature_speed.pl<http://jason.open-bio.org/%7Ejason/bioperl/
>>>> seqfeature_speed.pl>
>>>> and the GFF3 file I used to parse
>>>> http://jason.open-bio.org/~jason/bioperl/sgd.gff3.bz2<http://
>>>> jason.open-bio.org/%7Ejason/bioperl/sgd.gff3.bz2>
>>>>
>>>> -jason
>>>>
>>>
>>>
>>>
>>> -- 
>>> Lincoln D. Stein
>>> Cold Spring Harbor Laboratory
>>> 1 Bungtown Road
>>> Cold Spring Harbor, NY 11724
>>> (516) 367-8380 (voice)
>>> (516) 367-8389 (fax)
>>> FOR URGENT MESSAGES & SCHEDULING,
>>> PLEASE CONTACT MY ASSISTANT,
>>> SANDRA MICHELSEN, AT michelse at cshl.edu
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
> Christopher Fields
> Postdoctoral Researcher
> Lab of Dr. Robert Switzer
> Dept of Biochemistry
> University of Illinois Urbana-Champaign
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l

-- 
===========================================================
: Hilmar Lapp  -:-  Durham, NC  -:-  hlapp at gmx dot net :
===========================================================








More information about the Bioperl-l mailing list