[Bioperl-l] seq truncation by Bio::Tools::Run::Signalp

Hilmar Lapp hlapp at gmx.net
Wed May 26 12:11:03 EDT 2004


Marc, what you spotted is clearly a bug. Do you have cvs write access? 
If you don't yet, would you be comfortable with getting it and 
committing your changes yourself?

	-hilmar

On Wednesday, May 26, 2004, at 06:39  AM, Marc Logghe wrote:

> Hi,
> I don't know what your experience is with Bio::Tools::Run::Signalp, 
> but I was feeling a little uncomfortable about the current behaviour 
> of this package. The issue is that when you do:
> my @signals = $factory->run($seq)
> your sequence gets truncated to 50 aa !!!
> AFAIK, one expects of Bio::Tools::Run::* that your seq object remains 
> intact and that you only obtain your returned features, no ?
> Anyhow, this behaviour can be fixed with the patch below. Basically, 
> it passes the '-trunc 50' parameter to the signalp executable, which 
> has the same effect as first truncating your query seq to 50 aa.
> HTH,
> Marc
>
> *** /usr/lib/perl5/site_perl/5.6.1/Bio/Tools/Run/Signalp.pm	Wed May 26 
> 15:21:55 2004
> --- Signalp.pm 	Wed May 26 13:20:23 2004
> *************** sub run {
> *** 179,184 ****
> --- 179,190 ----
>               $self->throw("cannot use filehandle");
>           }
>
> +   #first 50 aa is enough for signalp
> + 	if ($seq->length>50){
> +
> + 	    my $sub_seq = $seq->subseq(1, 50);
> + 	    $seq->seq($sub_seq);
> + 	}
>   	my $infile1 = $self->_writeSeqFile($seq);
>
>   	$self->_input($infile1);
> *************** sub run {
> *** 191,197 ****
> --- 197,207 ----
>   	my $in  = Bio::SeqIO->new(-file => $seq, '-format' =>'fasta');
>   	my $infile1;
>
> +   #first 50 aa is enough for signalp
>   	while ( my $tmpseq = $in->next_seq() ) {
> +
> + 	    my $sub_seq = $tmpseq->length > 50 ? $tmpseq->subseq(1,50) : 
> $tmpseq->seq;
> + 	    $tmpseq->seq($sub_seq);
>   	    $infile1 = $self->_writeSeqFile($tmpseq);
>   	}
>
> *************** sub _run {
> *** 233,239 ****
>        my ($self)= @_;
>
>        my ($tfh1,$outfile) = 
> $self->io->tempfile(-dir=>$self->tempdir());
> !      my $str =$self->executable." -t euk -trunc 50 
> ".$self->{'input'}." > ".$outfile;
>        my $status = system($str);
>        $self->throw( "Signalp call ($str) crashed: $? \n") unless 
> $status==0;
>
> --- 243,249 ----
>        my ($self)= @_;
>
>        my ($tfh1,$outfile) = 
> $self->io->tempfile(-dir=>$self->tempdir());
> !      my $str =$self->executable." -t euk ".$self->{'input'}." > 
> ".$outfile;
>        my $status = system($str);
>        $self->throw( "Signalp call ($str) crashed: $? \n") unless 
> $status==0;
>
> ***********************************************************
> Marc Logghe, Ph.D.
> Senior Scientist
> Scientific Computing Group
> Devgen nv
> Technologiepark 9
> As of March 1st, 2004 : Technologiepark 30
> B - 9052 Ghent-Zwijnaarde
> Belgium
> Tel: +32 9 324 24 88
> Fax: +32 9 324 24 25
>
>> **** DISCLAIMER 
>> **********************************************************
>> "This e-mail and any attachments thereto may contain information
>> which is confidential and/or protected by intellectual property
>> rights and are intended for the sole use of the recipient(s) named 
>> above.
>> Any use of the information contained herein (including, but not 
>> limited to,
>> total or partial reproduction, communication or distribution in any 
>> form)
>> by persons other than the designated recipient(s) is prohibited.
>> If you have received this e-mail in error, please notify the sender 
>> either
>> by telephone or by e-mail and delete the material from any computer.
>> Thank you for your cooperation."
>>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------




More information about the Bioperl-l mailing list