[Bioperl-l] parsing tblastn using BPlite
Christoffels Alan
calan@mcbsgs1.imcb.nus.edu.sg
Fri, 26 Oct 2001 13:51:33 +0800 (PST)
This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.
Send mail to mime@docserver.cac.washington.edu for more info.
---1987827149-722225268-1004075493=:198262
Content-Type: TEXT/PLAIN; charset=US-ASCII
Jason,
I have attached the output from the BPlite test.
alan
> you can run the test individually by typing
>
> % perl -I. -w t/BPlite.t
>
> This will tell you what is failing wrt to test 30 as well as the line
> number in the test. If you can send us this output we may be able to
> diagnose the problem.
>
> -jason
>
> On Thu, 25 Oct 2001, Christoffels Alan wrote:
>
> >
> > > Pretty sure that some form of this bug is fixed in current versions - we
> > > are trying to get 1.0 finished by the end of the year, you could
> > > alternatively run off the CVS head or download a developer's release to
> > > get a fix for this.
> >
> > I downloaded the developer's release. One test for BPlite failed when I
> > installed it (Test 30).
> >
> >
> > t/BPlite............FAILED test 30
> > Failed 1/97 tests, 98.97% okay
> >
> > What is test 30?? COuld this have contributed to the problem I am
> > encountering with BPlite? I get the same error as before when parsing my
> > tblastn results (processing 16K records instead of 33K).
> >
> > Alan
> >
> >
> > >
> > >
> > >
> > > On Thu, 25 Oct 2001, Christoffels Alan wrote:
> > >
> > > > > Hmm, which version of bioperl/BPlite are you using?
> > > >
> > > > $Id: BPlite.pm,v 1.14.2.5 2001/04/08 21:28:44 lapp Exp $
> > > >
> > > >
> > > > Alan
> > > >
> > > >
> > > > >
> > > > > % perldoc -m Bio::Tools::BPlite | grep Id
> > > > >
> > > > > Will help us diagnose if this is a new or old bug.
> > > > >
> > > > > -jason
> > > > > On Wed, 24 Oct 2001, Christoffels Alan wrote:
> > > > >
> > > > > > Hi,
> > > > > >
> > > > > > I am using BPlite to parse a tblastn output file containing multiple
> > > > > > tblastn reports (~30K query sequences).
> > > > > >
> > > > > > WHen I count the actually reports being read by the script, I only get
> > > > > > about 16K. Can any tell me what I am doing wrong in this script?
> > > > > >
> > > > > > use Bio::Tools::BPlite;
> > > > > > use strict;
> > > > > > my $file = $ARGV[0];
> > > > > > my $cutoff = $ARGV[1];
> > > > > > my $total_rec = 0;
> > > > > > open(BLAST, "$file");
> > > > > > {
> > > > > > my $report = new Bio::Tools::BPlite(-fh=>\*BLAST);
> > > > > > $total_rec++;
> > > > > > my $QUERY = $report->query;
> > > > > > my $rec = 0;
> > > > > > GG:
> > > > > > while (my $sbjct = $report->nextSbjct) {
> > > > > > $rec++;
> > > > > > my $SUBJ = $sbjct->name;
> > > > > > while (my $hsp = $sbjct->nextHSP) {
> > > > > > my $HSP = $hsp->P;
> > > > > > if ($HSP < $cutoff) {
> > > > > > print "$QUERY\t$SUBJ\t$HSP\n";
> > > > > > last;
> > > > > > }
> > > > > > }
> > > > > > if ($rec ==1) {
> > > > > > last GG;
> > > > > > }
> > > > > > }
> > > > > > last if ($report->_parseHeader == -1);
> > > > > > redo;
> > > > > > }
> > > > > > close(BLAST);
> > > > > > print STDERR "total records .$total_rec.\n";
> > > > > >
> > > > > > Alan Christoffels
> > > > > > Institute of Molecular and Cell Biology
> > > > > > 30 Medical Drive
> > > > > > Singapore
> > > > > > 117609
> > > > > > Tel: 65-8741489
> > > > > >
> > > > > >
> > > > > >
> > > > > > _______________________________________________
> > > > > > Bioperl-l mailing list
> > > > > > Bioperl-l@bioperl.org
> > > > > > http://bioperl.org/mailman/listinfo/bioperl-l
> > > > > >
> > > > >
> > > > > --
> > > > > Jason Stajich
> > > > > Duke University
> > > > > jason@cgt.mc.duke.edu
> > > > >
> > > >
> > > > Christoffels Alan
> > > > Institute of Molecular and Cell Biology
> > > > 30 Medical Drive
> > > > Singapore
> > > > 117609
> > > > Tel: 65-8741489
> > > >
> > >
> > > --
> > > Jason Stajich
> > > Duke University
> > > jason@cgt.mc.duke.edu
> > >
> > >
> >
> > Christoffels Alan
> > Institute of Molecular and Cell Biology
> > 30 Medical Drive
> > Singapore
> > 117609
> > Tel: 65-8741489
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@bioperl.org
> > http://bioperl.org/mailman/listinfo/bioperl-l
> >
>
> --
> Jason Stajich
> Duke University
> jason@cgt.mc.duke.edu
>
Christoffels Alan
Institute of Molecular and Cell Biology
30 Medical Drive
Singapore
117609
Tel: 65-8741489
---1987827149-722225268-1004075493=:198262
Content-Type: TEXT/PLAIN; charset=US-ASCII; name=bplitelog
Content-Transfer-Encoding: BASE64
Content-ID: <Pine.SGI.3.96.1011026135133.198262F@mcbsgs1.imcb.nus.edu.sg>
Content-Description:
MS4uOTcNCm9rIDENCm9rIDINCm9rIDMNCm9rIDQNCm9rIDUNCm9rIDYNCm9r
IDcNCm9rIDgNCm9rIDkNCm9rIDEwDQpvayAxMQ0Kb2sgMTINCm9rIDEzDQpv
ayAxNA0Kb2sgMTUNCm9rIDE2DQpvayAxNw0Kb2sgMTgNCm9rIDE5DQpvayAy
MA0Kb2sgMjENCm9rIDIyDQpvayAyMw0Kb2sgMjQNCm9rIDI1DQpvayAyNg0K
b2sgMjcNCm9rIDI4DQpvayAyOQ0Kbm90IG9rIDMwDQojIFRlc3QgMzAgZ290
OiAnNGUtMDcnICh0L0JQbGl0ZS50IGF0IGxpbmUgNzMpDQojICAgIEV4cGVj
dGVkOiAnNC5lLTA3Jw0Kb2sgMzENCm9rIDMyDQpvayAzMw0Kb2sgMzQNCm9r
IDM1DQpvayAzNg0Kb2sgMzcNCm9rIDM4DQpvayAzOQ0Kb2sgNDANCm9rIDQx
DQpvayA0Mg0Kb2sgNDMNCm9rIDQ0DQpvayA0NQ0Kb2sgNDYNCm9rIDQ3DQpv
ayA0OA0Kb2sgNDkNCm9rIDUwDQpvayA1MQ0Kb2sgNTINCm9rIDUzDQpvayA1
NA0Kb2sgNTUNCm9rIDU2DQpvayA1Nw0Kb2sgNTgNCm9rIDU5DQpvayA2MA0K
b2sgNjENCm9rIDYyDQpvayA2Mw0Kb2sgNjQNCm9rIDY1DQpvayA2Ng0Kb2sg
NjcNCm9rIDY4DQpvayA2OQ0Kb2sgNzANCm9rIDcxDQpvayA3Mg0Kb2sgNzMN
Cm9rIDc0DQpvayA3NQ0Kb2sgNzYNCm9rIDc3DQpvayA3OA0Kb2sgNzkNCm9r
IDgwDQpvayA4MQ0Kb2sgODINCm9rIDgzDQpvayA4NA0Kb2sgODUNCm9rIDg2
DQpvayA4Nw0Kb2sgODgNCm9rIDg5DQpvayA5MA0Kb2sgOTENCm9rIDkyDQpv
ayA5Mw0Kb2sgOTQNCm9rIDk1DQpvayA5Ng0Kb2sgOTcNCg==
---1987827149-722225268-1004075493=:198262--