[Bioperl-l] parsing only the summary part of a blast report

Zhao, David [PRI] DZhao1@prius.jnj.com
Wed, 11 Oct 2000 12:47:53 -0400


This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C033A2.FFC457B0
Content-Type: text/plain

Thanks. However, BPlite is not in Bioperl 0.6.2 release, is it? I can't find
it under Tools. I do find Ian Korf's BPlite module, is this the one you were
talking about?
David


> -----Original Message-----
> From:	Jason Stajich [SMTP:jason@chg.mc.duke.edu]
> Sent:	Wednesday, October 11, 2000 6:19 AM
> To:	Hilmar Lapp
> Cc:	Zhao, David [PRI]; Bioperl
> Subject:	Re: [Bioperl-l] parsing only the summary part of a blast
> report
> 
> see the perldoc for Bio::Tools::BPlite for all the complete api.
> 
> This script will parse and print all of the hits in your report.
> 
> #!/usr/local/bin/perl -w
> 
> use Bio::Tools::BPlite;
> 
> my $report = new Bio::Tools::BPlite(-fh=>\*STDIN);
> print $report->query(), ",", $report->database(), "\n";
> while ( my $sbjct = $report->nextSbjct ) {
>     print "name is ", $sbjct->name(), "\n";
> }
> 
> Alternatively you can use Bio::Tools::Blast, but you will find it slower
> and more memory intensive.  It does support more features so it depends on
> what your needs are.
> 
> On Wed, 11 Oct 2000, Hilmar Lapp wrote:
> 
> > > "Zhao, David [PRI]" wrote:
> > > 
> > > Hi there,
> > > How can I parse the summary part of a blast report using bioperl
> > > modules? such as:
> > > 
> > 
> > I'm almost sure you can't using Blast.pm. Maybe you can with BPlite
> > (development trunk only). I know that Blast.pm takes a signifant time to
> > parse long reports (i.e., with  hundreds of alignments), but we haven't
> > checked yet whether BPlite is significantly faster in such cases. I
> guess
> > you're asking because you bother about the time lost in parsing the
> > alignments, although you needed only the summary data which are already
> > present in the hit list.
> > 
> > BTW you can pass a significance threshold to Blast.pm, and although I'm
> > not sure I think it won't parse those alignments beyond the significance
> > threshold.
> 
> [Regarding Bio::Tools::Blast]
> 
> I'm pretty sure the signifigance threshold only applies to when you are
> running Blast not parsing a report.  In fact you should not put a
> signif=>$value in your parameter hash if you are just parsing a report
> file.
> 
> > 
> > 	Hilmar
> > -- 
> > -----------------------------------------------------------------
> > Hilmar Lapp                                email: hlapp@gmx.net
> > NFI Vienna, IFD/Bioinformatics             phone: +43 1 86634 631
> > A-1235 Vienna                                fax: +43 1 86634 727
> > -----------------------------------------------------------------
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l@bioperl.org
> > http://bioperl.org/mailman/listinfo/bioperl-l
> > 
> 
> Jason Stajich
> jason@chg.mc.duke.edu
> http://galton.mc.duke.edu/~jason/
> (919)684-1806 (office) 
> (919)684-2275 (fax) 
> Center for Human Genetics - Duke University Medical Center
> http://wwwchg.mc.duke.edu/ 
> 
> 
> 

------_=_NextPart_001_01C033A2.FFC457B0
Content-Type: text/html
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
5.5.2651.75">
<TITLE>RE: [Bioperl-l] parsing only the summary part of a blast =
report</TITLE>
</HEAD>
<BODY>

<P><FONT COLOR=3D"#000080" FACE=3D"Comic Sans MS">Thanks. However, =
BPlite is not in Bioperl 0.6.2 release, is it? I can't find it under =
Tools. I do find Ian Korf's BPlite module, is this the one you were =
talking about?</FONT></P>

<P><FONT COLOR=3D"#000080" FACE=3D"Comic Sans MS">David</FONT>
</P>
<BR>

<P><FONT SIZE=3D1 FACE=3D"Arial">-----Original Message-----</FONT>
<BR><B><FONT SIZE=3D1 FACE=3D"Arial">From:&nbsp;&nbsp;</FONT></B> <FONT =
SIZE=3D1 FACE=3D"Arial">Jason Stajich =
[SMTP:jason@chg.mc.duke.edu]</FONT>
<BR><B><FONT SIZE=3D1 FACE=3D"Arial">Sent:&nbsp;&nbsp;</FONT></B> <FONT =
SIZE=3D1 FACE=3D"Arial">Wednesday, October 11, 2000 6:19 AM</FONT>
<BR><B><FONT SIZE=3D1 =
FACE=3D"Arial">To:&nbsp;&nbsp;&nbsp;&nbsp;</FONT></B> <FONT SIZE=3D1 =
FACE=3D"Arial">Hilmar Lapp</FONT>
<BR><B><FONT SIZE=3D1 =
FACE=3D"Arial">Cc:&nbsp;&nbsp;&nbsp;&nbsp;</FONT></B> <FONT SIZE=3D1 =
FACE=3D"Arial">Zhao, David [PRI]; Bioperl</FONT>
<BR><B><FONT SIZE=3D1 =
FACE=3D"Arial">Subject:&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</FONT>=
</B> <FONT SIZE=3D1 FACE=3D"Arial">Re: [Bioperl-l] parsing only the =
summary part of a blast report</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">see the perldoc for Bio::Tools::BPlite =
for all the complete api.</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">This script will parse and print all =
of the hits in your report.</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">#!/usr/local/bin/perl -w</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">use Bio::Tools::BPlite;</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">my $report =3D new =
Bio::Tools::BPlite(-fh=3D&gt;\*STDIN);</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">print $report-&gt;query(), =
&quot;,&quot;, $report-&gt;database(), &quot;\n&quot;;</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">while ( my $sbjct =3D =
$report-&gt;nextSbjct ) {</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&nbsp;&nbsp;&nbsp; print &quot;name =
is &quot;, $sbjct-&gt;name(), &quot;\n&quot;;</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">}</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">Alternatively you can use =
Bio::Tools::Blast, but you will find it slower</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">and more memory intensive.&nbsp; It =
does support more features so it depends on</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">what your needs are.</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">On Wed, 11 Oct 2000, Hilmar Lapp =
wrote:</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; &quot;Zhao, David =
[PRI]&quot; wrote:</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; Hi there,</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; How can I parse the summary =
part of a blast report using bioperl</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; modules? such as:</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &gt; </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; I'm almost sure you can't using =
Blast.pm. Maybe you can with BPlite</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; (development trunk only). I know =
that Blast.pm takes a signifant time to</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; parse long reports (i.e., =
with&nbsp; hundreds of alignments), but we haven't</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; checked yet whether BPlite is =
significantly faster in such cases. I guess</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; you're asking because you bother =
about the time lost in parsing the</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; alignments, although you needed =
only the summary data which are already</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; present in the hit list.</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; BTW you can pass a significance =
threshold to Blast.pm, and although I'm</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; not sure I think it won't parse =
those alignments beyond the significance</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; threshold.</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">[Regarding Bio::Tools::Blast]</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">I'm pretty sure the signifigance =
threshold only applies to when you are</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">running Blast not parsing a =
report.&nbsp; In fact you should not put a</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">signif=3D&gt;$value in your parameter =
hash if you are just parsing a report</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">file.</FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">&gt; </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
Hilmar</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; -- </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; =
-----------------------------------------------------------------</FONT>=

<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; Hilmar =
Lapp&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; email: =
hlapp@gmx.net</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; NFI Vienna, =
IFD/Bioinformatics&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp; phone: +43 1 86634 631</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; A-1235 =
Vienna&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; fax: +43 1 86634 =
727</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; =
-----------------------------------------------------------------</FONT>=

<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; =
_______________________________________________</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; Bioperl-l mailing list</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; Bioperl-l@bioperl.org</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; <A =
HREF=3D"http://bioperl.org/mailman/listinfo/bioperl-l" =
TARGET=3D"_blank">http://bioperl.org/mailman/listinfo/bioperl-l</A></FON=
T>
<BR><FONT SIZE=3D2 FACE=3D"Arial">&gt; </FONT>
</P>

<P><FONT SIZE=3D2 FACE=3D"Arial">Jason Stajich</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">jason@chg.mc.duke.edu</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial"><A =
HREF=3D"http://galton.mc.duke.edu/~jason/" =
TARGET=3D"_blank">http://galton.mc.duke.edu/~jason/</A></FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">(919)684-1806 (office) </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">(919)684-2275 (fax) </FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial">Center for Human Genetics - Duke =
University Medical Center</FONT>
<BR><FONT SIZE=3D2 FACE=3D"Arial"><A =
HREF=3D"http://wwwchg.mc.duke.edu/" =
TARGET=3D"_blank">http://wwwchg.mc.duke.edu/</A> </FONT>
</P>
<BR>
<BR>

</BODY>
</HTML>
------_=_NextPart_001_01C033A2.FFC457B0--