[Bioperl-l] bioperl 1.4 SearchIO doesn't work parsing Blast output

Hubert Prielinger hubert.prielinger at gmx.at
Thu Feb 9 20:13:31 UTC 2006


dear roger,
this error message I got, when I tried to parse Blast output (version 
2.2.12) with bioperl 1.5.1, but it doesn't matter, because I have a lot 
of Blast output files
with version 2.2.13 and for that I don't get any error message.....it 
just doesn't work

Hubert



Roger Hall wrote:

>Guys - I'm looking at the error message:
>
>MSG: no data for midline Query  1   WWWKWRW  7
>STACK Bio::SearchIO::blast::next_result
>/usr/lib/perl5/site_perl/5.8.6/Bio/SearchIO/blast.pm:1151
>STACK toplevel
>/home/Hubert/installed/eclipse/workspace/Database_Search/Blast.pl:21
>
>This is my line of thought:
>1. "no data for midline $_" is a unique message generated by blast.pm in one
>location only at the point of a. reading three lines b. dropping lines with
>spaces only c. identifying the Query, Midline, and Match lines (0 <= $i < 3)
>2. There is a regexp match that fails in order to reach that error message
>3. The $_ value "Query  1   WWWKWRW  7" should not fail the expression
>4. It does anyway
>5. I cannot find the value "Query  1   WWWKWRW  7" anywhere in the blast
>reports
>
>I suspect a newline/chomp/metacharacter issue. Not finding the string
>anywhere has me thoroughly confused - I asked Hubert for the additional
>file, assuming that I didn't have it.
>
>My next thought is to write a quick script to test perl behavior on "Fedora
>Core 9".
>
>Thoughts?
>
>Did I misread the issue entirely? :}
>
>Roger
>
>
>-----Original Message-----
>From: bioperl-l-bounces at lists.open-bio.org
>[mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of Chris Fields
>Sent: Thursday, February 09, 2006 10:16 AM
>To: 'Jason Stajich'; 'Hubert Prielinger'
>Cc: bioperl-l at bioperl.org
>Subject: Re: [Bioperl-l] bioperl 1.4 SearchIO doesn't work parsing Blast
>output
>
>
>  
>
>>-----Original Message-----
>>From: Jason Stajich [mailto:jason.stajich at duke.edu] 
>>Sent: Thursday, February 09, 2006 9:13 AM
>>To: Hubert Prielinger
>>Cc: Chris Fields; bioperl-l at bioperl.org
>>Subject: Re: [Bioperl-l] bioperl 1.4 SearchIO doesn't work 
>>parsing Blast output
>>
>>On Feb 8, 2006, at 4:41 PM, Hubert Prielinger wrote:
>>    
>>
>>>hi chris,
>>>thanks, I have upgraded to version 1.5.1 but it isn't still 
>>>      
>>>
>>working, 
>>    
>>
>>>do you have any ohter idea, the problem I have is that I 
>>>      
>>>
>>have to parse 
>>    
>>
>>>a lot of textfiles....
>>>or shall I look for another option to parse those files...
>>>
>>>regards
>>>Hubert
>>>      
>>>
>>The code from Bioperl 1.5.1 works fine for me for blast 
>>2.2.13 reports but unless you post your blast report we can't 
>>really determine the problem.
>>
>>If you are still getting the same error like this I am not 
>>convinced you have upgraded to 1.5.1 which includes a fix in 
>>the fact that NCBI changed the HSP result format to remove 
>>the ':' from the Query/Sbjct prefixes.  We fixed this as soon 
>>as it was apparent sometime in September.
>>
>>    
>>
>>>>>MSG: no data for midline Query  1   WWWKWRW  7
>>>>>STACK Bio::SearchIO::blast::next_result
>>>>>/usr/lib/perl5/site_perl/5.8.6/Bio/SearchIO/blast.pm:1151
>>>>>STACK toplevel
>>>>>
>>>>>          
>>>>>
>>/home/Hubert/installed/eclipse/workspace/Database_Search/Blast.pl:21
>>
>>If you are just getting no results but also no warnings wrt 
>>parsing, are you sure your logic is correct?
>>
>>If you remove your filters do you see all the HSPS?
>>
>>
>>while (my $result = $search->next_result) {
>>     print $result->query_name, "\n";
>>     #iterate over each hit on the query sequence
>>     while (my $hit = $result->next_hit) {
>>	print $hit->name, "\n";
>>         #iterate over each HSP in the hit
>>         while (my $hsp = $hit->next_hsp) {
>>	 print $hsp->evalue, " ", $hsp->length('sbjct'), " ", $hsp- 
>> >hit_string, "\n";	
>>        }
>>    }
>>}
>>    
>>
>
>I tested some of the BLAST results that Hubert sent Roger and me with a
>similar script to the above.  I removed the file parsing logic and it seemed
>to work just fine.  It may very well be a logic issue or that he hasn't
>installed the latest fix.
>    
>It's a funny thing, though.  When I tried using blastcl3 (v. 2.2.13), even
>though the returned output was from nr, the top of the blast output showed
>that it was v2.2.12:  
>
>BLASTP 2.2.12 [Aug-07-2005]
>
>I double-checked my local version and it's definitely v.2.2.13:
>-------------------------------------
>C:\Perl\Scripts>blastcl3 -
>
>blastcl3 2.2.13   arguments:...
>-------------------------------------
>
>If you use RemoteBlast using the same settings, the version in the header
>looks like this:
>
>BLASTP 2.2.13 [Nov-27-2005]
>
>I'm wondering if all the blast executables (blast and netblast) from NCBI
>have text output like v.2.2.12, while the wwwblast outputs a new format
>(2.2.13).  I'll ask blast-help at NCBI about this.
>
>  
>
>>To clarify some stuff -
>>Chris I don't necessarily think the XML is best way forward 
>>for BLAST reports generated locally, it isn't as detailed as 
>>the Text format and it is what most people expect to be able 
>>to scroll through and parse -- it is also harder for the 
>>format to change dramatically if you have a static binary on 
>>your machine =).  I think for remoteblast the XML format 
>>should be the way forward but I expect Bioperl to maintain 
>>support of any plain text BLAST report format that people use 
>>on a regular basis.
>>
>>    
>>
>
>Does XML lack some specific info that text output has?  Didn't know that.  I
>believe that XML should be default in RemoteBlast since it will not break,
>but I agree with you about text output.  I also agree that it will need
>somebody to maintain it constantly, much like RemoteBlast.
>
>  
>
>>-jason
>>    
>>
>>>Chris Fields wrote:
>>>
>>>      
>>>
>>>>My guess is you're running into text parsing problems in 
>>>>Bio::SearchIO::blast.  Upgrade to the latest developer version
>>>>(1.5.1) or
>>>>bioperl-live (CVS), then see the bug below.
>>>>
>>>>http://bugzilla.bioperl.org/show_bug.cgi?id=1934
>>>>
>>>>I think the first problem you ran into is solved in bioperl 1.5.1, 
>>>>the last problem (more recent, not related to the first) has been 
>>>>fixed but hasn't been committed to bioperl-live yet.  The fixed 
>>>>SearchIO::blast is available in the link above, but 
>>>>        
>>>>
>>realize it hasn't 
>>    
>>
>>>>been committed yet and may change.
>>>>
>>>>Christopher Fields
>>>>Postdoctoral Researcher - Switzer Lab Dept. of Biochemistry 
>>>>University of Illinois Urbana-Champaign
>>>>
>>>>
>>>>
>>>>        
>>>>
>>>>>-----Original Message-----
>>>>>From: bioperl-l-bounces at lists.open-bio.org
>>>>>[mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of Hubert 
>>>>>Prielinger
>>>>>Sent: Wednesday, February 08, 2006 2:52 PM
>>>>>To: bioperl-l at bioperl.org
>>>>>Subject: [Bioperl-l] bioperl 1.4 SearchIO doesn't work 
>>>>>          
>>>>>
>>parsing Blast 
>>    
>>
>>>>>output
>>>>>
>>>>>Hi,
>>>>>If I want to parse a Blast Output (Version 2.2.12) with 
>>>>>Bio::SearchIO, I get the following error message:
>>>>>
>>>>>MSG: no data for midline Query  1   WWWKWRW  7
>>>>>STACK Bio::SearchIO::blast::next_result
>>>>>/usr/lib/perl5/site_perl/5.8.6/Bio/SearchIO/blast.pm:1151
>>>>>STACK toplevel
>>>>>
>>>>>          
>>>>>
>>/home/Hubert/installed/eclipse/workspace/Database_Search/Blast.pl:21
>>    
>>
>>>>>is that a bug......
>>>>>
>>>>>If I want to parse Blast Output (version 2.2.13), I don't get 
>>>>>anything.....
>>>>>I'm using bioperl 1.4
>>>>>
>>>>>before, I have installed bioperl 1.4, it worked fine 
>>>>>          
>>>>>
>>parsing Blast 
>>    
>>
>>>>>Output (version 2.2.12), but I don't remember which 
>>>>>          
>>>>>
>>bioperl version 
>>    
>>
>>>>>I had installed
>>>>>
>>>>>thanks in advance
>>>>>
>>>>>Hubert
>>>>>
>>>>>
>>>>>
>>>>>_______________________________________________
>>>>>Bioperl-l mailing list
>>>>>Bioperl-l at lists.open-bio.org
>>>>>http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>>>
>>>>>
>>>>>          
>>>>>
>>>>
>>>>
>>>>        
>>>>
>>>_______________________________________________
>>>Bioperl-l mailing list
>>>Bioperl-l at lists.open-bio.org
>>>http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>>      
>>>
>>--
>>Jason Stajich
>>Duke University
>>http://www.duke.edu/~jes12
>>
>>    
>>
>
>Christopher Fields
>Postdoctoral Researcher - Switzer Lab
>Dept. of Biochemistry
>University of Illinois Urbana-Champaign  
>
>_______________________________________________
>Bioperl-l mailing list
>Bioperl-l at lists.open-bio.org
>http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
>
>  
>




More information about the Bioperl-l mailing list