[Bioperl-l] parse blast report for the best evalue
DeeGee
gdorjee at hotmail.com
Mon Apr 9 19:40:02 UTC 2007
thank you, Chris.
^ ^*
Chris Fields wrote:
>
> You should probably use sort_hits() with a coderef that sorts by
> evalue to ensure that you retrieve the best evalue (significance()
> for hits) (see POD for Bio::Search::Result::ResultI). You could then
> do something like:
>
> my $hit;
>
> unless ($result->no_hits_found) {
> # pass coderef to sort by evalue
> $result->sort_hits(\&sort_by_evalue);
> # retrieve first (best) hit
> $hit = $result->next_hit;
> }
>
> # do whatever you want with the best Hit
>
> If you plan on retaining data from hits over a ton of different
> reports it may be best (memory-wise) to only retain the data you want
> for each hit instead of retaining the actual object. For instance,
> if you only care about the description and evalue set up a simple
> data structure to house what you want by the query data instead of
> retaining all the extra stuff in the Hit object you don't need (all
> the HSP data, etc).
>
> chris
>
> On Apr 8, 2007, at 11:18 PM, DeeGee wrote:
>
>>
>> hi all,
>> i'm trying to parse a blast report using Bio::SearchIO as follows,
>> but since
>> this blast report is generated with many against many (database) fasta
>> sequences, there're many individual blast reports (one for each of the
>> sequence from the query file). i was wondering if there is a way to
>> get only
>> the best hit (with best evalue) from each one of them.
>>
>> ##### part of my script ######
>> my $in = new Bio::SearchIO(-format => 'blast', -file =>
>> $blast_report);
>> while( my $result = $in->next_result ) {
>> while( my $hit = $result->next_hit ) {
>> ...........
>>
>> thanks.
>>
>>
>> --
>> View this message in context: http://www.nabble.com/parse-blast-
>> report-for-the-best-evalue-tf3545784.html#a9898358
>> Sent from the Perl - Bioperl-L mailing list archive at Nabble.com.
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
> Christopher Fields
> Postdoctoral Researcher
> Lab of Dr. Robert Switzer
> Dept of Biochemistry
> University of Illinois Urbana-Champaign
>
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
>
--
View this message in context: http://www.nabble.com/parse-blast-report-for-the-best-evalue-tf3545784.html#a9907757
Sent from the Perl - Bioperl-L mailing list archive at Nabble.com.
More information about the Bioperl-l
mailing list