[Bioperl-l] parsing a BLAST output
Angshu Kar
angshu96 at gmail.com
Sun Dec 4 20:32:20 EST 2005
Hi,
To begin with, I'm new to Bioperl.
Now, I've written the following simple piece of code to parse a WU-Blast
output which filters data *for a given e-value and >50% overlap*.
I'm writing the main algorithm here:
my $blast_report = $ARG[1];
my $threshold_evalue = $ARG[2];
my $in = new Bio::SearchIO(-format => 'blast', -file => $blast_report);
while (my $result = $in -> next_result)
{
while(my $hit = $result->next_hit)
{
if(($line{$hit->name} == $line{$result->query_accession}))
{
next;
}
if($hit->hsp->evalue <= $threshold_evalue)
{
if($hit->hsp->frac_indentical>=0.5)
{
print $line{$result->query_accession} . "\t" .
$line{$hit->name} . "\t" . $hit->hsp-evalue . "\n";
}
}
}
}
My questions are:
1. does the frac_identical gives the measure of % overlap? Or, are there any
other methods?
2. now, i don't have any blast data sets to test my code upon.could any of
the experienced users let me know whether the algorithm is fine?any
tip-offs on any point (from optimization to syntactical errors) are heartily
welcome.
3. could any one please let me know if i can find sample wu-blast outputs to
test my script upon?
Appreciate your guidance.
Thanks,
Angshu
More information about the Bioperl-l
mailing list