[Bioperl-l] off by one problem w/exonerate & search2gff.pl ?
Charles Hauser
chauser at duke.edu
Wed Jul 9 15:46:44 EDT 2003
All,
I am trying to reconcile coordinates generated by exonerate w/ those
using search2gff.pl.
$ search2gff.pl -i /usr/local/pkg/exonerate/exonerate-0.7.0/s1_100h_500s -f exonerate -t hit
Below are sample outputs from exonerate and search2gff.pl.
query target
cigar line 532->661 58163->58298
gff output 533->660 58171->58298
Query appears to be off by 1, whereas target start is off by 8???
Ideas why the target values are off?
Charles
exonerate output:
C4 Alignment display:
Model: est2genome
Raw score: 457
Aligned positions 532->661 of query
Aligned positions 58163->58298 of target
Query: 832009G03.x1
Target: scaffold_1
533 : GCCACGTCTCCAGTTTGAGGCCGCAAGCGTCTACCTCATCTCCCGGACATAGCCGGGGTT : 591
||||||| || ||| ||||||||||||||||| ||||||||||||||||||||||||||
58164 : GCCACGTATCGAGTCTGAGGCCGCAAGCGTCTGCCTCATCTCCCGGACATAGCCGGGGTC : 58222
592 : GGTTCG--CA-C-GGCACGGCCCGCCATA---TGTCCAGCCACGTCCCGCTGGCAAGGAC : 644
|||||| || | ||||||||||||| || ||||||||||||||||||| |||| ||
58223 : GGTTCGTACAGCAGGCACGGCCCGCCGTACTGTGTCCAGCCACGTCCCGCTTCCAAGCAC : 58282
645 : AAGCCCAGCCAGGCAA : 661
|||| ||| || ||||
58283 : AAGCACAG-CAAGCAA : 58298
cigar: 832009G03.x1 532 661 + scaffold_1 58163 58298 + 457 M 66 D 2 M 2 D 1 M 1 D 1 M 16 D 3 M 36 I 1 M 7
search2gff.pl output:
scaffold_1 EXONERATE similarity 58171 58298 457 + 0 Target "Sequence:832009G03.x1" 533 660
More information about the Bioperl-l
mailing list