[Bioperl-l] Blast problem

shalabh sharma shalabh.sharma7 at gmail.com
Mon Oct 5 20:38:13 UTC 2009


Hi All,         This not exactly a bioperl query but i thought may be its a
good place to ask.
I am using blastall to blast sequences against my in house database.

one of the query sequence is :-

>JCVI_PEP_1105095073661 /read_id=JCVI_READ_391469 /begin=1 /end=1075
/orientation=-1 /5_prime_stop=TAA /3_prime_stop=0
/orf_id=JCVI_ORF_1105095073660 /ttable=11 /length=358 /ergatis_id=7720
/sample_id=JCVI_SMPL_1103283000001 /sample_name=GS000a /number_of_sites=2
/site_id_1=JCVI_SITE_GS000_S11 /location_1="Sargasso Station 11"
/region_1="Sargasso Sea" /country_1=Bermuda /site_depth_1="5 m"

LLNQESFRATPYTSRIEKQGVTAYTVYNHMLLPAAFGSLEESYHHLKKNVQIWDVAGERQVEISGKDSSKLVQLMTCRDL

SKSKDGRCYYCPILDDEAGIINDPIVLRINENKWWISIADSDVILFAKGLAIGNKFEVKILEPNVDIMAVQGPKSFGLME

KVFGKKITELKFFDFDYFDFEGAKHLIAKSGWSKQGGYEIYVENIESGLKLYDRLFEIGKEFYIRPGCPNLIERIESGLL

SYGNDMDNGDNPFECGFDKFINLDADINFLGKEKLKKIKAEGIKKKLVGVKFDIKEISLSKSIDLKDESSNIIGELRSAC

YSPHFGKVIGIAMIKKPYCEVSQIVKAEIIIFNVKKKK


and exactly the same sequence is there in my database:


>JCVI_PEP_1105095073661

LLNQESFRATPYTSRIEKQGVTAYTVYNHMLLPAAFGSLEESYHHLKKNVQIWDVAGERQVEISGKDSSKLVQLMTCRDLSKSKDGRCYYCPILDDEAGIINDPIVLRINENKWWISIADSDVILFAKGLAIGNKFEVKILEPNVDIMAVQGPKSFGLMEKVFGKKITELKFFDFDYFDFEGAKHLIAKSGWSKQGGYEIYVENIESGLKLYDRLFEIGKEFYIRPGCPNLIERIESGLLSYGNDMDNGDNPFECGFDKFINLDADINFLGKEKLKKIKAEGIKKKLVGVKFDIKEISLSKSIDLKDESSNIIGELRSACYSPHFGKVIGIAMIKKPYCEVSQIVKAEIIIFNVKKKK


But the blast report that i am getting does not give me 100% identity, there
is some region thats not aligned (though) its exactly the same.


portion of a blast report:


Score =  665 bits (1716), Expect = 0.0,   Method: Compositional matrix
adjust.

 Identities = 341/358 (95%), Positives = 341/358 (95%)


Query: 1   LLNQESFRATPYTSRIEKQGVTAYTVYNHMLLPAAFGSLEESYHHLKKNVQIWDVAGERQ 60

                LLNQESFRATPYTSRIEKQGVTAYTVYNHMLLPAAFGSLEESYHHLKKNVQIWDVAGERQ

Sbjct: 1    LLNQESFRATPYTSRIEKQGVTAYTVYNHMLLPAAFGSLEESYHHLKKNVQIWDVAGERQ 60

---------------------------
--------------------------

Query: 241 SYGNDMDNGDNPFECGFDKFINLDADINFXXXXXXXXXXXXXXXXXLVGVKFDIKEISLS 300
                  SYGNDMDNGDNPFECGFDKFINLDADINF
     LVGVKFDIKEISLS
Sbjct: 241  SYGNDMDNGDNPFECGFDKFINLDADINFLGKEKLKKIKAEGIKKKLVGVKFDIKEISLS 300

Query: 301 KSIDLKDESSNIIGELRSACYSPHFGKVIGIAMIKKPYCEVSQIVKAEIIIFNVKKKK 358
                  KSIDLKDESSNIIGELRSACYSPHFGKVIGIAMIKKPYCEVSQIVKAEIIIFNVKKKK
Sbjct: 301  KSIDLKDESSNIIGELRSACYSPHFGKVIGIAMIKKPYCEVSQIVKAEIIIFNVKKKK 358


I would really appreciate if anyone can help me out.

Thanks
Shalabh




More information about the Bioperl-l mailing list