[Bioperl-l] Query seq with gaps in blast

Julio Fernandez Banet julipao at terra.es
Wed Jun 4 16:41:08 EDT 2003


Hello.
I would like to know how to get the subject sequence from an alignment with
gaps included.
As I'm trying to get the symbols in both sequences (query and subject) where
the homolog symbol is  " " I need to get the subject sequebce with gap to
get "-" when there and insertion in the query sequence.
It seems I get the gaps in the query (not sure) as The program gets "-"
symbol everytime there's a deletion and it only gives me an error with the
insertion.
Thanks a lot

My sequences to check the insertions are:
>seq 1
tcgccggcntcccggccaccctggtccccaggccccgtaccnctccgccaacgcctcgtcatggcngccccacnng
ccagcccagctggagggcacntgggcctgcccngcctgtgccctgcggctgctttgcagccacggaaacagcncac
cgtgctgctgggctttgaggcccaaccctggactgcggctgcctgggcgctatgaggtccgggcagaagtgggcaa
tggcgtgtccaggcacaacctctcctgcagcttttgacgtggtctccccagtggctgggctgcgggtcatcttacc
ctgccccccgcgacggccgccttctacgtgcccaccaacggctcagcctttggtgctccaggtggacttctggtgc
caacgccacggccacggcttcgcttggcctgggggcagtgtcagcgcccgcttttgagaaatgtctgccctgccct
ggtggccacctttcgtgcccggcttgcccctggga
>seq2
Tcgccggctcccggccaccctggtccccgggccccgtacctctccgccaacgcctcgtcatggctgccccacttgc
cagcccagctggagggcacttgggcctgccctgcctgtgccctgcggctgcttgcagccacggaacagctcaccgt
gctgctgggcttgaggcccaaccctggactgcggctgcctgggcgctatgaggtccgggcagaggtgggcaatggc
gtgtccaggcacaacctctcctgcagctttgacgtggtctccccagtggctgggctgcgggtcatctaccctgccc
cccgcgacggccgcctctacgtgcccaccaacggctcagccttggtgctccaggtggactctggtgccaacgccac
ggccacggctcgctggcctgggggcagtgtcagcgcccgctttgagaatgtctgccctgccctggtggccaccttc
gtgcccggctgcccctggga

The alignment is:
Query: 148   tcgccggcntcccggccaccctggtccccaggccccgtaccnctccgccaacgcctcgtc
207
             |||||||| |||||||||||||||||||| ||||||||||| ||||||||||||||||||
Sbjct: 24446 tcgccggc-tcccggccaccctggtccccgggccccgtacctctccgccaacgcctcgtc
24504

                   
Query: 208   atggcngccccacnngccagcccagctggagggcacntgggcctgcccngcctgtgccct
267
             ||||| |||||||  ||||||||||||||||||||| ||||||||||| |||||||||||
Sbjct: 24505 atggctgccccacttgccagcccagctggagggcacttgggcctgccctgcctgtgccct
24564

                   
Query: 268   gcggctgctttgcagccacggaaacagcncaccgtgctgctgggctttgaggcccaaccc
327
             |||||||| |||||||||||| |||||| |||||||||||||||| ||||||||||||||
Sbjct: 24565 gcggctgc-ttgcagccacgg-aacagctcaccgtgctgctgggc-ttgaggcccaaccc
24621

                   
Query: 328   tggactgcggctgcctgggcgctatgaggtccgggcagaagtgggcaatggcgtgtccag
387
             ||||||||||||||||||||||||||||||||||||||| ||||||||||||||||||||
Sbjct: 24622 tggactgcggctgcctgggcgctatgaggtccgggcagaggtgggcaatggcgtgtccag
24681

                   
Query: 388   gcacaacctctcctgcagcttttgacgtggtctccccagtggctgggctgcgggtcatct
447
             ||||||||||||||||||| |||||||||||||||||||||||||||||||||||||||
Sbjct: 24682 gcacaacctctcctgcagc-tttgacgtggtctccccagtggctgggctgcgggtcatc-
24739

                   
Query: 448   taccctgccccccgcgacggccgccttctacgtgcccaccaacggctcagcctttggtgc
507
             ||||||||||||||||||||||||| |||||||||||||||||||||||||| |||||||
Sbjct: 24740 taccctgccccccgcgacggccgcc-tctacgtgcccaccaacggctcagcc-ttggtgc
24797

                   
Query: 508   tccaggtggacttctggtgccaacgccacggccacggcttcgcttggcctgggggcagtg
567
             ||||||||||| |||||||||||||||||||||||||| |||| ||||||||||||||||
Sbjct: 24798 tccaggtggac-tctggtgccaacgccacggccacggc-tcgc-tggcctgggggcagtg
24854

                   
Query: 568   tcagcgcccgcttttgagaaatgtctgccctgccctggtggccacctttcgtgcccggct
627
             ||||||||||| |||||| ||||||||||||||||||||||||||| ||||||||||||
Sbjct: 24855 tcagcgcccgc-tttgag-aatgtctgccctgccctggtggccacc-ttcgtgcccggc-
24910

                   
Query: 628   tgcccctggga 638
             |||||||||||
Sbjct: 24911 tgcccctggga 24921

-- 




More information about the Bioperl-l mailing list