[Biojava-l] Local aln - contig assembly
    Khalil El Mazouari 
    khalil.elmazouari at gmail.com
       
    Sun Jun  9 19:32:32 UTC 2013
    
    
  
Hi,
I am trying to assemble overlapping sequence (direct & reverse) via local alignment. I am only searching for local aln with 100% identity.
Which parameters, matrix ... should I use in order to get 100% ident. local aln.
Any other suggestion for assembling overlapping seq (in Java) is welcome.
Thanks
khalil
   SubstitutionMatrix<NucleotideCompound> matrix = SubstitutionMatrixHelper.getNuc4_2();
   SimpleGapPenalty gapP = new SimpleGapPenalty();
   gapP.setOpenPenalty((short) 5);
   gapP.setExtensionPenalty((short) 1);
   SequencePair<DNASequence, NucleotideCompound> psa =
   Alignments.getPairwiseAlignment(query, target,
   PairwiseSequenceAlignerType.LOCAL, gapP, matrix);
========
Local Alignment Identity: 97.84688995215312%
query     GGGGAAAACACGAAAGGCCCTTGGTGGAGGCGCTTGAGACGGTGACAAGGGTTCCCTGGC  68
          |||||| || |||  ||||||||||||||||||||||||||||||| |||||||||||||
target    GGGGAAGAC-CGATGGGCCCTTGGTGGAGGCGCTTGAGACGGTGACCAGGGTTCCCTGGC 417
query     CCCAGTAGTCAAAGGTCCGTGAGGAGCTCCACTTGTGTGCACAGTAATATGTGGCTGAGT 128
          ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
target    CCCAGTAGTCAAAGGTCCGTGAGGAGCTCCACTTGTGTGCACAGTAATATGTGGCTGAGT 477
query     CCACAGGGTCCATGTTGGTCATTGTAAGGACCACCTGGTCTTTGGAGGTGTCCTTGGTGA 188
          ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
target    CCACAGGGTCCATGTTGGTCATTGTAAGGACCACCTGGTCTTTGGAGGTGTCCTTGGTGA 537
query     TGGTGAGCCTGCTCTTCAGAGATGGGCTGTAGCGCTTATCATCATTCCAATAAATGAGTG 248
          ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
target    TGGTGAGCCTGCTCTTCAGAGATGGGCTGTAGCGCTTATCATCATTCCAATAAATGAGTG 597
query     CAAGCCACTCCAGGGCCTTTCCTGGGGGCTGACGGATCCAGCCCACACCCACTCCACTAG 308
          ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
target    CAAGCCACTCCAGGGCCTTTCCTGGGGGCTGACGGATCCAGCCCACACCCACTCCACTAG 657
query     TGCTGAGTGAGAACCCAGAGAAGGTGCAGGTCAGCGTGAGGGTCTGTGTGGGTTTCACCA 368
          ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
target    TGCTGAGTGAGAACCCAGAGAAGGTGCAGGTCAGCGTGAGGGTCTGTGTGGGTTTCACCA 717
query     GCGTAGGACCAGACTCCTTCAAGGTGATCTGGGCCATGGCCGGCTGGGCCGCGAGTAA 426
          |||||||||||||||||||||||||| ||||||||| |||||||||| |||| |||||
target    GCGTAGGACCAGACTCCTTCAAGGTG-TCTGGGCCA-GGCCGGCTGG-CCGCAAGTAA 772
-----
Confidentiality Notice: This e-mail and any files transmitted with it are private and confidential and are solely for the use of the addressee. It may contain material which is legally privileged. If you are not the addressee or the person responsible for delivering to the addressee, please notify that you have received this e-mail in error and that any use of it is strictly prohibited. It would be helpful if you could notify the author by replying to it.
    
    
More information about the Biojava-l
mailing list