[Biopython] pairwise sequence alignment programs in biopython

John Berrisford jmb at ebi.ac.uk
Tue Jul 10 18:51:51 UTC 2018


Hi 

 

I'm looking at performing pairwise alignments of polymer sequences in
biopython. 

These will be protein or nucleotide sequences. They may include non-standard
residues which will be denoted as X. 

The sequences will be of varying length from around 20 residues up to
several thousand residues - put simply the range of sequences in the PDB. 

 

I'm looking for the best tool to use to do this in biopython

 

So far I have performed tests with pairwise2 and Align.PairwiseAligner. 

>From my tests it seems that pairwise2 has a limit of ~2000 residues - i.e.
if I give it a sequence of 2500 residues to compare against itself it
crashes. PairwiseAligner seems to be able to handle much longer sequences
without issue. 

 

I need to be able to set gap penalties - which is possible in both of these
programs. 

 

So my question are:

Are these the only options in biopython? - I would prefer a python
implementation rather than something that requires external compilation i.e.
Emboss Needle

Are these the best options?

Are they both maintained / stable? 

Are they comparable in their results?

Is the limitation in sequence length in pairwise2 a known issue? A quick
google search suggests most people use pairwise2, which is strange given its
sequence length limitation. 

 

Thank you

 

John 

 

--

John Berrisford

PDBe

European Bioinformatics Institute (EMBL-EBI)

European Molecular Biology Laboratory

Wellcome Genome Campus

Hinxton

Cambridge CB10 1SD UK

Tel: +44 1223 492529

 

 <https://www.pdbe.org/> https://www.pdbe.org

 <https://www.facebook.com/proteindatabank>
https://www.facebook.com/proteindatabank

 <https://twitter.com/PDBeurope> https://twitter.com/PDBeurope

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython/attachments/20180710/502b5728/attachment.html>


More information about the Biopython mailing list