[Biopython-dev] [Bug 2910] New: Parsing some pdb files results in	shorter peptide sequences than expected
    bugzilla-daemon at portal.open-bio.org 
    bugzilla-daemon at portal.open-bio.org
       
    Wed Sep  9 15:18:01 UTC 2009
    
    
  
http://bugzilla.open-bio.org/show_bug.cgi?id=2910
           Summary: Parsing some pdb files results in shorter peptide
                    sequences than expected
           Product: Biopython
           Version: 1.49
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: critical
          Priority: P2
         Component: Main Distribution
        AssignedTo: biopython-dev at biopython.org
        ReportedBy: schafer at rostlab.org
Parsing the one-letter sequence for a specific chain out of a given pdb file
often seems to result in shorter sequences than expected. 
The following code demonstrates this behavior for structure 1a2d chain A.
Aminoacid #118 VAL after the HETATOM (#117) block is missing in the result. 
------------------CODE----------------
from Bio.PDB.PDBParser import PDBParser
from Bio.PDB.Polypeptide import *
parser = PDBParser()
ppb = PPBuilder()
structure = parser.get_structure('tmp', '1a2d.pdb')
polypeptides = ppb.build_peptides(structure[0]['A'])
sequence = str(polypeptides[0].get_sequence())
print sequence
------------------CODE----------------
Another example is structure 13gs chain C and D. Both sequences are ECG, the
code above however returns only CG.
So this behavior seems to be indepedent from a present HETATOM block.
This bug is also present in version 1.51.
-- 
Configure bugmail: http://bugzilla.open-bio.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
    
    
More information about the Biopython-dev
mailing list