[Biojava-l] Exception thrown when parsing GenBank file

Peter Cock p.j.a.cock at googlemail.com
Mon Nov 14 17:29:31 UTC 2011


On Mon, Nov 14, 2011 at 4:53 PM, Dietmar Birzer wrote:
>
> Hi all,
>
> I am currently trying to debug a little software application which
> uses BioJava's core-1.8.1.jar library because it has started to
> throw exceptions a while ago.
>
> I guess the problem is, that the GenbankLocationParser is not
> able to handle "Het" entries in the features section of the
> GenBank/GenPept format, e.g.
>
>  Het             join(bond(9),bond(125))
>                     /heterogen="( NA,   5 )"
>
> for database id 14719485 (http://www.ncbi.nlm.nih.gov/protein/14719485) .

Interesting - note the bond locations are not in the official
DDBJ/EMBL/GenBank feature table specification (v9, Oct 2011):
http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html

However, as noted on http://www.bioperl.org/wiki/BioPerl_Locations
that seems to be intended only for nucleotides and not proteins as here.
It might be worth contacting the NCBI to find out if there is an official
specification covering these location strings?

Peter




More information about the Biojava-l mailing list