[Biojava-l] FW: Location Problems
Cox, Greg
gcox@netgenics.com
Tue, 23 Oct 2001 13:12:37 -0400
There's a architectural problem with how locations are represented. The
problem is that strandedness is applied at the feature level, but in some
genbank files applies at the sub-location level. For example, MUSPRCON01
has the feature:
gene join(complement(244..1502),L21211.1:1..105,
L21222.1:1..141,L21212.1:1..89,L21213.1:1..101,
L21214.1:1..185,L21215.1:1..225,L21216.1:1..113,
L21217.1:1..116,L21218.1:1..130,L21219.1:1..198,
L21223.1:1..137,L21220.1:1..141,L21221.1:1..664)
/gene="PC4"
This highlights two things; first, remote feature probably has to be
re-worked. It has no concept of strand, and should. The other is that
there's at least one case where some component parts of a compound location
are on different strands.
I'm not making any proposals right now, just highlighting the
problem. Let me know if you have brilliant thoughts.
Greg