[BioSQL-l] Treating GenBank source features as top level annotation

Peter biopython at maubp.freeserve.co.uk
Wed Nov 18 13:34:38 UTC 2009


On Wed, Nov 18, 2009 at 1:10 PM, Chris Fields <cjfields at illinois.edu> wrote:
>
> Just to note, there are a few cases where there are two or more source features.
> This pops up mainly with chimeric sequences, for example:
>
> http://www.ncbi.nlm.nih.gov/nuccore/21727885
>
> We have run into this a couple of times on the bioperl list.  In this case, each
> feature is limited to specific locations on the sequence and doesn't pertain to
> the entire sequence.  NCBI only notes the first source on the ORGANISM line;
> last time I checked, EMBL used both.
>
> chris

Wow - cool example. It was worth starting this thread just to learn about
this interesting corner case. I wonder if this is a common enough case to
warrant leaving the source features as they are?

Peter




More information about the BioSQL-l mailing list