[Bioperl-l] DBSOURCE in a new GenBank record

Wang, Kai Wang.Kai@mayo.edu
Fri, 9 Nov 2001 15:41:09 -0600


Hi everyone,

I found genbank released a new sequence today with a "DBSOURCE" entry
between "VERSION" and "KEYWORD" entry. The accession number is NP_057168 and
release date is 08-NOV-2001.

It seems the "DBSOURCE" is used to represent "NCBI Reference Sequence" or
RefSeq. This standard provides a foundation for the functional annotation of
human genome.

I checked the latest CVS, which was revised by Dr.Birney several hours ago.
I did not find the current SeqIO::genbank can process this entry. I think it
is better to add the following code into the source code for compatability
with the new GenBank release.

#DBSOURCE
if (/^DBSOURCE\s+REFSEQ:\s+(.+)/) {
         $seq->dbsource_refseq($1);
}

It seems to me in the future they probably add something else into the
DBSOURCE entry besides "REFSEQ", so I use dbsource_refseq instead of only
dbsource.

How is your opinion?