[BioSQL-l] KW field from Swiss-Prot in Biosql schema missing
Hilmar Lapp
hlapp at gnf.org
Tue Mar 9 14:16:17 EST 2004
On Tuesday, March 9, 2004, at 01:07 AM, Raphael A. Bauer wrote:
> That's not the case in my biosql Schema. Only the GN (Gene Name) Lines
> from Swiss-Prot are parsed into Bioentry_qualifier_value.
> The KW Field is still missing.
>
You will need at least the latest stable release of bioperl, which is
1.4.x The 1.2 series stores the keyword in a way that will not get
persisted by bioperl-db.
To verify, read in a swissprot-formatted sequence with keywords and
then print the 'keyword'-tagged annotations:
$seqio = Bio::SeqIO->new(-format => "swiss", -file =>
">myswissprot.dat");
$seq = $seqio->next_seq();
foreach my $kw ($seq->annotation->get_Annotations("keyword")) {
print "KW: ",$kw->value,"\n";
}
This should print you all keywords that the sequence had in the input
file.
The trick about serializing keywords to biosql is that there is no
trick. Bio::Seq::RichSeq (instances of which are returned by the
swissprot parser) stores keywords as Bio::Annotation::SimpleValue
objects behind the scenes (since 1.4.x), and bioperl-db just serializes
all annotation objects.
With bioperl 1.2 this will not work because that one stores keywords in
a separate property.
-hilmar
--
-------------------------------------------------------------
Hilmar Lapp email: lapp at gnf.org
GNF, San Diego, Ca. 92121 phone: +1-858-812-1757
-------------------------------------------------------------
More information about the BioSQL-l
mailing list