[Bioperl-l] Problem with load_seqdatabase

albert vilella vilella at bio.ub.es
Wed Jun 4 15:53:58 EDT 2003


Hi,

I've been trying to load a swissprot dataset into a biosql database
using load_seqdatabase.pl, but I get an error:

./load_seqdatabase.pl -host localhost -dbname biosql -dbuser root
-dbpass '*******' -namespace bioperl -format swiss
/data/database/sprot41.dat

------------- EXCEPTION  -------------
MSG: swissprot stream with no ID. Not swissprot in my book
STACK Bio::SeqIO::swiss::next_seq
/usr/lib/perl5/site_perl/5.8.0/Bio/SeqIO/swiss.pm:180
STACK toplevel ./load_seqdatabase.pl:386
                                                                                                                            
--------------------------------------

Apparently, the next_seq subrutine gets stucked in the first entry while
parsing the ID field:

swiss.pm
----------------------------------------------------------------------

$line =~ /^ID\s+([^\s_]+)(_([^\s_]+))?\s+([^\s;]+);\s+([^\s;]+);/ ||
$self->throw("swissprot stream with no ID. Not swissprot in my book");

----------------------------------------------------------------------

This is strange because I can read the same entry in the same file with:

#! /usr/bin/perl -w

use strict;
use Bio::SeqIO;
use Bio::Seq;

my $file = shift @ARGV;
my $in = Bio::SeqIO->new ( -file => $file,
			   -format => 'swiss');
my $seq = $in->next_seq();
print "Seq: ", $seq->accession_number(), " -- ", $seq->desc(), "\n\n";

Anybody experiencing similar problems? Any guess of what is happening?

Thanks in advance,

Albert Vilella
Molecular Evolution - Dept. Genetics
Universitat de Barcelona
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
Url : http://portal.open-bio.org/pipermail/bioperl-l/attachments/20030604/18ba2841/attachment.bin


More information about the Bioperl-l mailing list