[Bioperl-l] Indexing est fasta file.

Ivan Sendin ivansendin at yahoo.com
Thu Jul 10 13:16:49 EDT 2003


Brian,

> Puzzling. One thing I'd try if this had happened to me would be to 
switch
> from SDBM to DB_File as the indexing method. In order to do this 
you'll have
> to install DB_File (from RPM say, or from www.sleepycat.com) and then
> install the DB_File Perl module. This doesn't sound like a problem 
with
> Bioperl per se, it seems like a problem with SDBM, that's my guess.
 
I'll try DB_File.


> One question though: why do you say that the keys are both
> "gi|6861423|gb|AW357480.1|AW357480" and "6861423" for that sequence? 
Are you

Because there are old scripts that search for
"gi|6861423|gb|AW357480.1|AW357480" and new scripts using only gi...
I will fix it someday.

I've tried index est file again using "gi" as key and the error
happened again:

15944439
15944440
sdbm store returned -1, errno 22, key "15944440" at
/usr/local/lib/perl5/site_perl/5.8.0/Bio/Index/Abstract.pm line 713, 
<FASTA>
line 63813394.

> using the idparser() method to specify /gi\|(\d+)/ as the key? You 
might
> want to show us the code, this is often illuminating.

The code is in the end of first mail... some "Page Down"
from here...


Thanks,

Ivan

> -----Original Message-----
> From: bioperl-l-bounces at portal.open-bio.org
> [mailto:bioperl-l-bounces at portal.open-bio.org]On Behalf Of Ivan Sendin
> Sent: Friday, July 04, 2003 8:35 AM
> To: bioperl-l at bioperl.org
> Subject: RE: [Bioperl-l] Indexing est fasta file.
> 
> 
> --- Brian Osborne <brian_osborne at cognia.com> wrote:
> > Ivan,
> >
> > My Google search says:
> > On failure, the tie call returns an undefined value and probably sets $!
> to
> > contain the reason the file could not be tied.
> > sdbm store returned -1, errno 22, key "..." at ...
> > This warning is emmitted when you try to store a key or a value that is
> too
> > long. It means that the change was not recorded in the database. See BUGS
> > AND WARNINGS below.
> > Your key can't be too long - is there something unusual about this
> > particular sequence or "value"?
> >
> 
> Brian,
> 
> There is not unusual with this sequence.
> I runned the script again, printing some debug info:
> ......
> gi|6861420|gb|AW357477.1|AW357477 6861420
> gi|6861421|gb|AW357478.1|AW357478 6861421
> gi|6861422|gb|AW357479.1|AW357479 6861422
> gi|6861423|gb|AW357480.1|AW357480 6861423
> sdbm store returned -1, errno 22, key "6861423" at
> /usr/local/lib/perl5/site_perl/5.8.0/Bio/Index/Abstract.pm line 713, <FASTA>
> line 22096655.
> 
> The keys for the last sequence are "gi|6861423|gb|AW357480.1|AW357480"
> and "6861423".
> 
> 
> Ivan
> 
> 
> 
> 
> > Brian O.
> >
> >
> > -----Original Message-----
> > From: bioperl-l-bounces at portal.open-bio.org
> > [mailto:bioperl-l-bounces at portal.open-bio.org]On Behalf Of Ivan Sendin
> > Sent: Thursday, July 03, 2003 3:47 PM
> > To: bioperl-l at bioperl.org
> > Subject: [Bioperl-l] Indexing est fasta file.
> >
> > Hi,
> >
> > I'm trying to make an index on est file, but
> > when I run my script I got this error:
> >
> > sdbm store returned -1, errno 22, key "6861423" at
> > /usr/local/lib/perl5/site_perl/5.8.0/Bio/Index/Abstract.pm line 713,
> <FASTA>
> > line 22096655.
> >
> >
> > The script is very simple:
> >
> > ...
> >  my $inx = Bio::Index::Fasta->new(
> >                                    -filename => $Index_File_Name,
> >                                    -write_flag => 1
> >                                   );
> >   $inx->id_parser(\&parse_ncbi_id);
> >   $inx->make_index($fasta);
> > }
> >
> > sub parse_ncbi_id {
> >   my @retvals;
> >   my $p = $_[0];
> >   if( $p =~ /^>(\S+)/ ) {
> >     my $val = $1;
> >     push @retvals, $val;
> >     while ( $p =~/gi\|(\d*)/g) {
> >       push(@retvals,$1);
> >     }
> >   }
> >   return @retvals;
> > }
> >
> >
> > Anybody knows what is wrong?
> >
> > The size of est file (11077527557 bytes) is a issue?
> >
> >
> > Thanks,
> >
> >
> > Ivan Sendin
> >
> > __________________________________
> > Do you Yahoo!?
> > SBC Yahoo! DSL - Now only $29.95 per month!
> > http://sbc.yahoo.com
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l at portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> >
> >
> > _______________________________________________
> > Bioperl-l mailing list
> > Bioperl-l at portal.open-bio.org
> > http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 
> =====
> ------------------------------------------------------------
>  Ivan da Silva Sendin  -   Bioinformatics
>  Raw IP,TCP & UDP with Java: http://jpacket.sourceforge.net
>                                Campinas - Brasil
> ------------------------------------------------------------
> 
> __________________________________
> Do you Yahoo!?
> SBC Yahoo! DSL - Now only $29.95 per month!
> http://sbc.yahoo.com
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
> 
> 


__________________________________
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com


More information about the Bioperl-l mailing list