[Bioperl-l] Trouble with Bio::DB::Fasta and large files
Lincoln Stein
lstein@cshl.org
Tue, 5 Nov 2002 21:14:15 -0500
I believe you are hitting the 2 GB file limit on some Unix systems. In
general, you will have to do three things:
1) make sure that your kernel supports large files > 2 Gb
Recompile the kernel if not.
2) make sure that you have a recent version of the C library,
libc, that supports large files. Install a new one if not (good luck!)
3) make sure that you have a version of Perl that was compiled
with large file support. Recompile with large file support turned
on if not.
It's a big pain. We just had to do this for one of our servers when we
experienced a similar problem.
Lincoln
On Tuesday 05 November 2002 07:32 pm, Tyler wrote:
> I have been using Bio::DB::Fasta to extract sequences from fasta BLAST
> databases for zebrafish and fugu with no problems. I've used both the
> tied hash and object oriented implementations and they work great with
> these databases. Thanks Lincoln.
>
> However, when trying to use Bio::DB::Fasta on local mouse or human
> genome databases (ensembl raw data) they throw the "Invalid file or
> dirname" exception. The mouse fasta file is 2.7GB and the human one is
> 3.2GB, as opposed to 1.2GB for zebrafish and 340MB for fugu. All
> scripts are the same except for the name of the database file. All
> databases work fine with standalone blast (both the web interface and
> the bioperl interface).
>
> Is there a work around for dealing with these extremely large files?
>
> -Tyler
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@bioperl.org
> http://bioperl.org/mailman/listinfo/bioperl-l