[Bioperl-l] Memory not sufficient when storing human chromosom 1 in BioSQL
Torsten Seemann
torsten.seemann at infotech.monash.edu.au
Fri Jul 4 06:54:29 UTC 2008
>> (ftp://ftp.ncbi.nih.gov/genomes/H_sapiens/CHR_01/hs_ref_chr1.gbk.gz) I
>> receive the error message 'Out of memory'. This takes about one hour. My
> Have you tried just loading the sequence into memory using Bio::SeqIO? The
> problem may be the size of the file itself.
# gunzip -l hs_ref_chr1.gbk.gz
compressed uncompressed ratio uncompressed_name
95180772 296770940 67.9% hs_ref_chr1.gbk
# zgrep -c ^LOCUS hs_ref_chr1.gbk.gz
49
# zgrep -c -i '^ [A-Z-]' hs_ref_chr1.gbk.gz
25853
It's about 300 MB of ASCII in 49 records and 26,000 features.
Not sure what that would explode to if wrapped in Bio::SeqFeature-like
objects? Factor of 4 perhaps?
--
--Torsten Seemann
--Victorian Bioinformatics Consortium, Dept. Microbiology, Monash University
More information about the Bioperl-l
mailing list