[Biojava-dev] SequenceDB way too big!!!
Murat Tasan
tasan at eecs.cwru.edu
Thu Jan 30 16:13:29 EST 2003
I've just started using biojava to help in my development of sequence
analysis and searching projects, but I have run into a huge (no pun
intended) problem. I make a call to SeqIOTools.readFasta(...), to read a
FASTA file of approximately 14MB in size. Because the file is only 14MB
or so, I figure getting a SequenceDB object from it will AT WORST take up
twice that in memory (~30MB) (with all of the extra information associated
with sequences... although my FASTA file has just sequence data).
Instead, my virtual machine eventually crashes out as I run out of memory.
I watched the execution using 'top' and witnessed over 75MB being
allocated to the running process.
Is there a more efficient implementation of this? Better yet, can anyone
tell me why so much space is being taken up for only 14MB of sequence
data?
Thanks for any help!!!!
Murat
--
Murat Tasan
mxt6 at po.cwru.edu
tasan at eecs.cwru.edu
http://genomics.cwru.edu
More information about the biojava-dev
mailing list