[EMBOSS] dbxflat and size of index files

Jérôme Laroche jerome.laroche at bioinfo.ulaval.ca
Wed Oct 31 20:46:50 UTC 2007


Hello,

I use dbxflat to index uniprot (sprot and trembl) flat files for  
which the size is 1.2 G for sprot and 11 G for trembl. The resulting  
files are amazingly huge: 11 G. Is it normal?

Another example with Genbank flat files: the division gbsts has a  
size of 3.3 G. Indexing with dbxflat give 6.8 G of index files but  
with dbiflat give only 199 M of index files. I know its not necessary  
to index genbank flat files with dbxflat because each individual file  
is not bigger than 300 M. I did this just for the demonstration.

Apart of this, all is working very well.

Thank you in advance.


Jérôme Laroche

Centre de bioinformatique et de biologie computationnelle
Université Laval





More information about the EMBOSS mailing list