[EMBOSS] Data Lib sizes and indexing progs

Alan Bleasby ableasby at hgmp.mrc.ac.uk
Tue Jun 21 15:27:56 UTC 2005


The new indexing programs are done (in CVS). The programs are:
dbxflat, dbxfasta and dbxgcg  and they operate like their
'dbi' couterparts. The dbx and dbi programs will be available
in the next release.

So, for EMBL, you would typically index the *.dat files.
As before, you can create id,acc,sv,key,org & des indexes
(though many sites just index id and acc). 

An indexing job on the whole of the recently released EMBL will
produce id, acc and key indexes of the following sizes. They
should give you some idea of the extra disc space you'll need.

-rw-r--r--  1 root root      19950 Jun 19 14:11 embli.ent
-rw-r--r--  1 root root        122 Jun 20 13:41 embli.pxac
-rw-r--r--  1 root root        122 Jun 20 13:41 embli.pxid
-rw-r--r--  1 root root        126 Jun 20 13:41 embli.pxkw
-rw-r--r--  1 root root 8755992576 Jun 20 13:41 embli.xac
-rw-r--r--  1 root root 7482558464 Jun 20 13:41 embli.xid
-rw-r--r--  1 root root 4046751744 Jun 20 13:41 embli.xkw

HTH

Alan



More information about the EMBOSS mailing list