[EMBOSS] Data Lib sizes and indexing progs
Alan Bleasby
ableasby at hgmp.mrc.ac.uk
Tue Jun 21 15:27:56 UTC 2005
The new indexing programs are done (in CVS). The programs are:
dbxflat, dbxfasta and dbxgcg and they operate like their
'dbi' couterparts. The dbx and dbi programs will be available
in the next release.
So, for EMBL, you would typically index the *.dat files.
As before, you can create id,acc,sv,key,org & des indexes
(though many sites just index id and acc).
An indexing job on the whole of the recently released EMBL will
produce id, acc and key indexes of the following sizes. They
should give you some idea of the extra disc space you'll need.
-rw-r--r-- 1 root root 19950 Jun 19 14:11 embli.ent
-rw-r--r-- 1 root root 122 Jun 20 13:41 embli.pxac
-rw-r--r-- 1 root root 122 Jun 20 13:41 embli.pxid
-rw-r--r-- 1 root root 126 Jun 20 13:41 embli.pxkw
-rw-r--r-- 1 root root 8755992576 Jun 20 13:41 embli.xac
-rw-r--r-- 1 root root 7482558464 Jun 20 13:41 embli.xid
-rw-r--r-- 1 root root 4046751744 Jun 20 13:41 embli.xkw
HTH
Alan
More information about the EMBOSS
mailing list