DBIBLAST error when creating blast database
Marcus Claesson
marcus at chah.ucc.ie
Mon Dec 2 16:49:17 UTC 2002
> The latest formatdb from NCBI creates "version 4" blast index files. NCBI
> have not provided any documentation on this format, so it is not supported
> by EMBOSS. To produce the 'old' (non-ASN.1) index files, use:
>
> formatdb -A F
>
> That should fix it.
Not fully, but at least I don't get any error messages. The ecoli.aa db
won't show up when I run 'showdb'. Here's what I did:
[marcus at nsfm39 db]$ formatdb -A F -i ecoli.aa -p T -o T
[marcus at nsfm39 db]$ ll|grep "Dec 2"
-rw------- 1 marcus marcus 1774183 Dec 2 14:15 ecoli.aa
-rw-rw-r-- 1 marcus marcus 387530 Dec 2 15:29 ecoli.aa.phr
-rw-rw-r-- 1 marcus marcus 34372 Dec 2 15:29 ecoli.aa.pin
-rw-rw-r-- 1 marcus marcus 34312 Dec 2 15:29 ecoli.aa.pnd
-rw-rw-r-- 1 marcus marcus 180 Dec 2 15:29 ecoli.aa.pni
-rw-rw-r-- 1 marcus marcus 354726 Dec 2 15:29 ecoli.aa.psd
-rw-rw-r-- 1 marcus marcus 8287 Dec 2 15:29 ecoli.aa.psi
-rw-rw-r-- 1 marcus marcus 1363280 Dec 2 15:29 ecoli.aa.psq
-rw-rw-r-- 1 marcus marcus 642786 Dec 2 15:29 formatdb.log
[marcus at nsfm39 db]$ dbiblast
Index a BLAST database
Database name: ecoli.aa
Database directory [.]:
Wildcard database filename [ecoli.aa]: ecoli.aa.*
Release number [0.0]:
Index date [00/00/00]:
N : nucleic
P : protein
? : unknown
Sequence type [unknown]: P
1 : wublast and setdb/pressdb
2 : formatdb
0 : unknown
Blast index version [unknown]: 2
[marcus at nsfm39 db]$ showdb
Displays information on the currently available databases
# Name Type ID Qry All Comment
# ==== ==== == === === =======
tpir P OK OK OK PIR using NBRF access for 4 files
tsw P OK OK OK Swissprot native format with EMBL CD-ROM
index
tswnew P OK OK OK Swissnew as 3 files in native format with
EMBL CD-ROM index
twp P OK OK OK EMBL new in native format with EMBL CD-ROM
index
gb N OK - - Genbank IDs
gba N OK - - Genbank ACs
tembl N OK OK OK EMBL in native format with EMBL CD-ROM
index
tgb N OK - - Genbank IDs
tgenbank N OK OK OK GenBank in native format with EMBL CD-ROM
index
[marcus at nsfm39 db]$ ll|grep "Dec 2"
-rw-rw-r-- 1 marcus marcus 300 Dec 2 15:30 acnum.hit
-rw-rw-r-- 1 marcus marcus 300 Dec 2 15:30 acnum.trg
-rw-rw-r-- 1 marcus marcus 322 Dec 2 15:30 division.lkp
-rw------- 1 marcus marcus 1774183 Dec 2 14:15 ecoli.aa
-rw-rw-r-- 1 marcus marcus 387530 Dec 2 15:29 ecoli.aa.phr
-rw-rw-r-- 1 marcus marcus 34372 Dec 2 15:29 ecoli.aa.pin
-rw-rw-r-- 1 marcus marcus 34312 Dec 2 15:29 ecoli.aa.pnd
-rw-rw-r-- 1 marcus marcus 180 Dec 2 15:29 ecoli.aa.pni
-rw-rw-r-- 1 marcus marcus 354726 Dec 2 15:29 ecoli.aa.psd
-rw-rw-r-- 1 marcus marcus 8287 Dec 2 15:29 ecoli.aa.psi
-rw-rw-r-- 1 marcus marcus 1363280 Dec 2 15:29 ecoli.aa.psq
-rw-rw-r-- 1 marcus marcus 86080 Dec 2 15:30 entrynam.idx
-rw-rw-r-- 1 marcus marcus 642786 Dec 2 15:29 formatdb.log
Thus, nothing new in showdb...
Thanks for helping me Peter!
/Marcus
More information about the EMBOSS
mailing list