question about dbiblast
Jean Mao
maoj at mail.nih.gov
Fri May 30 14:49:17 UTC 2003
Hi, I am new in emboss db config. Need some help in indexing blast db.
I have in dir following files:
-rw-rw-r-- 1 maoj Seqdb 190733 May 30 08:19 drosoph.nt.nhr
-rw-rw-r-- 1 maoj Seqdb 14108 May 30 08:19 drosoph.nt.nin
-rw-rw-r-- 1 maoj Seqdb 9360 May 30 08:19 drosoph.nt.nnd
-rw-rw-r-- 1 maoj Seqdb 84 May 30 08:19 drosoph.nt.nni
-rw-rw-r-- 1 maoj Seqdb 174584 May 30 08:19 drosoph.nt.nsd
-rw-rw-r-- 1 maoj Seqdb 3699 May 30 08:19 drosoph.nt.nsi
-rw-rw-r-- 1 maoj Seqdb 31368306 May 30 08:19 drosoph.nt.nsq
i believe these are files from NCBI and generated use formatdb version 2.2.5. I run dbiblast in this directory:
Index a BLAST database
Database name: drosoph
Database directory [.]:
Wildcard database filename [drosoph]: drosoph.nt.*
Release number [0.0]:
Index date [00/00/00]:
N : nucleic
P : protein
? : unknown
Sequence type [unknown]: N
1 : wublast and setdb/pressdb
2 : formatdb
0 : unknown
Blast index version [unknown]: 2
then I got many lines of the following message:
Warning: Duplicate ID skipped: '0?0? ?^DROSOPHILA' All hits will point to first ID found
The following new files were generated:
-rw-rw-r-- 1 maoj Seqdb 322 May 30 10:44 division.lkp
-rw-rw-r-- 1 maoj Seqdb 496 May 30 10:44 entrynam.idx
-rw-rw-r-- 1 maoj Seqdb 300 May 30 10:44 acnum.trg
-rw-rw-r-- 1 maoj Seqdb 300 May 30 10:44 acnum.hit
I then edit my ~/.embossrc file by add the following lines:
DB drosoph [
type: N
method: blast
format: ncbi
dir: /data/maoj/emboss/db/blast/drosoph
indexdir: /data/maoj/emboss/db/blast/drosoph
file: "drosoph.nt.*"
release: "0.0"
comment: "blast drosoph"
]
Then I run showdb:
% showdb
Displays information on the currently available databases
# Name Type ID Qry All Comment
# ==== ==== == === === =======
drosoph N OK OK OK blast drosoph
test N OK OK OK Test DB
The test db is genbank format and was running fine.
Then I run seqret:
% seqret
Reads and writes (returns) sequences
Input sequence(s): drosoph:A*
Error: BLAST Query failed
Error: Unable to read sequence 'drosoph:A*'
Input sequence(s): drosoph:*
EMBOSS An error in ajseqdb.c at line 4006:
error reading file /data/maoj/emboss/db/blast/drosoph/drosoph.nt.nhr
Please advise what I might did wrong.
Thank you very much!!!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.open-bio.org/pipermail/emboss/attachments/20030530/21abd710/attachment-0001.html>
More information about the EMBOSS
mailing list