question about dbiblast

Jean Mao maoj at mail.nih.gov
Fri May 30 14:49:17 UTC 2003


Hi, I am new in emboss db config. Need some help in indexing blast db.

I have in dir following files:
-rw-rw-r--    1 maoj     Seqdb       190733 May 30 08:19 drosoph.nt.nhr
-rw-rw-r--    1 maoj     Seqdb        14108 May 30 08:19 drosoph.nt.nin
-rw-rw-r--    1 maoj     Seqdb         9360 May 30 08:19 drosoph.nt.nnd
-rw-rw-r--    1 maoj     Seqdb           84 May 30 08:19 drosoph.nt.nni
-rw-rw-r--    1 maoj     Seqdb       174584 May 30 08:19 drosoph.nt.nsd
-rw-rw-r--    1 maoj     Seqdb         3699 May 30 08:19 drosoph.nt.nsi
-rw-rw-r--    1 maoj     Seqdb     31368306 May 30 08:19 drosoph.nt.nsq

i believe these are files from NCBI and generated use formatdb version 2.2.5. I run dbiblast in this directory:
Index a BLAST database
Database name: drosoph
Database directory [.]: 
Wildcard database filename [drosoph]: drosoph.nt.*
Release number [0.0]: 
Index date [00/00/00]: 
         N : nucleic
         P : protein
         ? : unknown
Sequence type [unknown]: N
         1 : wublast and setdb/pressdb
         2 : formatdb
         0 : unknown
Blast index version [unknown]: 2

then I got many lines of the following message:
Warning: Duplicate ID skipped: '0?0? ?^DROSOPHILA' All hits will point to first ID found

The following new files were generated:
-rw-rw-r--    1 maoj     Seqdb          322 May 30 10:44 division.lkp
-rw-rw-r--    1 maoj     Seqdb          496 May 30 10:44 entrynam.idx
-rw-rw-r--    1 maoj     Seqdb          300 May 30 10:44 acnum.trg
-rw-rw-r--    1 maoj     Seqdb          300 May 30 10:44 acnum.hit

I then edit my ~/.embossrc file by add the following lines:
DB drosoph [
        type: N
        method: blast
        format: ncbi
        dir: /data/maoj/emboss/db/blast/drosoph
        indexdir: /data/maoj/emboss/db/blast/drosoph
        file: "drosoph.nt.*"
        release: "0.0"
        comment: "blast drosoph"
        ]

Then I run showdb:
% showdb
Displays information on the currently available databases
# Name        Type ID  Qry All Comment
# ====        ==== ==  === === =======
drosoph       N    OK  OK  OK  blast drosoph
test               N    OK  OK  OK  Test DB

The test db is genbank format and was running fine.

Then I run seqret:
% seqret
Reads and writes (returns) sequences
Input sequence(s): drosoph:A*
Error: BLAST Query failed
Error: Unable to read sequence 'drosoph:A*'
Input sequence(s): drosoph:*

   EMBOSS An error in ajseqdb.c at line 4006:
error reading file /data/maoj/emboss/db/blast/drosoph/drosoph.nt.nhr

Please advise what I might did wrong. 

Thank you very much!!!

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.open-bio.org/pipermail/emboss/attachments/20030530/21abd710/attachment-0001.html>


More information about the EMBOSS mailing list