[EMBOSS] dbiflat and entret...

Ted Chiang tchiang at bioinfo.sickkids.on.ca
Fri Mar 5 20:54:46 UTC 2004


Thanks, but that doesn't seem to work.  Here is a record of what I did.


$dbiflat -fields=taxon
Index a flat file database
      EMBL : EMBL
     SWISS : Swiss-Prot, SpTrEMBL, TrEMBLnew
        GB : Genbank, DDBJ
    REFSEQ : Refseq
Entry format [SWISS]:
Database directory [.]:
Wildcard database filename [*.dat]: *.test
Database name: swtest
Release number [0.0]:
Index date [00/00/00]:


$ls -l
-rw-r--r--           322 Mar  5 15:46 division.lkp
-rw-r--r--         12920 Mar  5 15:46 entrynam.idx
-rw-r--r--       2081272 Mar  5 12:27 sw.test
-rw-r--r--         29176 Mar  5 15:46 taxon.hit
-rw-r--r--         64411 Mar  5 15:46 taxon.trg


$showdb -database swtest
Displays information on the currently available databases
# Name        Type ID  Qry All Comment
# ====        ==== ==  === === =======
swtest        P    OK  OK  OK  Swissprot indexed by field <taxon>


$entret swtest-taxon:homo\*
Reads and writes (returns) flatfile entries
Error: USA 'swtest-taxon:homo*' query field 'taxon' not defined for
database 'swtest'
Error: Unable to read sequence 'swtest-taxon:homo*'


$entret swtest:*
Reads and writes (returns) flatfile entries
Output file [hs11_arath.entret]: all_entries

$ls -l
-rw-r--r--    1 tchiang  jamie       2081272 Mar  5 15:51 all_entries
-rw-r--r--    1 tchiang  jamie           322 Mar  5 15:46 division.lkp
-rw-r--r--    1 tchiang  jamie         12920 Mar  5 15:46 entrynam.idx
-rw-r--r--    1 tchiang  jamie       2081272 Mar  5 12:27 sw.test
-rw-r--r--    1 tchiang  jamie         29176 Mar  5 15:46 taxon.hit
-rw-r--r--    1 tchiang  jamie         64411 Mar  5 15:46 taxon.trg


----------------------------------------------


-Ted


=====================================
Ted Chiang, Analyst
Centre for Computational Biology 
Hospital for Sick Children, Toronto
416.813.7028
tchiang at bioinfo.sickkids.on.ca
=====================================


On Fri, 5 Mar 2004, [ISO-8859-1] Jérôme Laroche wrote:

> Try this:
> 
> entret swtest-taxon:homo
> 
> If you want to use wildcard, you type :
> 
> entret swtest-taxon:homo\*
> 
> 
> Jérôme
> 
> Centre de bioinformatique
> Pavillon Charles-Eugène-Marchand
> Bureau 4233d
> tél: 418-656-2131 poste 6184
> fax: 418-656-7176
> www.bioinfo.ulaval.ca
> Le 04-03-05, à 13:36, Ted Chiang a écrit :
> 
> >
> > Hi,
> >
> > I ran dbiflat on a swissprot flatfile with the option '-fields taxon' 
> > in
> > order to index the db by species.  All went well, and there were files:
> >
> > taxon.hit
> > taxon.trg
> >
> > produced in addition to the division.lkp and entrynam.idx.
> >
> > My question is how do I now retrieve only the records that are for a 
> > given
> > species, ie.
> >
> > $ entret swtest:HOMO
> > Reads and writes (returns) flatfiles entries
> > Error: Database Entry 'HOMO' not found
> > Error: Unable to read sequence 'swtest:HOMO'
> >
> >
> >
> > I know the dbiflat worked, because when I run
> >
> > $ entret swtest:*
> >
> > I get all the files in the database.
> >
> >
> > Thanks for the help.
> >
> >
> >
> > -Ted
> >
> >
> > =====================================
> > Ted Chiang, Analyst
> > Centre for Computational Biology
> > Hospital for Sick Children, Toronto
> > 416.813.7028
> > tchiang at bioinfo.sickkids.on.ca
> > =====================================
> >
> 
> 




More information about the EMBOSS mailing list