indexing refseq with dbiflat

Roehrig, Sascha s.roehrig at xantos.de
Fri Sep 6 11:33:04 UTC 2002


Dear all,
 
I am having trouble indexing refseq (release in genbank format from
yesterday). During indexing I get a lot of errors complaining about
duplicate ids:
 
Index a flat file database
Warning: Duplicate ID skipped: '0610012A05Rik'
Warning: Duplicate ID skipped: '0610043B10Rik'
Warning: Duplicate ID skipped: '1110004B13Rik'
Warning: Duplicate ID skipped: '1110020A23Rik'
Warning: Duplicate ID skipped: '14'
Warning: Duplicate ID skipped: '14'
Warning: Duplicate ID skipped: '14'
Warning: Duplicate ID skipped: '14'
Warning: Duplicate ID skipped: '1500000C01Rik'
...
...
 
After indexing, I am not able to retrieve a lot of entries which are present
in the flatfile, i.e.:
 
NM_000303
NM_005693
...
...
 
Any suggestions would be greatly appreciated. I noticed that one of the
changes in version 2.4.1 (I am using 2.5.0) addressed fixing the indexing of
refseq. 
 
Best regards
 
Sascha
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.open-bio.org/pipermail/emboss/attachments/20020906/fa6e7412/attachment-0001.html>


More information about the EMBOSS mailing list