[EMBOSS] Problem indexing PDB fasta file
simon andrews (BI)
simon.andrews at bbsrc.ac.uk
Mon Apr 10 09:40:30 UTC 2006
On 10 Apr 2006, at 10:12, Peter Rice wrote:
> Enrique de Andres Saiz wrote:
>> I have been looking the PDB fasta file and I see that, for the
>> previous
>> warning, there are an entry whoose id is '1FNT_A' and another one
>> whoose
>> id is '1FNT_a'. Then, this make me think that EMBOSS is
>> case-insensitive. Is this true? Are there any way to distinguish
>> between
>> the two id's?
>
> Yes, EMBOSS is case-insensitive. So is the Staden/EMBLCD indexing
> standard
> that dbifasta uses.
>
> The standard also only allows one entry with each ID.
If anyone's interested I've got a small perl script which reformats the
PDB database into a more sensible format and sorts out the problems
with case sensitive ids and a number of other odd conventions used in
PDB.
I'm happy to supply a copy to anyone who wants it.
TTFN
Simon.
--
Simon Andrews PhD
Bioinformatics Dept.
The Babraham Institute
simon.andrews at bbsrc.ac.uk
+44 (0) 1223 496463
More information about the EMBOSS
mailing list