[EMBOSS] db formatting (?) and parsing issue -- emboss version 5.0.0
georgios at biotek.uio.no
Wed Jan 28 14:15:43 UTC 2009
Indeed there was an \r \n to blame. Didn't spot that with of, because it
was only one instance at the beginning of the file and not on every
line. dos2unix to the rescue and we are back in business.
Peter Rice wrote:
> Hi George,
>> Now, then, we have a second file called 'bhits' (697 sequences). This
>> file has exactly the same format as 'ahits', but when we try to
>> extract the identified sequences, we get the following:
>> for seq in `cat bhits`; do seqret -filter staphyl68-id:$seq; done
>> Died: seqret terminated: Bad value for '-sequence' with -auto defined
>> 'rror: Unable to read sequence 'staphyl68-id:FLTU7OB01AJHZO
>> (one error per sequence ID)
> Umm ... does the message really start with 'rror'?
> That suggests some non-printing character is involved in the ID. Have
> you checked bhits does not have any strange characters?
> The error message should be:
> Error: Unable to read sequence 'staphyl68-id:FLTU7OB01AJHZO'
> So something at the end of the ID seems to have moved the final quite to
> the start of the line.
> I can get the same effect by using noreturn -system pc to change the
> carriage control characters in bhits.
> I suspect that is the cause of your problem.
> Let me know if that doesn't solve it.
More information about the EMBOSS