compseq: is U an amino acid

JAEN (Jacob Engelbrecht) jaen at
Wed Aug 21 07:34:30 UTC 2002

I have been using compseq for protein sequences and wondered why 'U' is
reported as an amino acid?
I looked in the code (nucleus/embnmer.c) and found it was specifically
accounted for, whereas 'X' which in many databases  as unknown is not
specifically accounted for.

Would it not make sense to have options which made specific symbols part of
the alphabet or left them out: 
-leaveout XU or -include BZXU

Jacob Engelbrecht, Phd
Insulin Research
Novo Nordisk
6A1.038 Novo Alle
DK-2880 Bagsvaerd
tel: +45 4442 4403
mail: jaen at

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the EMBOSS mailing list