pattern matching problem
José R. Valverde
jrvalverde at cnb.uam.es
Tue May 6 14:44:58 UTC 2003
On Mon, 5 May 2003 13:40:36 +0200
"raouljp at libero.it" <raouljp at libero.it> wrote:
> is the result :
> AATTCGATGCAACGCGAAGA
>
> "N" seems to be a jolly char.
> I need an exact match but, how ?
>
Indeed! N stands for "anything" in the IUPAC codes. It is like a '*' in
most regexps.
I understand that what you want is to find sequences with actual 'N's in
them, i.e. the N is not to be substituted by its meaning, just like you
can use 'abc\*de' in common regexps.
If you try using the program 'dreg' -or looking at its source- you'll be
able to do that: use "AATTCNATNCAACGCGAAGA" as the regexp and there you
are. Since it uses standard regexps, N is no longer interpreted as a
IUPAC code.
--
These opinions are mine and only mine. Hey man, I saw them first!
José R. Valverde
De nada sirve la Inteligencia Artificial cuando falta la Natural
More information about the emboss-dev
mailing list