pattern matching problem

José R. Valverde jrvalverde at cnb.uam.es
Tue May 6 14:44:58 UTC 2003


On Mon,  5 May 2003 13:40:36 +0200
"raouljp at libero.it" <raouljp at libero.it> wrote:
>  is the result :
> AATTCGATGCAACGCGAAGA
> 
> "N"  seems to be a jolly char.
> I need an exact match but, how  ?
> 
Indeed! N stands for "anything" in the IUPAC codes. It is like a '*' in
most regexps.

I understand that what you want is to find sequences with actual 'N's in
them, i.e. the N is not to be substituted by its meaning, just like you 
can use 'abc\*de' in common regexps.

If you try using the program 'dreg' -or looking at its source- you'll be
able to do that: use "AATTCNATNCAACGCGAAGA" as the regexp and there you
are. Since it uses standard regexps, N is no longer interpreted as a
IUPAC code.

-- 
	These opinions are mine and only mine. Hey man, I saw them first!

			    José R. Valverde

	De nada sirve la Inteligencia Artificial cuando falta la Natural



More information about the emboss-dev mailing list