[EMBOSS] Searching for repeats in fuzznuc
Steve Taylor
staylor at molbiol.ox.ac.uk
Thu Jun 21 16:42:57 UTC 2007
Hi Peter,
>
>> I would like to search for a specific repeat using fuzznuc. It is
>> pretty easy using a regexp in preg (yes, I know preg should really
>> only be used for protein sequences:-)) via
>>
>> preg -pattern '(TG{5,20}){2,10}'
>>
>> but is there a way to do something similar in fuzznuc, since I would
>> like to introduce mismatches.
>
>
> You could use dreg rather than preg :-)
>
Excellent first suggestion!:-)
> Fuzznuc now accepts files with a pattern on each line:
>
> % cat tg.pat
> TG(5,20)TG(5,20)
> TG(5,20)TG(5,20)TG(5,20)
> TG(5,20)TG(5,20)TG(5,20)TG(5,20)
> TG(5,20)TG(5,20)TG(5,20)TG(5,20)TG(5,20)
>
> or, if you want to name them, ...
>
> % cat tg.pat
>
>> tg2
>
> TG(5,20)TG(5,20)
>
>> tg3
>
> TG(5,20)TG(5,20)TG(5,20)
>
>> tg4
>
> TG(5,20)TG(5,20)TG(5,20)TG(5,20)
>
>> tg5
>
> TG(5,20)TG(5,20)TG(5,20)TG(5,20)TG(5,20)
>
> You can use the file with:
>
> % fuzznuc -pattern @tg.pat
Thanks. That looks like a useful work around.
Out of interest any plans for a mismatch option in dreg?
Steve
More information about the EMBOSS
mailing list