[EMBOSS] non-overlapping matches in fuzznuc?
Aengus Stewart
aengus.stewart at cancer.org.uk
Wed Oct 12 15:50:36 UTC 2011
Hi Folks,
I couldnt see a command line option to do what I wanted ie return non-overlapping hits.
This is best explained with some sample output.
#=======================================
#
# Sequence: chr1_174353258_174354335 from: 1 to: 200
# HitCount: 9
#
# Pattern_name Mismatch Pattern
# pattern1 3 CC[AT](6)GG
#
# Complement: No
#
#=======================================
Start End Strand Pattern_name Mismatch Sequence
54 63 + pattern1 3 GCCAAATAAG
55 64 + pattern1 . CCAAATAAGG
56 65 + pattern1 2 CAAATAAGGG
104 113 + pattern1 1 CCTAAATAAG
105 114 + pattern1 1 CTAAATAAGG
106 115 + pattern1 3 TAAATAAGGG
179 188 + pattern1 2 CCTTGCTTGG
190 199 + pattern1 3 CCGATTAGAG
191 200 + pattern1 3 CGATTAGAGC
As you can see this is actually only 4 hits rather than the 9 reported.
I can do this myself with another script but I was wondering if it could be an option?
regards
Aengus
--
-----------------------------------------------------------------------
Aengus Stewart Tel: +44 (0)20 7269 3679
Head of Bioinformatics and BioStatistics
CRUK London Research Institute
Lincoln's Inn Fields, Holborn, London, WC2A 3LY, UK
-----------------------------------------------------------------------
This electronic message contains information which may be privileged and
confidential. The information is intended to be for the use of the
individual(s) or entity named above. Be aware that any third party
disclosure, distribution, copying or use of this communication, without
prior permission, is strictly prohibited.
NOTICE AND DISCLAIMER
This e-mail (including any attachments) is intended for the above-named person(s). If you are not the intended recipient, notify the sender immediately, delete this email from your system and do not disclose or use for any purpose.
We may monitor all incoming and outgoing emails in line with current legislation. We have taken steps to ensure that this email and attachments are free from any virus, but it remains your responsibility to ensure that viruses do not adversely affect you.
Cancer Research UK
Registered in England and Wales
Company Registered Number: 4325234.
Registered Charity Number: 1089464 and Scotland SC041666
Registered Office Address: Angel Building, 407 St John Street, London EC1V 4AD.
More information about the EMBOSS
mailing list