[EMBOSS] Why tfscan generate duplicate results?

Tao Zhu tzhu at mail.bnu.edu.cn
Fri Aug 3 00:53:19 UTC 2012


tfscan from emboss 6.5.7.0

My input sequence is an intron sequence from Arabidopsis thaliana(in
attached files: test.fasta)

I run:

$ tfscan -sequence test.fasta -menu P -mismatch 0 -outfile test.out

the result is:

########################################
# Program: tfscan
# Rundate: Fri  3 Aug 2012 08:55:50
# Commandline: tfscan
#    -sequence test.fasta
#    -menu P
#    -mismatch 0
#    -outfile test.out
# Report_format: seqtable
# Report_file: test.out
########################################

#=======================================
#
# Sequence: Atha     from: 1   to: 3388
# HitCount: 20
#=======================================

  Start     End  Strand Accession Factor
               Sequence
   3108    3114       + R03715
               ggttaat
   3108    3114       + R03715
               ggttaat
   2482    2488       + R03710
               gaaagaa
   3354    3359       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2185    2190       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1008    1013       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1261    1266       + R02731    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. gagata
   3354    3359       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2185    2190       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   1008    1013       + R02729    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatctc
   2726    2731       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    989     994       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    223     228       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
   2726    2731       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    989     994       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
    223     228       + R02728    T00627; NIT2;Quality: 2; Species:
Neurospora crassa. tatcta
   2876    2879       + R01203
               ctcc
   2449    2452       + R01203
               ctcc
   2141    2144       + R01203
               ctcc
   2877    2883       + R01202
               tccacct

#---------------------------------------
#---------------------------------------

#---------------------------------------
# Reported_sequences: 1
# Reported_hitcount: 20
#---------------------------------------

It could be seen that there exists duplicate items: for example,
3108-3114, +, appear and be counted twice. Why so?

-- 
Tao Zhu, College of Life Sciences, Beijing Normal University, Beijing
100875, China
Email: tzhu at mail.bnu.edu.cn

-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.fasta
Type: application/x-wine-extension-fasta
Size: 3394 bytes
Desc: not available
URL: <http://lists.open-bio.org/pipermail/emboss/attachments/20120803/0ecf718a/attachment-0002.bin>


More information about the EMBOSS mailing list