[EMBOSS] How to apply the einverted and etandom to a fasta file
yun zheng
mincloud at gmail.com
Sun Oct 29 17:39:35 UTC 2006
Hi,
I am a new user of emboss. I am trying to find repeat sequences in a
nucleotide sequence file that have many sequences.
Can anybody tell me how to use einverted and etandem to analyze all the
sequences in a fasta file?
Many Thanks.
Sincerely
Zheng, yun
Dept of Computer Science and Engineering
Washington Univ in St Louis
Campus Box 1045
1 Brookings Drive
Jolley Hall 505
St Louis, MO 63130
Details:
I install a version on the linux platform. And the command is like follows,
where the default value is used.
>einverted -sequence test.fasta -outfile test.outfile -outseq
>test-i.fasta
Finds DNA inverted repeats
Gap penalty [12]:
Minimum score threshold [50]:
Match score [3]:
Mismatch score [-4]:
But the output file seems always to be empty.
When I try etandom
>etandem -sequence test.fasta -outfile test-t.out -origfile test.etandem
Looks for tandem repeats in a nucleotide sequence
Minimum repeat size [10]:
Maximum repeat size [10]: 18
However, it seems that only the first sequence is analyzed by the einverted
and etandom. The test-t.out file is as follows.
########################################
# Program: etandem
# Rundate: Sat Oct 28 2006 17:24:30
# Commandline: etandem
# -sequence test.fasta
# -outfile test-t.out
# -origfile test.etandem
# -maxrepeat 18
# Report_format: table
# Report_file: test-t.out
########################################
#=======================================
#
# Sequence: D9X6RJV01EER0J from: 1 to: 55
# HitCount: 0
#
# Threshold: 20
# Minrepeat: 10
# Maxrepeat: 18
# Mismatch: No
# Uniform: No
#
#=======================================
Start End Score Size Count Identity Consensus
#---------------------------------------
#---------------------------------------
Many thanks.
More information about the EMBOSS
mailing list