[EMBOSS] EMBOSS equivalents for GCG programs
Peter Rice
pmr at ebi.ac.uk
Fri Nov 7 14:46:21 UTC 2003
Derek Gatherer wrote:
> I have to confess, I'm a bit baffled by this. Obviously, EMBOSS needle
> would do a NW alignment, but the gap opening penalty default in needle
> is 10 and the gap extension is 0.5. I'm not sure what is implied in Gap
> by "gap creation", or whether or not needle allows any end gap
> penalties. Does anybody have any particular recommendations about the
> appropriate needle parameters to use to emulate Gap's behaviour?
In a Needleman-Wunsch global alignment, the entire length of each
sequence is aligned. This can be thought of as an overlap between the
two sequences (one can be completely within the otehr, or their ends can
overlap).
This leaves no penalty for the hanging ends of the overlap. In
bioinformatics, it is usually reasonable to assume that the sequences
are incomplete and there should be no penalty for failing to align the
missing bases.
In GCG's gap, "endweight" means that the hanging ends are treated as two
more rather long gaps. This tends to force the ends of the sequences to
be aligned (whether they should be or not) to reduce the number of gaps,
and is usually not the option you want.
it would not be too difficult to support endweights in needle (the
difference is in setting up the gap penalties in the matrix at the start
and end, the rest is the same).
But ... is it useful?
> I'm also looking for an equivalent of the GCG program Diverge, that
> calculates Ka/Ks values. I think there is no equivalent to this in
> EMBOSS, but I may have missed something?
diverge is a strange best that was not too happy with large input data
sets. GCG replaced it by "newdiverge" some time back with a different
algorithm.
EMBOSS supports phylip 3.5, which has distance methods, as an EMBASSY
package. You may find what you need there, and phylip 3.6 will be stable
soon and will appear as a new and more closely integrated EMBASSY
package soon after.
Again ... would phylip 3.6 in EMBOSS be useful?
regards,
Peter Rice
More information about the EMBOSS
mailing list