[Bioperl-l] clustalw output format
Jason Eric Stajich
jason@cgt.mc.duke.edu
Mon, 15 Oct 2001 19:49:08 -0400 (EDT)
Thanks Peter - will work on implementing this - I guess I should have
perused the manual more carefully.
-jason
On Mon, 15 Oct 2001, Peter Schattner wrote:
> Jason Eric Stajich wrote:
>
> > In a Clustal alignment what do the '.' and ':' signify - I assume
> > conservative transitions either by hydrophobicity and charge or from
> > BLOSUM matrices, not sure how I would calculate them though without
> > knowin exactly what they code for.
> >
> > I
>
> Here's an except from the clustalw documentation that should help:
>
> 12.The conservation line output in the clustal format alignment file has
> been changed. Three characters are now used: '*' indicates positions which
> have
> a single, fully conserved residue ':' indicates that one of the
> following 'strong' groups is fully conserved:-
>
> STA
> NEQK
> NHQK
> NDEQ
> QHRK
> MILV
> MILF
> HY
> FYW
>
> '.' indicates that one of the following 'weaker' groups is fully
> conserved:-
>
> CSA
> ATV
> SAG
> STNK
> STPA
> SGND
> SNDEQK
> NDEQHK
> NEQHRK
> FVLIM
> HFY
>
> These are all the positively scoring groups that occur in the Gonnet
> Pam250 matrix. The strong and weak groups are defined as strong score >0.5
> and weak score =<0.5 respectively.
>
> - Peter
>
--
Jason Stajich
Duke University
jason@cgt.mc.duke.edu