[Biojava-l] Alignment consensus calculation
mark.schreiber at novartis.com
mark.schreiber at novartis.com
Fri May 19 02:16:00 UTC 2006
Hi -
To get a Distribution[] over an alignment you could use
DistributionTools.distOverAlignment(a) or one of the other overloaded
methods.
To get a consensus you could simply find the most frequent Symbol in each
Distribution. To make a more sophisticated consensus you could have
thresholds below which you would report an ambiguity.
eg if:
a = 0.50
t = 0.40
c = 0.0
g = 1.0
Your routine would need to decide if the consensus should be 'a' or 'w' or
the IUPAC symbol for [atg] which I cannot remember. You would probably use
some sort of cutoff value. It might be a routine like this:
public SymbolList consensus(Alignment a, double threshold){
....
}
It might be a method that others find useful so please post it back to the
list.
Hope this helps,
- Mark
Mark Schreiber
Research Investigator (Bioinformatics)
Novartis Institute for Tropical Diseases (NITD)
10 Biopolis Road
#05-01 Chromos
Singapore 138670
www.nitd.novartis.com
phone +65 6722 2973
fax +65 6722 2910
"Nathan S. Haigh" <n.haigh at sheffield.ac.uk>
Sent by: biojava-l-bounces at lists.open-bio.org
05/18/2006 11:44 PM
Please respond to n.haigh
To: <biojava-l at lists.open-bio.org>
cc: (bcc: Mark Schreiber/GP/Novartis)
Subject: [Biojava-l] Alignment consensus calculation
I was wondering if there were any methods for generating a consensus
sequence for alignments? Or any suggestions for calculating the frequency
of
symbols at each position in an alignment.
I had a look at the DistributionTools after seeing a past e-mail to the
list
but couldn't figure if this would do the job as I'm new to Java.
Thanks
Nath
----------------------------------------------------------------------------
------
Dr. Nathan S. Haigh
Bioinformatics PostDoctoral Research Associate
Room B2 211 Tel: +44 (0)114 22
20112
Department of Animal and Plant Sciences Mob: +44 (0)7742
533
569
University of Sheffield Fax: +44 (0)114 22
20002
Western Bank Web:
www.bioinf.shef.ac.uk
Sheffield
www.petraea.shef.ac.uk
S10 2TN
----------------------------------------------------------------------------
------
---
avast! Antivirus: Outbound message clean.
Virus Database (VPS): 0620-2, 18/05/2006
Tested on: 18/05/2006 16:44:01
avast! - copyright (c) 1988-2006 ALWIL Software.
http://www.avast.com
_______________________________________________
Biojava-l mailing list - Biojava-l at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/biojava-l
More information about the Biojava-l
mailing list