[Biojava-l] Short names for Amino acid symbols

George Waldon gwaldon at geneinfinity.org
Tue Aug 5 00:22:20 UTC 2008


The link http://biojava.org/wiki/BioJava:Cookbook:Translation:OneLetterAmbiDaniel does not seem to work.

Did you try SymbolTonenization? Something like:

Symbol s;
SymbolTokenization  tok= ProteinTools.getTAlphabet().getTokenization("token");
String 	s = tok.tokenizeSymbol(s);

Should give you the short name of any given symbol.

- George

> -----Original Message-----
> From: biojava-l-bounces at lists.open-bio.org [mailto:biojava-l-
> bounces at lists.open-bio.org] On Behalf Of Peter Robinson
> Sent: Sunday, July 27, 2008 8:57 AM
> To: biojava-l at lists.open-bio.org
> Subject: [Biojava-l] Short names for Amino acid symbols
> 
> Hi,
> 
> thanks to all on the list who helped me get started with Biojava, and by
> the way, the online documents are quite helpful!
> 
> I am trying to develop some code to look for signs of positive selection
> in human sequences by making multiple alignments of protein sequences
> and mapping the nucleotide sequences onto this alignment and checking
> synonymous and nonsynonymous nucleotide substitutions in several species
> (etc).
> 
> A few small questions;
> 1) I have written a class to encapsulate all I need from a given Genbank
> mRNA sequence; the entire mRNA, the CDS and the corresponding protein
> sequence. I have some methods such as the following:
> 
>  private void setCDSSequence() {
>         Feature CDS = getCDSFeature(this.completeSequence);
>         Location loc = CDS.getLocation();
>         SymbolList symL = this.completeSequence.subList(loc.getMin(),
> loc.getMax()-3); //-3 to remove stop codon
>         this.CDS= symL;
>     }
> 
> Question: Why is there (seemingly) no way in Biojava to create a
> Sequence object instead of a SymbolList object? Or did I miss something?
> 
> 2)  I would then like to printout the protein alignment to check for
> correctness, and it seems there is no way of getting from a symbol to
> the one-letter aminoacid code. That is,
> 
> proteinAlignment.get(j).symbolAt(k).getName()
> 
> will return "Ala" instead of "A" etc. Is there a good way of getting the
> short symbols?
> 
> Thanks, Peter
> _______________________________________________
> Biojava-l mailing list  -  Biojava-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biojava-l



More information about the Biojava-l mailing list