[Biojava-dev] alternate tokenization

Richard Holland holland at ebi.ac.uk
Fri Oct 12 22:21:59 UTC 2007


interesting idea. we kind of have this already but not really - the stuff
is all there but not wired up right.

best to add it to the wiki so we don't forget to finish the wiring. :)

cheers,
Richard

On Fri, October 12, 2007 11:12 pm, george waldon wrote:
> Hello,
>
> I would like to add alternate tokenizations in AlphabetManager.xml for the
> DNA alphabet and the PROTEIN-TERM alphabet.
>
> These tokenizations will be used for stringifying DNA sequences in capital
> letters, like in "ATCG", and proteins with a mix of capital and small
> letters, like in "SerGluPro".
>
> Comments welcome,
> George
>
> _______________________________________________
> biojava-dev mailing list
> biojava-dev at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biojava-dev
>


-- 
Richard Holland
BioMart (http://www.biomart.org/)
EMBL-EBI
Hinxton, Cambridgeshire CB10 1SD, UK




More information about the biojava-dev mailing list