[Biojava-dev] Code Update

Andy Yates ayates at ebi.ac.uk
Sun Feb 7 00:27:18 UTC 2010


Hi Mark,

I thought that there had been some cases of Valine in bacteria not  
being modified but that said it's probably me mis-understanding what  
the other person was saying :). However there is the in-flexibility in  
other resources/programs which can throw the toys out of the pram the  
moment they are presented with a peptide not starting with M.

What I was thinking is to either have this option available (much in  
the same way I've got stop codon trimming working) or using the  
planned edit capability. If I do put the optional translation as a  
boolean for the moment then that bit becomes a bit more feature  
complete and then a better solution can be applied later on :).

Anyway next on the hit list I think is locations ....

Andy

On 6 Feb 2010, at 15:45, Mark Schreiber wrote:

> Hi Andy,
>
> Great work!
>
> Regarding the issue of non Met start codons such as TTG,  
> biologically speaking they are still translated as Met. fMet is the  
> only tRNA that can initiate. Presumably there is some flexibility in  
> the recognition of the start codon.
>
> Maybe this is what you where aiming for but it would be good to have  
> the option of making the first codon translate to Met no matter what  
> the codon.
>
> Mark
>
>
>> On 06-Feb-2010 11:13 PM, "Andy Yates" <ayates at ebi.ac.uk> wrote:
>>
>> Finally it's in. I've managed to get enough time to finish the  
>> transcription code off. The main features of this check-in are:
>>
>> * DNA -> RNA -> Codon -> Peptide translation
>> * Support for all IUPAC tables
>> * New views for reversing sequences & complementing them
>> * Windowed sequence views giving portions of a sequence as requested
>> * TranscriptionEngine & TranscriptionEngine.Builder deal with the  
>> business of assembling the classes together as required
>> * Singletons provided from the classes they are in (e.g.  
>> IUPACParser has one) *but* no class requires a singleton!
>> * Utilities for working with IO streams & classpath resources  
>> (useful for testing)
>> * Test case shows 1000 translations of BRCA2 (from DNA) in  
>> 0.7seconds (on my MacBook Pro; YMMV); test case will break if it  
>> takes longer than a second
>> ** This is a vast improvement over my first attempt that had a rate  
>> of 1 per second hence why that was not checked in
>>
>> Limitations are:
>>
>> * Not much checking WRT lengths of sequence given to the code; need  
>> a strict & lenient mode
>> * Stop codon trimming controlled by a boolean
>> * No init-met translation (very important as some programs get a  
>> bit annoyed if they're given a V as an initiator AA)
>> * Not sure if there is a way to ask if a codon is a start codon  
>> easily; I'm sure it can be done just not as easily as we may want
>> * No way of modifying a badly translated peptide which we expect to  
>> badly translate
>>
>> However it's workable & means if you have a DNASequence technically  
>> you can get a peptide by saying:
>>
>> DNASequence s = getSeq();
>> ProteinSequence p = s.getRNASequence().getProteinSequence();
>>
>> Now how's that for easy :)
>>
>> Share & enjoy!
>>
>> Andy
>>
>> _______________________________________________
>> biojava-dev mailing list
>> biojava-dev at lists.open-bio...
>>
>

-- 
Andrew Yates                   Ensembl Genomes Engineer
EMBL-EBI                       Tel: +44-(0)1223-492538
Wellcome Trust Genome Campus   Fax: +44-(0)1223-494468
Cambridge CB10 1SD, UK         http://www.ensemblgenomes.org/







More information about the biojava-dev mailing list