[Bioperl-l] automation of translation based on alignment
Florent Angly
florent.angly at gmail.com
Mon Mar 22 00:44:11 UTC 2010
Hi Ross,
It seems like your answer is in the link you put. On this link, all the
coding sequences are already identified and their aminoacid sequence
provided. You simply need to parse all the GenBank entries to extract
this information. You may use EUtilities to achieve this online:
http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook
Florent
On 21/03/10 09:55, Ross KK Leung wrote:
> Dear bioperl users,
>
>
>
> I am working on virus sequences and one of the Genbank file is here:
>
>
>
> http://www.ncbi.nlm.nih.gov/nuccore/DQ089804.1?ordinalpos=1
> <http://www.ncbi.nlm.nih.gov/nuccore/DQ089804.1?ordinalpos=1&itool=EntrezSys
> tem2.PEntrez.Sequence.Sequence_ResultsPanel.Sequence_RVDocSum>
> &itool=EntrezSystem2.PEntrez.Sequence.Sequence_ResultsPanel.Sequence_RVDocSu
> m
>
>
>
> with 1000 such nucleotide sequences, I'd like to translate the corresponding
> protein coding sequences. The difficulties lie in:
>
>
>
> 1) The genome sequence is circular
>
> 2) The genes are overlapping
>
>
>
> I don't have all the 1000 Genbank files but I plan to use the above guide
> one to direct the automation process. Has bioperl implemented specialized
> functions to handle this kind of problem?
>
>
>
> Thanks a lot for your advice, Ross
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
More information about the Bioperl-l
mailing list