[Bioperl-l] Bio::Root::Exception when using Bio::Seq
Torsten Seemann
torsten.seemann at infotech.monash.edu.au
Mon Jun 19 23:30:19 UTC 2006
> -------------------- WARNING ---------------------
> MSG: seq doesn't validate, mismatch is 1
> MSG: Attempting to set the sequence to [ACTG*] which does not look
> healthy
> NOTE: ACTG* represents a sequence of those 4 characters(a valid DNA
> sequence)
> For some reason it is failing on a particular sequence, which is a valid
> DNA sequence. If anyone has any ideas on why this is I would appreciate
> it.
Usually a '*' indicates a STOP codon in a protein sequence.
I don't think it is valid in a DNA sequence?
So my guess is that BioPerl is auto-detecting it as Protein sequence,
as A,C,T,G are all valid amino acids, and * is a stop codon.
So I think BioPerl is doing the right thing.
If you want to force it, try adding a " -alphabet=>'protein' " parameter to the
Bio:Seq constructor.
--
Torsten Seemann
Victorian Bioinformatics Consortium, Monash University, Australia
http://www.vicbioinformatics.com/
More information about the Bioperl-l
mailing list