I must point out that, from the point of view of the FASTA program, and I think BLAST as well, any sequence of printable characters is a "valid" FASTA protein or DNA sequence. Letters that do not conform to IUPAC amino-acids or nucleotides are simply ignored, as are numbers, spaces, tabs, etc. Bill Pearson