[Biopython-dev] Newick Parser

Nigel Delaney nigel.delaney at outlook.com
Sat Sep 28 15:55:14 UTC 2013


You mean a Unicode byte order mark (BOM)?

 

Yep.

 

Does it even make sense to allow non-ASCII in Newick format?

 

I think that's a matter of opinion.  The specs I found discussed how to
parse the string, but not how to encode the string.  

 

The advantages I can see are allowing people to use the extended characters
for node/tip label names, and being robust if different
text-editors/programs muck with the files (which I would suspect are usually
ASCII).

 

The disadvantage is that it's another case to handle in code, so could just
be ignored or throw an exception.

 

Not sure what the preferred choice for biopython would be.




More information about the Biopython-dev mailing list