<div dir="ltr">I emailed NCBI. I'll post on both the mailing list and the github issue when/if I hear back.<div><br></div><div>Cheers,</div><div><br></div><div>Lenna</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Dec 7, 2015 at 4:32 PM, Peter Cock <span dir="ltr"><<a href="mailto:p.j.a.cock@googlemail.com" target="_blank">p.j.a.cock@googlemail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Mon, Dec 7, 2015 at 7:32 PM, Lenna Peterson <<a href="mailto:arklenna@gmail.com">arklenna@gmail.com</a>> wrote:<br>
> The problem is that the DOCTYPE is missing from the XML file.<br>
><br>
> For example, a nucleotide XML file begins like this:<br>
><br>
> <?xml version="1.0"?><br>
>  <!DOCTYPE GBSet PUBLIC "-//NCBI//NCBI GBSeq/EN"<br>
> "<a href="http://www.ncbi.nlm.nih.gov/dtd/NCBI_GBSeq.dtd" rel="noreferrer" target="_blank">http://www.ncbi.nlm.nih.gov/dtd/NCBI_GBSeq.dtd</a>"><br>
>  <GBSet><br>
><br>
> As far as I can tell, this is the appropriate DTD:<br>
> <a href="http://www.nlm.nih.gov/databases/dtd/nlmcatalogrecordset_150101.dtd" rel="noreferrer" target="_blank">http://www.nlm.nih.gov/databases/dtd/nlmcatalogrecordset_150101.dtd</a><br>
><br>
> However, because the DTD is not specified in the file, the parser does not<br>
> know where to find it.<br>
><br>
> Cheers,<br>
><br>
> Lenna<br>
><br>
<br>
</span>This looks like <a href="https://github.com/biopython/biopython/issues/354" rel="noreferrer" target="_blank">https://github.com/biopython/biopython/issues/354</a><br>
again - could someone write to the NCBI again since this doesn't seem<br>
to have been fixed yet on their side?<br>
<span class="HOEnZb"><font color="#888888"><br>
Peter<br>
</font></span></blockquote></div><br></div>