<div dir="ltr">Thank you, Peter. I did contact NCBI and the preferred work-around is to download the rather large bioproject.xml file from the ftp site and work with that.</div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jul 11, 2017 at 10:52 AM, Peter Cock <span dir="ltr"><<a href="mailto:p.j.a.cock@googlemail.com" target="_blank">p.j.a.cock@googlemail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Kirk,<br>

<br>

Unfortunately based on the NCBI documentation, with efetch<br>

and the genome database it appears only docsum and uilist<br>

are supported:<br>

<br>

<a href="https://www.ncbi.nlm.nih.gov/books/NBK25499/table/chapter4.T._valid_values_of__retmode_and/" rel="noreferrer" target="_blank">https://www.ncbi.nlm.nih.gov/<wbr>books/NBK25499/table/chapter4.<wbr>T._valid_values_of__retmode_<wbr>and/</a><br>

<br>

Unless one of the Biopython community has tried something<br>

similar, you might be best contacting the NCBI Entrez team<br>

directly to ask. Please do let us know if you solve it.<br>

<br>

Good luck,<br>

<br>

Peter<br>

<div><div class="h5"><br>

On Tue, Jul 11, 2017 at 3:56 PM, Kirk Vander Meulen <<a href="mailto:kvander11@gmail.com">kvander11@gmail.com</a>> wrote:<br>

> I am trying to extract metadata, specifically things like temperature range<br>

> and oxygen requirement status, for a large list of organisms. I am having<br>

> trouble accomplishing this by searching the genome database using either<br>

> efetch or esummary. No matter the route I take, it appears I end up with the<br>

> "docsum" version of data. As an example, I am searching for the genome id#<br>

> 13296. If I search the genome database on NCBI's website, I get a results<br>

> page corresponding to a page ending with ".../genome/?term=13296", which<br>

> often contains my desired metadata. But when using the Entrez module, I get<br>

> results corresponding to ".../genome/?term=13296&<wbr>report=docsum", which<br>

> contains pretty bare-bones level of information.<br>

><br>

> Here is an example of what I have tried and the result:<br>

><br>

>>>>record=Entrez.esummary(gb=<wbr>"genome,id="13296")<br>

>>>>Entrez.read(record)<br>

> [{'Number_of_Organelles': '0', 'Number_of_Plasmids': '0', 'Create_Date':<br>

> '2012/03/29 00:00', 'ProjectID': '157423', 'Assembly_Accession':<br>

> 'GCA_000284295.1', 'Number_of_Chromosomes': '1', 'Options': '', 'DefLine':<br>

> 'Actinoplanes missouriensis overview', u'Item': [], 'Organism_Kingdom':<br>

> 'Bacteria', 'Assembly_Name': 'ASM28429v1', 'AssemblyID': '408138',<br>

> 'Organism_Name': 'Actinoplanes missouriensis', u'Id': '13296'}]<br>

><br>

><br>

> I have tried playing with rettype="full", retmode="text" and the like, as<br>

> well as using efetch instead of esummary, but regardless I only get this<br>

> more limited set of data. Is there a different approach I should be taking<br>

> here? Thank you very much for any pointers.<br>

><br>

> Kirk<br>

><br>

</div></div>> ______________________________<wbr>_________________<br>

> Biopython mailing list  -  <a href="mailto:Biopython@mailman.open-bio.org">Biopython@mailman.open-bio.org</a><br>

> <a href="http://mailman.open-bio.org/mailman/listinfo/biopython" rel="noreferrer" target="_blank">http://mailman.open-bio.org/<wbr>mailman/listinfo/biopython</a><br>

</blockquote></div><br></div>