[Biopython] download sequences by date from Genbank
Michiel de Hoon
mjldehoon at yahoo.com
Thu Jun 28 01:01:59 UTC 2012
Hi Lifeng,
Have a look at esearch in the NCBI E-Utilities:
http://www.ncbi.nlm.nih.gov/books/NBK25499/#chapter4.ESearch
You can access the E-Utilities and parse the results with Bio.Entrez as described in the Biopython manual.
Best,
-Michiel
--- On Wed, 6/27/12, Lifeng Lin <linlifeng at gmail.com> wrote:
> From: Lifeng Lin <linlifeng at gmail.com>
> Subject: [Biopython] download sequences by date from Genbank
> To: biopython at lists.open-bio.org
> Date: Wednesday, June 27, 2012, 5:20 PM
> Hi folks,
>
> Is there an elegant way of downloading sequences from
> Genbank and using
> date as a cutoff?
>
> I am trying to maintain an up-to-date local version of all
> sequences for a
> certain number of species. When "synching" with Genbank, all
> i can think of
> is retrieving all GI numbers for these species once again,
> compare them
> with what i have locally, and generate a list of new
> sequences and append
> them. I have a hunch that there might be a better way of
> doing this, for
> example, if there is a date filter that we can apply for
> Genbank download,
> then all the trouble for comparisons would be saved.
>
> Any suggestions?
>
> best,
> L.
> _______________________________________________
> Biopython mailing list - Biopython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
>
More information about the Biopython
mailing list