[Bioperl-l] SeqHound
Susan J. Miller
sjmiller at email.arizona.edu
Wed Feb 6 20:57:35 UTC 2008
Barry Moore wrote:
> Susan,
>
> I'm joining this discussion late so my apologies if I'm missing the
> original point. If you're trying to routinely download thousands of
> sequences from GenBank or SeqHound you probably want to be using ftp to
> download the flat files and query/parse locally. If you're trying to
> stay on top of the latest Drosophila ESTs, then how about setting up a
> nightly cron job to download the incremental updates from NCBIs ftp
> (ftp://ftp.ncbi.nih.gov/genbank/daily-nc) and parse that for Drosophila
> EST sequences. The EST division is huge, but I would think nightly
> incrementals should be manageable.
Hi Barry,
I'll try your suggestion. I guess my interpretation of the
documentation for SeqHound was erroneous. (Who knows what 'large
numbers of sequences' means?) I tried using SeqHound's get_Stream_by_id
method to fetch 10000 sequences, 500 at a time, and got a timeout error.
--
Regards,
-susan
Susan J. Miller
Manager, Scientific Data Analysis
Biotechnology Computing Facility
Arizona Research Laboratories
(520) 626-2597
More information about the Bioperl-l
mailing list