[Bioperl-l] Get GIs from Taxonomy ID
Chris Fields
cjfields at illinois.edu
Thu Jun 10 21:35:48 UTC 2010
You can get up-to-date files mapping GI to TaxID here (nr and nt):
ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/
chris
On Jun 10, 2010, at 4:11 PM, Smithies, Russell wrote:
> Eutils will do it with the right query:
>
>
>
>
> use Bio::DB::EUtilities;
>
> my $factory = Bio::DB::EUtilities->new(-eutil => 'esearch',
> -db => 'nucleotide',
> -term => 'txid9940[Organism:noexp]',
> -email => 'mymail at foo.bar',
> -retmax => 1000000);
>
> # query hits
> print "Count = ",$factory->get_count,"\n";
> # UIDs
> my @ids = $factory->get_ids;
>
>
> --Russell
>
>
>> -----Original Message-----
>> From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l-
>> bounces at lists.open-bio.org] On Behalf Of Dave Messina
>> Sent: Friday, 11 June 2010 8:01 a.m.
>> To: armendarez77 at hotmail.com
>> Cc: bioperl-l at lists.open-bio.org
>> Subject: Re: [Bioperl-l] Get GIs from Taxonomy ID
>>
>> Hi Veronica,
>>
>> These days when you run BLAST at the NCBI server, you're running BLAST+,
>> which is their complete rewrite of (and replacement for) BLAST.
>>
>> You can also download BLAST+ and do pretty much everything on your local
>> machine that you can do on their server, including limit by taxonomy.
>>
>> I think this is the right parameter:
>>
>> http://www.ncbi.nlm.nih.gov/blast/html/blastcgihelp.html#entrez_que
>> ry
>>
>> Incidentally, BLAST+ has this awesome feature whereby you can, from the
>> command line, run searches remotely on their server against their
>> databases from the command line, just by adding the --remote flag.
>>
>>
>> (You can run BLAST+ via the BioPerl wrapper module StandAloneBlastPlus, by
>> the way.)
>>
>> Dave
>>
>>
>>
>> On Jun 10, 2010, at 4:54 PM, <armendarez77 at hotmail.com> wrote:
>>
>>>
>>> Hello,
>>>
>>> Is there a BioPerl method that will give a list of GIs for a specified
>> NCBI taxonomy Id?
>>>
>>> I've previously tried using Urlapi to BLAST primers against the nr
>> database on the NCBI server, but recently I keep getting a
>>> 'Bad Gateway' error. While my system admin is looking into this, I've
>>> decided to go another route. Therefore, I've downloaded the NCBI nr
>> database.
>>>
>>> The problem I've run into is restricting the BLAST against the nr
>> database to a subset of sequences. The NCBI Blast tools have an option (-
>> l) that does this, but it requires a list of GI's.
>>>
>>> When I was using Urlapi, I restricted sequences using Taxonomy Ids
>> (Entrez Query). Therefore, is there a way to get all GIs within a
>> Taxonomy Id? I've seen that woth Bio::Taxonomy I can give a GI and get a
>> Tax Id, but not the reverse.
>>>
>>>
>>> Thank you,
>>>
>>> Veronica
>>>
>>>
>>>
>>>
>>> _________________________________________________________________
>>> Hotmail is redefining busy with tools for the New Busy. Get more from
>> your inbox.
>>>
>> http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON
>> :WL:en-US:WM_HMP:042010_2
>>> _______________________________________________
>>> Bioperl-l mailing list
>>> Bioperl-l at lists.open-bio.org
>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
> =======================================================================
> Attention: The information contained in this message and/or attachments
> from AgResearch Limited is intended only for the persons or entities
> to which it is addressed and may contain confidential and/or privileged
> material. Any review, retransmission, dissemination or other use of, or
> taking of any action in reliance upon, this information by persons or
> entities other than the intended recipients is prohibited by AgResearch
> Limited. If you have received this message in error, please notify the
> sender immediately.
> =======================================================================
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list