[Bioperl-l] Getting taxa from gi
shalabh sharma
shalabh.sharma7 at gmail.com
Wed Sep 28 19:53:07 UTC 2011
Hi All,
I know this has been discussed before, but this is kind of a new
problem that i am facing.
I want to get taxonomy (full linage) information from the huge list of GI's.
I am using Bio::DB:Genbak for this with perl-5.12.3.
Here is my small script.
#! /usr/local/perl-5.12.3/bin/perl -w
use strict;
use warnings;
use Bio::DB::GenBank;
my @ids = qw( CP000490 );
my $gbh = Bio::DB::GenBank->new();
foreach my $id( @ids ) {
# say "* ID: $id";
my $seq = $gbh->get_Seq_by_acc( $id );
my $org = $seq->species;
#print "$org\n";
my $class = join'-', $org->classification;
print "$class\n";
}
The output is: Paracoccus denitrificans
PD1222-Paracoccus-Rhodobacteraceae-Rhodobacterales-Alphaproteobacteria-Proteobacteria-Bacteria
which is fine but i also want to get the taxa id, and if possible taxa ids
for all the linage classification.
ideally i would like to get something like this: 318586 - - - - - - - 1224 -
2
I would really appreciate your help.
Thanks
Shalabh
--
Shalabh Sharma
Scientific Computing Professional Associate (Bioinformatics Specialist)
Department of Marine Sciences
University of Georgia
Athens, GA 30602-3636
More information about the Bioperl-l
mailing list