[Bioperl-l] Lineage from GB files
Surya Saha
ss2489 at cornell.edu
Tue Jan 31 21:49:37 UTC 2012
Hi Adlai,
It really depends on what items are present the Genbank/EMBL. You can use
the NCBI Taxonomy database and Taxonomy
modules<http://search.cpan.org/search?query=taxonomy&mode=all> in
CPAN to identify the taxonomic hierarchy of an accession, for e.g., you can
map the GI to Taxonomy ID and extract the taxonomy
using Bio::LITE::Taxonomy::NCBI.
Here's a script<https://github.com/hyphaltip/mobedac-fungi/blob/master/scripts/taxonomy.pl>
(not
authored by me) on Github that might get you started.
-Surya
On Fri, Jan 27, 2012 at 6:27 AM, Adlai Burman <adlai at refenestration.com>wrote:
> Does anyone know if there is a way to batch extract taxa such as class,
> order in Perl from, e/g/ genbank, EMBL records? I know that genus/species
> and some of the higher taxa are easy to parse from gb records but the
> interior are inconsistent strings (e.g. element x sometimes is a subclass
> and sometimes a family.
> Any help would really be appreciated.
>
> Thanks.
> Adlai
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
More information about the Bioperl-l
mailing list