[Bioperl-l] [bioperl newbie] Retrieving link to protein from PubChem
Chris Fields
cjfields at illinois.edu
Mon Nov 9 16:27:10 UTC 2009
On Nov 9, 2009, at 10:05 AM, saikari keitele wrote:
> Hi,
>
> I'm using Bioperl to retrieve records from PubChem.
> I'm trying to find a way-but have been unsuccessful- to retrieve
> from a
> compound record, the reference to the protein(s) that can synthesize
> the
> compound.
> Thanks very much.
>
> saikari
The below bioperl script returns the GI for proteins that correspond
to the substance passed on the command line; invoke using 'perl
pc_substance.pl substance_requested'. It probably needs more fiddling
to catch everything but it should get you started.
For other bits and pieces (such as how to retrieve the raw sequence
files), please see the EUtilities HOWTO:
http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook
chris
----------------------------------------
#!/usr/bin/perl -w
use 5.010;
use strict;
use warnings;
use Bio::DB::EUtilities;
my $substance = shift;
my $eutil = Bio::DB::EUtilities->new(-eutil => 'esearch',
-db => 'pcsubstance',
-term => $substance,
-usehistory => 'y');
my $hist = $eutil->next_History || die;
$eutil->reset_parameters(-eutil => 'elink',
-history => $hist,
-db => 'protein',
-dbfrom => 'pcsubstance',
-retmax => 1000);
say join(',',$eutil->get_ids);
More information about the Bioperl-l
mailing list