[Bioperl-l] [bioperl newbie] Retrieving link to protein from	PubChem
    Chris Fields 
    cjfields at illinois.edu
       
    Mon Nov  9 16:27:10 UTC 2009
    
    
  
On Nov 9, 2009, at 10:05 AM, saikari keitele wrote:
> Hi,
>
> I'm using Bioperl to retrieve records from PubChem.
> I'm trying to find a way-but have been unsuccessful- to retrieve  
> from a
> compound record, the reference to the protein(s) that can synthesize  
> the
> compound.
> Thanks very much.
>
> saikari
The below bioperl script returns the GI for proteins that correspond  
to the substance passed on the command line; invoke using 'perl  
pc_substance.pl substance_requested'.  It probably needs more fiddling  
to catch everything but it should get you started.
For other bits and pieces (such as how to retrieve the raw sequence  
files), please see the EUtilities HOWTO:
http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook
chris
----------------------------------------
#!/usr/bin/perl -w
use 5.010;
use strict;
use warnings;
use Bio::DB::EUtilities;
my $substance = shift;
my $eutil = Bio::DB::EUtilities->new(-eutil => 'esearch',
                                      -db => 'pcsubstance',
                                      -term => $substance,
                                      -usehistory => 'y');
my $hist = $eutil->next_History || die;
$eutil->reset_parameters(-eutil => 'elink',
                        -history => $hist,
                        -db      => 'protein',
                        -dbfrom  => 'pcsubstance',
                        -retmax  => 1000);
say join(',',$eutil->get_ids);
    
    
More information about the Bioperl-l
mailing list