[Bioperl-l] Bio::DB::GFF::Aggregator problem, new wormbase models.

Philip MacMenamin pm66 at nyu.edu
Wed Feb 18 11:41:46 EST 2004


Thanks very much Todd...

What version of wormbase are you using? I am using WS118. I am not able to 
get this aggregator to return me the UTR bits. 

For instance, I connect to the db using your agg:
my $db = new Bio::DB::GFF(-adaptor=>'dbi::mysqlopt',
#			  -dsn=>'dbi:mysql:wormbase110;host=localhost',
			  -dsn=>'dbi:mysql:wormbase118;host=localhost',
			  -user=>$user,
			  -pass=>$passwd,
			  -aggregator => [qw(wormabse_cds{coding_exon,5_UTR,3_UTR/CDS})],
			 ) or die();

#...ask for a segment in the AH6.5 region:
my $panelSeg = $db->segment(CDS=>'AH6.5');

#...make a searchSegment a little larger to pick everything:
my $searchSeg =$db->segment($panelSeg->sourceseq, 
($panelSeg->abs_start-1000), ($panelSeg->abs_end+1000));

#and, then get the features that wormabse_cds pulls:
my @all_transcripts = $searchSeg->features('wormabse_cds');
foreach my $transcript ( @all_transcripts )
{
    print $transcript, $transcript->abs_start,' ', $transcript->abs_end,"\n";
}

I assume this is the right way to do things. But, the problem is that this 
does not get my UTRs. This does:
my @UTRs = $searchSeg->features('UTR:UTR');
foreach my $UTR (@UTRs)
{
    print $UTR," ",$UTR->start," ",$UTR->end,"\n";
}
But, these are not aggregated obviously.

This is the output of the little script above:

UTR:UTR(5_UTR:AH6.5) 9524078 9524086
UTR:UTR(3_UTR:AH6.5) 9525782 9526248
...
wormabse_cds:curated(AH6.5)9524087 9525781

And you can see that the wormabse_cds does not overlap with the UTRs.

Sorry about this. I have been trying all sorts of things... it just keeps on 
missing the UTRs in the new wormbase models. And we can't re-sync the site 
here till this works.

Philip.
On Tuesday 17 February 2004 05:10 pm, Todd Harris wrote:
> Hi Phillip -
>
> You need to aggregate the separate parts of the CDS.  Create a wormbase_cds
> (or whatever you wish to call it), aggregating the following features using
> the CDS group: coding_exon,5_UTR,3_UTR.
>
> The following stanza should do the trick.
>
> $dbgff = (-adaptor => 'dbi::mysql',
>           -dsn     => 'dbi:mysql:database=your_database;host=your_host',
>           -aggregators => [qw(wormabse_cds{coding_exon,5_UTR,3_UTR/CDS})],
>           -user    => 'your_username',
>           -pass    => 'your_dbgff_pass');
>
> This should do the trick for properly aggregating genes under the new
> WormBase CDS class.
>
> Todd Harris


More information about the Bioperl-l mailing list