[Bioperl-l] UCSC database -> GFF

Lincoln Stein lstein at cshl.edu
Mon Jul 21 11:14:22 EDT 2003


The modules are sax based.  For downloading smallish whole genome datasets, 
such as the gene models, bio::das will be fine.  For ESTs and repeats, don't 
even try!

Lincoln


On Friday 18 July 2003 03:46 am, Ewan Birney wrote:
> On Thu, 17 Jul 2003, Allen Day wrote:
> > yes, but there is a problem with the ucsc das server not providing all
> > the information that is available in the tables.  utr info for refgenes,
> > for example.
> >
> > also, my last understanding was that the bio::das packages were dom
> > based... you needed to read the whole xml before gffifying the features.
> > not so desireable with the est tracks...
>
> I (*think*) they are SAX based...
>
>
> DAS is not a good way to download full datasets however... it was never
> designed for that and doesn't scale well.
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
========================================================================
Lincoln D. Stein                           Cold Spring Harbor Laboratory
lstein at cshl.org			                  Cold Spring Harbor, NY
========================================================================




More information about the Bioperl-l mailing list