[Bioperl-l] UCSC database -> GFF
Paul Edlefsen
pedlefsen at systemsbiology.org
Thu Jul 17 17:21:15 EDT 2003
Hmm. I've been working with them and they have assumed that they are
Sax based. Which is it?
Allen Day wrote:
>yes, but there is a problem with the ucsc das server not providing all the
>information that is available in the tables. utr info for refgenes, for
>example.
>
>also, my last understanding was that the bio::das packages were dom
>based... you needed to read the whole xml before gffifying the features.
>not so desireable with the est tracks...
>
>-allen
>
>
>On Thu, 17 Jul 2003, Lincoln Stein wrote:
>
>
>
>>I know that this response is rather delayed, but yes, this would work.
>>
>>Lincoln
>>
>>On Monday 23 June 2003 07:14 pm, Paul Edlefsen wrote:
>>
>>
>>>Could we just use our Bio::DB::Das stuff to read in the features, then
>>>use the features' gff_string() method to get the string? Does
>>>Bio::DB::GFF dealy allow us to write features back to a GFF db directly?
>>>
>>>:Paul
>>>
>>>Lincoln Stein wrote:
>>>
>>>
>>>>Hi Allen,
>>>>
>>>>I hope someone (Mummi?) will pick it up. It's necessary now that NCBI has
>>>>decided to make their human annotations effectively unloadable. Maybe
>>>>ENSEMBL can have a look at their EMBL output and try to find a way to get
>>>>more track info into it?
>>>>
>>>>Lincoln
>>>>
>>>>On Tuesday 17 June 2003 04:23 pm, Allen Day wrote:
>>>>
>>>>
>>>>>I wrote it ~1 year ago.
>>>>>
>>>>>-Allen
>>>>>
>>>>>On Tue, 17 Jun 2003, Lincoln Stein wrote:
>>>>>
>>>>>
>>>>>>Where did that come from? I was just talking with Mummi about how badly
>>>>>>we need this and how difficult it will be to do it right.
>>>>>>
>>>>>>Lincoln
>>>>>>
>>>>>>On Tuesday 17 June 2003 02:07 pm, Jason Stajich wrote:
>>>>>>
>>>>>>
>>>>>>>Does core/scripts/Bio-DB-GFF/load_ucsc.pl work for you?
>>>>>>>
>>>>>>>On Tue, 17 Jun 2003, Paul Edlefsen wrote:
>>>>>>>
>>>>>>>
>>>>>>>>The Generic Genome Browser comes with many helpful scripts for
>>>>>>>>converting various data sources into GFF format. I am wondering if
>>>>>>>>anyone in the bioperl community has written one for the UCSC data, as
>>>>>>>>is available at
>>>>>>>>ftp://genome.cse.ucsc.edu/goldenPath/10april2003/database and
>>>>>>>>described at http://genome.ucsc.edu/goldenPath/gbdDescriptions.html
>>>>>>>>-- I noticed that Allen Day wrote some handy aggregators for that
>>>>>>>>data such as Bio::DB::GFF::ucsc_genscan, so I wanted to check before
>>>>>>>>I start writing Generic-Genome-Browser/bin/process_ucsc.PLS.
>>>>>>>>
>>>>>>>>Thanks,
>>>>>>>>
>>>>>>>> :Paul
>>>>>>>>
>>>>>>>>
>>
>>
>
>
>
--
+-----O------------------------------------+
| o-o Paul T. Edlefsen
| o---o Computational Biologist
| o----o mailto:paul at systemsbiology.org
| O----O Institute for Systems Biology
| 0--o 1441 North 34th Street
| O Seattle, Washington 98103-8904
| o-o callto:1-206-732-1336
+-o---o------------------------------------+
More information about the Bioperl-l
mailing list