[Bioperl-l] Re: flybase release 4.2 gff
Don Gilbert
gilbertd at bio.indiana.edu
Mon Sep 26 15:35:08 EDT 2005
Michael,
The GFF data from flybase is version 3 format, and at a quick look
the current 4.2 release is as valid GFF3 as earlier 4.x releases. I
don't know what version of BioPerl added GFF3 support. I'm using 1.5.x
which will read the flybase release 4 GFF3 ok.
Did you have problems with just this recent dmel 4.2 release?
Though I'm no longer with the flybase project, the data dumper they
are using is the one I wrote for dumping GMOD Bulkfiles from Chado
databases (and it works for other Chado databases). Find this software
at http://cvs.sourceforge.net/viewcvs.py/gmod/schema/GMODTools/
Sometime this will be packaged as a simpler-to-use Chado data dumper.
If this isn't producing proper GFF3 output, I'll be happy to correct
it.
BTW, I've loaded dmel release 4.1 GFF into GBrowse v1.63
in multiple database adapators for tests. Find configurations for
D.melanogaster features in GBrowse at
ftp://eugenes.org/eugenes/gbrowse/conf/gbrowse.conf/
07.dmel_mysql.conf == for MySQL database
06.dmel_bdb.conf == BerkelyDB database
10.dmel_lucene.conf == Lucene database
05.dmel_lucegene.conf = Lucegene database
Both of these Lucene adaptors are as fast or faster than MySQL, and
you can copy the data indices and run locally with minimal setup, along with
all the other Drosophila species annotations I've put up at
http://insects.eugenes.org/species/maps/
More info at
ftp://eugenes.org/eugenes/gbrowse/lucene-gbrowse-lite.txt
-- Don Gilbert
>From Michael Eisen mbeisen at lbl.gov
> Wed Sep 21 18:00:24 EDT 2005
>
> Has anyone written a script to convert flybases release 4.2 Drosophila
> melanogaster annotations to gff suitable for bioperl?
Data from ftp://flybase.net/genomes/dmel/dmel_r4.2_20050909/gff/
##gff-version 3
##sequence-region 2L 1 22407834
#organism dmel
#source FlyBase Chado DB r4.2.1
#date 20050807
$gbl/bin/bp_load_gff.pl --adaptor dbi::mysqlopt --user dgg
--dsn 'dbi:mysql:database=dmel_r42_20050909;host=localhost'
--create $w/dmel420/gffb/*gff.gz
/bio/argos/flybase/work.local/dmel420/gffb/dmel-2L-r4.2.1.gff.gz: loading...
/bio/argos/flybase/work.local/dmel420/gffb/dmel-2L-r4.2.1.gff.gz: 557362 records loaded
-- d.gilbert--bioinformatics--indiana-u--bloomington-in-47405
-- gilbertd at indiana.edu--http://marmot.bio.indiana.edu/
More information about the Bioperl-l
mailing list