[BioSQL-l] Loading SwissProt to BioSQL?
mark.schreiber at group.novartis.com
mark.schreiber at group.novartis.com
Thu Jan 27 00:49:22 EST 2005
Richard Holland and I are trying to compile a document stating where
things should go to make the projects more compatible.
Initially we are finding out where things currently go for bioperl and
biojava and then we can argue about where they should go. If anyone can do
something similar for biopython and bioruby that would be cool.
- Mark
Hilmar Lapp <hlapp at gnf.org>
Sent by: biosql-l-bounces at portal.open-bio.org
01/27/2005 03:01 AM
To: Nathan Edwards <nedwards at umiacs.umd.edu>
cc: biopython at biopython.org, biosql-l at open-bio.org, (bcc: Mark
Schreiber/GP/Novartis)
Subject: Re: [BioSQL-l] Loading SwissProt to BioSQL?
I can't comment on the biopython side. Just one thought, if perl isn't
too scary for you and if you think you can manage to install the
requirement (or if you have a sysadmin you can bug with this), loading
the database through bioperl-db's load_seqdatabase.pl may get you set
up to then retrieve the content through biophyton.
It is in this case also untested though whether the storage of
annotation and feature annotation is compatible with how biopython
wants to find it. We just came upon a similar problem between bioperl
and biojava.
-hilmar
On Jan 26, 2005, at 9:12 AM, Nathan Edwards wrote:
>
> What is the current recommended solution for those wanting to load
> SwissProt database files (uniprot_sprot.dat) to BioSQL via BioPython?
>
> As far as I can tell, the situtation is this:
>
> * The SwissProt .dat file parsers under Bio.SwissProt.SProt don't
> produce SeqRecord objects required by BioSeqDatabase.load.
>
> * The SwissProt .dat file parsers under FormatIO can't parse SwissProt
> .dat files, period.
>
> Does *anyone* have a working SwissProt .dat file to BioSQL solution
> working in Python?
>
> The most recent solution I see suggested:
>
> http://biopython.org/pipermail/biopython/2004-May/002088.html
>
> doesn't work as advertised: SProt.SequenceParser produces SeqRecord
> objects with only minimal instantiation of SeqRecord fields (sequence
> and accession) and BioSeqDatabase dies because it expects a field that
> SequenceParser never instantiates; and FormatIO.readFile doesn't
> recognize the uniprot_sprot.dat as any format it recognizes, and if
> forced, with format='swissprot/38' or format='swissprot/40' it dies
> while parsing.
>
> Note: My entire BioPython, BioSQL, etc installation is new, pristine,
> the latest update from CVS.
>
> Thanks,
>
> nathan
>
> --
> Nathan Edwards, Ph.D.
> Center for Bioinformatics and Computational Biology
> 3119 Agriculture/Life Sciences Surge Building #296
> University of Maryland, College Park, MD 20742-3360
> Phone: +1 301-405-9901
> Email: nedwards at umiacs.umd.edu
> WWWeb: http://www.umiacs.umd.edu/~nedwards
> _______________________________________________
> BioSQL-l mailing list
> BioSQL-l at open-bio.org
> http://open-bio.org/mailman/listinfo/biosql-l
>
--
-------------------------------------------------------------
Hilmar Lapp email: lapp at gnf.org
GNF, San Diego, Ca. 92121 phone: +1-858-812-1757
-------------------------------------------------------------
_______________________________________________
BioSQL-l mailing list
BioSQL-l at open-bio.org
http://open-bio.org/mailman/listinfo/biosql-l
More information about the BioSQL-l
mailing list