[Bioperl-l] genbank mirror

Epstein, Jonathan A (NIH/NICHD) [E] epsteinj at mail.nih.gov
Wed Oct 4 11:25:57 UTC 2006

There's Seqhound:

We set this up locally, and it's probably the most comprehensive free solution out there, but it's non-trivial to setup. Also, since the Blueprint&BIND have lost most of their funding, I'm not sure how long you can count on SeqHound to remain operational (although for now it is being updated).


-----Original Message-----
From: Carlo Lapid [mailto:cmlapid at up.edu.ph]
Sent: Tue 10/3/2006 10:06 PM
To: bioperl-l at bioperl.org
Subject: [Bioperl-l] genbank mirror

I'm trying to set up a local mirror of a large part of the Genbank database.
For users to access the local database, I need to create a web-based search
tool, much like Entrez of NCBI, or SRS of EBI; that can parse the Genbank
flat files I've downloaded based on a query entered by the user.

I'm trying to use Bioperl to create this from scratch, but I'm having a very
hard time, especially since I want the user to have reasonable flexibility
in customizing his search. The best that I've been able to accomplish is a
search function that retrieves genbank sequence objects based on their
primary IDs or accession numbers; by using the fetch method of the
Bio::Index::GenBank module. But this doesn't help users who don't know the
exact IDs for the sequences they want.

Can anybody suggest a way to use Bioperl to search for an ordinary word or
phrase, like "16S gene", which could be matched against the description
field, or the entire genbank entry? (Alternatively, is there some other
freely available tool or software that can do this?) I've been scouring the
Bioperl documentation, but I couldn't find anything. I just need to be
pointed in the right direction. What I thought was a relatively simple
problem has been driving me crazy for days; if anybody has any suggestions I
would really, really appreciate it.
Bioperl-l mailing list
Bioperl-l at lists.open-bio.org

More information about the Bioperl-l mailing list