[Biojava-l] Re: Proposed addition to the SequenceDB interface

Marc Colosimo MEColosimo@alumni.carnegiemellon.edu
Fri, 15 Mar 2002 13:51:07 -0500


Thomas Down <td2@sanger.ac.uk> wrote:

> Hi...
>
> I'm considering adding a filter(FeatureFilter); method to
> SequenceDB, which allows features to be extracted from a
> whole database, rather than just a single sequence.  Typical
> usage would be:
>
>   SequenceDB seqDB = ...
>   FeatureHolder mygene = seqDB.filter(
>       new FeatureFilter.ByAnnotation("gene.id", "BRCA2")
>   );

Would this return the feature as some sort of generic gene.id feature? My
growing concern is that for each file/db/SQL format we are adding features with
their original names rather than some defined BioJava enforced named feature. I
noticed a dtd for features. Unfortunately, I don't know much about XML besides
the simple things. Could we make something like gene_id, accession_no, etc...
? By using these set names, you don't have to know what a gene_id tag is for
EMBL, genbank, SQL,.......

Or have I missed this ability in BioJava somehow?

Marc