[BioSQL-l] Indexing of (seqfeature) locations?

Peter biopython at maubp.freeserve.co.uk
Wed Aug 26 10:53:40 UTC 2009


Hi BioSQL folks,

The BioSQL schema includes a few indexes on the location table
(e.g. quoting the MySQL schema, but it looks the same on pg too):

CREATE INDEX seqfeatureloc_start ON location(start_pos, end_pos);
CREATE INDEX seqfeatureloc_dbx   ON location(dbxref_id);
CREATE INDEX seqfeatureloc_trm   ON location(term_id);

Will these facilitate searches like this?:

"SELECT ... WHERE 2000 <= location.start_pos
AND location.end_pos <= 5000 AND ..."

Or, for this would it help to include:

CREATE INDEX seqfeatureloc_start ON location(start_pos);
CREATE INDEX seqfeatureloc_start ON location(end_pos);

A motivational use case would be to pull out an operon, or a
region of a record as part of a genome browser.

Thanks,

Peter



More information about the BioSQL-l mailing list