Bioperl: content.pm
Clay Shirky
clay@shirky.com
Wed, 19 Jan 2000 07:25:10 -0500 (EST)
> c) is there another way? (as Tony Blair would say, a Third Way)
Wouldn't hashing the string in user-specified sizes make the crunching
the time-consuming part and the queries fast? (assuming it gets TIEd
to a database, or is queried on a command-line, that is.)
If you did something like:
$strand{$nucleotide_group} .= ":$offset";
where $nucleotide_group was anything from 1 to 6 nucleotides, and
offset was the number of positions, you could do things like look for
specific codons at specific offsets, or make a count of the number of
specific codons, or look for dicodon pairs. The position of the
offests would give you the location of the nucleotide group, just like
a regular array, but the total number of offests linked to any given
nucleotide group would also give you information about relative
frequency.
-clay
=========== Bioperl Project Mailing List Message Footer =======
Project URL: http://bio.perl.org/
For info about how to (un)subscribe, where messages are archived, etc:
http://www.techfak.uni-bielefeld.de/bcd/Perl/Bio/vsns-bcd-perl.html
====================================================================