[Biojava-l] Interested in the "cloudization" of BioJava
Andreas Prlic
andreas at sdsc.edu
Thu Apr 5 15:49:08 UTC 2012
Hi Arthur,
> 1) The first one and the one i find most interesting can be to try to
> introduce the map-reduce framework to help to speed-up the pairwise
> alignment in the creation of the muliple sequence alignment.
That would be a possible application.
> 2)If the input files are big enough, it can be interesting to perform the
> parsing on this files while using a distributed infrastructure to speedup
> the process,
I am not sure if I have encountered such large files as of yet. Do you
have an example?
> 3)Another idea can be to try to have a hadoopify version of blast, in which
> the input file also can be splitted and then for each sequence in a chunk,
> the node would perform a local blast query.
I agree, another possible application...
What frameworks did you think about using?
Andreas
More information about the Biojava-l
mailing list