[Biojava-l] Sequence retrieval

Jeffrey Rosenfeld jeffr at amnh.org
Thu Jul 24 19:25:39 EDT 2003


I am new to this list, so my question might have already been 
discussed, but I cannot find any reference to it in the archive, so 
here goes:
I am trying to find a quick java-only way to retrieve sequences from a 
blast database.  I am writing a program that needs to obtain large 
amounts of sequences from a fairly large database.  I have tried using 
fastacmd, but there is a great slowdown because of teh need to start up 
an external process for each sequence query.  (I cannot execute one 
large fastacmd job because of the large amounts of sequence that I am 
querying. ) I know that biojava has many different formats for storing 
sequences, but I don't want to have to keep two databases of my 
sequences updated.  I am already using the blast database for blast, so 
I don't want another database.
Is there a simple way to implement fastacmd or somethign similar in 
java?  It should not be too hard to do either using JNI or reverse 
engineering the fastacmd code.

Thanks,

Jeff
Jeffrey Rosenfeld
Department of Invertebrate Zoology
American Museum of Natural History

79th Street@ Central Park West
New York, NY 10024
(212)313-7646



More information about the Biojava-l mailing list