[Biojava-l] Parsing MegaBLAST output files?

James Diggans jdiggans at excelsiortech.com
Mon Nov 22 01:38:20 EST 2004


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1



All, I'm attempting to use BioJava to parse the output from NCBI's
commandline MegaBLAST and receiving an error:

'Could not recognise the format of this file as one supported by the
framework.'

in a SAXException thrown by BlastLikeSAXParser. An old post to the
mailing list:

http://www.biojava.org/pipermail/biojava-dev/2002-October/000150.html

seems to indicate that this was fixed long ago via this commit to CVS:

http://cvs.biojava.org/cgi-bin/viewcvs/viewcvs.cgi/biojava-live/src/org/biojava/bio/program/ssbind/HeaderStAXHandler.java.diff?r1=1.3&r2=1.4&cvsroot=biojava

The MegaBLAST file I'm trying to parse is clean and my attempt at a
parse consists of (largely pulled from the recipe from BioJava in Anger):

- ------------------
InputStream is = new FileInputStream(blastResult);

BlastLikeSAXParser parser = new BlastLikeSAXParser();
SeqSimilarityAdapter adapter = new SeqSimilarityAdapter();
parser.setContentHandler(adapter);

alignmentResults = new ArrayList();
SearchContentHandler builder = new
	BlastLikeSearchBuilder(alignmentResults,
~                new DummySequenceDB("queries"),
		new DummySequenceDBInstallation());

adapter.setSearchContentHandler(builder);

parser.parse(new InputSource(is));
- ------------------

Any ideas on why I'm getting the SAXException? Thanks ...
- -j

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3-nr1 (Windows XP)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFBoYlc75jgGJzUhNkRAu8zAJ9gTNoPouk4/29EDpWKcQVx5EB34gCg2MkD
DndldC3zi3bD2QKWgqMNOxs=
=TS47
-----END PGP SIGNATURE-----


More information about the Biojava-l mailing list