[Biojava-l] reading nib sequence files

mark.schreiber at group.novartis.com mark.schreiber at group.novartis.com
Mon Jan 24 03:37:16 EST 2005


I'd need to brush up on my nio, and my c !





Thomas Down <td2 at sanger.ac.uk>
01/24/2005 04:34 PM

 
        To:     "Richard HOLLAND" <hollandr at gis.a-star.edu.sg>
        cc:     "<baggott2 at llnl.gov>", biojava-list List <biojava-l at biojava.org>, Mark 
Schreiber/GP/Novartis at PH
        Subject:        Re: [Biojava-l] reading nib sequence files



On 24 Jan 2005, at 02:48, Richard HOLLAND wrote:

> It's a compressed binary format. I doubt BioJava would be able to read
> it without a lot of effort as the current parser framework is set up 
> for
> text input only.

Nib support probably wouldn't fit into the text-oriented parsing 
framework, but I'm sure it could be supported somehow if there was 
demand.  A quick google doesn't turn up any format documentation, but 
Jim Kent's IO code is at:

           http://www.soe.ucsc.edu/~kent/src/unzipped/lib/nib.c

One interesting way to handle this might be to open the nib file as a 
MappedByteBuffer, and back a SymbolList directly using that -- 
potentially giving us an efficient way of working with huge sequences.. 
  Any interest in that?

           Thomas.






More information about the Biojava-l mailing list