[Biojava-dev] PhredFormat problems

VERHOEF Frans verhoeff2 at gis.a-star.edu.sg
Thu Nov 27 04:14:30 EST 2003


Hi,

 

I've got some issues with the PhredFormat class. In the
readSequenceData(BufferedReader, SymbolTokenization, SeqIOListener )
method, a quality sequence is read and parsed into the individual
quality scores. Those individual scores are each given to
WordStreamParser (in the file WordTokenization) through the characters
method, which is pasting all the scores in a StringBuffer. However, the
spaces in between the quality scores are disappearing in the process.

When the close() method of WordStreamParser is called, it is trying to
do parse the quality score StringBuffer again, but since there are no
spaces in between the different values, it cannot parse it, with as a
result that once it tries to convert the string to an integer, it has a
problem because the number is huge.

 

So I assume the readSequenceData should not parse the quality sequence
string, but just hand the WordStreamParser the complete String. Am I
right? Can anyone still follow me?? ;-)

 

I also wonder whether anyone is using the phred classes.

 

Kind regards,

 

Frans Verhoef

Bioinformatics Specialist

Genome Institute of Singapore

Genome, #02-01, 60 Biopolis Street, Singapore 138672

Tel: +65 6478 8000

DID: +65 6478 8060

HP: +65 9848 4325

Email: verhoeff2 at gis.a-star.edu.sg

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://portal.open-bio.org/pipermail/biojava-dev/attachments/20031127/5a1851f3/attachment.htm


More information about the biojava-dev mailing list