[Biojava-l] [Biojava-dev] Problem converting ab1 to fastq files in Biojava 1.8
Andy Yates
ayates at ebi.ac.uk
Thu Sep 20 15:17:02 UTC 2012
Hi,
It's been about 7-8 years since I last looked into the ABIFChromatogram object and I all remember failing to extract quality scores. Can you convert the AB1 file into an SCF trace and then use biojava? The staden io package comes with a binary called convert_trace which will get you into SCF. No guarantee the scores will be converted as well but it's worth a try I guess.
Andy
On 20 Sep 2012, at 16:06, Nick England wrote:
> Sebastian,
>
> I have also tried to obtain the AB1 quality scores using BioJava, but
> was not successful. I can obtain them from scf files, but not AB1.
> Looking at the API it appears that they are not available, but I would
> be happy to be shown wrong!
>
> Nick
>
> On 20 September 2012 15:49, Sebastian garcia lopez
> <deltadedirac at gmail.com> wrote:
>> Good day to all,
>>
>> Excuse me for my english, I wil try to explain in a better way my problem:
>> I need to parser ab1 file into fastq file, the problem is that I do not
>> know the form to obtain the quality scores from ab1. to obtain the
>> sequence, I use ABIFChromatogram, and in fact, I can obtain the sequence,
>> yet I do not know how I can obtain the quality scores to build my fastq
>> file. The problem is that the "trace-offsets" do not correspond with
>> quality scores. If it is helpful, I put a fragment of the code that I am
>> using
>>
>>
>> ABIFChromatogram y = new ABIFChromatogram();//ya casi lo logro, falta ver
>> como leer esos alignment
>> y=ABIFChromatogram.create(new File(Path));
>> Alignment to=y.getBaseCalls();
>> SymbolList dnaSeq=to.symbolListForLabel("dna");
>> SymbolList trace=to.symbolListForLabel("trace-offsets");
>>
>> System.out.println(dnaSeq.seqString());
>> System.out.println(trace.seqString());
>>
>> Please, if somebody know the mode to obtain the scores from ab1 files in
>> Biojava, please let me know.
>>
>> Thank you.
>>
>> --
>> Sebastián García López
>> Electronic Engineer
>> Universidad Nacional de Colombia at Manizales
>>
>> Ms.Eng. Industrial Automation Student
>> Control and Digital Signal Processing Research Group (GC&PDS)
>> Universidad Nacional de Colombia at Manizales
>> MCP-Microsoft Certified Professional
>>
>> Email: deltadedirac at gmail.com
>> sgarcialop at unal.edu.co
>> Skype: sebastiang55
>> Mobile: +57 3147569794
>>
>> _______________________________________________
>> biojava-dev mailing list
>> biojava-dev at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/biojava-dev
>
> _______________________________________________
> Biojava-l mailing list - Biojava-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biojava-l
More information about the Biojava-l
mailing list