[Bioperl-l] Status of assembly modules

Florent Angly florent.angly at gmail.com
Fri Jun 18 05:39:39 UTC 2010


Hi Joshua,

Yes, there have been interesting improvements in the assembly BioPerl 
module since v1.6.1. You can find these changes in the development 
version of BioPerl at http://github.com/bioperl/. I'll take this 
opportunity to introduce people who don't follow the commit messages to 
the new features that have been introduced:

First, there is support for more file formats from high-throughput 
platforms, including those generated by de novo assembly and comparative 
assembly tools, such as:
     * Roche 454 GS Assembler, aka Newbler (the ACE-454 variant)
     * Maq
     * Sam
     * Bowtie
There is support for running a lot more of these tools in Bioperl-run 
Bio::Tools::Run :
     * Roche 454 GS Assembler, aka Newbler
     * Minimo
     * Maq
     * Samtools
     * Bowtie
In terms of writing assembly file, I added the option to write ACE 
files, which is quite useful because maybe assembly programs recognize 
this format. So now you can read assemblies, modify them as you see fit 
and exporting them to other programs by writing the modified assembly in 
an ACE file.
The internals of the IO parsers have acquired some granularity as it is 
now possible to read/write assembly files entirely, or one contig at a 
time. This is terrific to reduce memory usage.

That's about it...

Regards,

Florent

PS/ Josh, you filed bug reports related to several of these issues 
(http://bugzilla.open-bio.org/show_bug.cgi?id=2726, 
http://bugzilla.open-bio.org/show_bug.cgi?id=2483). I am closing the 
ones that were not closed yet and thank you for submitting patches.


On 18/06/10 14:00, Joshua Udall wrote:
> Florent -
>
> I didn't want to ask a direct question on-list to perhaps avoid
> confusion.  Were you able to improve/submit a ContigIO to bioperl that
> works with one entry at a time (instead of slurping the entire ace
> file)?
>
>    




More information about the Bioperl-l mailing list